Vision Language Pre Training
https://github.com/salesforce/lavis
https://github.com/uta-smile/TCL
https://github.com/YehLi/xmodaler
https://github.com/sangminwoo/awesome-vision-and-language
Masked Vision and Language Modeling for Multi-modal Representation Learning (2022-08-03)
GitHub - guilk/VLC: Research code for "Training Vision-Language Transformers from Captions Alone"
GitHub - RERV/UniAdapter![[Screen Shot 2023-04-19 at 1.13.36 PM.png]
Contrastive
CLIP
GitHub - baaivision/EVA: EVA Series: Vision Foundation Model Fanatics from BAAI
e-CLIP: Large-Scale Vision-Language Representation Learning in E-commerce