Transformers in Vision

[2209.07399] A Light Recipe to Train Robust Vision Transformers

Architectures

https://github.com/cmhungsteve/Awesome-Transformer-Attention

ViT

Swin

DEIT

GitHub - facebookresearch/deit: Official DeiT repository

CiT

GitHub - facebookresearch/CiT: Code for the paper titled “CiT Curation in Training for Effective Vision-Language Data”.

Training

Small Data

https://github.com/hananshafi/vits-for-small-scale-datasets

Detection

VitDet

detectron2/projects/ViTDet at main · facebookresearch/detectron2 · GitHub

Links

Code

Videos

Papers