Transformers in Vision
- [2209.07399] A Light Recipe to Train Robust Vision Transformers
- [2412.04786] Slicing Vision Transformer for Flexible Inference
Architectures
https://github.com/cmhungsteve/Awesome-Transformer-Attention
ViT
- Vision Transformer (ViT) under the magnifying glass, Part 1 | by Kate Yurkova | Medium
- Transformers-Tutorials/VisionTransformer at master · NielsRogge/Transformers-Tutorials · GitHub
Swin
DEIT
GitHub - facebookresearch/deit: Official DeiT repository
CiT
Training
Small Data
https://github.com/hananshafi/vits-for-small-scale-datasets
Detection
VitDet
detectron2/projects/ViTDet at main · facebookresearch/detectron2 · GitHub
Links
Code
- GitHub - lucidrains/vit-pytorch: Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
- GitHub - ShoufaChen/AdaptFormer: [NeurIPS 2022] Implementation of “AdaptFormer: Adapting Vision Transformers for Scalable Visual Recognition”