- GitHub - pavelsuma/ames: AMES: Asymmetric and Memory-Efficient Similarity
- [2412.00784] EDTformer: An Efficient Decoder Transformer for Visual Place Recognition
- [2406.16204] Breaking the Frame: Visual Place Recognition by Overlap Prediction
Local Feature Based
VLAD
Deep Learning Based
DELF
DELG
NetVLAD
MAC
GeM
Losses / Objectives
Datasets / Benchmarks
Oxford5K
ROxford5K
Google Landmarks
MET
Products-10K
Large scale product recognition dataset
AICrowd Products
AIcrowd | Visual Product Recognition Challenge 2023 | Challenges
Amazon Reviews
Tricks of the Trade
Query Expansion
Index Augmentation
Reranking
Spatial Verification
Ideas
- tokenformer for instance recognition (learned codebook)
- VLMs for instance recognition (same or not, prompting with multiple examples ⇒ generate query token (prefixLM style masking))
Links
-
GitHub - pavelsuma/ames: AMES: Asymmetric and Memory-Efficient Similarity
-
[2405.18065v1] EffoVPR: Effective Foundation Model Utilization for Visual Place Recognition
-
[2204.02287v2] Rethinking Visual Geo-localization for Large-Scale Applications
-
[2405.07364v2] BoQ: A Place is Worth a Bag of Learnable Queries
-
ILR2024 instance level recognition