RoBERTa: A Robustly Optimized BERT Pretraining Approach Paper โข 1907.11692 โข Published Jul 26, 2019 โข 7
Visual Transformers: Token-based Image Representation and Processing for Computer Vision Paper โข 2006.03677 โข Published Jun 5, 2020 โข 1
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale Paper โข 2010.11929 โข Published Oct 22, 2020 โข 7
CLIP as RNN: Segment Countless Visual Concepts without Training Endeavor Paper โข 2312.07661 โข Published Dec 12, 2023 โข 16