File:Results-ViT.jpg
Size of this preview: 800 × 259 pixels. Other resolution: 1,027 × 332 pixels.
Original file (1,027 × 332 pixels, file size: 88 KB, MIME type: image/jpeg)
Summary
Description | English: Results-ViT |
Date | 31 January 2020 |
File source | Dosovitskiy, Alexey, Lucas Beyer, Alexander Kolesnikov, Dirk Weissenborn, Xiaohua Zhai, Thomas Unterthiner, Mostafa Dehghani et al. "An image is worth 16x16 words: Transformers for image recognition at scale." arXiv preprint arXiv:2010.11929 (2020). |
Author | Dosovitskiy, Alexey, Lucas Beyer, Alexander Kolesnikov, Dirk Weissenborn, Xiaohua Zhai, Thomas Unterthiner, Mostafa Dehghani |
Licensing
|
File history
Click on a date/time to view the file as it appeared at that time.
Date/Time | Thumbnail | Dimensions | User | Comment | |
---|---|---|---|---|---|
current | 18:20, 17 April 2023 | 1,027 × 332 (88 KB) | HarshineeSriram (talk | contribs) | Uploaded a work by Dosovitskiy, Alexey, Lucas Beyer, Alexander Kolesnikov, Dirk Weissenborn, Xiaohua Zhai, Thomas Unterthiner, Mostafa Dehghani from Dosovitskiy, Alexey, Lucas Beyer, Alexander Kolesnikov, Dirk Weissenborn, Xiaohua Zhai, Thomas Unterthiner, Mostafa Dehghani et al. "An image is worth 16x16 words: Transformers for image recognition at scale." arXiv preprint arXiv:2010.11929 (2020). with UploadWizard |
You cannot overwrite this file.
File usage
The following 2 pages use this file: