Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2406.03478
Cited By
Convolutional Neural Networks and Vision Transformers for Fashion MNIST Classification: A Literature Review
5 June 2024
Sonia Bbouzidi
Ghazala Hcini
Imen Jdey
Fadoua Drira
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Convolutional Neural Networks and Vision Transformers for Fashion MNIST Classification: A Literature Review"
22 / 22 papers shown
Title
Large Language Models Implicitly Learn to See and Hear Just By Reading
Prateek Verma
Mert Pilanci
148
0
0
20 May 2025
HSViT: Horizontally Scalable Vision Transformer
Chenhao Xu
Chang-Tsun Li
Chee Peng Lim
Douglas Creighton
ViT
45
2
0
08 Apr 2024
A Comprehensive Survey of Transformers for Computer Vision
Sonain Jamil
Md. Jalil Piran
Oh-Jin Kwon
ViT
45
52
0
11 Nov 2022
PatchRot: A Self-Supervised Technique for Training Vision Transformers
S. Chhabra
Prabal Bijoy Dutta
Hemanth Venkateswara
Baoxin Li
SSL
ViT
34
2
0
27 Oct 2022
Dilated Neighborhood Attention Transformer
Ali Hassani
Humphrey Shi
ViT
MedIm
56
73
0
29 Sep 2022
DeiT III: Revenge of the ViT
Hugo Touvron
Matthieu Cord
Hervé Jégou
ViT
116
409
0
14 Apr 2022
Rethinking Semantic Segmentation: A Prototype View
Tianfei Zhou
Wenguan Wang
E. Konukoglu
Luc Van Gool
SSeg
99
269
0
28 Mar 2022
AdaViT: Adaptive Tokens for Efficient Vision Transformer
Hongxu Yin
Arash Vahdat
J. Álvarez
Arun Mallya
Jan Kautz
Pavlo Molchanov
ViT
79
335
0
14 Dec 2021
Post-Training Quantization for Vision Transformer
Zhenhua Liu
Yunhe Wang
Kai Han
Siwei Ma
Wen Gao
ViT
MQ
89
337
0
27 Jun 2021
CoAtNet: Marrying Convolution and Attention for All Data Sizes
Zihang Dai
Hanxiao Liu
Quoc V. Le
Mingxing Tan
ViT
102
1,196
0
09 Jun 2021
Segmenter: Transformer for Semantic Segmentation
Robin Strudel
Ricardo Garcia Pinel
Ivan Laptev
Cordelia Schmid
ViT
184
1,460
0
12 May 2021
Conformer: Local Features Coupling Global Representations for Visual Recognition
Zhiliang Peng
Wei Huang
Shanzhi Gu
Lingxi Xie
Yaowei Wang
Jianbin Jiao
QiXiang Ye
ViT
56
538
0
09 May 2021
Emerging Properties in Self-Supervised Vision Transformers
Mathilde Caron
Hugo Touvron
Ishan Misra
Hervé Jégou
Julien Mairal
Piotr Bojanowski
Armand Joulin
611
6,029
0
29 Apr 2021
Going deeper with Image Transformers
Hugo Touvron
Matthieu Cord
Alexandre Sablayrolles
Gabriel Synnaeve
Hervé Jégou
ViT
133
1,006
0
31 Mar 2021
CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image Classification
Chun-Fu Chen
Quanfu Fan
Yikang Shen
ViT
68
1,469
0
27 Mar 2021
Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
Ze Liu
Yutong Lin
Yue Cao
Han Hu
Yixuan Wei
Zheng Zhang
Stephen Lin
B. Guo
ViT
415
21,347
0
25 Mar 2021
ConViT: Improving Vision Transformers with Soft Convolutional Inductive Biases
Stéphane dÁscoli
Hugo Touvron
Matthew L. Leavitt
Ari S. Morcos
Giulio Biroli
Levent Sagun
ViT
110
824
0
19 Mar 2021
LambdaNetworks: Modeling Long-Range Interactions Without Attention
Irwan Bello
322
180
0
17 Feb 2021
Transformers in Vision: A Survey
Salman Khan
Muzammal Naseer
Munawar Hayat
Syed Waqas Zamir
Fahad Shahbaz Khan
M. Shah
ViT
274
2,492
0
04 Jan 2021
Squeeze-and-Attention Networks for Semantic Segmentation
Zilong Zhong
Z. Q. Lin
Rene Bidart
Xiaodan Hu
Ibrahim Ben Daya
Zhifeng Li
Wei-Shi Zheng
Jonathan Li
A. Wong
SSeg
61
235
0
08 Sep 2019
Fashion-MNIST: a Novel Image Dataset for Benchmarking Machine Learning Algorithms
Han Xiao
Kashif Rasul
Roland Vollgraf
256
8,856
0
25 Aug 2017
Very Deep Convolutional Networks for Large-Scale Image Recognition
Karen Simonyan
Andrew Zisserman
FAtt
MDE
1.5K
100,213
0
04 Sep 2014
1