ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2105.01601
  4. Cited By
MLP-Mixer: An all-MLP Architecture for Vision

MLP-Mixer: An all-MLP Architecture for Vision

4 May 2021
Ilya O. Tolstikhin
N. Houlsby
Alexander Kolesnikov
Lucas Beyer
Xiaohua Zhai
Thomas Unterthiner
Jessica Yung
Andreas Steiner
Daniel Keysers
Jakob Uszkoreit
Mario Lucic
Alexey Dosovitskiy
ArXivPDFHTML

Papers citing "MLP-Mixer: An all-MLP Architecture for Vision"

50 / 1,119 papers shown
Title
Unsupervised Learning of Full-Waveform Inversion: Connecting CNN and
  Partial Differential Equation in a Loop
Unsupervised Learning of Full-Waveform Inversion: Connecting CNN and Partial Differential Equation in a Loop
Peng Jin
Xitong Zhang
Yinpeng Chen
Sharon X. Huang
Zicheng Liu
Youzuo Lin
64
48
0
14 Oct 2021
Two-argument activation functions learn soft XOR operations like
  cortical neurons
Two-argument activation functions learn soft XOR operations like cortical neurons
Kijung Yoon
Emin Orhan
Juhyeon Kim
Xaq Pitkow
MLT
27
0
0
13 Oct 2021
Open-Set Recognition: a Good Closed-Set Classifier is All You Need?
Open-Set Recognition: a Good Closed-Set Classifier is All You Need?
S. Vaze
Kai Han
Andrea Vedaldi
Andrew Zisserman
BDL
167
404
0
12 Oct 2021
Phase Collapse in Neural Networks
Phase Collapse in Neural Networks
Florentin Guth
J. Zarka
S. Mallat
6
7
0
11 Oct 2021
Vision Transformer based COVID-19 Detection using Chest X-rays
Vision Transformer based COVID-19 Detection using Chest X-rays
Koushik Sivarama Krishnan
Karthik Sivarama Krishnan
ViT
MedIm
22
54
0
09 Oct 2021
Adversarial Token Attacks on Vision Transformers
Adversarial Token Attacks on Vision Transformers
Ameya Joshi
Gauri Jagatap
C. Hegde
ViT
30
19
0
08 Oct 2021
UniNet: Unified Architecture Search with Convolution, Transformer, and
  MLP
UniNet: Unified Architecture Search with Convolution, Transformer, and MLP
Jihao Liu
Hongsheng Li
Guanglu Song
Xin Huang
Yu Liu
ViT
29
35
0
08 Oct 2021
MC-LCR: Multi-modal contrastive classification by locally correlated
  representations for effective face forgery detection
MC-LCR: Multi-modal contrastive classification by locally correlated representations for effective face forgery detection
Gaojian Wang
Qian Jiang
Xin Jin
Wei Li
Xiaohui Cui
CVBM
26
25
0
07 Oct 2021
Attention is All You Need? Good Embeddings with Statistics are
  enough:Large Scale Audio Understanding without Transformers/ Convolutions/
  BERTs/ Mixers/ Attention/ RNNs or ....
Attention is All You Need? Good Embeddings with Statistics are enough:Large Scale Audio Understanding without Transformers/ Convolutions/ BERTs/ Mixers/ Attention/ RNNs or ....
Prateek Verma
AI4TS
24
2
0
07 Oct 2021
Adversarial Robustness Comparison of Vision Transformer and MLP-Mixer to
  CNNs
Adversarial Robustness Comparison of Vision Transformer and MLP-Mixer to CNNs
Philipp Benz
Soomin Ham
Chaoning Zhang
Adil Karjauv
In So Kweon
AAML
ViT
29
78
0
06 Oct 2021
PoNet: Pooling Network for Efficient Token Mixing in Long Sequences
PoNet: Pooling Network for Efficient Token Mixing in Long Sequences
Chao-Hong Tan
Qian Chen
Wen Wang
Qinglin Zhang
Siqi Zheng
Zhenhua Ling
ViT
17
11
0
06 Oct 2021
Exploring the Limits of Large Scale Pre-training
Exploring the Limits of Large Scale Pre-training
Samira Abnar
Mostafa Dehghani
Behnam Neyshabur
Hanie Sedghi
AI4CE
55
114
0
05 Oct 2021
Deep Instance Segmentation with Automotive Radar Detection Points
Deep Instance Segmentation with Automotive Radar Detection Points
Jianan Liu
Weiyi Xiong
Liping Bai
Yu Xia
Tao Huang
Wanli Ouyang
Bing Zhu
46
53
0
05 Oct 2021
Efficient Identification of Butterfly Sparse Matrix Factorizations
Efficient Identification of Butterfly Sparse Matrix Factorizations
Léon Zheng
E. Riccietti
Rémi Gribonval
30
6
0
04 Oct 2021
ResNet strikes back: An improved training procedure in timm
ResNet strikes back: An improved training procedure in timm
Ross Wightman
Hugo Touvron
Hervé Jégou
AI4TS
209
487
0
01 Oct 2021
UFO-ViT: High Performance Linear Vision Transformer without Softmax
UFO-ViT: High Performance Linear Vision Transformer without Softmax
Jeonggeun Song
ViT
106
20
0
29 Sep 2021
Scale Efficiently: Insights from Pre-training and Fine-tuning
  Transformers
Scale Efficiently: Insights from Pre-training and Fine-tuning Transformers
Yi Tay
Mostafa Dehghani
J. Rao
W. Fedus
Samira Abnar
Hyung Won Chung
Sharan Narang
Dani Yogatama
Ashish Vaswani
Donald Metzler
198
110
0
22 Sep 2021
Sparse MLP for Image Recognition: Is Self-Attention Really Necessary?
Sparse MLP for Image Recognition: Is Self-Attention Really Necessary?
Chuanxin Tang
Yucheng Zhao
Guangting Wang
Chong Luo
Wenxuan Xie
Wenjun Zeng
MoE
ViT
27
98
0
12 Sep 2021
RobustART: Benchmarking Robustness on Architecture Design and Training
  Techniques
RobustART: Benchmarking Robustness on Architecture Design and Training Techniques
Shiyu Tang
Ruihao Gong
Yan Wang
Aishan Liu
Jiakai Wang
...
Xianglong Liu
D. Song
Alan Yuille
Philip H. S. Torr
Dacheng Tao
VLM
AAML
21
106
0
11 Sep 2021
ConvMLP: Hierarchical Convolutional MLPs for Vision
ConvMLP: Hierarchical Convolutional MLPs for Vision
Jiachen Li
Ali Hassani
Steven Walton
Humphrey Shi
38
55
0
09 Sep 2021
Axial multi-layer perceptron architecture for automatic segmentation of
  choroid plexus in multiple sclerosis
Axial multi-layer perceptron architecture for automatic segmentation of choroid plexus in multiple sclerosis
Marius Schmidt-Mengin
Vito A G Ricigliano
B. Bodini
E. Morena
A. Colombi
Mariem Hamzaoui
Arya Yazdan Panah
B. Stankoff
O. Colliot
15
15
0
08 Sep 2021
Cross-token Modeling with Conditional Computation
Cross-token Modeling with Conditional Computation
Yuxuan Lou
Fuzhao Xue
Zangwei Zheng
Yang You
MoE
22
19
0
05 Sep 2021
CrypTen: Secure Multi-Party Computation Meets Machine Learning
CrypTen: Secure Multi-Party Computation Meets Machine Learning
Brian Knott
Shobha Venkataraman
Awni Y. Hannun
Shubho Sengupta
Mark Ibrahim
L. V. D. van der Maaten
16
346
0
02 Sep 2021
SANSformers: Self-Supervised Forecasting in Electronic Health Records
  with Attention-Free Models
SANSformers: Self-Supervised Forecasting in Electronic Health Records with Attention-Free Models
Yogesh Kumar
Alexander Ilin
H. Salo
S. Kulathinal
M. Leinonen
Pekka Marttinen
AI4TS
MedIm
20
0
0
31 Aug 2021
Hire-MLP: Vision MLP via Hierarchical Rearrangement
Hire-MLP: Vision MLP via Hierarchical Rearrangement
Jianyuan Guo
Yehui Tang
Kai Han
Xinghao Chen
Han Wu
Chao Xu
Chang Xu
Yunhe Wang
38
105
0
30 Aug 2021
A Battle of Network Structures: An Empirical Study of CNN, Transformer,
  and MLP
A Battle of Network Structures: An Empirical Study of CNN, Transformer, and MLP
Yucheng Zhao
Guangting Wang
Chuanxin Tang
Chong Luo
Wenjun Zeng
Zhengjun Zha
26
69
0
30 Aug 2021
SERF: Towards better training of deep neural networks using log-Softplus
  ERror activation Function
SERF: Towards better training of deep neural networks using log-Softplus ERror activation Function
Sayan Nag
Mayukh Bhattacharyya
LLMSV
25
22
0
21 Aug 2021
PatchCleanser: Certifiably Robust Defense against Adversarial Patches
  for Any Image Classifier
PatchCleanser: Certifiably Robust Defense against Adversarial Patches for Any Image Classifier
Chong Xiang
Saeed Mahloujifar
Prateek Mittal
VLM
AAML
11
73
0
20 Aug 2021
Do Vision Transformers See Like Convolutional Neural Networks?
Do Vision Transformers See Like Convolutional Neural Networks?
M. Raghu
Thomas Unterthiner
Simon Kornblith
Chiyuan Zhang
Alexey Dosovitskiy
ViT
41
922
0
19 Aug 2021
Modulating Language Models with Emotions
Modulating Language Models with Emotions
Ruibo Liu
Jason W. Wei
Chenyan Jia
Soroush Vosoughi
16
20
0
17 Aug 2021
MOI-Mixer: Improving MLP-Mixer with Multi Order Interactions in
  Sequential Recommendation
MOI-Mixer: Improving MLP-Mixer with Multi Order Interactions in Sequential Recommendation
Hojoon Lee
Dongyoon Hwang
Sunghwan Hong
Changyeon Kim
Seungryong Kim
Jaegul Choo
11
10
0
17 Aug 2021
3D High-Fidelity Mask Face Presentation Attack Detection Challenge
3D High-Fidelity Mask Face Presentation Attack Detection Challenge
Ajian Liu
Chenxu Zhao
Zitong Yu
Anyang Su
Xing Liu
...
Jun Wan
Sergio Escalera
Hugo Jair Escalante
Zhen Lei
G. Guo
CVBM
18
34
0
16 Aug 2021
Mobile-Former: Bridging MobileNet and Transformer
Mobile-Former: Bridging MobileNet and Transformer
Yinpeng Chen
Xiyang Dai
Dongdong Chen
Mengchen Liu
Xiaoyi Dong
Lu Yuan
Zicheng Liu
ViT
172
476
0
12 Aug 2021
RaftMLP: How Much Can Be Done Without Attention and with Less Spatial
  Locality?
RaftMLP: How Much Can Be Done Without Attention and with Less Spatial Locality?
Yuki Tatsunami
Masato Taki
19
12
0
09 Aug 2021
Understanding the computational demands underlying visual reasoning
Understanding the computational demands underlying visual reasoning
Mohit Vaishnav
Rémi Cadène
A. Alamia
Drew Linsley
Rufin VanRullen
Thomas Serre
GNN
CoGe
32
16
0
08 Aug 2021
Large-Scale Differentially Private BERT
Large-Scale Differentially Private BERT
Rohan Anil
Badih Ghazi
Vineet Gupta
Ravi Kumar
Pasin Manurangsi
22
131
0
03 Aug 2021
Evo-ViT: Slow-Fast Token Evolution for Dynamic Vision Transformer
Evo-ViT: Slow-Fast Token Evolution for Dynamic Vision Transformer
Yifan Xu
Zhijie Zhang
Mengdan Zhang
Kekai Sheng
Ke Li
Weiming Dong
Liqing Zhang
Changsheng Xu
Xing Sun
ViT
24
201
0
03 Aug 2021
S$^2$-MLPv2: Improved Spatial-Shift MLP Architecture for Vision
S2^22-MLPv2: Improved Spatial-Shift MLP Architecture for Vision
Tan Yu
Xu Li
Yunfeng Cai
Mingming Sun
Ping Li
37
50
0
02 Aug 2021
Structure and Performance of Fully Connected Neural Networks: Emerging
  Complex Network Properties
Structure and Performance of Fully Connected Neural Networks: Emerging Complex Network Properties
Leonardo F. S. Scabini
Odemir M. Bruno
GNN
6
51
0
29 Jul 2021
Experiments on Properties of Hidden Structures of Sparse Neural Networks
Experiments on Properties of Hidden Structures of Sparse Neural Networks
Julian Stier
Harsh Darji
Michael Granitzer
14
2
0
27 Jul 2021
Pointer Value Retrieval: A new benchmark for understanding the limits of
  neural network generalization
Pointer Value Retrieval: A new benchmark for understanding the limits of neural network generalization
Chiyuan Zhang
M. Raghu
Jon M. Kleinberg
Samy Bengio
OOD
19
30
0
27 Jul 2021
CycleMLP: A MLP-like Architecture for Dense Prediction
CycleMLP: A MLP-like Architecture for Dense Prediction
Shoufa Chen
Enze Xie
Chongjian Ge
Runjian Chen
Ding Liang
Ping Luo
19
231
0
21 Jul 2021
Deep learning for temporal data representation in electronic health
  records: A systematic review of challenges and methodologies
Deep learning for temporal data representation in electronic health records: A systematic review of challenges and methodologies
F. Xie
Han Yuan
Yilin Ning
M. Ong
Mengling Feng
W. Hsu
B. Chakraborty
Nan Liu
19
83
0
21 Jul 2021
AS-MLP: An Axial Shifted MLP Architecture for Vision
AS-MLP: An Axial Shifted MLP Architecture for Vision
Dongze Lian
Zehao Yu
Xing Sun
Shenghua Gao
17
189
0
18 Jul 2021
SA-GD: Improved Gradient Descent Learning Strategy with Simulated
  Annealing
SA-GD: Improved Gradient Descent Learning Strategy with Simulated Annealing
Zhicheng Cai
10
4
0
15 Jul 2021
Hierarchical Associative Memory
Hierarchical Associative Memory
Dmitry Krotov
BDL
91
31
0
14 Jul 2021
The Brownian motion in the transformer model
The Brownian motion in the transformer model
Yingshi Chen
19
1
0
12 Jul 2021
ViTGAN: Training GANs with Vision Transformers
ViTGAN: Training GANs with Vision Transformers
Kwonjoon Lee
Huiwen Chang
Lu Jiang
Han Zhang
Z. Tu
Ce Liu
ViT
18
183
0
09 Jul 2021
Vision Xformers: Efficient Attention for Image Classification
Vision Xformers: Efficient Attention for Image Classification
Pranav Jeevan
Amit Sethi
ViT
17
13
0
05 Jul 2021
What Makes for Hierarchical Vision Transformer?
What Makes for Hierarchical Vision Transformer?
Yuxin Fang
Xinggang Wang
Rui Wu
Wenyu Liu
ViT
11
9
0
05 Jul 2021
Previous
123...20212223
Next