ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2105.01601
  4. Cited By
MLP-Mixer: An all-MLP Architecture for Vision

MLP-Mixer: An all-MLP Architecture for Vision

4 May 2021
Ilya O. Tolstikhin
N. Houlsby
Alexander Kolesnikov
Lucas Beyer
Xiaohua Zhai
Thomas Unterthiner
Jessica Yung
Andreas Steiner
Daniel Keysers
Jakob Uszkoreit
Mario Lucic
Alexey Dosovitskiy
ArXivPDFHTML

Papers citing "MLP-Mixer: An all-MLP Architecture for Vision"

50 / 1,119 papers shown
Title
Augmenting Convolutional networks with attention-based aggregation
Augmenting Convolutional networks with attention-based aggregation
Hugo Touvron
Matthieu Cord
Alaaeldin El-Nouby
Piotr Bojanowski
Armand Joulin
Gabriel Synnaeve
Hervé Jégou
ViT
30
47
0
27 Dec 2021
RepMLPNet: Hierarchical Vision MLP with Re-parameterized Locality
RepMLPNet: Hierarchical Vision MLP with Re-parameterized Locality
Xiaohan Ding
Honghao Chen
X. Zhang
Jungong Han
Guiguang Ding
17
71
0
21 Dec 2021
Couplformer:Rethinking Vision Transformer with Coupling Attention Map
Couplformer:Rethinking Vision Transformer with Coupling Attention Map
Hai Lan
Xihao Wang
Xian Wei
ViT
26
3
0
10 Dec 2021
Spatio-temporal Relation Modeling for Few-shot Action Recognition
Spatio-temporal Relation Modeling for Few-shot Action Recognition
Anirudh Thatipelli
Sanath Narayan
Salman Khan
Rao Muhammad Anwer
F. Khan
Bernard Ghanem
ViT
22
87
0
09 Dec 2021
3D Medical Point Transformer: Introducing Convolution to Attention
  Networks for Medical Point Cloud Analysis
3D Medical Point Transformer: Introducing Convolution to Attention Networks for Medical Point Cloud Analysis
Jianhui Yu
Chaoyi Zhang
Heng Wang
Dingxin Zhang
Yang Song
Tiange Xiang
Dongnan Liu
Weidong (Tom) Cai
ViT
MedIm
19
32
0
09 Dec 2021
MLP Architectures for Vision-and-Language Modeling: An Empirical Study
MLP Architectures for Vision-and-Language Modeling: An Empirical Study
Yi-Liang Nie
Linjie Li
Zhe Gan
Shuohang Wang
Chenguang Zhu
Michael Zeng
Zicheng Liu
Mohit Bansal
Lijuan Wang
12
6
0
08 Dec 2021
Constrained Adaptive Projection with Pretrained Features for Anomaly
  Detection
Constrained Adaptive Projection with Pretrained Features for Anomaly Detection
Xingtai Gui
Di Wu
Yang Chang
Shicai Fan
11
5
0
05 Dec 2021
A Novel Deep Parallel Time-series Relation Network for Fault Diagnosis
A Novel Deep Parallel Time-series Relation Network for Fault Diagnosis
Chun Yang
AI4TS
AI4CE
19
4
0
03 Dec 2021
Probabilistic Approach for Road-Users Detection
Probabilistic Approach for Road-Users Detection
Gledson Melotti
Weihao Lu
Pedro Conde
Dezong Zhao
A. Asvadi
Nuno Gonçalves
C. Premebida
21
2
0
02 Dec 2021
Pixelated Butterfly: Simple and Efficient Sparse training for Neural
  Network Models
Pixelated Butterfly: Simple and Efficient Sparse training for Neural Network Models
Tri Dao
Beidi Chen
Kaizhao Liang
Jiaming Yang
Zhao-quan Song
Atri Rudra
Christopher Ré
25
75
0
30 Nov 2021
Pyramid Adversarial Training Improves ViT Performance
Pyramid Adversarial Training Improves ViT Performance
Charles Herrmann
Kyle Sargent
Lu Jiang
Ramin Zabih
Huiwen Chang
Ce Liu
Dilip Krishnan
Deqing Sun
ViT
20
56
0
30 Nov 2021
UBoCo : Unsupervised Boundary Contrastive Learning for Generic Event
  Boundary Detection
UBoCo : Unsupervised Boundary Contrastive Learning for Generic Event Boundary Detection
Hyolim Kang
Jinwoo Kim
Taehyun Kim
Seon Joo Kim
34
25
0
29 Nov 2021
SWAT: Spatial Structure Within and Among Tokens
SWAT: Spatial Structure Within and Among Tokens
Kumara Kahatapitiya
Michael S. Ryoo
23
6
0
26 Nov 2021
Global Interaction Modelling in Vision Transformer via Super Tokens
Global Interaction Modelling in Vision Transformer via Super Tokens
Ammarah Farooq
Muhammad Awais
S. Ahmed
J. Kittler
ViT
28
6
0
25 Nov 2021
Domain Prompt Learning for Efficiently Adapting CLIP to Unseen Domains
Domain Prompt Learning for Efficiently Adapting CLIP to Unseen Domains
X. Zhang
S. Gu
Yutaka Matsuo
Yusuke Iwasawa
VLM
32
36
0
25 Nov 2021
MorphMLP: An Efficient MLP-Like Backbone for Spatial-Temporal
  Representation Learning
MorphMLP: An Efficient MLP-Like Backbone for Spatial-Temporal Representation Learning
David Junhao Zhang
Kunchang Li
Yali Wang
Yuxiang Chen
Shashwat Chandra
Yu Qiao
Luoqi Liu
Mike Zheng Shou
AI4TS
19
30
0
24 Nov 2021
An Image Patch is a Wave: Phase-Aware Vision MLP
An Image Patch is a Wave: Phase-Aware Vision MLP
Yehui Tang
Kai Han
Jianyuan Guo
Chang Xu
Yanxi Li
Chao Xu
Yunhe Wang
22
133
0
24 Nov 2021
Adaptive Fourier Neural Operators: Efficient Token Mixers for
  Transformers
Adaptive Fourier Neural Operators: Efficient Token Mixers for Transformers
John Guibas
Morteza Mardani
Zong-Yi Li
Andrew Tao
Anima Anandkumar
Bryan Catanzaro
19
227
0
24 Nov 2021
Critical Initialization of Wide and Deep Neural Networks through Partial
  Jacobians: General Theory and Applications
Critical Initialization of Wide and Deep Neural Networks through Partial Jacobians: General Theory and Applications
Darshil Doshi
Tianyu He
Andrey Gromov
25
8
0
23 Nov 2021
SimpleTRON: Simple Transformer with O(N) Complexity
SimpleTRON: Simple Transformer with O(N) Complexity
Uladzislau Yorsh
Alexander Kovalenko
Vojtvech Vanvcura
Daniel Vavsata
Pavel Kordík
Tomávs Mikolov
28
1
0
23 Nov 2021
GenReg: Deep Generative Method for Fast Point Cloud Registration
GenReg: Deep Generative Method for Fast Point Cloud Registration
Xiaoshui Huang
Zongyi Xu
Guofeng Mei
Sheng R. Li
Jian Andrew Zhang
Yifan Zuo
Yucheng Wang
3DPC
28
1
0
23 Nov 2021
MetaFormer Is Actually What You Need for Vision
MetaFormer Is Actually What You Need for Vision
Weihao Yu
Mi Luo
Pan Zhou
Chenyang Si
Yichen Zhou
Xinchao Wang
Jiashi Feng
Shuicheng Yan
26
872
0
22 Nov 2021
PointMixer: MLP-Mixer for Point Cloud Understanding
PointMixer: MLP-Mixer for Point Cloud Understanding
Jaesung Choe
Chunghyun Park
François Rameau
Jaesik Park
In So Kweon
3DPC
36
98
0
22 Nov 2021
Are Vision Transformers Robust to Patch Perturbations?
Are Vision Transformers Robust to Patch Perturbations?
Jindong Gu
Volker Tresp
Yao Qin
AAML
ViT
30
60
0
20 Nov 2021
Discrete Representations Strengthen Vision Transformer Robustness
Discrete Representations Strengthen Vision Transformer Robustness
Chengzhi Mao
Lu Jiang
Mostafa Dehghani
Carl Vondrick
Rahul Sukthankar
Irfan Essa
ViT
25
43
0
20 Nov 2021
Global and Local Alignment Networks for Unpaired Image-to-Image
  Translation
Global and Local Alignment Networks for Unpaired Image-to-Image Translation
Guanglei Yang
H. Tang
Humphrey Shi
M. Ding
N. Sebe
Radu Timofte
Luc Van Gool
Elisa Ricci
11
1
0
19 Nov 2021
UFO: A UniFied TransfOrmer for Vision-Language Representation Learning
UFO: A UniFied TransfOrmer for Vision-Language Representation Learning
Jianfeng Wang
Xiaowei Hu
Zhe Gan
Zhengyuan Yang
Xiyang Dai
Zicheng Liu
Yumao Lu
Lijuan Wang
ViT
27
57
0
19 Nov 2021
Benchmarking and scaling of deep learning models for land cover image
  classification
Benchmarking and scaling of deep learning models for land cover image classification
Ioannis Papoutsis
N. Bountos
Angelos Zavras
Dimitrios Michail
Christos Tryfonopoulos
11
55
0
18 Nov 2021
INTERN: A New Learning Paradigm Towards General Vision
INTERN: A New Learning Paradigm Towards General Vision
Jing Shao
Siyu Chen
Yangguang Li
Kun Wang
Zhen-fei Yin
...
F. Yu
Junjie Yan
Dahua Lin
Xiaogang Wang
Yu Qiao
13
34
0
16 Nov 2021
LiT: Zero-Shot Transfer with Locked-image text Tuning
LiT: Zero-Shot Transfer with Locked-image text Tuning
Xiaohua Zhai
Xiao Wang
Basil Mustafa
Andreas Steiner
Daniel Keysers
Alexander Kolesnikov
Lucas Beyer
VLM
28
539
0
15 Nov 2021
Attention Mechanisms in Computer Vision: A Survey
Attention Mechanisms in Computer Vision: A Survey
Meng-Hao Guo
Tianhan Xu
Jiangjiang Liu
Zheng-Ning Liu
Peng-Tao Jiang
Tai-Jiang Mu
Song-Hai Zhang
Ralph Robert Martin
Ming-Ming Cheng
Shimin Hu
19
1,633
0
15 Nov 2021
A Survey on Hyperdimensional Computing aka Vector Symbolic
  Architectures, Part II: Applications, Cognitive Models, and Challenges
A Survey on Hyperdimensional Computing aka Vector Symbolic Architectures, Part II: Applications, Cognitive Models, and Challenges
Denis Kleyko
D. Rachkovskij
Evgeny Osipov
A. Rahim
18
126
0
12 Nov 2021
Are Transformers More Robust Than CNNs?
Are Transformers More Robust Than CNNs?
Yutong Bai
Jieru Mei
Alan Yuille
Cihang Xie
ViT
AAML
186
258
0
10 Nov 2021
Sliced Recursive Transformer
Sliced Recursive Transformer
Zhiqiang Shen
Zechun Liu
Eric P. Xing
ViT
17
27
0
09 Nov 2021
Personalized Benchmarking with the Ludwig Benchmarking Toolkit
Personalized Benchmarking with the Ludwig Benchmarking Toolkit
A. Narayan
Piero Molino
Karan Goel
W. Neiswanger
Christopher Ré
8
11
0
08 Nov 2021
Are we ready for a new paradigm shift? A Survey on Visual Deep MLP
Are we ready for a new paradigm shift? A Survey on Visual Deep MLP
Ruiyang Liu
Yinghui Li
Li Tao
Dun Liang
Haitao Zheng
85
96
0
07 Nov 2021
TermiNeRF: Ray Termination Prediction for Efficient Neural Rendering
TermiNeRF: Ray Termination Prediction for Efficient Neural Rendering
Martin Piala
R. Clark
18
79
0
05 Nov 2021
Efficiently Modeling Long Sequences with Structured State Spaces
Efficiently Modeling Long Sequences with Structured State Spaces
Albert Gu
Karan Goel
Christopher Ré
28
1,648
0
31 Oct 2021
Blending Anti-Aliasing into Vision Transformer
Blending Anti-Aliasing into Vision Transformer
Shengju Qian
Hao Shao
Yi Zhu
Mu Li
Jiaya Jia
21
20
0
28 Oct 2021
Arbitrary Distribution Modeling with Censorship in Real-Time Bidding
  Advertising
Arbitrary Distribution Modeling with Censorship in Real-Time Bidding Advertising
Xu Li
Michelle Ma Zhang
Youjun Tong
Zhenya Wang
22
9
0
26 Oct 2021
K-Lane: Lidar Lane Dataset and Benchmark for Urban Roads and Highways
K-Lane: Lidar Lane Dataset and Benchmark for Urban Roads and Highways
Dong-Hee Paek
Seung-Hyung Kong
Kevin Tirta Wijaya
3DV
17
21
0
21 Oct 2021
A channel attention based MLP-Mixer network for motor imagery decoding
  with EEG
A channel attention based MLP-Mixer network for motor imagery decoding with EEG
Yan-Nan He
Zhiyang Lu
Jun Wang
Jun Shi
15
9
0
21 Oct 2021
A Data-Centric Optimization Framework for Machine Learning
A Data-Centric Optimization Framework for Machine Learning
Oliver Rausch
Tal Ben-Nun
Nikoli Dryden
Andrei Ivanov
Shigang Li
Torsten Hoefler
AI4CE
17
16
0
20 Oct 2021
Ranking and Tuning Pre-trained Models: A New Paradigm for Exploiting
  Model Hubs
Ranking and Tuning Pre-trained Models: A New Paradigm for Exploiting Model Hubs
Kaichao You
Yong Liu
Ziyang Zhang
Jianmin Wang
Michael I. Jordan
Mingsheng Long
107
30
0
20 Oct 2021
Cascaded Cross MLP-Mixer GANs for Cross-View Image Translation
Cascaded Cross MLP-Mixer GANs for Cross-View Image Translation
Bin Ren
Hao Tang
N. Sebe
27
30
0
19 Oct 2021
Inductive Biases and Variable Creation in Self-Attention Mechanisms
Inductive Biases and Variable Creation in Self-Attention Mechanisms
Benjamin L. Edelman
Surbhi Goel
Sham Kakade
Cyril Zhang
27
115
0
19 Oct 2021
SCENIC: A JAX Library for Computer Vision Research and Beyond
SCENIC: A JAX Library for Computer Vision Research and Beyond
Mostafa Dehghani
A. Gritsenko
Anurag Arnab
Matthias Minderer
Yi Tay
41
68
0
18 Oct 2021
Graph-less Neural Networks: Teaching Old MLPs New Tricks via
  Distillation
Graph-less Neural Networks: Teaching Old MLPs New Tricks via Distillation
Shichang Zhang
Yozen Liu
Yizhou Sun
Neil Shah
31
173
0
17 Oct 2021
Sharpness-Aware Minimization Improves Language Model Generalization
Sharpness-Aware Minimization Improves Language Model Generalization
Dara Bahri
H. Mobahi
Yi Tay
119
98
0
16 Oct 2021
Attention-Free Keyword Spotting
Attention-Free Keyword Spotting
Mashrur M. Morshed
Ahmad Omar Ahsan
25
9
0
14 Oct 2021
Previous
123...1920212223
Next