Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2105.01601
Cited By
v1
v2
v3
v4 (latest)
MLP-Mixer: An all-MLP Architecture for Vision
4 May 2021
Ilya O. Tolstikhin
N. Houlsby
Alexander Kolesnikov
Lucas Beyer
Xiaohua Zhai
Thomas Unterthiner
Jessica Yung
Andreas Steiner
Daniel Keysers
Jakob Uszkoreit
Mario Lucic
Alexey Dosovitskiy
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"MLP-Mixer: An all-MLP Architecture for Vision"
50 / 1,144 papers shown
Title
MLP-ASR: Sequence-length agnostic all-MLP architectures for speech recognition
Jin Sakuma
Tatsuya Komatsu
Robin Scheibler
44
6
0
17 Feb 2022
Taking a Step Back with KCal: Multi-Class Kernel-Based Calibration for Deep Neural Networks
Zhen Lin
Shubhendu Trivedi
Jimeng Sun
60
5
0
15 Feb 2022
CATs++: Boosting Cost Aggregation with Convolutions and Transformers
Seokju Cho
Sunghwan Hong
Seung Wook Kim
ViT
94
40
0
14 Feb 2022
How Do Vision Transformers Work?
Namuk Park
Songkuk Kim
ViT
124
484
0
14 Feb 2022
Mixing and Shifting: Exploiting Global and Local Dependencies in Vision MLPs
Huangjie Zheng
Pengcheng He
Weizhu Chen
Mingyuan Zhou
61
14
0
14 Feb 2022
BViT: Broad Attention based Vision Transformer
Nannan Li
Yaran Chen
Weifan Li
Zixiang Ding
Dong Zhao
ViT
94
23
0
13 Feb 2022
A Modern Self-Referential Weight Matrix That Learns to Modify Itself
Kazuki Irie
Imanol Schlag
Róbert Csordás
Jürgen Schmidhuber
50
28
0
11 Feb 2022
Universal Hopfield Networks: A General Framework for Single-Shot Associative Memory Models
Beren Millidge
Tommaso Salvatori
Yuhang Song
Thomas Lukasiewicz
Rafal Bogacz
VLM
71
54
0
09 Feb 2022
pNLP-Mixer: an Efficient all-MLP Architecture for Language
Francesco Fusco
Damian Pascual
Peter W. J. Staar
Diego Antognini
86
30
0
09 Feb 2022
Calibrated Learning to Defer with One-vs-All Classifiers
Rajeev Verma
Eric Nalisnick
70
45
0
08 Feb 2022
Towards an Analytical Definition of Sufficient Data
Adam Byerly
T. Kalganova
104
4
0
07 Feb 2022
Image-to-Image MLP-mixer for Image Reconstruction
Youssef Mansour
Kang Lin
Reinhard Heckel
SupR
75
15
0
04 Feb 2022
Keyword localisation in untranscribed speech using visually grounded speech models
Kayode Olaleye
Dan Oneaţă
Herman Kamper
60
7
0
02 Feb 2022
AtmoDist: Self-supervised Representation Learning for Atmospheric Dynamics
Sebastian Hoffmann
C. Lessig
AI4Cl
68
10
0
02 Feb 2022
When Do Flat Minima Optimizers Work?
Jean Kaddour
Linqing Liu
Ricardo M. A. Silva
Matt J. Kusner
ODL
130
64
0
01 Feb 2022
Plug-In Inversion: Model-Agnostic Inversion for Vision with Data Augmentations
Amin Ghiasi
Hamid Kazemi
Steven Reich
Chen Zhu
Micah Goldblum
Tom Goldstein
88
16
0
31 Jan 2022
DynaMixer: A Vision MLP Architecture with Dynamic Mixing
Ziyu Wang
Wenhao Jiang
Yiming Zhu
Li Yuan
Yibing Song
Wei Liu
95
44
0
28 Jan 2022
When Shift Operation Meets Vision Transformer: An Extremely Simple Alternative to Attention Mechanism
Guangting Wang
Yucheng Zhao
Chuanxin Tang
Chong Luo
Wenjun Zeng
99
72
0
26 Jan 2022
AggMatch: Aggregating Pseudo Labels for Semi-Supervised Learning
Jiwon Kim
Kwang-seok Ryoo
Gyuseong Lee
Seokju Cho
Junyoung Seo
Daehwan Kim
Hansang Cho
Seung Wook Kim
70
1
0
25 Jan 2022
Convolutional Xformers for Vision
Pranav Jeevan
Amit Sethi
ViT
86
12
0
25 Jan 2022
Patches Are All You Need?
Asher Trockman
J. Zico Kolter
ViT
272
412
0
24 Jan 2022
Learning to Minimize the Remainder in Supervised Learning
Yan Luo
Yongkang Wong
Mohan S. Kankanhalli
Qi Zhao
90
1
0
23 Jan 2022
AiTLAS: Artificial Intelligence Toolbox for Earth Observation
I. Dimitrovski
Ivan Kitanovski
P. Panov
Nikola Simidjievski
D. Kocev
94
10
0
21 Jan 2022
Continual Transformers: Redundancy-Free Attention for Online Inference
Lukas Hedegaard
Arian Bakhtiarnia
Alexandros Iosifidis
CLL
80
12
0
17 Jan 2022
Video Transformers: A Survey
Javier Selva
A. S. Johansen
Sergio Escalera
Kamal Nasrollahi
T. Moeslund
Albert Clapés
ViT
141
107
0
16 Jan 2022
ConvMixer: Feature Interactive Convolution with Curriculum Learning for Small Footprint and Noisy Far-field Keyword Spotting
Dianwen Ng
Yunqi Chen
Biao Tian
Qiang Fu
Chng Eng Siong
53
46
0
15 Jan 2022
Hand-Object Interaction Reasoning
Jian Ma
Dima Damen
48
7
0
13 Jan 2022
MAXIM: Multi-Axis MLP for Image Processing
Zhengzhong Tu
Hossein Talebi
Han Zhang
Feng Yang
P. Milanfar
A. Bovik
Yinxiao Li
140
481
0
09 Jan 2022
Lawin Transformer: Improving Semantic Segmentation Transformer with Multi-Scale Representations via Large Window Attention
Haotian Yan
Chuang Zhang
Ming Wu
ViT
134
63
0
05 Jan 2022
PyramidTNT: Improved Transformer-in-Transformer Baselines with Pyramid Architecture
Kai Han
Jianyuan Guo
Yehui Tang
Yunhe Wang
ViT
131
22
0
04 Jan 2022
Facial-Sketch Synthesis: A New Challenge
Deng-Ping Fan
Ziling Huang
Peng Zheng
Hong Liu
Xue Qin
Luc Van Gool
CVBM
100
35
0
31 Dec 2021
SmoothNet: A Plug-and-Play Network for Refining Human Poses in Videos
Ailing Zeng
Lei Yang
Xu Ju
Jiefeng Li
Jianyi Wang
Qiang Xu
3DH
81
71
0
27 Dec 2021
Augmenting Convolutional networks with attention-based aggregation
Hugo Touvron
Matthieu Cord
Alaaeldin El-Nouby
Piotr Bojanowski
Armand Joulin
Gabriel Synnaeve
Hervé Jégou
ViT
114
49
0
27 Dec 2021
RepMLPNet: Hierarchical Vision MLP with Re-parameterized Locality
Xiaohan Ding
Honghao Chen
Xinming Zhang
Jungong Han
Guiguang Ding
84
74
0
21 Dec 2021
Couplformer:Rethinking Vision Transformer with Coupling Attention Map
Hai Lan
Xihao Wang
Xian Wei
ViT
84
3
0
10 Dec 2021
Spatio-temporal Relation Modeling for Few-shot Action Recognition
Anirudh Thatipelli
Sanath Narayan
Salman Khan
Rao Muhammad Anwer
Fahad Shahbaz Khan
Guohao Li
ViT
83
92
0
09 Dec 2021
3D Medical Point Transformer: Introducing Convolution to Attention Networks for Medical Point Cloud Analysis
Jianhui Yu
Chaoyi Zhang
Heng Wang
Dingxin Zhang
Yang Song
Tiange Xiang
Dongnan Liu
Weidong (Tom) Cai
ViT
MedIm
81
32
0
09 Dec 2021
MLP Architectures for Vision-and-Language Modeling: An Empirical Study
Yi-Liang Nie
Linjie Li
Zhe Gan
Shuohang Wang
Chenguang Zhu
Michael Zeng
Zicheng Liu
Joey Tianyi Zhou
Lijuan Wang
58
6
0
08 Dec 2021
Constrained Adaptive Projection with Pretrained Features for Anomaly Detection
Xingtai Gui
Di Wu
Yang Chang
Shicai Fan
33
5
0
05 Dec 2021
A Novel Deep Parallel Time-series Relation Network for Fault Diagnosis
Chun Yang
AI4TS
AI4CE
38
4
0
03 Dec 2021
Probabilistic Approach for Road-Users Detection
Gledson Melotti
Weihao Lu
Pedro Conde
Dezong Zhao
A. Asvadi
Nuno Gonçalves
C. Premebida
76
2
0
02 Dec 2021
Pixelated Butterfly: Simple and Efficient Sparse training for Neural Network Models
Tri Dao
Beidi Chen
Kaizhao Liang
Jiaming Yang
Zhao Song
Atri Rudra
Christopher Ré
130
79
0
30 Nov 2021
Pyramid Adversarial Training Improves ViT Performance
Charles Herrmann
Kyle Sargent
Lu Jiang
Ramin Zabih
Huiwen Chang
Ce Liu
Dilip Krishnan
Deqing Sun
ViT
116
59
0
30 Nov 2021
UBoCo : Unsupervised Boundary Contrastive Learning for Generic Event Boundary Detection
Hyolim Kang
Jinwoo Kim
Taehyun Kim
Seon Joo Kim
74
25
0
29 Nov 2021
SWAT: Spatial Structure Within and Among Tokens
Kumara Kahatapitiya
Michael S. Ryoo
72
6
0
26 Nov 2021
Global Interaction Modelling in Vision Transformer via Super Tokens
Ammarah Farooq
Muhammad Awais
S. Ahmed
J. Kittler
ViT
59
7
0
25 Nov 2021
Domain Prompt Learning for Efficiently Adapting CLIP to Unseen Domains
Xinyu Zhang
S. Gu
Yutaka Matsuo
Yusuke Iwasawa
VLM
104
40
0
25 Nov 2021
MorphMLP: An Efficient MLP-Like Backbone for Spatial-Temporal Representation Learning
David Junhao Zhang
Kunchang Li
Yali Wang
Yuxiang Chen
Shashwat Chandra
Yu Qiao
Luoqi Liu
Mike Zheng Shou
AI4TS
88
30
0
24 Nov 2021
An Image Patch is a Wave: Phase-Aware Vision MLP
Yehui Tang
Kai Han
Jianyuan Guo
Chang Xu
Yanxi Li
Chao Xu
Yunhe Wang
91
135
0
24 Nov 2021
Adaptive Fourier Neural Operators: Efficient Token Mixers for Transformers
John Guibas
Morteza Mardani
Zong-Yi Li
Andrew Tao
Anima Anandkumar
Bryan Catanzaro
107
245
0
24 Nov 2021
Previous
1
2
3
...
19
20
21
22
23
Next