Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2105.01601
Cited By
MLP-Mixer: An all-MLP Architecture for Vision
4 May 2021
Ilya O. Tolstikhin
N. Houlsby
Alexander Kolesnikov
Lucas Beyer
Xiaohua Zhai
Thomas Unterthiner
Jessica Yung
Andreas Steiner
Daniel Keysers
Jakob Uszkoreit
Mario Lucic
Alexey Dosovitskiy
Re-assign community
ArXiv
PDF
HTML
Papers citing
"MLP-Mixer: An all-MLP Architecture for Vision"
50 / 1,123 papers shown
Title
Sequencer: Deep LSTM for Image Classification
Yuki Tatsunami
Masato Taki
VLM
ViT
31
78
0
04 May 2022
Synthesized Speech Detection Using Convolutional Transformer-Based Spectrogram Analysis
Emily R. Bartusiak
Edward J. Delp
ViT
36
18
0
03 May 2022
Better plain ViT baselines for ImageNet-1k
Lucas Beyer
Xiaohua Zhai
Alexander Kolesnikov
ViT
VLM
33
112
0
03 May 2022
SideRT: A Real-time Pure Transformer Architecture for Single Image Depth Estimation
Chang Shu
Zi-Chun Chen
Lei Chen
Kuan Ma
Minghui Wang
Haibing Ren
ViT
32
14
0
29 Apr 2022
Improving the Transferability of Adversarial Examples with Restructure Embedded Patches
Huipeng Zhou
Yu-an Tan
Yajie Wang
Haoran Lyu
Shan-Hung Wu
Yuan-zhang Li
ViT
24
4
0
27 Apr 2022
SCGC : Self-Supervised Contrastive Graph Clustering
Gayan K. Kulatilleke
Marius Portmann
Shekhar S. Chandra
39
8
0
27 Apr 2022
Boosting Adversarial Transferability of MLP-Mixer
Haoran Lyu
Yajie Wang
Yu-an Tan
Huipeng Zhou
Yuhang Zhao
Quan-xin Zhang
AAML
32
1
0
26 Apr 2022
Selective Cross-Task Distillation
Su Lu
Han-Jia Ye
De-Chuan Zhan
36
0
0
25 Apr 2022
A Spatio-Temporal Multilayer Perceptron for Gesture Recognition
Adrian Holzbock
Alexander Tsaregorodtsev
Youssef Dawoud
Klaus C. J. Dietmayer
Vasileios Belagiannis
37
12
0
25 Apr 2022
Spacing Loss for Discovering Novel Categories
K. J. Joseph
S. Paul
Gaurav Aggarwal
Soma Biswas
Piyush Rai
Kai Han
V. Balasubramanian
22
14
0
22 Apr 2022
Few-Shot Object Detection with Proposal Balance Refinement
Sueyeon Kim
Woo-Jeoung Nam
Seong-Whan Lee
ObjD
27
3
0
22 Apr 2022
Cylin-Painting: Seamless {360\textdegree} Panoramic Image Outpainting and Beyond
K. Liao
Xiangyu Xu
Chunyu Lin
Wenqi Ren
Yunchao Wei
Yao Zhao
42
8
0
18 Apr 2022
Application of Transfer Learning and Ensemble Learning in Image-level Classification for Breast Histopathology
Yuchao Zheng
Chen Li
Xiaomin Zhou
Hao Chen
Hao Xu
...
Haiqing Zhang
Xirong Li
Hongzan Sun
Xinyu Huang
M. Grzegorzek
36
55
0
18 Apr 2022
DeiT III: Revenge of the ViT
Hugo Touvron
Matthieu Cord
Hervé Jégou
ViT
48
391
0
14 Apr 2022
3D Shuffle-Mixer: An Efficient Context-Aware Vision Learner of Transformer-MLP Paradigm for Dense Prediction in Medical Volume
Jianye Pang
Cheng Jiang
Yihao Chen
Jianbo Chang
M. Feng
Renzhi Wang
Jianhua Yao
ViT
MedIm
28
11
0
14 Apr 2022
Particle Video Revisited: Tracking Through Occlusions Using Point Trajectories
Adam W. Harley
Zhaoyuan Fang
Katerina Fragkiadaki
35
160
0
08 Apr 2022
Are We Really Making Much Progress in Text Classification? A Comparative Review
Lukas Galke
Andor Diera
Bao Xin Lin
Bhakti Khera
Tim Meuser
Tushar Singhal
Fabian Karl
A. Scherp
VLM
32
4
0
08 Apr 2022
DaViT: Dual Attention Vision Transformers
Mingyu Ding
Bin Xiao
Noel Codella
Ping Luo
Jingdong Wang
Lu Yuan
ViT
51
242
0
07 Apr 2022
Solving ImageNet: a Unified Scheme for Training any Backbone to Top Results
T. Ridnik
Hussam Lawen
Emanuel Ben-Baruch
Asaf Noy
40
11
0
07 Apr 2022
Few-Shot Forecasting of Time-Series with Heterogeneous Channels
L. Brinkmeyer
Rafael Rêgo Drumond
Johannes Burchert
Lars Schmidt-Thieme
AI4TS
28
7
0
07 Apr 2022
OccamNets: Mitigating Dataset Bias by Favoring Simpler Hypotheses
Robik Shrestha
Kushal Kafle
Christopher Kanan
CML
33
13
0
05 Apr 2022
MaxViT: Multi-Axis Vision Transformer
Zhengzhong Tu
Hossein Talebi
Han Zhang
Feng Yang
P. Milanfar
A. Bovik
Yinxiao Li
ViT
62
638
0
04 Apr 2022
Supervised Robustness-preserving Data-free Neural Network Pruning
Mark Huasong Meng
Guangdong Bai
Sin Gee Teo
Jin Song Dong
AAML
26
4
0
02 Apr 2022
Monarch: Expressive Structured Matrices for Efficient and Accurate Training
Tri Dao
Beidi Chen
N. Sohoni
Arjun D Desai
Michael Poli
Jessica Grogan
Alexander Liu
Aniruddh Rao
Atri Rudra
Christopher Ré
29
87
0
01 Apr 2022
Physical Deep Learning with Biologically Plausible Training Method
M. Nakajima
Katsuma Inoue
Kenji Tanaka
Yasuo Kuniyoshi
Toshikazu Hashimoto
Kohei Nakajima
AI4CE
36
3
0
01 Apr 2022
Deformable Video Transformer
Jue Wang
Lorenzo Torresani
ViT
30
28
0
31 Mar 2022
Exploring Plain Vision Transformer Backbones for Object Detection
Yanghao Li
Hanzi Mao
Ross B. Girshick
Kaiming He
ViT
36
779
0
30 Mar 2022
AdaMixer: A Fast-Converging Query-Based Object Detector
Ziteng Gao
Limin Wang
Bing Han
Sheng Guo
ObjD
39
105
0
30 Mar 2022
Recognition of polar lows in Sentinel-1 SAR images with deep learning
J. Grahn
F. Bianchi
38
3
0
30 Mar 2022
Investigating Top-
k
k
k
White-Box and Transferable Black-box Attack
Chaoning Zhang
Philipp Benz
Adil Karjauv
Jae-Won Cho
Kang Zhang
In So Kweon
31
42
0
30 Mar 2022
InstaFormer: Instance-Aware Image-to-Image Translation with Transformer
Soohyun Kim
Jongbeom Baek
Jihye Park
Gyeongnyeon Kim
Seung Wook Kim
ViT
39
47
0
30 Mar 2022
FlowFormer: A Transformer Architecture for Optical Flow
Zhaoyang Huang
Xiaoyu Shi
Chao Zhang
Qiang Wang
Ka Chun Cheung
Hongwei Qin
Jifeng Dai
Hongsheng Li
ViT
35
270
0
30 Mar 2022
Understanding out-of-distribution accuracies through quantifying difficulty of test samples
Berfin Simsek
Melissa Hall
Levent Sagun
31
5
0
28 Mar 2022
Brain-inspired Multilayer Perceptron with Spiking Neurons
Wenshuo Li
Hanting Chen
Jianyuan Guo
Ziyang Zhang
Yunhe Wang
35
35
0
28 Mar 2022
FAMLP: A Frequency-Aware MLP-Like Architecture For Domain Generalization
Kecheng Zheng
Yang Cao
Kai Zhu
Ruijing Zhao
Zhengjun Zha
33
5
0
24 Mar 2022
Linearizing Transformer with Key-Value Memory
Yizhe Zhang
Deng Cai
28
5
0
23 Mar 2022
PaCa-ViT: Learning Patch-to-Cluster Attention in Vision Transformers
Ryan Grainger
Thomas Paniagua
Xi Song
Naresh P. Cuntoor
Mun Wai Lee
Tianfu Wu
ViT
15
7
0
22 Mar 2022
Focal Modulation Networks
Jianwei Yang
Chunyuan Li
Xiyang Dai
Lu Yuan
Jianfeng Gao
3DPC
38
263
0
22 Mar 2022
Masked Discrimination for Self-Supervised Learning on Point Clouds
Haotian Liu
Mu Cai
Yong Jae Lee
3DPC
21
164
0
21 Mar 2022
TVConv: Efficient Translation Variant Convolution for Layout-aware Visual Processing
Jie Chen
Tianlang He
Weipeng Zhuo
Li Ma
Sangtae Ha
Shueng-Han Gary Chan
CVBM
21
24
0
20 Mar 2022
Towards Robust Semantic Segmentation of Accident Scenes via Multi-Source Mixed Sampling and Meta-Learning
Xinyu Luo
Jiaming Zhang
Kailun Yang
Alina Roitberg
Kunyu Peng
Rainer Stiefelhagen
25
9
0
19 Mar 2022
Three things everyone should know about Vision Transformers
Hugo Touvron
Matthieu Cord
Alaaeldin El-Nouby
Jakob Verbeek
Hervé Jégou
ViT
24
120
0
18 Mar 2022
Learning Audio Representations with MLPs
Mashrur M. Morshed
Ahmad Omar Ahsan
H. Mahmud
Md. Kamrul Hasan
27
4
0
16 Mar 2022
CrowdMLP: Weakly-Supervised Crowd Counting via Multi-Granularity MLP
Mingjie Wang
Jun Zhou
Hao Cai
Minglun Gong
20
29
0
15 Mar 2022
Can Neural Nets Learn the Same Model Twice? Investigating Reproducibility and Double Descent from the Decision Boundary Perspective
Gowthami Somepalli
Liam H. Fowl
Arpit Bansal
Ping Yeh-Chiang
Yehuda Dar
Richard Baraniuk
Micah Goldblum
Tom Goldstein
24
64
0
15 Mar 2022
Surrogate Gap Minimization Improves Sharpness-Aware Training
Juntang Zhuang
Boqing Gong
Liangzhe Yuan
Huayu Chen
Hartwig Adam
Nicha Dvornek
S. Tatikonda
James Duncan
Ting Liu
27
146
0
15 Mar 2022
Self-Promoted Supervision for Few-Shot Transformer
Bowen Dong
Pan Zhou
Shuicheng Yan
W. Zuo
ViT
22
28
0
14 Mar 2022
Efficient Language Modeling with Sparse all-MLP
Ping Yu
Mikel Artetxe
Myle Ott
Sam Shleifer
Hongyu Gong
Ves Stoyanov
Xian Li
MoE
23
11
0
14 Mar 2022
Scaling Up Your Kernels to 31x31: Revisiting Large Kernel Design in CNNs
Xiaohan Ding
Xinming Zhang
Yi Zhou
Jungong Han
Guiguang Ding
Jian Sun
VLM
49
528
0
13 Mar 2022
Active Token Mixer
Guoqiang Wei
Zhizheng Zhang
Cuiling Lan
Yan Lu
Zhibo Chen
24
15
0
11 Mar 2022
Previous
1
2
3
...
17
18
19
...
21
22
23
Next