ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2105.01601
  4. Cited By
MLP-Mixer: An all-MLP Architecture for Vision

MLP-Mixer: An all-MLP Architecture for Vision

4 May 2021
Ilya O. Tolstikhin
N. Houlsby
Alexander Kolesnikov
Lucas Beyer
Xiaohua Zhai
Thomas Unterthiner
Jessica Yung
Andreas Steiner
Daniel Keysers
Jakob Uszkoreit
Mario Lucic
Alexey Dosovitskiy
ArXivPDFHTML

Papers citing "MLP-Mixer: An all-MLP Architecture for Vision"

50 / 1,123 papers shown
Title
Sequencer: Deep LSTM for Image Classification
Sequencer: Deep LSTM for Image Classification
Yuki Tatsunami
Masato Taki
VLM
ViT
31
78
0
04 May 2022
Synthesized Speech Detection Using Convolutional Transformer-Based
  Spectrogram Analysis
Synthesized Speech Detection Using Convolutional Transformer-Based Spectrogram Analysis
Emily R. Bartusiak
Edward J. Delp
ViT
36
18
0
03 May 2022
Better plain ViT baselines for ImageNet-1k
Better plain ViT baselines for ImageNet-1k
Lucas Beyer
Xiaohua Zhai
Alexander Kolesnikov
ViT
VLM
33
112
0
03 May 2022
SideRT: A Real-time Pure Transformer Architecture for Single Image Depth
  Estimation
SideRT: A Real-time Pure Transformer Architecture for Single Image Depth Estimation
Chang Shu
Zi-Chun Chen
Lei Chen
Kuan Ma
Minghui Wang
Haibing Ren
ViT
32
14
0
29 Apr 2022
Improving the Transferability of Adversarial Examples with Restructure
  Embedded Patches
Improving the Transferability of Adversarial Examples with Restructure Embedded Patches
Huipeng Zhou
Yu-an Tan
Yajie Wang
Haoran Lyu
Shan-Hung Wu
Yuan-zhang Li
ViT
24
4
0
27 Apr 2022
SCGC : Self-Supervised Contrastive Graph Clustering
SCGC : Self-Supervised Contrastive Graph Clustering
Gayan K. Kulatilleke
Marius Portmann
Shekhar S. Chandra
39
8
0
27 Apr 2022
Boosting Adversarial Transferability of MLP-Mixer
Boosting Adversarial Transferability of MLP-Mixer
Haoran Lyu
Yajie Wang
Yu-an Tan
Huipeng Zhou
Yuhang Zhao
Quan-xin Zhang
AAML
32
1
0
26 Apr 2022
Selective Cross-Task Distillation
Selective Cross-Task Distillation
Su Lu
Han-Jia Ye
De-Chuan Zhan
36
0
0
25 Apr 2022
A Spatio-Temporal Multilayer Perceptron for Gesture Recognition
A Spatio-Temporal Multilayer Perceptron for Gesture Recognition
Adrian Holzbock
Alexander Tsaregorodtsev
Youssef Dawoud
Klaus C. J. Dietmayer
Vasileios Belagiannis
37
12
0
25 Apr 2022
Spacing Loss for Discovering Novel Categories
Spacing Loss for Discovering Novel Categories
K. J. Joseph
S. Paul
Gaurav Aggarwal
Soma Biswas
Piyush Rai
Kai Han
V. Balasubramanian
22
14
0
22 Apr 2022
Few-Shot Object Detection with Proposal Balance Refinement
Few-Shot Object Detection with Proposal Balance Refinement
Sueyeon Kim
Woo-Jeoung Nam
Seong-Whan Lee
ObjD
27
3
0
22 Apr 2022
Cylin-Painting: Seamless {360\textdegree} Panoramic Image Outpainting
  and Beyond
Cylin-Painting: Seamless {360\textdegree} Panoramic Image Outpainting and Beyond
K. Liao
Xiangyu Xu
Chunyu Lin
Wenqi Ren
Yunchao Wei
Yao Zhao
42
8
0
18 Apr 2022
Application of Transfer Learning and Ensemble Learning in Image-level
  Classification for Breast Histopathology
Application of Transfer Learning and Ensemble Learning in Image-level Classification for Breast Histopathology
Yuchao Zheng
Chen Li
Xiaomin Zhou
Hao Chen
Hao Xu
...
Haiqing Zhang
Xirong Li
Hongzan Sun
Xinyu Huang
M. Grzegorzek
36
55
0
18 Apr 2022
DeiT III: Revenge of the ViT
DeiT III: Revenge of the ViT
Hugo Touvron
Matthieu Cord
Hervé Jégou
ViT
48
391
0
14 Apr 2022
3D Shuffle-Mixer: An Efficient Context-Aware Vision Learner of
  Transformer-MLP Paradigm for Dense Prediction in Medical Volume
3D Shuffle-Mixer: An Efficient Context-Aware Vision Learner of Transformer-MLP Paradigm for Dense Prediction in Medical Volume
Jianye Pang
Cheng Jiang
Yihao Chen
Jianbo Chang
M. Feng
Renzhi Wang
Jianhua Yao
ViT
MedIm
28
11
0
14 Apr 2022
Particle Video Revisited: Tracking Through Occlusions Using Point
  Trajectories
Particle Video Revisited: Tracking Through Occlusions Using Point Trajectories
Adam W. Harley
Zhaoyuan Fang
Katerina Fragkiadaki
35
160
0
08 Apr 2022
Are We Really Making Much Progress in Text Classification? A Comparative
  Review
Are We Really Making Much Progress in Text Classification? A Comparative Review
Lukas Galke
Andor Diera
Bao Xin Lin
Bhakti Khera
Tim Meuser
Tushar Singhal
Fabian Karl
A. Scherp
VLM
32
4
0
08 Apr 2022
DaViT: Dual Attention Vision Transformers
DaViT: Dual Attention Vision Transformers
Mingyu Ding
Bin Xiao
Noel Codella
Ping Luo
Jingdong Wang
Lu Yuan
ViT
51
242
0
07 Apr 2022
Solving ImageNet: a Unified Scheme for Training any Backbone to Top
  Results
Solving ImageNet: a Unified Scheme for Training any Backbone to Top Results
T. Ridnik
Hussam Lawen
Emanuel Ben-Baruch
Asaf Noy
40
11
0
07 Apr 2022
Few-Shot Forecasting of Time-Series with Heterogeneous Channels
Few-Shot Forecasting of Time-Series with Heterogeneous Channels
L. Brinkmeyer
Rafael Rêgo Drumond
Johannes Burchert
Lars Schmidt-Thieme
AI4TS
28
7
0
07 Apr 2022
OccamNets: Mitigating Dataset Bias by Favoring Simpler Hypotheses
OccamNets: Mitigating Dataset Bias by Favoring Simpler Hypotheses
Robik Shrestha
Kushal Kafle
Christopher Kanan
CML
33
13
0
05 Apr 2022
MaxViT: Multi-Axis Vision Transformer
MaxViT: Multi-Axis Vision Transformer
Zhengzhong Tu
Hossein Talebi
Han Zhang
Feng Yang
P. Milanfar
A. Bovik
Yinxiao Li
ViT
62
638
0
04 Apr 2022
Supervised Robustness-preserving Data-free Neural Network Pruning
Supervised Robustness-preserving Data-free Neural Network Pruning
Mark Huasong Meng
Guangdong Bai
Sin Gee Teo
Jin Song Dong
AAML
26
4
0
02 Apr 2022
Monarch: Expressive Structured Matrices for Efficient and Accurate
  Training
Monarch: Expressive Structured Matrices for Efficient and Accurate Training
Tri Dao
Beidi Chen
N. Sohoni
Arjun D Desai
Michael Poli
Jessica Grogan
Alexander Liu
Aniruddh Rao
Atri Rudra
Christopher Ré
29
87
0
01 Apr 2022
Physical Deep Learning with Biologically Plausible Training Method
Physical Deep Learning with Biologically Plausible Training Method
M. Nakajima
Katsuma Inoue
Kenji Tanaka
Yasuo Kuniyoshi
Toshikazu Hashimoto
Kohei Nakajima
AI4CE
36
3
0
01 Apr 2022
Deformable Video Transformer
Deformable Video Transformer
Jue Wang
Lorenzo Torresani
ViT
30
28
0
31 Mar 2022
Exploring Plain Vision Transformer Backbones for Object Detection
Exploring Plain Vision Transformer Backbones for Object Detection
Yanghao Li
Hanzi Mao
Ross B. Girshick
Kaiming He
ViT
36
779
0
30 Mar 2022
AdaMixer: A Fast-Converging Query-Based Object Detector
AdaMixer: A Fast-Converging Query-Based Object Detector
Ziteng Gao
Limin Wang
Bing Han
Sheng Guo
ObjD
39
105
0
30 Mar 2022
Recognition of polar lows in Sentinel-1 SAR images with deep learning
Recognition of polar lows in Sentinel-1 SAR images with deep learning
J. Grahn
F. Bianchi
38
3
0
30 Mar 2022
Investigating Top-$k$ White-Box and Transferable Black-box Attack
Investigating Top-kkk White-Box and Transferable Black-box Attack
Chaoning Zhang
Philipp Benz
Adil Karjauv
Jae-Won Cho
Kang Zhang
In So Kweon
31
42
0
30 Mar 2022
InstaFormer: Instance-Aware Image-to-Image Translation with Transformer
InstaFormer: Instance-Aware Image-to-Image Translation with Transformer
Soohyun Kim
Jongbeom Baek
Jihye Park
Gyeongnyeon Kim
Seung Wook Kim
ViT
39
47
0
30 Mar 2022
FlowFormer: A Transformer Architecture for Optical Flow
FlowFormer: A Transformer Architecture for Optical Flow
Zhaoyang Huang
Xiaoyu Shi
Chao Zhang
Qiang Wang
Ka Chun Cheung
Hongwei Qin
Jifeng Dai
Hongsheng Li
ViT
35
270
0
30 Mar 2022
Understanding out-of-distribution accuracies through quantifying
  difficulty of test samples
Understanding out-of-distribution accuracies through quantifying difficulty of test samples
Berfin Simsek
Melissa Hall
Levent Sagun
31
5
0
28 Mar 2022
Brain-inspired Multilayer Perceptron with Spiking Neurons
Brain-inspired Multilayer Perceptron with Spiking Neurons
Wenshuo Li
Hanting Chen
Jianyuan Guo
Ziyang Zhang
Yunhe Wang
35
35
0
28 Mar 2022
FAMLP: A Frequency-Aware MLP-Like Architecture For Domain Generalization
FAMLP: A Frequency-Aware MLP-Like Architecture For Domain Generalization
Kecheng Zheng
Yang Cao
Kai Zhu
Ruijing Zhao
Zhengjun Zha
33
5
0
24 Mar 2022
Linearizing Transformer with Key-Value Memory
Linearizing Transformer with Key-Value Memory
Yizhe Zhang
Deng Cai
28
5
0
23 Mar 2022
PaCa-ViT: Learning Patch-to-Cluster Attention in Vision Transformers
PaCa-ViT: Learning Patch-to-Cluster Attention in Vision Transformers
Ryan Grainger
Thomas Paniagua
Xi Song
Naresh P. Cuntoor
Mun Wai Lee
Tianfu Wu
ViT
15
7
0
22 Mar 2022
Focal Modulation Networks
Focal Modulation Networks
Jianwei Yang
Chunyuan Li
Xiyang Dai
Lu Yuan
Jianfeng Gao
3DPC
38
263
0
22 Mar 2022
Masked Discrimination for Self-Supervised Learning on Point Clouds
Masked Discrimination for Self-Supervised Learning on Point Clouds
Haotian Liu
Mu Cai
Yong Jae Lee
3DPC
21
164
0
21 Mar 2022
TVConv: Efficient Translation Variant Convolution for Layout-aware
  Visual Processing
TVConv: Efficient Translation Variant Convolution for Layout-aware Visual Processing
Jie Chen
Tianlang He
Weipeng Zhuo
Li Ma
Sangtae Ha
Shueng-Han Gary Chan
CVBM
21
24
0
20 Mar 2022
Towards Robust Semantic Segmentation of Accident Scenes via Multi-Source
  Mixed Sampling and Meta-Learning
Towards Robust Semantic Segmentation of Accident Scenes via Multi-Source Mixed Sampling and Meta-Learning
Xinyu Luo
Jiaming Zhang
Kailun Yang
Alina Roitberg
Kunyu Peng
Rainer Stiefelhagen
25
9
0
19 Mar 2022
Three things everyone should know about Vision Transformers
Three things everyone should know about Vision Transformers
Hugo Touvron
Matthieu Cord
Alaaeldin El-Nouby
Jakob Verbeek
Hervé Jégou
ViT
24
120
0
18 Mar 2022
Learning Audio Representations with MLPs
Learning Audio Representations with MLPs
Mashrur M. Morshed
Ahmad Omar Ahsan
H. Mahmud
Md. Kamrul Hasan
27
4
0
16 Mar 2022
CrowdMLP: Weakly-Supervised Crowd Counting via Multi-Granularity MLP
CrowdMLP: Weakly-Supervised Crowd Counting via Multi-Granularity MLP
Mingjie Wang
Jun Zhou
Hao Cai
Minglun Gong
20
29
0
15 Mar 2022
Can Neural Nets Learn the Same Model Twice? Investigating
  Reproducibility and Double Descent from the Decision Boundary Perspective
Can Neural Nets Learn the Same Model Twice? Investigating Reproducibility and Double Descent from the Decision Boundary Perspective
Gowthami Somepalli
Liam H. Fowl
Arpit Bansal
Ping Yeh-Chiang
Yehuda Dar
Richard Baraniuk
Micah Goldblum
Tom Goldstein
24
64
0
15 Mar 2022
Surrogate Gap Minimization Improves Sharpness-Aware Training
Surrogate Gap Minimization Improves Sharpness-Aware Training
Juntang Zhuang
Boqing Gong
Liangzhe Yuan
Huayu Chen
Hartwig Adam
Nicha Dvornek
S. Tatikonda
James Duncan
Ting Liu
27
146
0
15 Mar 2022
Self-Promoted Supervision for Few-Shot Transformer
Self-Promoted Supervision for Few-Shot Transformer
Bowen Dong
Pan Zhou
Shuicheng Yan
W. Zuo
ViT
22
28
0
14 Mar 2022
Efficient Language Modeling with Sparse all-MLP
Efficient Language Modeling with Sparse all-MLP
Ping Yu
Mikel Artetxe
Myle Ott
Sam Shleifer
Hongyu Gong
Ves Stoyanov
Xian Li
MoE
23
11
0
14 Mar 2022
Scaling Up Your Kernels to 31x31: Revisiting Large Kernel Design in CNNs
Scaling Up Your Kernels to 31x31: Revisiting Large Kernel Design in CNNs
Xiaohan Ding
Xinming Zhang
Yi Zhou
Jungong Han
Guiguang Ding
Jian Sun
VLM
49
528
0
13 Mar 2022
Active Token Mixer
Active Token Mixer
Guoqiang Wei
Zhizheng Zhang
Cuiling Lan
Yan Lu
Zhibo Chen
24
15
0
11 Mar 2022
Previous
123...171819...212223
Next