ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2105.01601
  4. Cited By
MLP-Mixer: An all-MLP Architecture for Vision

MLP-Mixer: An all-MLP Architecture for Vision

4 May 2021
Ilya O. Tolstikhin
N. Houlsby
Alexander Kolesnikov
Lucas Beyer
Xiaohua Zhai
Thomas Unterthiner
Jessica Yung
Andreas Steiner
Daniel Keysers
Jakob Uszkoreit
Mario Lucic
Alexey Dosovitskiy
ArXivPDFHTML

Papers citing "MLP-Mixer: An all-MLP Architecture for Vision"

50 / 1,124 papers shown
Title
Scaling & Shifting Your Features: A New Baseline for Efficient Model
  Tuning
Scaling & Shifting Your Features: A New Baseline for Efficient Model Tuning
Dongze Lian
Daquan Zhou
Jiashi Feng
Xinchao Wang
36
250
0
17 Oct 2022
Scratching Visual Transformer's Back with Uniform Attention
Scratching Visual Transformer's Back with Uniform Attention
Nam Hyeon-Woo
Kim Yu-Ji
Byeongho Heo
Doonyoon Han
Seong Joon Oh
Tae-Hyun Oh
366
23
0
16 Oct 2022
Transformer-Based Speech Synthesizer Attribution in an Open Set Scenario
Transformer-Based Speech Synthesizer Attribution in an Open Set Scenario
Emily R. Bartusiak
Edward J. Delp
27
12
0
14 Oct 2022
Probabilistic Integration of Object Level Annotations in Chest X-ray
  Classification
Probabilistic Integration of Object Level Annotations in Chest X-ray Classification
Tom van Sonsbeek
Xiantong Zhen
Dwarikanath Mahapatra
M. Worring
40
12
0
13 Oct 2022
Parameter-Efficient Masking Networks
Parameter-Efficient Masking Networks
Yue Bai
Huan Wang
Xu Ma
Yitian Zhang
Zhiqiang Tao
Yun Fu
31
10
0
13 Oct 2022
Token-Label Alignment for Vision Transformers
Token-Label Alignment for Vision Transformers
Han Xiao
Wenzhao Zheng
Zhengbiao Zhu
Jie Zhou
Jiwen Lu
26
4
0
12 Oct 2022
The Lazy Neuron Phenomenon: On Emergence of Activation Sparsity in
  Transformers
The Lazy Neuron Phenomenon: On Emergence of Activation Sparsity in Transformers
Zong-xiao Li
Chong You
Srinadh Bhojanapalli
Daliang Li
A. S. Rawat
...
Kenneth Q Ye
Felix Chern
Felix X. Yu
Ruiqi Guo
Surinder Kumar
MoE
29
87
0
12 Oct 2022
FCT-GAN: Enhancing Table Synthesis via Fourier Transform
FCT-GAN: Enhancing Table Synthesis via Fourier Transform
Zilong Zhao
Robert Birke
L. Chen
27
7
0
12 Oct 2022
The Unreasonable Effectiveness of Fully-Connected Layers for Low-Data
  Regimes
The Unreasonable Effectiveness of Fully-Connected Layers for Low-Data Regimes
Peter Kocsis
Peter Súkeník
Guillem Brasó
Matthias Nießner
Laura Leal-Taixé
Ismail Elezi
16
7
0
11 Oct 2022
EarthNets: Empowering AI in Earth Observation
EarthNets: Empowering AI in Earth Observation
Zhitong Xiong
Fahong Zhang
Yi Wang
Yilei Shi
Xiao Xiang Zhu
101
73
0
10 Oct 2022
LieGG: Studying Learned Lie Group Generators
LieGG: Studying Learned Lie Group Generators
A. Moskalev
A. Sepliarskaia
Ivan Sosnovik
A. Smeulders
33
22
0
09 Oct 2022
Are All Vision Models Created Equal? A Study of the Open-Loop to
  Closed-Loop Causality Gap
Are All Vision Models Created Equal? A Study of the Open-Loop to Closed-Loop Causality Gap
Mathias Lechner
Ramin Hasani
Alexander Amini
Tsun-Hsuan Wang
T. Henzinger
Daniela Rus
CML
OOD
29
7
0
09 Oct 2022
Boost Event-Driven Tactile Learning with Location Spiking Neurons
Boost Event-Driven Tactile Learning with Location Spiking Neurons
Pengxin Kang
Srutarshi Banerjee
Henry H. Chopp
Aggelos K. Katsaggelos
O. Cossairt
23
9
0
09 Oct 2022
Explainable fMRI-based Brain Decoding via Spatial Temporal-pyramid Graph
  Convolutional Network
Explainable fMRI-based Brain Decoding via Spatial Temporal-pyramid Graph Convolutional Network
Ziyuan Ye
Youzhi Qu
Zhichao Liang
Mo Wang
Quanying Liu
18
13
0
08 Oct 2022
ViewFool: Evaluating the Robustness of Visual Recognition to Adversarial
  Viewpoints
ViewFool: Evaluating the Robustness of Visual Recognition to Adversarial Viewpoints
Yinpeng Dong
Shouwei Ruan
Hang Su
Cai Kang
Xingxing Wei
Junyi Zhu
AAML
32
49
0
08 Oct 2022
Towards Light Weight Object Detection System
Towards Light Weight Object Detection System
K. Dharma
V. Dayana
Menglan Wu
Venkateswara Rao Cherukuri
Hau Hwang
18
1
0
08 Oct 2022
PS-ARM: An End-to-End Attention-aware Relation Mixer Network for Person
  Search
PS-ARM: An End-to-End Attention-aware Relation Mixer Network for Person Search
M. Fiaz
Hisham Cholakkal
Sanath Narayan
Rao Muhammad Anwer
Fahad Shahbaz Khan
38
4
0
07 Oct 2022
Scaling Forward Gradient With Local Losses
Scaling Forward Gradient With Local Losses
Mengye Ren
Simon Kornblith
Renjie Liao
Geoffrey E. Hinton
81
49
0
07 Oct 2022
The Lie Derivative for Measuring Learned Equivariance
The Lie Derivative for Measuring Learned Equivariance
Nate Gruver
Marc Finzi
Micah Goldblum
A. Wilson
23
35
0
06 Oct 2022
Centralized Feature Pyramid for Object Detection
Centralized Feature Pyramid for Object Detection
Yu Quan
Dong Zhang
Liyan Zhang
Jinhui Tang
ObjD
36
154
0
05 Oct 2022
The Calibration Generalization Gap
The Calibration Generalization Gap
Annabelle Carrell
Neil Rohit Mallinar
James Lucas
Preetum Nakkiran
UQCV
34
18
0
05 Oct 2022
One-shot Detail Retouching with Patch Space Neural Transformation
  Blending
One-shot Detail Retouching with Patch Space Neural Transformation Blending
Fazilet Gokbudak
Cengiz Öztireli
45
1
0
03 Oct 2022
Rethinking skip connection model as a learnable Markov chain
Rethinking skip connection model as a learnable Markov chain
Dengsheng Chen
Jie Hu
Jingyao Wang
Xiaoming Wei
Enhua Wu
BDL
27
1
0
30 Sep 2022
Neural Methods for Logical Reasoning Over Knowledge Graphs
Neural Methods for Logical Reasoning Over Knowledge Graphs
Alfonso Amayuelas
Shuai Zhang
Susie Xi Rao
Ce Zhang
NAI
78
24
0
28 Sep 2022
A Closer Look at Evaluating the Bit-Flip Attack Against Deep Neural
  Networks
A Closer Look at Evaluating the Bit-Flip Attack Against Deep Neural Networks
Kevin Hector
Mathieu Dumont
Pierre-Alain Moëllic
J. Dutertre
AAML
32
4
0
28 Sep 2022
Exploring the Relationship between Architecture and Adversarially Robust
  Generalization
Exploring the Relationship between Architecture and Adversarially Robust Generalization
Aishan Liu
Shiyu Tang
Siyuan Liang
Ruihao Gong
Boxi Wu
Xianglong Liu
Dacheng Tao
AAML
34
18
0
28 Sep 2022
Scaling Laws For Deep Learning Based Image Reconstruction
Scaling Laws For Deep Learning Based Image Reconstruction
Tobit Klug
Reinhard Heckel
70
12
0
27 Sep 2022
Fast-FNet: Accelerating Transformer Encoder Models via Efficient Fourier
  Layers
Fast-FNet: Accelerating Transformer Encoder Models via Efficient Fourier Layers
Nurullah Sevim
Ege Ozan Özyedek
Furkan Şahinuç
Aykut Koç
40
11
0
26 Sep 2022
Greybox XAI: a Neural-Symbolic learning framework to produce
  interpretable predictions for image classification
Greybox XAI: a Neural-Symbolic learning framework to produce interpretable predictions for image classification
Adrien Bennetot
Gianni Franchi
Javier Del Ser
Raja Chatila
Natalia Díaz Rodríguez
AAML
34
28
0
26 Sep 2022
Optimal Transport-based Identity Matching for Identity-invariant Facial
  Expression Recognition
Optimal Transport-based Identity Matching for Identity-invariant Facial Expression Recognition
D. Kim
B. Song
OT
45
10
0
25 Sep 2022
Neural Clamping: Joint Input Perturbation and Temperature Scaling for
  Neural Network Calibration
Neural Clamping: Joint Input Perturbation and Temperature Scaling for Neural Network Calibration
Yu Tang
Pin-Yu Chen
Tsung-Yi Ho
31
5
0
23 Sep 2022
Implementing and Experimenting with Diffusion Models for Text-to-Image
  Generation
Implementing and Experimenting with Diffusion Models for Text-to-Image Generation
Robin Zbinden
33
3
0
22 Sep 2022
HiFuse: Hierarchical Multi-Scale Feature Fusion Network for Medical
  Image Classification
HiFuse: Hierarchical Multi-Scale Feature Fusion Network for Medical Image Classification
Xiangzuo Huo
Gang Sun
Sheng Tian
Yan Wang
Long Yu
Jun Long
Wendong Zhang
Aolun Li
30
101
0
21 Sep 2022
Adaptable Butterfly Accelerator for Attention-based NNs via Hardware and
  Algorithm Co-design
Adaptable Butterfly Accelerator for Attention-based NNs via Hardware and Algorithm Co-design
Hongxiang Fan
Thomas C. P. Chau
Stylianos I. Venieris
Royson Lee
Alexandros Kouris
Wayne Luk
Nicholas D. Lane
Mohamed S. Abdelfattah
40
58
0
20 Sep 2022
Diffusion Unit: Interpretable Edge Enhancement and Suppression Learning
  for 3D Point Cloud Segmentation
Diffusion Unit: Interpretable Edge Enhancement and Suppression Learning for 3D Point Cloud Segmentation
H. Xiu
Xin Liu
Weimin Wang
Kyoung-Sook Kim
T. Shinohara
Qiong Chang
M. Matsuoka
3DPC
52
11
0
20 Sep 2022
Quantum Vision Transformers
Quantum Vision Transformers
El Amine Cherrat
Iordanis Kerenidis
Natansh Mathur
Jonas Landman
M. Strahm
Yun. Y Li
ViT
39
55
0
16 Sep 2022
HarDNet-DFUS: An Enhanced Harmonically-Connected Network for Diabetic
  Foot Ulcer Image Segmentation and Colonoscopy Polyp Segmentation
HarDNet-DFUS: An Enhanced Harmonically-Connected Network for Diabetic Foot Ulcer Image Segmentation and Colonoscopy Polyp Segmentation
Ting-Yu Liao
Ching-Hui Yang
Y. Lo
Kuan-Ying Lai
Po-Huai Shen
Youn-Long Lin
13
15
0
15 Sep 2022
PaLI: A Jointly-Scaled Multilingual Language-Image Model
PaLI: A Jointly-Scaled Multilingual Language-Image Model
Xi Chen
Tianlin Li
Soravit Changpinyo
A. Piergiovanni
Piotr Padlewski
...
Andreas Steiner
A. Angelova
Xiaohua Zhai
N. Houlsby
Radu Soricut
MLLM
VLM
37
688
0
14 Sep 2022
Analysis of Quantization on MLP-based Vision Models
Analysis of Quantization on MLP-based Vision Models
Lingran Zhao
Zhen Dong
Kurt Keutzer
MQ
32
7
0
14 Sep 2022
Revisiting Neural Scaling Laws in Language and Vision
Revisiting Neural Scaling Laws in Language and Vision
Ibrahim M. Alabdulmohsin
Behnam Neyshabur
Xiaohua Zhai
159
103
0
13 Sep 2022
A lightweight Transformer-based model for fish landmark detection
A lightweight Transformer-based model for fish landmark detection
Alzayat Saleh
David Jones
D. Jerry
M. R. Azghadi
26
1
0
13 Sep 2022
Pre-Training a Graph Recurrent Network for Language Representation
Pre-Training a Graph Recurrent Network for Language Representation
Yile Wang
Linyi Yang
Zhiyang Teng
M. Zhou
Yue Zhang
GNN
38
1
0
08 Sep 2022
Predicting the clinical citation count of biomedical papers using
  multilayer perceptron neural network
Predicting the clinical citation count of biomedical papers using multilayer perceptron neural network
Xin Li
Xuli Tang
Qikai Cheng
19
17
0
07 Sep 2022
A Review of Sparse Expert Models in Deep Learning
A Review of Sparse Expert Models in Deep Learning
W. Fedus
J. Dean
Barret Zoph
MoE
25
145
0
04 Sep 2022
Towards Accurate Binary Neural Networks via Modeling Contextual
  Dependencies
Towards Accurate Binary Neural Networks via Modeling Contextual Dependencies
Xingrun Xing
Yangguang Li
Wei Li
Wenrui Ding
Yalong Jiang
Yufeng Wang
Jinghua Shao
Chunlei Liu
Xianglong Liu
MQ
11
8
0
03 Sep 2022
Generating Coherent Drum Accompaniment With Fills And Improvisations
Generating Coherent Drum Accompaniment With Fills And Improvisations
Rishabh A. Dahale
Vaibhav Talwadker
Preeti Rao
Prateek Verma
32
3
0
01 Sep 2022
MRL: Learning to Mix with Attention and Convolutions
MRL: Learning to Mix with Attention and Convolutions
Shlok Mohta
Hisahiro Suganuma
Yoshiki Tanaka
28
2
0
30 Aug 2022
gSwin: Gated MLP Vision Model with Hierarchical Structure of Shifted
  Window
gSwin: Gated MLP Vision Model with Hierarchical Structure of Shifted Window
Mocho Go
Hideyuki Tachibana
ViT
37
9
0
24 Aug 2022
Efficient Attention-free Video Shift Transformers
Efficient Attention-free Video Shift Transformers
Adrian Bulat
Brais Martínez
Georgios Tzimiropoulos
ViT
34
1
0
23 Aug 2022
CM-MLP: Cascade Multi-scale MLP with Axial Context Relation Encoder for
  Edge Segmentation of Medical Image
CM-MLP: Cascade Multi-scale MLP with Axial Context Relation Encoder for Edge Segmentation of Medical Image
Jinkai Lv
Yuyong Hu
Quanshui Fu
Zhiwang Zhang
Yuqiang Hu
Lin Lv
Guoqing Yang
Jinpeng Li
Yi Zhao
MedIm
28
9
0
23 Aug 2022
Previous
123...141516...212223
Next