ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2105.01601
  4. Cited By
MLP-Mixer: An all-MLP Architecture for Vision

MLP-Mixer: An all-MLP Architecture for Vision

4 May 2021
Ilya O. Tolstikhin
N. Houlsby
Alexander Kolesnikov
Lucas Beyer
Xiaohua Zhai
Thomas Unterthiner
Jessica Yung
Andreas Steiner
Daniel Keysers
Jakob Uszkoreit
Mario Lucic
Alexey Dosovitskiy
ArXivPDFHTML

Papers citing "MLP-Mixer: An all-MLP Architecture for Vision"

50 / 1,120 papers shown
Title
Deep reinforcement learning uncovers processes for separating azeotropic
  mixtures without prior knowledge
Deep reinforcement learning uncovers processes for separating azeotropic mixtures without prior knowledge
Q. Göttl
Jonathan Pirnay
Jakob Burger
D. G. Grimm
50
0
0
10 Oct 2023
TiC: Exploring Vision Transformer in Convolution
TiC: Exploring Vision Transformer in Convolution
Song Zhang
Qingzhong Wang
Jiang Bian
Haoyi Xiong
ViT
31
1
0
06 Oct 2023
HartleyMHA: Self-Attention in Frequency Domain for Resolution-Robust and
  Parameter-Efficient 3D Image Segmentation
HartleyMHA: Self-Attention in Frequency Domain for Resolution-Robust and Parameter-Efficient 3D Image Segmentation
Ken C. L. Wong
Hongzhi Wang
T. Syeda-Mahmood
42
1
0
05 Oct 2023
Decision ConvFormer: Local Filtering in MetaFormer is Sufficient for
  Decision Making
Decision ConvFormer: Local Filtering in MetaFormer is Sufficient for Decision Making
Jeonghye Kim
Suyoung Lee
Woojun Kim
Young-Jin Sung
OffRL
37
17
0
04 Oct 2023
Pixel-Inconsistency Modeling for Image Manipulation Localization
Pixel-Inconsistency Modeling for Image Manipulation Localization
Chenqi Kong
Anwei Luo
Shiqi Wang
Haoliang Li
Anderson de Rezende Rocha
Alex C. Kot
AAML
34
15
0
30 Sep 2023
A Survey on Deep Learning Techniques for Action Anticipation
A Survey on Deep Learning Techniques for Action Anticipation
Zeyun Zhong
Manuel Martin
Michael Voit
Juergen Gall
Jürgen Beyerer
24
7
0
29 Sep 2023
Unidirectional brain-computer interface: Artificial neural network
  encoding natural images to fMRI response in the visual cortex
Unidirectional brain-computer interface: Artificial neural network encoding natural images to fMRI response in the visual cortex
Ruixing Liang
Xiangyu Zhang
Qiong Li
Lai Wei
Hexin Liu
Avisha Kumar
Kelley M. Kempski Leadingham
Joshua Punnoose
Leibny Paola García
A. Manbachi
35
2
0
26 Sep 2023
Identity-preserving Editing of Multiple Facial Attributes by Learning
  Global Edit Directions and Local Adjustments
Identity-preserving Editing of Multiple Facial Attributes by Learning Global Edit Directions and Local Adjustments
Najmeh Mohammadbagheri
Fardin Ayar
A. Nickabadi
R. Safabakhsh
CVBM
GAN
24
3
0
25 Sep 2023
MLPST: MLP is All You Need for Spatio-Temporal Prediction
MLPST: MLP is All You Need for Spatio-Temporal Prediction
Zijian Zhang
Ze Huang
Zhiwei Hu
Xiangyu Zhao
Wanyu Wang
Zitao Liu
Junbo Zhang
S. Qin
Hongwei Zhao
AI4TS
22
27
0
23 Sep 2023
RBFormer: Improve Adversarial Robustness of Transformer by Robust Bias
RBFormer: Improve Adversarial Robustness of Transformer by Robust Bias
Hao Cheng
Jinhao Duan
Hui Li
Lyutianyang Zhang
Jiahang Cao
Ping Wang
Jize Zhang
Kaidi Xu
Renjing Xu
AAML
32
3
0
23 Sep 2023
DualToken-ViT: Position-aware Efficient Vision Transformer with Dual
  Token Fusion
DualToken-ViT: Position-aware Efficient Vision Transformer with Dual Token Fusion
Zhenzhen Chu
Jiayu Chen
Cen Chen
Chengyu Wang
Ziheng Wu
Jun Huang
Weining Qian
ViT
13
2
0
21 Sep 2023
Bayesian sparsification for deep neural networks with Bayesian model
  reduction
Bayesian sparsification for deep neural networks with Bayesian model reduction
Dimitrije Marković
K. Friston
S. Kiebel
BDL
UQCV
38
1
0
21 Sep 2023
Extreme Image Transformations Facilitate Robust Latent Object
  Representations
Extreme Image Transformations Facilitate Robust Latent Object Representations
Girik Malik
Dakarai Crowder
E. Mingolla
AAML
24
0
0
19 Sep 2023
Pixel Adapter: A Graph-Based Post-Processing Approach for Scene Text
  Image Super-Resolution
Pixel Adapter: A Graph-Based Post-Processing Approach for Scene Text Image Super-Resolution
Wenyu Zhang
Xin Deng
Baojun Jia
Xingtong Yu
Yifan Chen
Jin Ma
Qing Ding
Xinming Zhang
30
11
0
16 Sep 2023
Unveiling Invariances via Neural Network Pruning
Unveiling Invariances via Neural Network Pruning
Derek Xu
Yizhou Sun
Wei Wang
36
0
0
15 Sep 2023
Increasing diversity of omni-directional images generated from single
  image using cGAN based on MLPMixer
Increasing diversity of omni-directional images generated from single image using cGAN based on MLPMixer
Atsuya Nakata
Ryuto Miyazaki
Takao Yamanaka
32
1
0
15 Sep 2023
Auto-Regressive Next-Token Predictors are Universal Learners
Auto-Regressive Next-Token Predictors are Universal Learners
Eran Malach
LRM
24
36
0
13 Sep 2023
Hydra: Multi-head Low-rank Adaptation for Parameter Efficient
  Fine-tuning
Hydra: Multi-head Low-rank Adaptation for Parameter Efficient Fine-tuning
Sanghyeon Kim
Hyunmo Yang
Younghyun Kim
Youngjoon Hong
Eunbyung Park
AI4CE
32
16
0
13 Sep 2023
Dynamic Spectrum Mixer for Visual Recognition
Dynamic Spectrum Mixer for Visual Recognition
Zhiqiang Hu
Tao Yu
30
3
0
13 Sep 2023
Blendshapes GHUM: Real-time Monocular Facial Blendshape Prediction
Blendshapes GHUM: Real-time Monocular Facial Blendshape Prediction
Ivan Grishchenko
Geng Yan
Eduard Gabriel Bazavan
Andrei Zanfir
Nikolai Chinaev
Karthik Raveendran
Matthias Grundmann
C. Sminchisescu
3DH
CVBM
40
0
0
11 Sep 2023
Pareto Frontiers in Neural Feature Learning: Data, Compute, Width, and
  Luck
Pareto Frontiers in Neural Feature Learning: Data, Compute, Width, and Luck
Benjamin L. Edelman
Surbhi Goel
Sham Kakade
Eran Malach
Cyril Zhang
48
8
0
07 Sep 2023
A Theoretical Explanation of Activation Sparsity through Flat Minima and
  Adversarial Robustness
A Theoretical Explanation of Activation Sparsity through Flat Minima and Adversarial Robustness
Ze Peng
Lei Qi
Yinghuan Shi
Yang Gao
32
3
0
06 Sep 2023
A survey on efficient vision transformers: algorithms, techniques, and
  performance benchmarking
A survey on efficient vision transformers: algorithms, techniques, and performance benchmarking
Lorenzo Papa
Paolo Russo
Irene Amerini
Luping Zhou
33
43
0
05 Sep 2023
Hindering Adversarial Attacks with Multiple Encrypted Patch Embeddings
Hindering Adversarial Attacks with Multiple Encrypted Patch Embeddings
AprilPyone Maungmaung
Isao Echizen
Hitoshi Kiya
AAML
28
2
0
04 Sep 2023
AttrSeg: Open-Vocabulary Semantic Segmentation via Attribute
  Decomposition-Aggregation
AttrSeg: Open-Vocabulary Semantic Segmentation via Attribute Decomposition-Aggregation
Chaofan Ma
Yu-Hao Yang
Chen Ju
Fei Zhang
Ya Zhang
Yanfeng Wang
VLM
48
17
0
31 Aug 2023
SiT-MLP: A Simple MLP with Point-wise Topology Feature Learning for
  Skeleton-based Action Recognition
SiT-MLP: A Simple MLP with Point-wise Topology Feature Learning for Skeleton-based Action Recognition
Shaojie Zhang
Jianqin Yin
Yonghao Dang
Jiajun Fu
40
4
0
30 Aug 2023
Spatio-temporal MLP-graph network for 3D human pose estimation
Spatio-temporal MLP-graph network for 3D human pose estimation
T. Hassan
A. Ben Hamza
3DH
35
3
0
29 Aug 2023
LatentDR: Improving Model Generalization Through Sample-Aware Latent
  Degradation and Restoration
LatentDR: Improving Model Generalization Through Sample-Aware Latent Degradation and Restoration
Ran Liu
Sahil Khose
Jingyun Xiao
Lakshmi Sathidevi
Keerthan Ramnath
Z. Kira
Eva L. Dyer
34
3
0
28 Aug 2023
Task-Aware Machine Unlearning and Its Application in Load Forecasting
Task-Aware Machine Unlearning and Its Application in Load Forecasting
Wangkun Xu
Fei Teng
MU
AI4TS
40
3
0
28 Aug 2023
Boosting Residual Networks with Group Knowledge
Boosting Residual Networks with Group Knowledge
Shengji Tang
Peng Ye
Baopu Li
Wei Lin
Tao Chen
Tong He
Chong Yu
Wanli Ouyang
46
5
0
26 Aug 2023
CS-Mixer: A Cross-Scale Vision MLP Model with Spatial-Channel Mixing
CS-Mixer: A Cross-Scale Vision MLP Model with Spatial-Channel Mixing
Jianwei Cui
David A. Araujo
Suman Saha
Md Faisal Kabir
BDL
38
0
0
25 Aug 2023
CoC-GAN: Employing Context Cluster for Unveiling a New Pathway in Image
  Generation
CoC-GAN: Employing Context Cluster for Unveiling a New Pathway in Image Generation
Zihao Wang
Yiming Huang
Ziyu Zhou
29
0
0
23 Aug 2023
A Benchmark Study on Calibration
A Benchmark Study on Calibration
Linwei Tao
Younan Zhu
Haolan Guo
Minjing Dong
Chang Xu
21
9
0
23 Aug 2023
SPANet: Frequency-balancing Token Mixer using Spectral Pooling
  Aggregation Modulation
SPANet: Frequency-balancing Token Mixer using Spectral Pooling Aggregation Modulation
Guhnoo Yun
J. Yoo
Kijung Kim
Jeongho Lee
Dong Hwan Kim
MoE
33
8
0
22 Aug 2023
An Effective Transformer-based Contextual Model and Temporal Gate
  Pooling for Speaker Identification
An Effective Transformer-based Contextual Model and Temporal Gate Pooling for Speaker Identification
Harunori Kawano
Sota Shimizu
30
1
0
22 Aug 2023
A Simple Framework for Multi-mode Spatial-Temporal Data Modeling
A Simple Framework for Multi-mode Spatial-Temporal Data Modeling
Zihang Liu
Le Yu
T. Zhu
Lei Sun
AI4TS
21
0
0
22 Aug 2023
MISSRec: Pre-training and Transferring Multi-modal Interest-aware
  Sequence Representation for Recommendation
MISSRec: Pre-training and Transferring Multi-modal Interest-aware Sequence Representation for Recommendation
Jinpeng Wang
Ziyun Zeng
Yunxiao Wang
Yuting Wang
Xingyu Lu
Tianxiang Li
Jun Yuan
Rui Zhang
Haitao Zheng
Shutao Xia
38
43
0
22 Aug 2023
Disposable Transfer Learning for Selective Source Task Unlearning
Disposable Transfer Learning for Selective Source Task Unlearning
Seunghee Koh
Hyounguk Shon
Janghyeon Lee
H. Hong
Junmo Kim
25
2
0
19 Aug 2023
Understanding Self-attention Mechanism via Dynamical System Perspective
Understanding Self-attention Mechanism via Dynamical System Perspective
Zhongzhan Huang
Mingfu Liang
Jinghui Qin
Shan Zhong
Liang Lin
29
15
0
19 Aug 2023
SparseBEV: High-Performance Sparse 3D Object Detection from Multi-Camera
  Videos
SparseBEV: High-Performance Sparse 3D Object Detection from Multi-Camera Videos
Haisong Liu
Yao Teng
Tao Lu
Haiguang Wang
Liming Wang
16
97
0
18 Aug 2023
Which Transformer to Favor: A Comparative Analysis of Efficiency in Vision Transformers
Which Transformer to Favor: A Comparative Analysis of Efficiency in Vision Transformers
Tobias Christian Nauen
Sebastián M. Palacio
Federico Raue
Andreas Dengel
42
3
0
18 Aug 2023
Agglomerative Transformer for Human-Object Interaction Detection
Agglomerative Transformer for Human-Object Interaction Detection
Danyang Tu
Wei Sun
Guangtao Zhai
Wei Shen
ViT
32
5
0
16 Aug 2023
Computer vision-enriched discrete choice models, with an application to
  residential location choice
Computer vision-enriched discrete choice models, with an application to residential location choice
S. Cranenburgh
Francisco Garrido-Valenzuela
24
2
0
16 Aug 2023
Attention Is Not All You Need Anymore
Attention Is Not All You Need Anymore
Zhe Chen
32
3
0
15 Aug 2023
Block-Wise Encryption for Reliable Vision Transformer models
Block-Wise Encryption for Reliable Vision Transformer models
Hitoshi Kiya
Ryota Iijima
Teru Nagamori
33
1
0
15 Aug 2023
ST-MLP: A Cascaded Spatio-Temporal Linear Framework with
  Channel-Independence Strategy for Traffic Forecasting
ST-MLP: A Cascaded Spatio-Temporal Linear Framework with Channel-Independence Strategy for Traffic Forecasting
Zepu Wang
Yuqi Nie
Peng Sun
Nam H. Nguyen
John M. Mulvey
H. Vincent Poor
AI4TS
24
22
0
14 Aug 2023
On the Importance of Spatial Relations for Few-shot Action Recognition
On the Importance of Spatial Relations for Few-shot Action Recognition
Yilun Zhang
Yu Fu
Xingjun Ma
Lizhe Qi
Jingjing Chen
Zuxuan Wu
Yueping Jiang
ViT
19
6
0
14 Aug 2023
Sensitivity-Aware Mixed-Precision Quantization and Width Optimization of
  Deep Neural Networks Through Cluster-Based Tree-Structured Parzen Estimation
Sensitivity-Aware Mixed-Precision Quantization and Width Optimization of Deep Neural Networks Through Cluster-Based Tree-Structured Parzen Estimation
Seyedarmin Azizi
M. Nazemi
A. Fayyazi
Massoud Pedram
MQ
27
5
0
12 Aug 2023
Spatial Gated Multi-Layer Perceptron for Land Use and Land Cover Mapping
Spatial Gated Multi-Layer Perceptron for Land Use and Land Cover Mapping
Ali Jamali
Swalpa Kumar Roy
Danfeng Hong
P. M. Atkinson
Pedram Ghamisi
17
11
0
09 Aug 2023
Exploring Transformers for Open-world Instance Segmentation
Exploring Transformers for Open-world Instance Segmentation
Jiannan Wu
Yi-Xin Jiang
B. Yan
Huchuan Lu
Zehuan Yuan
Ping Luo
ViT
33
5
0
08 Aug 2023
Previous
123...8910...212223
Next