ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2105.01601
  4. Cited By
MLP-Mixer: An all-MLP Architecture for Vision
v1v2v3v4 (latest)

MLP-Mixer: An all-MLP Architecture for Vision

4 May 2021
Ilya O. Tolstikhin
N. Houlsby
Alexander Kolesnikov
Lucas Beyer
Xiaohua Zhai
Thomas Unterthiner
Jessica Yung
Andreas Steiner
Daniel Keysers
Jakob Uszkoreit
Mario Lucic
Alexey Dosovitskiy
ArXiv (abs)PDFHTML

Papers citing "MLP-Mixer: An all-MLP Architecture for Vision"

50 / 1,144 papers shown
Title
EarthLoc: Astronaut Photography Localization by Indexing Earth from
  Space
EarthLoc: Astronaut Photography Localization by Indexing Earth from Space
Gabriele Berton
Alex Stoken
Barbara Caputo
Carlo Masone
81
3
0
11 Mar 2024
Joint-Embedding Masked Autoencoder for Self-supervised Learning of Dynamic Functional Connectivity from the Human Brain
Joint-Embedding Masked Autoencoder for Self-supervised Learning of Dynamic Functional Connectivity from the Human Brain
Jungwon Choi
Hyungi Lee
Byung-Hoon Kim
Juho Lee
122
1
0
11 Mar 2024
Fooling Neural Networks for Motion Forecasting via Adversarial Attacks
Fooling Neural Networks for Motion Forecasting via Adversarial Attacks
Edgar Medina
Leyong Loh
AAML
59
0
0
07 Mar 2024
LORS: Low-rank Residual Structure for Parameter-Efficient Network
  Stacking
LORS: Low-rank Residual Structure for Parameter-Efficient Network Stacking
Jialin Li
Qiang Nie
Weifu Fu
Yuhuan Lin
Guangpin Tao
Yong-Jin Liu
Chengjie Wang
83
5
0
07 Mar 2024
DPOT: Auto-Regressive Denoising Operator Transformer for Large-Scale PDE
  Pre-Training
DPOT: Auto-Regressive Denoising Operator Transformer for Large-Scale PDE Pre-Training
Zhongkai Hao
Chang Su
Songming Liu
Julius Berner
Chengyang Ying
Hang Su
A. Anandkumar
Jian Song
Jun Zhu
AI4TSAI4CE
116
37
0
06 Mar 2024
Revisiting Confidence Estimation: Towards Reliable Failure Prediction
Revisiting Confidence Estimation: Towards Reliable Failure Prediction
Fei Zhu
Xu-Yao Zhang
Zhen Cheng
Cheng-Lin Liu
UQCV
100
12
0
05 Mar 2024
What do we learn from inverting CLIP models?
What do we learn from inverting CLIP models?
Hamid Kazemi
Atoosa Malemir Chegini
Jonas Geiping
Soheil Feizi
Tom Goldstein
55
6
0
05 Mar 2024
NiNformer: A Network in Network Transformer with Token Mixing Generated
  Gating Function
NiNformer: A Network in Network Transformer with Token Mixing Generated Gating Function
Abdullah Nazhat Abdullah
Tarkan Aydin
68
0
0
04 Mar 2024
HyenaPixel: Global Image Context with Convolutions
HyenaPixel: Global Image Context with Convolutions
Julian Spravil
Sebastian Houben
Sven Behnke
44
1
0
29 Feb 2024
Deep learning for 3D human pose estimation and mesh recovery: A survey
Deep learning for 3D human pose estimation and mesh recovery: A survey
Yang Liu
Changzhen Qiu
Zhiyong Zhang
3DH
89
9
0
29 Feb 2024
Mixer is more than just a model
Mixer is more than just a model
Qingfeng Ji
Yuxin Wang
Letong Sun
61
0
0
28 Feb 2024
QN-Mixer: A Quasi-Newton MLP-Mixer Model for Sparse-View CT
  Reconstruction
QN-Mixer: A Quasi-Newton MLP-Mixer Model for Sparse-View CT Reconstruction
Ishak Ayad
Nicolas Larue
Mai K. Nguyen
64
4
0
28 Feb 2024
Latent Neural PDE Solver: a reduced-order modelling framework for partial differential equations
Latent Neural PDE Solver: a reduced-order modelling framework for partial differential equations
Zijie Li
Saurabh Patil
Francis Ogoke
Dule Shu
Wilson Zhen
Michael Schneier
John R. Buchanan
A. Farimani
AI4CE
80
5
0
27 Feb 2024
An Efficient MLP-based Point-guided Segmentation Network for Ore Images
  with Ambiguous Boundary
An Efficient MLP-based Point-guided Segmentation Network for Ore Images with Ambiguous Boundary
Guodong Sun
Yuting Peng
Lei Cheng
Mengya Xu
An-Chi Wang
Bo Wu
Hongliang Ren
Yang Zhang
75
2
0
27 Feb 2024
Training Neural Networks from Scratch with Parallel Low-Rank Adapters
Training Neural Networks from Scratch with Parallel Low-Rank Adapters
Minyoung Huh
Brian Cheung
Jeremy Bernstein
Phillip Isola
Pulkit Agrawal
106
12
0
26 Feb 2024
Why Transformers Need Adam: A Hessian Perspective
Why Transformers Need Adam: A Hessian Perspective
Yushun Zhang
Congliang Chen
Tian Ding
Ziniu Li
Ruoyu Sun
Zhimin Luo
124
57
0
26 Feb 2024
Shaving Weights with Occam's Razor: Bayesian Sparsification for Neural
  Networks Using the Marginal Likelihood
Shaving Weights with Occam's Razor: Bayesian Sparsification for Neural Networks Using the Marginal Likelihood
Rayen Dhahri
Alexander Immer
Bertrand Charpentier
Stephan Günnemann
Vincent Fortuin
BDLUQCV
78
5
0
25 Feb 2024
IRConStyle: Image Restoration Framework Using Contrastive Learning and
  Style Transfer
IRConStyle: Image Restoration Framework Using Contrastive Learning and Style Transfer
Dongqi Fan
Xin Zhao
Liang Chang
61
1
0
24 Feb 2024
Mixup Barcodes: Quantifying Geometric-Topological Interactions between
  Point Clouds
Mixup Barcodes: Quantifying Geometric-Topological Interactions between Point Clouds
Hubert Wagner
Nickolas Arustamyan
Matthew Wheeler
Peter Bubenik
3DPC
58
1
0
23 Feb 2024
PaCKD: Pattern-Clustered Knowledge Distillation for Compressing Memory
  Access Prediction Models
PaCKD: Pattern-Clustered Knowledge Distillation for Compressing Memory Access Prediction Models
Neelesh Gupta
Pengmiao Zhang
Rajgopal Kannan
Viktor Prasanna
52
4
0
21 Feb 2024
Conditional Logical Message Passing Transformer for Complex Query
  Answering
Conditional Logical Message Passing Transformer for Complex Query Answering
Chongzhi Zhang
Zhiping Peng
Junhao Zheng
Qianli Ma
74
1
0
20 Feb 2024
BMLP: Behavior-aware MLP for Heterogeneous Sequential Recommendation
BMLP: Behavior-aware MLP for Heterogeneous Sequential Recommendation
Weixin Li
Yuhao Wu
Yang Liu
Weike Pan
Zhong Ming
74
4
0
20 Feb 2024
Multilinear Mixture of Experts: Scalable Expert Specialization through
  Factorization
Multilinear Mixture of Experts: Scalable Expert Specialization through Factorization
James Oldfield
Markos Georgopoulos
Grigorios G. Chrysos
Christos Tzelepis
Yannis Panagakis
M. Nicolaou
Jiankang Deng
Ioannis Patras
MoE
117
10
0
19 Feb 2024
Balanced Data, Imbalanced Spectra: Unveiling Class Disparities with
  Spectral Imbalance
Balanced Data, Imbalanced Spectra: Unveiling Class Disparities with Spectral Imbalance
Chiraag Kaushik
Ran Liu
Chi-Heng Lin
Amrit Khera
Matthew Y Jin
Wenrui Ma
Vidya Muthukumar
Eva L. Dyer
81
3
0
18 Feb 2024
FViT: A Focal Vision Transformer with Gabor Filter
FViT: A Focal Vision Transformer with Gabor Filter
Yulong Shi
Mingwei Sun
Yongshuai Wang
Rui Wang
149
4
0
17 Feb 2024
Associative Memories in the Feature Space
Associative Memories in the Feature Space
Tommaso Salvatori
Beren Millidge
Yuhang Song
Rafal Bogacz
Thomas Lukasiewicz
59
1
0
16 Feb 2024
RPMixer: Shaking Up Time Series Forecasting with Random Projections for
  Large Spatial-Temporal Data
RPMixer: Shaking Up Time Series Forecasting with Random Projections for Large Spatial-Temporal Data
Chin-Chia Michael Yeh
Yujie Fan
Xin Dai
Uday Singh Saini
Vivian Lai
...
Huiyuan Chen
Yan Zheng
Zhongfang Zhuang
Liang Wang
Wei Zhang
AI4TS
46
8
0
16 Feb 2024
Spike-driven Transformer V2: Meta Spiking Neural Network Architecture
  Inspiring the Design of Next-generation Neuromorphic Chips
Spike-driven Transformer V2: Meta Spiking Neural Network Architecture Inspiring the Design of Next-generation Neuromorphic Chips
Man Yao
Jiakui Hu
Tianxiang Hu
Yifan Xu
Zhaokun Zhou
Yonghong Tian
Boxing Xu
Guoqi Li
90
64
0
15 Feb 2024
Model Compression and Efficient Inference for Large Language Models: A
  Survey
Model Compression and Efficient Inference for Large Language Models: A Survey
Wenxiao Wang
Wei Chen
Yicong Luo
Yongliu Long
Zhengkai Lin
Liye Zhang
Binbin Lin
Deng Cai
Xiaofei He
MQ
114
58
0
15 Feb 2024
Switch EMA: A Free Lunch for Better Flatness and Sharpness
Switch EMA: A Free Lunch for Better Flatness and Sharpness
Siyuan Li
Zicheng Liu
Juanxi Tian
Ge Wang
Zedong Wang
...
Cheng Tan
Tao Lin
Yang Liu
Baigui Sun
Stan Z. Li
66
6
0
14 Feb 2024
A Closer Look at the Robustness of Contrastive Language-Image
  Pre-Training (CLIP)
A Closer Look at the Robustness of Contrastive Language-Image Pre-Training (CLIP)
Weijie Tu
Weijian Deng
Tom Gedeon
UQCVVLM
70
35
0
12 Feb 2024
Efficient Stagewise Pretraining via Progressive Subnetworks
Efficient Stagewise Pretraining via Progressive Subnetworks
Abhishek Panigrahi
Nikunj Saunshi
Kaifeng Lyu
Sobhan Miryoosefi
Sashank J. Reddi
Satyen Kale
Sanjiv Kumar
65
6
0
08 Feb 2024
Text Role Classification in Scientific Charts Using Multimodal
  Transformers
Text Role Classification in Scientific Charts Using Multimodal Transformers
Hye Jin Kim
N. Lell
A. Scherp
46
0
0
08 Feb 2024
TASER: Temporal Adaptive Sampling for Fast and Accurate Dynamic Graph
  Representation Learning
TASER: Temporal Adaptive Sampling for Fast and Accurate Dynamic Graph Representation Learning
Gangda Deng
Hongkuan Zhou
Hanqing Zeng
Yinglong Xia
Christopher Leung
Jianbo Li
Rajgopal Kannan
Viktor Prasanna
82
1
0
08 Feb 2024
CAST: Clustering Self-Attention using Surrogate Tokens for Efficient
  Transformers
CAST: Clustering Self-Attention using Surrogate Tokens for Efficient Transformers
Adjorn van Engelenhoven
Nicola Strisciuglio
Estefanía Talavera
104
1
0
06 Feb 2024
MOMENT: A Family of Open Time-series Foundation Models
MOMENT: A Family of Open Time-series Foundation Models
Mononito Goswami
Konrad Szafer
Arjun Choudhry
Yifu Cai
Shuo Li
Artur Dubrawski
AIFinAI4TS
109
153
0
06 Feb 2024
MoD-SLAM: Monocular Dense Mapping for Unbounded 3D Scene Reconstruction
MoD-SLAM: Monocular Dense Mapping for Unbounded 3D Scene Reconstruction
Heng Zhou
Zhetao Guo
Shuhong Liu
Lechen Zhang
Qihao Wang
Yuxiang Ren
Mingrui Li
MDE
101
14
0
06 Feb 2024
Professional Agents -- Evolving Large Language Models into Autonomous
  Experts with Human-Level Competencies
Professional Agents -- Evolving Large Language Models into Autonomous Experts with Human-Level Competencies
Zhixuan Chu
Yan Wang
Feng Zhu
Lu Yu
Longfei Li
Jinjie Gu
LLMAG
37
9
0
06 Feb 2024
A Survey on Transformer Compression
A Survey on Transformer Compression
Yehui Tang
Yunhe Wang
Jianyuan Guo
Zhijun Tu
Kai Han
Hailin Hu
Dacheng Tao
146
35
0
05 Feb 2024
Time-, Memory- and Parameter-Efficient Visual Adaptation
Time-, Memory- and Parameter-Efficient Visual Adaptation
Otniel-Bogdan Mercea
Alexey Gritsenko
Cordelia Schmid
Anurag Arnab
VLM
77
15
0
05 Feb 2024
Enhancing Transformer RNNs with Multiple Temporal Perspectives
Enhancing Transformer RNNs with Multiple Temporal Perspectives
Razvan-Gabriel Dumitru
Darius Peteleaza
Mihai Surdeanu
AI4TS
32
2
0
04 Feb 2024
Spatio-temporal Prompting Network for Robust Video Feature Extraction
Spatio-temporal Prompting Network for Robust Video Feature Extraction
Guanxiong Sun
Chi Wang
Zhaoyu Zhang
Jiankang Deng
Stefanos Zafeiriou
Yang Hua
ViT
55
4
0
04 Feb 2024
NOAH: Learning Pairwise Object Category Attentions for Image
  Classification
NOAH: Learning Pairwise Object Category Attentions for Image Classification
Chao Li
Aojun Zhou
Anbang Yao
VLM
56
2
0
04 Feb 2024
Polyp-DAM: Polyp segmentation via depth anything model
Polyp-DAM: Polyp segmentation via depth anything model
Zhuoran Zheng
Chen Henry Wu
Wei Wang
Yeying Jin
Xiuyi Jia
VLM
77
6
0
03 Feb 2024
ParZC: Parametric Zero-Cost Proxies for Efficient NAS
ParZC: Parametric Zero-Cost Proxies for Efficient NAS
Peijie Dong
Lujun Li
Xinglin Pan
Zimian Wei
Xiang Liu
Qiang-qiang Wang
Xiaowen Chu
85
3
0
03 Feb 2024
Todyformer: Towards Holistic Dynamic Graph Transformers with
  Structure-Aware Tokenization
Todyformer: Towards Holistic Dynamic Graph Transformers with Structure-Aware Tokenization
Mahdi Biparva
Raika Karimi
Faezeh Faez
Yingxue Zhang
53
3
0
02 Feb 2024
LIR: A Lightweight Baseline for Image Restoration
LIR: A Lightweight Baseline for Image Restoration
Dongqi Fan
Ting Yue
Xin Zhao
Renjing Xu
Liang Chang
58
0
0
02 Feb 2024
Towards Optimal Feature-Shaping Methods for Out-of-Distribution
  Detection
Towards Optimal Feature-Shaping Methods for Out-of-Distribution Detection
Qinyu Zhao
Ming Xu
Kartik Gupta
Akshay Asthana
Liang Zheng
Stephen Gould
OODD
34
7
0
01 Feb 2024
A Single Graph Convolution Is All You Need: Efficient Grayscale Image
  Classification
A Single Graph Convolution Is All You Need: Efficient Grayscale Image Classification
Jacob Fein-Ashley
S. Wickramasinghe
Bingyi Zhang
Rajgopal Kannan
Viktor Prasanna
37
5
0
01 Feb 2024
A Manifold Representation of the Key in Vision Transformers
A Manifold Representation of the Key in Vision Transformers
Li Meng
Morten Goodwin
Anis Yazidi
P. Engelstad
86
0
0
01 Feb 2024
Previous
123...678...212223
Next