Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2105.01601
Cited By
v1
v2
v3
v4 (latest)
MLP-Mixer: An all-MLP Architecture for Vision
4 May 2021
Ilya O. Tolstikhin
N. Houlsby
Alexander Kolesnikov
Lucas Beyer
Xiaohua Zhai
Thomas Unterthiner
Jessica Yung
Andreas Steiner
Daniel Keysers
Jakob Uszkoreit
Mario Lucic
Alexey Dosovitskiy
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"MLP-Mixer: An all-MLP Architecture for Vision"
50 / 1,144 papers shown
Title
EarthLoc: Astronaut Photography Localization by Indexing Earth from Space
Gabriele Berton
Alex Stoken
Barbara Caputo
Carlo Masone
81
3
0
11 Mar 2024
Joint-Embedding Masked Autoencoder for Self-supervised Learning of Dynamic Functional Connectivity from the Human Brain
Jungwon Choi
Hyungi Lee
Byung-Hoon Kim
Juho Lee
122
1
0
11 Mar 2024
Fooling Neural Networks for Motion Forecasting via Adversarial Attacks
Edgar Medina
Leyong Loh
AAML
59
0
0
07 Mar 2024
LORS: Low-rank Residual Structure for Parameter-Efficient Network Stacking
Jialin Li
Qiang Nie
Weifu Fu
Yuhuan Lin
Guangpin Tao
Yong-Jin Liu
Chengjie Wang
83
5
0
07 Mar 2024
DPOT: Auto-Regressive Denoising Operator Transformer for Large-Scale PDE Pre-Training
Zhongkai Hao
Chang Su
Songming Liu
Julius Berner
Chengyang Ying
Hang Su
A. Anandkumar
Jian Song
Jun Zhu
AI4TS
AI4CE
116
37
0
06 Mar 2024
Revisiting Confidence Estimation: Towards Reliable Failure Prediction
Fei Zhu
Xu-Yao Zhang
Zhen Cheng
Cheng-Lin Liu
UQCV
100
12
0
05 Mar 2024
What do we learn from inverting CLIP models?
Hamid Kazemi
Atoosa Malemir Chegini
Jonas Geiping
Soheil Feizi
Tom Goldstein
55
6
0
05 Mar 2024
NiNformer: A Network in Network Transformer with Token Mixing Generated Gating Function
Abdullah Nazhat Abdullah
Tarkan Aydin
68
0
0
04 Mar 2024
HyenaPixel: Global Image Context with Convolutions
Julian Spravil
Sebastian Houben
Sven Behnke
44
1
0
29 Feb 2024
Deep learning for 3D human pose estimation and mesh recovery: A survey
Yang Liu
Changzhen Qiu
Zhiyong Zhang
3DH
89
9
0
29 Feb 2024
Mixer is more than just a model
Qingfeng Ji
Yuxin Wang
Letong Sun
61
0
0
28 Feb 2024
QN-Mixer: A Quasi-Newton MLP-Mixer Model for Sparse-View CT Reconstruction
Ishak Ayad
Nicolas Larue
Mai K. Nguyen
64
4
0
28 Feb 2024
Latent Neural PDE Solver: a reduced-order modelling framework for partial differential equations
Zijie Li
Saurabh Patil
Francis Ogoke
Dule Shu
Wilson Zhen
Michael Schneier
John R. Buchanan
A. Farimani
AI4CE
80
5
0
27 Feb 2024
An Efficient MLP-based Point-guided Segmentation Network for Ore Images with Ambiguous Boundary
Guodong Sun
Yuting Peng
Lei Cheng
Mengya Xu
An-Chi Wang
Bo Wu
Hongliang Ren
Yang Zhang
75
2
0
27 Feb 2024
Training Neural Networks from Scratch with Parallel Low-Rank Adapters
Minyoung Huh
Brian Cheung
Jeremy Bernstein
Phillip Isola
Pulkit Agrawal
106
12
0
26 Feb 2024
Why Transformers Need Adam: A Hessian Perspective
Yushun Zhang
Congliang Chen
Tian Ding
Ziniu Li
Ruoyu Sun
Zhimin Luo
124
57
0
26 Feb 2024
Shaving Weights with Occam's Razor: Bayesian Sparsification for Neural Networks Using the Marginal Likelihood
Rayen Dhahri
Alexander Immer
Bertrand Charpentier
Stephan Günnemann
Vincent Fortuin
BDL
UQCV
78
5
0
25 Feb 2024
IRConStyle: Image Restoration Framework Using Contrastive Learning and Style Transfer
Dongqi Fan
Xin Zhao
Liang Chang
61
1
0
24 Feb 2024
Mixup Barcodes: Quantifying Geometric-Topological Interactions between Point Clouds
Hubert Wagner
Nickolas Arustamyan
Matthew Wheeler
Peter Bubenik
3DPC
58
1
0
23 Feb 2024
PaCKD: Pattern-Clustered Knowledge Distillation for Compressing Memory Access Prediction Models
Neelesh Gupta
Pengmiao Zhang
Rajgopal Kannan
Viktor Prasanna
52
4
0
21 Feb 2024
Conditional Logical Message Passing Transformer for Complex Query Answering
Chongzhi Zhang
Zhiping Peng
Junhao Zheng
Qianli Ma
74
1
0
20 Feb 2024
BMLP: Behavior-aware MLP for Heterogeneous Sequential Recommendation
Weixin Li
Yuhao Wu
Yang Liu
Weike Pan
Zhong Ming
74
4
0
20 Feb 2024
Multilinear Mixture of Experts: Scalable Expert Specialization through Factorization
James Oldfield
Markos Georgopoulos
Grigorios G. Chrysos
Christos Tzelepis
Yannis Panagakis
M. Nicolaou
Jiankang Deng
Ioannis Patras
MoE
117
10
0
19 Feb 2024
Balanced Data, Imbalanced Spectra: Unveiling Class Disparities with Spectral Imbalance
Chiraag Kaushik
Ran Liu
Chi-Heng Lin
Amrit Khera
Matthew Y Jin
Wenrui Ma
Vidya Muthukumar
Eva L. Dyer
81
3
0
18 Feb 2024
FViT: A Focal Vision Transformer with Gabor Filter
Yulong Shi
Mingwei Sun
Yongshuai Wang
Rui Wang
149
4
0
17 Feb 2024
Associative Memories in the Feature Space
Tommaso Salvatori
Beren Millidge
Yuhang Song
Rafal Bogacz
Thomas Lukasiewicz
59
1
0
16 Feb 2024
RPMixer: Shaking Up Time Series Forecasting with Random Projections for Large Spatial-Temporal Data
Chin-Chia Michael Yeh
Yujie Fan
Xin Dai
Uday Singh Saini
Vivian Lai
...
Huiyuan Chen
Yan Zheng
Zhongfang Zhuang
Liang Wang
Wei Zhang
AI4TS
46
8
0
16 Feb 2024
Spike-driven Transformer V2: Meta Spiking Neural Network Architecture Inspiring the Design of Next-generation Neuromorphic Chips
Man Yao
Jiakui Hu
Tianxiang Hu
Yifan Xu
Zhaokun Zhou
Yonghong Tian
Boxing Xu
Guoqi Li
90
64
0
15 Feb 2024
Model Compression and Efficient Inference for Large Language Models: A Survey
Wenxiao Wang
Wei Chen
Yicong Luo
Yongliu Long
Zhengkai Lin
Liye Zhang
Binbin Lin
Deng Cai
Xiaofei He
MQ
114
58
0
15 Feb 2024
Switch EMA: A Free Lunch for Better Flatness and Sharpness
Siyuan Li
Zicheng Liu
Juanxi Tian
Ge Wang
Zedong Wang
...
Cheng Tan
Tao Lin
Yang Liu
Baigui Sun
Stan Z. Li
66
6
0
14 Feb 2024
A Closer Look at the Robustness of Contrastive Language-Image Pre-Training (CLIP)
Weijie Tu
Weijian Deng
Tom Gedeon
UQCV
VLM
70
35
0
12 Feb 2024
Efficient Stagewise Pretraining via Progressive Subnetworks
Abhishek Panigrahi
Nikunj Saunshi
Kaifeng Lyu
Sobhan Miryoosefi
Sashank J. Reddi
Satyen Kale
Sanjiv Kumar
65
6
0
08 Feb 2024
Text Role Classification in Scientific Charts Using Multimodal Transformers
Hye Jin Kim
N. Lell
A. Scherp
46
0
0
08 Feb 2024
TASER: Temporal Adaptive Sampling for Fast and Accurate Dynamic Graph Representation Learning
Gangda Deng
Hongkuan Zhou
Hanqing Zeng
Yinglong Xia
Christopher Leung
Jianbo Li
Rajgopal Kannan
Viktor Prasanna
82
1
0
08 Feb 2024
CAST: Clustering Self-Attention using Surrogate Tokens for Efficient Transformers
Adjorn van Engelenhoven
Nicola Strisciuglio
Estefanía Talavera
104
1
0
06 Feb 2024
MOMENT: A Family of Open Time-series Foundation Models
Mononito Goswami
Konrad Szafer
Arjun Choudhry
Yifu Cai
Shuo Li
Artur Dubrawski
AIFin
AI4TS
109
153
0
06 Feb 2024
MoD-SLAM: Monocular Dense Mapping for Unbounded 3D Scene Reconstruction
Heng Zhou
Zhetao Guo
Shuhong Liu
Lechen Zhang
Qihao Wang
Yuxiang Ren
Mingrui Li
MDE
101
14
0
06 Feb 2024
Professional Agents -- Evolving Large Language Models into Autonomous Experts with Human-Level Competencies
Zhixuan Chu
Yan Wang
Feng Zhu
Lu Yu
Longfei Li
Jinjie Gu
LLMAG
37
9
0
06 Feb 2024
A Survey on Transformer Compression
Yehui Tang
Yunhe Wang
Jianyuan Guo
Zhijun Tu
Kai Han
Hailin Hu
Dacheng Tao
146
35
0
05 Feb 2024
Time-, Memory- and Parameter-Efficient Visual Adaptation
Otniel-Bogdan Mercea
Alexey Gritsenko
Cordelia Schmid
Anurag Arnab
VLM
77
15
0
05 Feb 2024
Enhancing Transformer RNNs with Multiple Temporal Perspectives
Razvan-Gabriel Dumitru
Darius Peteleaza
Mihai Surdeanu
AI4TS
32
2
0
04 Feb 2024
Spatio-temporal Prompting Network for Robust Video Feature Extraction
Guanxiong Sun
Chi Wang
Zhaoyu Zhang
Jiankang Deng
Stefanos Zafeiriou
Yang Hua
ViT
55
4
0
04 Feb 2024
NOAH: Learning Pairwise Object Category Attentions for Image Classification
Chao Li
Aojun Zhou
Anbang Yao
VLM
56
2
0
04 Feb 2024
Polyp-DAM: Polyp segmentation via depth anything model
Zhuoran Zheng
Chen Henry Wu
Wei Wang
Yeying Jin
Xiuyi Jia
VLM
77
6
0
03 Feb 2024
ParZC: Parametric Zero-Cost Proxies for Efficient NAS
Peijie Dong
Lujun Li
Xinglin Pan
Zimian Wei
Xiang Liu
Qiang-qiang Wang
Xiaowen Chu
85
3
0
03 Feb 2024
Todyformer: Towards Holistic Dynamic Graph Transformers with Structure-Aware Tokenization
Mahdi Biparva
Raika Karimi
Faezeh Faez
Yingxue Zhang
53
3
0
02 Feb 2024
LIR: A Lightweight Baseline for Image Restoration
Dongqi Fan
Ting Yue
Xin Zhao
Renjing Xu
Liang Chang
58
0
0
02 Feb 2024
Towards Optimal Feature-Shaping Methods for Out-of-Distribution Detection
Qinyu Zhao
Ming Xu
Kartik Gupta
Akshay Asthana
Liang Zheng
Stephen Gould
OODD
34
7
0
01 Feb 2024
A Single Graph Convolution Is All You Need: Efficient Grayscale Image Classification
Jacob Fein-Ashley
S. Wickramasinghe
Bingyi Zhang
Rajgopal Kannan
Viktor Prasanna
37
5
0
01 Feb 2024
A Manifold Representation of the Key in Vision Transformers
Li Meng
Morten Goodwin
Anis Yazidi
P. Engelstad
86
0
0
01 Feb 2024
Previous
1
2
3
...
6
7
8
...
21
22
23
Next