Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2105.01601
Cited By
MLP-Mixer: An all-MLP Architecture for Vision
4 May 2021
Ilya O. Tolstikhin
N. Houlsby
Alexander Kolesnikov
Lucas Beyer
Xiaohua Zhai
Thomas Unterthiner
Jessica Yung
Andreas Steiner
Daniel Keysers
Jakob Uszkoreit
Mario Lucic
Alexey Dosovitskiy
Re-assign community
ArXiv
PDF
HTML
Papers citing
"MLP-Mixer: An all-MLP Architecture for Vision"
50 / 1,119 papers shown
Title
Heracles: A Hybrid SSM-Transformer Model for High-Resolution Image and Time-Series Analysis
Badri N. Patro
Suhas Ranganath
Vinay P. Namboodiri
Vijay Srinivas Agneeswaran
43
2
0
26 Mar 2024
PlainMamba: Improving Non-Hierarchical Mamba in Visual Recognition
Chenhongyi Yang
Zehui Chen
Miguel Espinosa
Linus Ericsson
Zhenyu Wang
Jiaming Liu
Elliot J. Crowley
Mamba
26
86
0
26 Mar 2024
Incorporating Exponential Smoothing into MLP: A Simple but Effective Sequence Model
Jiqun Chu
Zuoquan Lin
AI4TS
25
2
0
26 Mar 2024
Neural Clustering based Visual Representation Learning
Guikun Chen
Xia Li
Yi Yang
Wenguan Wang
SSL
32
8
0
26 Mar 2024
ChebMixer: Efficient Graph Representation Learning with MLP Mixer
Xiaoyan Kui
Haonan Yan
Qinsong Li
Liming Chen
Beiji Zou
30
0
0
25 Mar 2024
SiMBA: Simplified Mamba-Based Architecture for Vision and Multivariate Time series
Badri N. Patro
Vijay Srinivas Agneeswaran
Mamba
53
50
0
22 Mar 2024
Not All Attention is Needed: Parameter and Computation Efficient Transfer Learning for Multi-modal Large Language Models
Qiong Wu
Weihao Ye
Yiyi Zhou
Xiaoshuai Sun
Rongrong Ji
MoE
38
1
0
22 Mar 2024
Simple Graph Condensation
Zhenbang Xiao
Yu Wang
Shunyu Liu
Huiqiong Wang
Mingli Song
Tongya Zheng
DD
61
6
0
22 Mar 2024
KeyPoint Relative Position Encoding for Face Recognition
Minchul Kim
Yiyang Su
Feng Liu
Anil Jain
Xiaoming Liu
CVBM
41
7
0
21 Mar 2024
PostoMETRO: Pose Token Enhanced Mesh Transformer for Robust 3D Human Mesh Recovery
Wendi Yang
Zihang Jiang
Shang Zhao
S. Kevin Zhou
33
0
0
19 Mar 2024
Large-scale flood modeling and forecasting with FloodCast
Qingsong Xu
Yilei Shi
Jonathan Bamber
Chaojun Ouyang
Xiao Xiang Zhu
AI4CE
39
12
0
18 Mar 2024
NeoNeXt: Novel neural network operator and architecture based on the patch-wise matrix multiplications
Vladimir Korviakov
Denis Koposov
29
0
0
17 Mar 2024
D-Net: Dynamic Large Kernel with Dynamic Feature Fusion for Volumetric Medical Image Segmentation
Jin Yang
Peijie Qiu
Yichi Zhang
Daniel S. Marcus
Aristeidis Sotiras
MedIm
36
9
0
15 Mar 2024
LabelAId: Just-in-time AI Interventions for Improving Human Labeling Quality and Domain Knowledge in Crowdsourcing Systems
Chu Li
Zhihan Zhang
Michael Saugstad
Esteban Safranchik
Minchu Kulkarni
Xiaoyu Huang
Shwetak N. Patel
Vikram Iyer
Tim Althoff
Jon E. Froehlich
40
5
0
14 Mar 2024
depyf: Open the Opaque Box of PyTorch Compiler for Machine Learning Researchers
Kaichao You
Runsheng Bai
Meng Cao
Jianmin Wang
Ion Stoica
Mingsheng Long
VLM
33
0
0
14 Mar 2024
xMLP: Revolutionizing Private Inference with Exclusive Square Activation
Jiajie Li
Jinjun Xiong
19
0
0
12 Mar 2024
PeLK: Parameter-efficient Large Kernel ConvNets with Peripheral Convolution
Honghao Chen
Xiangxiang Chu
Yongjian Ren
Xin Zhao
Kaiqi Huang
33
25
0
12 Mar 2024
EarthLoc: Astronaut Photography Localization by Indexing Earth from Space
Gabriele Berton
Alex Stoken
Barbara Caputo
Carlo Masone
24
3
0
11 Mar 2024
Joint-Embedding Masked Autoencoder for Self-supervised Learning of Dynamic Functional Connectivity from the Human Brain
Jungwon Choi
Hyungi Lee
Byung-Hoon Kim
Juho Lee
72
0
0
11 Mar 2024
Fooling Neural Networks for Motion Forecasting via Adversarial Attacks
Edgar Medina
Leyong Loh
AAML
27
0
0
07 Mar 2024
LORS: Low-rank Residual Structure for Parameter-Efficient Network Stacking
Jialin Li
Qiang Nie
Weifu Fu
Yuhuan Lin
Guangpin Tao
Yong-Jin Liu
Chengjie Wang
25
4
0
07 Mar 2024
DPOT: Auto-Regressive Denoising Operator Transformer for Large-Scale PDE Pre-Training
Zhongkai Hao
Chang Su
Songming Liu
Julius Berner
Chengyang Ying
Hang Su
A. Anandkumar
Jian Song
Jun Zhu
AI4TS
AI4CE
22
21
0
06 Mar 2024
Revisiting Confidence Estimation: Towards Reliable Failure Prediction
Fei Zhu
Xu-Yao Zhang
Zhen Cheng
Cheng-Lin Liu
UQCV
44
10
0
05 Mar 2024
What do we learn from inverting CLIP models?
Hamid Kazemi
Atoosa Malemir Chegini
Jonas Geiping
S. Feizi
Tom Goldstein
31
3
0
05 Mar 2024
NiNformer: A Network in Network Transformer with Token Mixing Generated Gating Function
Abdullah Nazhat Abdullah
Tarkan Aydin
31
0
0
04 Mar 2024
HyenaPixel: Global Image Context with Convolutions
Julian Spravil
Sebastian Houben
Sven Behnke
29
1
0
29 Feb 2024
Deep learning for 3D human pose estimation and mesh recovery: A survey
Yang Liu
Changzhen Qiu
Zhiyong Zhang
3DH
39
6
0
29 Feb 2024
Mixer is more than just a model
Qingfeng Ji
Yuxin Wang
Letong Sun
30
0
0
28 Feb 2024
QN-Mixer: A Quasi-Newton MLP-Mixer Model for Sparse-View CT Reconstruction
Ishak Ayad
Nicolas Larue
Mai K. Nguyen
33
3
0
28 Feb 2024
Latent Neural PDE Solver: a reduced-order modelling framework for partial differential equations
Zijie Li
Saurabh Patil
Francis Ogoke
Dule Shu
Wilson Zhen
Michael Schneier
John R. Buchanan
A. Farimani
AI4CE
32
5
0
27 Feb 2024
An Efficient MLP-based Point-guided Segmentation Network for Ore Images with Ambiguous Boundary
Guodong Sun
Yuting Peng
Lei Cheng
Mengya Xu
An-Chi Wang
Bo Wu
Hongliang Ren
Yang Zhang
34
2
0
27 Feb 2024
Training Neural Networks from Scratch with Parallel Low-Rank Adapters
Minyoung Huh
Brian Cheung
Jeremy Bernstein
Phillip Isola
Pulkit Agrawal
35
10
0
26 Feb 2024
Why Transformers Need Adam: A Hessian Perspective
Yushun Zhang
Congliang Chen
Tian Ding
Ziniu Li
Ruoyu Sun
Zhimin Luo
32
40
0
26 Feb 2024
Shaving Weights with Occam's Razor: Bayesian Sparsification for Neural Networks Using the Marginal Likelihood
Rayen Dhahri
Alexander Immer
Bertrand Charpentier
Stephan Günnemann
Vincent Fortuin
BDL
UQCV
22
4
0
25 Feb 2024
IRConStyle: Image Restoration Framework Using Contrastive Learning and Style Transfer
Dongqi Fan
Xin Zhao
Liang Chang
24
1
0
24 Feb 2024
Mixup Barcodes: Quantifying Geometric-Topological Interactions between Point Clouds
Hubert Wagner
Nickolas Arustamyan
Matthew Wheeler
Peter Bubenik
3DPC
42
1
0
23 Feb 2024
PaCKD: Pattern-Clustered Knowledge Distillation for Compressing Memory Access Prediction Models
Neelesh Gupta
Pengmiao Zhang
Rajgopal Kannan
Viktor Prasanna
14
3
0
21 Feb 2024
Conditional Logical Message Passing Transformer for Complex Query Answering
Chongzhi Zhang
Zhiping Peng
Junhao Zheng
Qianli Ma
27
1
0
20 Feb 2024
BMLP: Behavior-aware MLP for Heterogeneous Sequential Recommendation
Weixin Li
Yuhao Wu
Yang Liu
Weike Pan
Zhong Ming
25
3
0
20 Feb 2024
Multilinear Mixture of Experts: Scalable Expert Specialization through Factorization
James Oldfield
Markos Georgopoulos
Grigorios G. Chrysos
Christos Tzelepis
Yannis Panagakis
M. Nicolaou
Jiankang Deng
Ioannis Patras
MoE
37
8
0
19 Feb 2024
Balanced Data, Imbalanced Spectra: Unveiling Class Disparities with Spectral Imbalance
Chiraag Kaushik
Ran Liu
Chi-Heng Lin
Amrit Khera
Matthew Y Jin
Wenrui Ma
Vidya Muthukumar
Eva L. Dyer
40
3
0
18 Feb 2024
FViT: A Focal Vision Transformer with Gabor Filter
Yulong Shi
Mingwei Sun
Yongshuai Wang
Rui Wang
47
4
0
17 Feb 2024
Associative Memories in the Feature Space
Tommaso Salvatori
Beren Millidge
Yuhang Song
Rafal Bogacz
Thomas Lukasiewicz
27
1
0
16 Feb 2024
RPMixer: Shaking Up Time Series Forecasting with Random Projections for Large Spatial-Temporal Data
Chin-Chia Michael Yeh
Yujie Fan
Xin Dai
Uday Singh Saini
Vivian Lai
...
Huiyuan Chen
Yan Zheng
Zhongfang Zhuang
Liang Wang
Wei Zhang
AI4TS
18
7
0
16 Feb 2024
Spike-driven Transformer V2: Meta Spiking Neural Network Architecture Inspiring the Design of Next-generation Neuromorphic Chips
Man Yao
Jiakui Hu
Tianxiang Hu
Yifan Xu
Zhaokun Zhou
Yonghong Tian
Boxing Xu
Guoqi Li
29
55
0
15 Feb 2024
Model Compression and Efficient Inference for Large Language Models: A Survey
Wenxiao Wang
Wei Chen
Yicong Luo
Yongliu Long
Zhengkai Lin
Liye Zhang
Binbin Lin
Deng Cai
Xiaofei He
MQ
36
47
0
15 Feb 2024
Switch EMA: A Free Lunch for Better Flatness and Sharpness
Siyuan Li
Zicheng Liu
Juanxi Tian
Ge Wang
Zedong Wang
...
Cheng Tan
Tao Lin
Yang Liu
Baigui Sun
Stan Z. Li
30
6
0
14 Feb 2024
A Closer Look at the Robustness of Contrastive Language-Image Pre-Training (CLIP)
Weijie Tu
Weijian Deng
Tom Gedeon
UQCV
VLM
20
32
0
12 Feb 2024
Efficient Stagewise Pretraining via Progressive Subnetworks
Abhishek Panigrahi
Nikunj Saunshi
Kaifeng Lyu
Sobhan Miryoosefi
Sashank J. Reddi
Satyen Kale
Sanjiv Kumar
30
5
0
08 Feb 2024
Text Role Classification in Scientific Charts Using Multimodal Transformers
Hye Jin Kim
N. Lell
A. Scherp
16
0
0
08 Feb 2024
Previous
1
2
3
...
5
6
7
...
21
22
23
Next