Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2105.08050
Cited By
Pay Attention to MLPs
17 May 2021
Hanxiao Liu
Zihang Dai
David R. So
Quoc V. Le
AI4CE
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Pay Attention to MLPs"
50 / 303 papers shown
Title
CU-Mamba: Selective State Space Models with Channel Learning for Image Restoration
Rui Deng
Tianpei Gu
Mamba
42
16
0
17 Apr 2024
HGRN2: Gated Linear RNNs with State Expansion
Zhen Qin
Songlin Yang
Weixuan Sun
Xuyang Shen
Dong Li
Weigao Sun
Yiran Zhong
LRM
47
47
0
11 Apr 2024
SpiralMLP: A Lightweight Vision MLP Architecture
Haojie Mu
Burhan Ul Tayyab
Nicholas Chua
43
0
0
31 Mar 2024
A Survey on Large Language Models from Concept to Implementation
Chen Wang
Jin Zhao
Jiaqi Gong
LLMAG
LM&MA
37
3
0
27 Mar 2024
Heracles: A Hybrid SSM-Transformer Model for High-Resolution Image and Time-Series Analysis
Badri N. Patro
Suhas Ranganath
Vinay P. Namboodiri
Vijay Srinivas Agneeswaran
43
2
0
26 Mar 2024
Fully-fused Multi-Layer Perceptrons on Intel Data Center GPUs
Kai Yuan
Christoph Bauinger
Xiangyi Zhang
Pascal Baehr
Matthias Kirchhart
Darius Dabert
Adrien Tousnakhoff
Pierre Boudier
Michael Paulitsch
34
2
0
26 Mar 2024
Incorporating Exponential Smoothing into MLP: A Simple but Effective Sequence Model
Jiqun Chu
Zuoquan Lin
AI4TS
30
2
0
26 Mar 2024
Neural Clustering based Visual Representation Learning
Guikun Chen
Xia Li
Yi Yang
Wenguan Wang
SSL
37
8
0
26 Mar 2024
State Space Models as Foundation Models: A Control Theoretic Overview
Carmen Amo Alonso
Jerome Sieber
M. Zeilinger
AI4CE
Mamba
36
13
0
25 Mar 2024
A Comparison of Deep Learning Architectures for Spacecraft Anomaly Detection
Daniel Lakey
Tim Schlippe
35
2
0
19 Mar 2024
depyf: Open the Opaque Box of PyTorch Compiler for Machine Learning Researchers
Kaichao You
Runsheng Bai
Meng Cao
Jianmin Wang
Ion Stoica
Mingsheng Long
VLM
33
0
0
14 Mar 2024
NiNformer: A Network in Network Transformer with Token Mixing Generated Gating Function
Abdullah Nazhat Abdullah
Tarkan Aydin
39
0
0
04 Mar 2024
Accelerating Greedy Coordinate Gradient via Probe Sampling
Yiran Zhao
Wenyue Zheng
Tianle Cai
Xuan Long Do
Kenji Kawaguchi
Anirudh Goyal
Michael Shieh
45
11
0
02 Mar 2024
Mixer is more than just a model
Qingfeng Ji
Yuxin Wang
Letong Sun
40
0
0
28 Feb 2024
Learning to See Through Dazzle
Xiaopeng Peng
Erin F. Fleet
A. Watnik
Grover A. Swartzlander
GAN
AAML
32
4
0
24 Feb 2024
IRConStyle: Image Restoration Framework Using Contrastive Learning and Style Transfer
Dongqi Fan
Xin Zhao
Liang Chang
32
1
0
24 Feb 2024
Few-Shot Learning with Uncertainty-based Quadruplet Selection for Interference Classification in GNSS Data
Felix Ott
Lucas Heublein
N. Raichur
Tobias Feigl
Jonathan Hansen
A. Rügamer
Christopher Mutschler
24
7
0
09 Feb 2024
NOAH: Learning Pairwise Object Category Attentions for Image Classification
Chao Li
Aojun Zhou
Anbang Yao
VLM
35
2
0
04 Feb 2024
LIR: A Lightweight Baseline for Image Restoration
Dongqi Fan
Ting Yue
Xin Zhao
Renjing Xu
Liang Chang
30
0
0
02 Feb 2024
Multilinear Operator Networks
Yixin Cheng
Grigorios G. Chrysos
Markos Georgopoulos
V. Cevher
32
7
0
31 Jan 2024
LOCOST: State-Space Models for Long Document Abstractive Summarization
Florian Le Bronnec
Song Duong
Mathieu Ravaut
Alexandre Allauzen
Nancy F. Chen
Vincent Guigue
Alberto Lumbreras
Laure Soulier
Patrick Gallinari
45
8
0
31 Jan 2024
SegMamba: Long-range Sequential Modeling Mamba For 3D Medical Image Segmentation
Zhaohu Xing
Tian-Chun Ye
Yijun Yang
Guang Liu
Lei Zhu
Mamba
42
192
0
24 Jan 2024
Transformers are Multi-State RNNs
Matanel Oren
Michael Hassid
Nir Yarden
Yossi Adi
Roy Schwartz
OffRL
32
35
0
11 Jan 2024
Efficient Image Deblurring Networks based on Diffusion Models
Kang Chen
Yuanjie Liu
DiffM
16
2
0
11 Jan 2024
Learning Generalizable Models via Disentangling Spurious and Enhancing Potential Correlations
Na Wang
Lei Qi
Jintao Guo
Yinghuan Shi
Yang Gao
OOD
32
4
0
11 Jan 2024
Setting the Record Straight on Transformer Oversmoothing
G. Dovonon
M. Bronstein
Matt J. Kusner
28
5
0
09 Jan 2024
Image Super-resolution Reconstruction Network based on Enhanced Swin Transformer via Alternating Aggregation of Local-Global Features
Yuming Huang
Yingpin Chen
Changhui Wu
Hanrong Xie
Binhui Song
Hui Wang
SupR
ViT
37
0
0
30 Dec 2023
SCHEME: Scalable Channel Mixer for Vision Transformers
Deepak Sridhar
Yunsheng Li
Nuno Vasconcelos
44
0
0
01 Dec 2023
Dimension Mixer: A Generalized Method for Structured Sparsity in Deep Neural Networks
Suman Sapkota
Binod Bhattarai
34
0
0
30 Nov 2023
Full-resolution MLPs Empower Medical Dense Prediction
Mingyuan Meng
Yuxin Xue
Da-wei Feng
Lei Bi
Jinman Kim
MedIm
21
4
0
28 Nov 2023
CMFDFormer: Transformer-based Copy-Move Forgery Detection with Continual Learning
Yaqi Liu
Chao Xia
Song Xiao
Qingxiao Guan
Wenqian Dong
Yifan Zhang
Neng H. Yu
35
3
0
22 Nov 2023
4K-Resolution Photo Exposure Correction at 125 FPS with ~8K Parameters
Yijie Zhou
Chao Li
Jin Liang
Tianyi Xu
Xin Liu
Jun Xu
3DV
26
10
0
15 Nov 2023
Two-Stage Aggregation with Dynamic Local Attention for Irregular Time Series
Xingyu Chen
Xiaochen Zheng
Amina Mollaysa
Manuel Schürch
Ahmed Allam
Michael Krauthammer
AI4TS
27
1
0
13 Nov 2023
Hierarchically Gated Recurrent Neural Network for Sequence Modeling
Zhen Qin
Songlin Yang
Yiran Zhong
36
74
0
08 Nov 2023
Scattering Vision Transformer: Spectral Mixing Matters
Badri N. Patro
Vijay Srinivas Agneeswaran
34
14
0
02 Nov 2023
Fantastic Gains and Where to Find Them: On the Existence and Prospect of General Knowledge Transfer between Any Pretrained Model
Karsten Roth
Lukas Thede
Almut Sophia Koepke
Oriol Vinyals
Olivier J. Hénaff
Zeynep Akata
AAML
22
11
0
26 Oct 2023
Unraveling Feature Extraction Mechanisms in Neural Networks
Xiaobing Sun
Jiaxi Li
Wei Lu
18
0
0
25 Oct 2023
Handling Data Heterogeneity via Architectural Design for Federated Visual Recognition
Sara Pieri
Jose Renato Restom
Samuel Horvath
Hisham Cholakkal
FedML
19
8
0
23 Oct 2023
Exploring Driving Behavior for Autonomous Vehicles Based on Gramian Angular Field Vision Transformer
Junwei You
Ying Chen
Zhuoyu Jiang
Zhangchi Liu
Zilin Huang
Yifeng Ding
Bin Ran
16
0
0
21 Oct 2023
Attentive Multi-Layer Perceptron for Non-autoregressive Generation
Shuyang Jiang
Jinchao Zhang
Jiangtao Feng
Lin Zheng
Lingpeng Kong
54
0
0
14 Oct 2023
IFAST: Weakly Supervised Interpretable Face Anti-spoofing from Single-shot Binocular NIR Images
Jiancheng Huang
Donghao Zhou
Shifeng Chen
CVBM
39
2
0
29 Sep 2023
Auto-Regressive Next-Token Predictors are Universal Learners
Eran Malach
LRM
24
36
0
13 Sep 2023
Dynamic Spectrum Mixer for Visual Recognition
Zhiqiang Hu
Tao Yu
30
3
0
13 Sep 2023
Hindering Adversarial Attacks with Multiple Encrypted Patch Embeddings
AprilPyone Maungmaung
Isao Echizen
Hitoshi Kiya
AAML
28
2
0
04 Sep 2023
SiT-MLP: A Simple MLP with Point-wise Topology Feature Learning for Skeleton-based Action Recognition
Shaojie Zhang
Jianqin Yin
Yonghao Dang
Jiajun Fu
35
4
0
30 Aug 2023
CS-Mixer: A Cross-Scale Vision MLP Model with Spatial-Channel Mixing
Jianwei Cui
David A. Araujo
Suman Saha
Md Faisal Kabir
BDL
38
0
0
25 Aug 2023
SPANet: Frequency-balancing Token Mixer using Spectral Pooling Aggregation Modulation
Guhnoo Yun
J. Yoo
Kijung Kim
Jeongho Lee
Dong Hwan Kim
MoE
30
8
0
22 Aug 2023
An Effective Transformer-based Contextual Model and Temporal Gate Pooling for Speaker Identification
Harunori Kawano
Sota Shimizu
30
1
0
22 Aug 2023
Attention Is Not All You Need Anymore
Zhe Chen
29
3
0
15 Aug 2023
Block-Wise Encryption for Reliable Vision Transformer models
Hitoshi Kiya
Ryota Iijima
Teru Nagamori
25
1
0
15 Aug 2023
Previous
1
2
3
4
5
6
7
Next