ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2105.08050
  4. Cited By
Pay Attention to MLPs

Pay Attention to MLPs

17 May 2021
Hanxiao Liu
Zihang Dai
David R. So
Quoc V. Le
    AI4CE
ArXivPDFHTML

Papers citing "Pay Attention to MLPs"

50 / 303 papers shown
Title
CU-Mamba: Selective State Space Models with Channel Learning for Image
  Restoration
CU-Mamba: Selective State Space Models with Channel Learning for Image Restoration
Rui Deng
Tianpei Gu
Mamba
42
16
0
17 Apr 2024
HGRN2: Gated Linear RNNs with State Expansion
HGRN2: Gated Linear RNNs with State Expansion
Zhen Qin
Songlin Yang
Weixuan Sun
Xuyang Shen
Dong Li
Weigao Sun
Yiran Zhong
LRM
47
47
0
11 Apr 2024
SpiralMLP: A Lightweight Vision MLP Architecture
SpiralMLP: A Lightweight Vision MLP Architecture
Haojie Mu
Burhan Ul Tayyab
Nicholas Chua
43
0
0
31 Mar 2024
A Survey on Large Language Models from Concept to Implementation
A Survey on Large Language Models from Concept to Implementation
Chen Wang
Jin Zhao
Jiaqi Gong
LLMAG
LM&MA
37
3
0
27 Mar 2024
Heracles: A Hybrid SSM-Transformer Model for High-Resolution Image and
  Time-Series Analysis
Heracles: A Hybrid SSM-Transformer Model for High-Resolution Image and Time-Series Analysis
Badri N. Patro
Suhas Ranganath
Vinay P. Namboodiri
Vijay Srinivas Agneeswaran
43
2
0
26 Mar 2024
Fully-fused Multi-Layer Perceptrons on Intel Data Center GPUs
Fully-fused Multi-Layer Perceptrons on Intel Data Center GPUs
Kai Yuan
Christoph Bauinger
Xiangyi Zhang
Pascal Baehr
Matthias Kirchhart
Darius Dabert
Adrien Tousnakhoff
Pierre Boudier
Michael Paulitsch
34
2
0
26 Mar 2024
Incorporating Exponential Smoothing into MLP: A Simple but Effective
  Sequence Model
Incorporating Exponential Smoothing into MLP: A Simple but Effective Sequence Model
Jiqun Chu
Zuoquan Lin
AI4TS
30
2
0
26 Mar 2024
Neural Clustering based Visual Representation Learning
Neural Clustering based Visual Representation Learning
Guikun Chen
Xia Li
Yi Yang
Wenguan Wang
SSL
37
8
0
26 Mar 2024
State Space Models as Foundation Models: A Control Theoretic Overview
State Space Models as Foundation Models: A Control Theoretic Overview
Carmen Amo Alonso
Jerome Sieber
M. Zeilinger
AI4CE
Mamba
36
13
0
25 Mar 2024
A Comparison of Deep Learning Architectures for Spacecraft Anomaly
  Detection
A Comparison of Deep Learning Architectures for Spacecraft Anomaly Detection
Daniel Lakey
Tim Schlippe
35
2
0
19 Mar 2024
depyf: Open the Opaque Box of PyTorch Compiler for Machine Learning
  Researchers
depyf: Open the Opaque Box of PyTorch Compiler for Machine Learning Researchers
Kaichao You
Runsheng Bai
Meng Cao
Jianmin Wang
Ion Stoica
Mingsheng Long
VLM
33
0
0
14 Mar 2024
NiNformer: A Network in Network Transformer with Token Mixing Generated
  Gating Function
NiNformer: A Network in Network Transformer with Token Mixing Generated Gating Function
Abdullah Nazhat Abdullah
Tarkan Aydin
39
0
0
04 Mar 2024
Accelerating Greedy Coordinate Gradient via Probe Sampling
Accelerating Greedy Coordinate Gradient via Probe Sampling
Yiran Zhao
Wenyue Zheng
Tianle Cai
Xuan Long Do
Kenji Kawaguchi
Anirudh Goyal
Michael Shieh
45
11
0
02 Mar 2024
Mixer is more than just a model
Mixer is more than just a model
Qingfeng Ji
Yuxin Wang
Letong Sun
40
0
0
28 Feb 2024
Learning to See Through Dazzle
Learning to See Through Dazzle
Xiaopeng Peng
Erin F. Fleet
A. Watnik
Grover A. Swartzlander
GAN
AAML
32
4
0
24 Feb 2024
IRConStyle: Image Restoration Framework Using Contrastive Learning and
  Style Transfer
IRConStyle: Image Restoration Framework Using Contrastive Learning and Style Transfer
Dongqi Fan
Xin Zhao
Liang Chang
32
1
0
24 Feb 2024
Few-Shot Learning with Uncertainty-based Quadruplet Selection for
  Interference Classification in GNSS Data
Few-Shot Learning with Uncertainty-based Quadruplet Selection for Interference Classification in GNSS Data
Felix Ott
Lucas Heublein
N. Raichur
Tobias Feigl
Jonathan Hansen
A. Rügamer
Christopher Mutschler
24
7
0
09 Feb 2024
NOAH: Learning Pairwise Object Category Attentions for Image
  Classification
NOAH: Learning Pairwise Object Category Attentions for Image Classification
Chao Li
Aojun Zhou
Anbang Yao
VLM
35
2
0
04 Feb 2024
LIR: A Lightweight Baseline for Image Restoration
LIR: A Lightweight Baseline for Image Restoration
Dongqi Fan
Ting Yue
Xin Zhao
Renjing Xu
Liang Chang
30
0
0
02 Feb 2024
Multilinear Operator Networks
Multilinear Operator Networks
Yixin Cheng
Grigorios G. Chrysos
Markos Georgopoulos
V. Cevher
32
7
0
31 Jan 2024
LOCOST: State-Space Models for Long Document Abstractive Summarization
LOCOST: State-Space Models for Long Document Abstractive Summarization
Florian Le Bronnec
Song Duong
Mathieu Ravaut
Alexandre Allauzen
Nancy F. Chen
Vincent Guigue
Alberto Lumbreras
Laure Soulier
Patrick Gallinari
45
8
0
31 Jan 2024
SegMamba: Long-range Sequential Modeling Mamba For 3D Medical Image
  Segmentation
SegMamba: Long-range Sequential Modeling Mamba For 3D Medical Image Segmentation
Zhaohu Xing
Tian-Chun Ye
Yijun Yang
Guang Liu
Lei Zhu
Mamba
42
192
0
24 Jan 2024
Transformers are Multi-State RNNs
Transformers are Multi-State RNNs
Matanel Oren
Michael Hassid
Nir Yarden
Yossi Adi
Roy Schwartz
OffRL
32
35
0
11 Jan 2024
Efficient Image Deblurring Networks based on Diffusion Models
Efficient Image Deblurring Networks based on Diffusion Models
Kang Chen
Yuanjie Liu
DiffM
16
2
0
11 Jan 2024
Learning Generalizable Models via Disentangling Spurious and Enhancing
  Potential Correlations
Learning Generalizable Models via Disentangling Spurious and Enhancing Potential Correlations
Na Wang
Lei Qi
Jintao Guo
Yinghuan Shi
Yang Gao
OOD
32
4
0
11 Jan 2024
Setting the Record Straight on Transformer Oversmoothing
Setting the Record Straight on Transformer Oversmoothing
G. Dovonon
M. Bronstein
Matt J. Kusner
28
5
0
09 Jan 2024
Image Super-resolution Reconstruction Network based on Enhanced Swin
  Transformer via Alternating Aggregation of Local-Global Features
Image Super-resolution Reconstruction Network based on Enhanced Swin Transformer via Alternating Aggregation of Local-Global Features
Yuming Huang
Yingpin Chen
Changhui Wu
Hanrong Xie
Binhui Song
Hui Wang
SupR
ViT
37
0
0
30 Dec 2023
SCHEME: Scalable Channel Mixer for Vision Transformers
SCHEME: Scalable Channel Mixer for Vision Transformers
Deepak Sridhar
Yunsheng Li
Nuno Vasconcelos
44
0
0
01 Dec 2023
Dimension Mixer: A Generalized Method for Structured Sparsity in Deep
  Neural Networks
Dimension Mixer: A Generalized Method for Structured Sparsity in Deep Neural Networks
Suman Sapkota
Binod Bhattarai
34
0
0
30 Nov 2023
Full-resolution MLPs Empower Medical Dense Prediction
Full-resolution MLPs Empower Medical Dense Prediction
Mingyuan Meng
Yuxin Xue
Da-wei Feng
Lei Bi
Jinman Kim
MedIm
21
4
0
28 Nov 2023
CMFDFormer: Transformer-based Copy-Move Forgery Detection with Continual
  Learning
CMFDFormer: Transformer-based Copy-Move Forgery Detection with Continual Learning
Yaqi Liu
Chao Xia
Song Xiao
Qingxiao Guan
Wenqian Dong
Yifan Zhang
Neng H. Yu
35
3
0
22 Nov 2023
4K-Resolution Photo Exposure Correction at 125 FPS with ~8K Parameters
4K-Resolution Photo Exposure Correction at 125 FPS with ~8K Parameters
Yijie Zhou
Chao Li
Jin Liang
Tianyi Xu
Xin Liu
Jun Xu
3DV
26
10
0
15 Nov 2023
Two-Stage Aggregation with Dynamic Local Attention for Irregular Time
  Series
Two-Stage Aggregation with Dynamic Local Attention for Irregular Time Series
Xingyu Chen
Xiaochen Zheng
Amina Mollaysa
Manuel Schürch
Ahmed Allam
Michael Krauthammer
AI4TS
27
1
0
13 Nov 2023
Hierarchically Gated Recurrent Neural Network for Sequence Modeling
Hierarchically Gated Recurrent Neural Network for Sequence Modeling
Zhen Qin
Songlin Yang
Yiran Zhong
36
74
0
08 Nov 2023
Scattering Vision Transformer: Spectral Mixing Matters
Scattering Vision Transformer: Spectral Mixing Matters
Badri N. Patro
Vijay Srinivas Agneeswaran
34
14
0
02 Nov 2023
Fantastic Gains and Where to Find Them: On the Existence and Prospect of
  General Knowledge Transfer between Any Pretrained Model
Fantastic Gains and Where to Find Them: On the Existence and Prospect of General Knowledge Transfer between Any Pretrained Model
Karsten Roth
Lukas Thede
Almut Sophia Koepke
Oriol Vinyals
Olivier J. Hénaff
Zeynep Akata
AAML
22
11
0
26 Oct 2023
Unraveling Feature Extraction Mechanisms in Neural Networks
Unraveling Feature Extraction Mechanisms in Neural Networks
Xiaobing Sun
Jiaxi Li
Wei Lu
18
0
0
25 Oct 2023
Handling Data Heterogeneity via Architectural Design for Federated
  Visual Recognition
Handling Data Heterogeneity via Architectural Design for Federated Visual Recognition
Sara Pieri
Jose Renato Restom
Samuel Horvath
Hisham Cholakkal
FedML
19
8
0
23 Oct 2023
Exploring Driving Behavior for Autonomous Vehicles Based on Gramian
  Angular Field Vision Transformer
Exploring Driving Behavior for Autonomous Vehicles Based on Gramian Angular Field Vision Transformer
Junwei You
Ying Chen
Zhuoyu Jiang
Zhangchi Liu
Zilin Huang
Yifeng Ding
Bin Ran
16
0
0
21 Oct 2023
Attentive Multi-Layer Perceptron for Non-autoregressive Generation
Attentive Multi-Layer Perceptron for Non-autoregressive Generation
Shuyang Jiang
Jinchao Zhang
Jiangtao Feng
Lin Zheng
Lingpeng Kong
54
0
0
14 Oct 2023
IFAST: Weakly Supervised Interpretable Face Anti-spoofing from
  Single-shot Binocular NIR Images
IFAST: Weakly Supervised Interpretable Face Anti-spoofing from Single-shot Binocular NIR Images
Jiancheng Huang
Donghao Zhou
Shifeng Chen
CVBM
39
2
0
29 Sep 2023
Auto-Regressive Next-Token Predictors are Universal Learners
Auto-Regressive Next-Token Predictors are Universal Learners
Eran Malach
LRM
24
36
0
13 Sep 2023
Dynamic Spectrum Mixer for Visual Recognition
Dynamic Spectrum Mixer for Visual Recognition
Zhiqiang Hu
Tao Yu
30
3
0
13 Sep 2023
Hindering Adversarial Attacks with Multiple Encrypted Patch Embeddings
Hindering Adversarial Attacks with Multiple Encrypted Patch Embeddings
AprilPyone Maungmaung
Isao Echizen
Hitoshi Kiya
AAML
28
2
0
04 Sep 2023
SiT-MLP: A Simple MLP with Point-wise Topology Feature Learning for
  Skeleton-based Action Recognition
SiT-MLP: A Simple MLP with Point-wise Topology Feature Learning for Skeleton-based Action Recognition
Shaojie Zhang
Jianqin Yin
Yonghao Dang
Jiajun Fu
35
4
0
30 Aug 2023
CS-Mixer: A Cross-Scale Vision MLP Model with Spatial-Channel Mixing
CS-Mixer: A Cross-Scale Vision MLP Model with Spatial-Channel Mixing
Jianwei Cui
David A. Araujo
Suman Saha
Md Faisal Kabir
BDL
38
0
0
25 Aug 2023
SPANet: Frequency-balancing Token Mixer using Spectral Pooling
  Aggregation Modulation
SPANet: Frequency-balancing Token Mixer using Spectral Pooling Aggregation Modulation
Guhnoo Yun
J. Yoo
Kijung Kim
Jeongho Lee
Dong Hwan Kim
MoE
30
8
0
22 Aug 2023
An Effective Transformer-based Contextual Model and Temporal Gate
  Pooling for Speaker Identification
An Effective Transformer-based Contextual Model and Temporal Gate Pooling for Speaker Identification
Harunori Kawano
Sota Shimizu
30
1
0
22 Aug 2023
Attention Is Not All You Need Anymore
Attention Is Not All You Need Anymore
Zhe Chen
29
3
0
15 Aug 2023
Block-Wise Encryption for Reliable Vision Transformer models
Block-Wise Encryption for Reliable Vision Transformer models
Hitoshi Kiya
Ryota Iijima
Teru Nagamori
25
1
0
15 Aug 2023
Previous
1234567
Next