Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2105.01601
Cited By
MLP-Mixer: An all-MLP Architecture for Vision
4 May 2021
Ilya O. Tolstikhin
N. Houlsby
Alexander Kolesnikov
Lucas Beyer
Xiaohua Zhai
Thomas Unterthiner
Jessica Yung
Andreas Steiner
Daniel Keysers
Jakob Uszkoreit
Mario Lucic
Alexey Dosovitskiy
Re-assign community
ArXiv
PDF
HTML
Papers citing
"MLP-Mixer: An all-MLP Architecture for Vision"
50 / 1,119 papers shown
Title
SwinLip: An Efficient Visual Speech Encoder for Lip Reading Using Swin Transformer
Young-Hu Park
R.-H. Park
Hyung-Min Park
49
0
0
07 May 2025
Image Recognition with Online Lightweight Vision Transformer: A Survey
Zherui Zhang
Rongtao Xu
Jie Zhou
Changwei Wang
Xingtian Pei
...
Jiguang Zhang
Li Guo
Longxiang Gao
W. Xu
Shibiao Xu
ViT
113
0
0
06 May 2025
Vision Mamba in Remote Sensing: A Comprehensive Survey of Techniques, Applications and Outlook
Muyi Bao
Shuchang Lyu
Zhaoyang Xu
Huiyu Zhou
Jinchang Ren
Shiming Xiang
X. Li
Guangliang Cheng
Mamba
77
0
0
01 May 2025
A Survey on Parameter-Efficient Fine-Tuning for Foundation Models in Federated Learning
Jieming Bian
Yuanzhe Peng
Lei Wang
Yin Huang
Jie Xu
FedML
60
0
0
29 Apr 2025
Unsupervised 2D-3D lifting of non-rigid objects using local constraints
Shalini Maiti
Lourdes Agapito
Benjamin Graham
110
0
0
27 Apr 2025
A Spatially-Aware Multiple Instance Learning Framework for Digital Pathology
H. Keshvarikhojasteh
Mihail Tifrea
Sibylle Hess
J. Pluim
M. Veta
49
0
0
24 Apr 2025
QuantBench: Benchmarking AI Methods for Quantitative Investment
Saizhuo Wang
Hao Kong
Jiadong Guo
Fengrui Hua
Yiyan Qi
Wanyun Zhou
Jiahao Zheng
Xinyu Wang
Lionel M. Ni
Jian Guo
118
1
0
24 Apr 2025
Seurat: From Moving Points to Depth
Seokju Cho
Jiahui Huang
S. Kim
Joon-Young Lee
3DPC
MDE
29
0
0
20 Apr 2025
Vision-Centric Representation-Efficient Fine-Tuning for Robust Universal Foreground Segmentation
Guoyi Zhang
Siyang Chen
Guangsheng Xu
Han Wang
Xiaohu Zhang
29
0
0
20 Apr 2025
GFT: Gradient Focal Transformer
Boris Kriuk
Simranjit Kaur Gill
Shoaib Aslam
Amir Fakhrutdinov
31
0
0
14 Apr 2025
Exploring Synergistic Ensemble Learning: Uniting CNNs, MLP-Mixers, and Vision Transformers to Enhance Image Classification
Mk Bashar
Ocean Monjur
Samia Islam
Mohammad Galib Shams
Niamul Quader
UQCV
29
0
0
12 Apr 2025
FLASH: Flexible Learning of Adaptive Sampling from History in Temporal Graph Neural Networks
Or Feldman
Krishna Sri Ipsit Mantri
Carola-Bibiane Schönlieb
Chaim Baskin
Moshe Eliasof
AI4TS
24
0
0
09 Apr 2025
CAT: Circular-Convolutional Attention for Sub-Quadratic Transformers
Yoshihiro Yamada
ViT
21
0
0
09 Apr 2025
TabKAN: Advancing Tabular Data Analysis using Kolmogorov-Arnold Network
Ali Eslamian
Alireza Afzal Aghaei
Qiang Cheng
LMTD
73
0
0
09 Apr 2025
Intermediate Layer Classifiers for OOD generalization
Arnas Uselis
Seong Joon Oh
OOD
49
0
0
07 Apr 2025
Window Token Concatenation for Efficient Visual Large Language Models
Yifan Li
Wentao Bao
Botao Ye
Zhen Tan
Tianlong Chen
Huan Liu
Yu Kong
VLM
41
0
0
05 Apr 2025
EMF: Event Meta Formers for Event-based Real-time Traffic Object Detection
Muhammad Ahmed Ullah Khan
Abdul Hannan Khan
Andreas Dengel
35
0
0
05 Apr 2025
Detecting underdetermination in parameterized quantum circuits
Marie Kempkes
Jakob Spiegelberg
Evert van Nieuwenburg
Vedran Dunjko
34
0
0
04 Apr 2025
Multi-Granularity Vision Fastformer with Fusion Mechanism for Skin Lesion Segmentation
Xuanyu Liu
Huiyun Yao
Jinggui Gao
Zhongyi Guo
Xue Zhang
Yulin Dong
ViT
MedIm
41
0
0
04 Apr 2025
GECKO: Gigapixel Vision-Concept Contrastive Pretraining in Histopathology
S. Kapse
Pushpak Pati
Srikar Yellapragada
Srijan Das
Rajarsi R. Gupta
Joel H. Saltz
Dimitris Samaras
Prateek Prasanna
VLM
41
0
0
01 Apr 2025
Simple yet Effective Node Property Prediction on Edge Streams under Distribution Shifts
Jongha Lee
Taehyung Kwon
Heechan Moon
Kijung Shin
AI4TS
41
0
0
01 Apr 2025
EMForecaster: A Deep Learning Framework for Time Series Forecasting in Wireless Networks with Distribution-Free Uncertainty Quantification
Xavier Mootoo
Hina Tabassum
Luca Chiaraviglio
AI4TS
31
0
0
31 Mar 2025
Spectral-Adaptive Modulation Networks for Visual Perception
Guhnoo Yun
J. Yoo
Kijung Kim
Jeongho Lee
Paul Hongsuck Seo
Dong Hwan Kim
37
0
0
31 Mar 2025
HiPART: Hierarchical Pose AutoRegressive Transformer for Occluded 3D Human Pose Estimation
Hongwei Zheng
Han Li
Wenrui Dai
Ziyang Zheng
Chenglin Li
Junni Zou
Hongkai Xiong
3DH
55
0
0
30 Mar 2025
LSNet: See Large, Focus Small
Ao Wang
Hui Chen
Zijia Lin
J. Han
Guiguang Ding
37
0
0
29 Mar 2025
A Semantic-Enhanced Heterogeneous Graph Learning Method for Flexible Objects Recognition
Kunshan Yang
Wenwei Luo
Yuguo Hu
Jiafu Yan
Mengmeng Jing
Lin Zuo
31
0
0
28 Mar 2025
Enabling Heterogeneous Adversarial Transferability via Feature Permutation Attacks
Tao Wu
Tie Luo
AAML
84
0
0
26 Mar 2025
ModalTune: Fine-Tuning Slide-Level Foundation Models with Multi-Modal Information for Multi-task Learning in Digital Pathology
Vishwesh Ramanathan
Tony Xu
Pushpak Pati
Faruk Ahmed
Maged Goubran
Anne L. Martel
43
0
0
21 Mar 2025
NdLinear Is All You Need for Representation Learning
Alex Reneau
Jerry Yao-Chieh Hu
Zhongfang Zhuang
Ting-Chun Liu
HAI
39
0
0
21 Mar 2025
No Thing, Nothing: Highlighting Safety-Critical Classes for Robust LiDAR Semantic Segmentation in Adverse Weather
J. Park
Hwijeong Lee
Inha Kang
Hyunjung Shim
54
0
0
20 Mar 2025
MeshFleet: Filtered and Annotated 3D Vehicle Dataset for Domain Specific Generative Modeling
Damian Boborzi
Phillip Mueller
Jonas Emrich
Dominik Schmid
Sebastian Mueller
Lars Mikelsons
DiffM
67
0
0
18 Mar 2025
Changing Base Without Losing Pace: A GPU-Efficient Alternative to MatMul in DNNs
Nir Ailon
Akhiad Bercovich
Omri Weinstein
52
0
0
15 Mar 2025
Transformers without Normalization
Jiachen Zhu
Xinlei Chen
Kaiming He
Yann LeCun
Zhuang Liu
ViT
OffRL
51
7
0
13 Mar 2025
Lightweight Models for Emotional Analysis in Video
Quoc-Tien Nguyen
H. Nguyen
V. Huynh
43
0
0
13 Mar 2025
A Multimodal Fusion Model Leveraging MLP Mixer and Handcrafted Features-based Deep Learning Networks for Facial Palsy Detection
Heng Yim Nicole Oo
Min Hun Lee
Jeong Hoon Lim
CVBM
56
0
0
13 Mar 2025
HOTFormerLoc: Hierarchical Octree Transformer for Versatile Lidar Place Recognition Across Ground and Aerial Views
Ethan Griffiths
Maryam Haghighat
Simon Denman
Clinton Fookes
Milad Ramezani
3DPC
57
0
0
11 Mar 2025
JamMa: Ultra-lightweight Local Feature Matching with Joint Mamba
Xiaoyong Lu
Songlin Du
Mamba
68
0
0
05 Mar 2025
Unify and Anchor: A Context-Aware Transformer for Cross-Domain Time Series Forecasting
Xiaobin Hong
J. Zhang
Wenzhong Li
Sanglu Lu
J. Li
AI4TS
63
0
0
03 Mar 2025
Robust and Efficient Writer-Independent IMU-Based Handwriting Recognization
Jindong Li
Tim Hamann
Jens Barth
Peter Kaempf
Dario Zanca
Bjoern M. Eskofier
36
0
0
28 Feb 2025
SciceVPR: Stable Cross-Image Correlation Enhanced Model for Visual Place Recognition
Shanshan Wan
Yingmei Wei
Lai Kang
Tianrui Shen
Haixuan Wang
Yee-Hong Yang
41
0
0
28 Feb 2025
Revisit the Stability of Vanilla Federated Learning Under Diverse Conditions
Youngjoon Lee
J. Gong
Sun Choi
Joonhyuk Kang
FedML
Presented at
ResearchTrend Connect | FedML
on
23 Apr 2025
112
1
0
27 Feb 2025
The FFT Strikes Again: An Efficient Alternative to Self-Attention
Jacob Fein-Ashley
R. Kannan
Viktor Prasanna
63
1
0
25 Feb 2025
TSKANMixer: Kolmogorov-Arnold Networks with MLP-Mixer Model for Time Series Forecasting
Young-Chae Hong
Bei Xiao
Yangho Chen
AI4TS
61
0
0
25 Feb 2025
A Survey of fMRI to Image Reconstruction
Weiyu Guo
Guoying Sun
JianXiang He
Tong Shao
Shaoguang Wang
Ziyang Chen
Meisheng Hong
Ying Sun
Hui Xiong
38
1
0
24 Feb 2025
MindLLM: A Subject-Agnostic and Versatile Model for fMRI-to-Text Decoding
Weikang Qiu
Zheng Huang
Haoyu Hu
Aosong Feng
Yujun Yan
Rex Ying
43
0
0
18 Feb 2025
VRoPE: Rotary Position Embedding for Video Large Language Models
Zikang Liu
Longteng Guo
Yepeng Tang
Junxian Cai
Kai Ma
Xi Chen
J. Liu
49
0
0
17 Feb 2025
TLOB: A Novel Transformer Model with Dual Attention for Price Trend Prediction with Limit Order Book Data
Leonardo Berti
Gjergji Kasneci
AI4TS
40
0
0
12 Feb 2025
Multi-Level Decoupled Relational Distillation for Heterogeneous Architectures
Yaoxin Yang
Peng Ye
Weihao Lin
Kangcong Li
Yan Wen
Jia Hao
Tao Chen
33
0
0
10 Feb 2025
Exploiting Ensemble Learning for Cross-View Isolated Sign Language Recognition
Fei Wang
Kun Li
Yiqi Nie
Zhangling Duan
Peng Zou
Z. Wu
Y. Wang
Yanyan Wei
SLR
62
1
0
04 Feb 2025
Towards Fast Graph Generation via Autoregressive Noisy Filtration Modeling
Markus Krimmel
Jenna Wiens
Karsten M. Borgwardt
Dexiong Chen
91
0
0
04 Feb 2025
1
2
3
4
...
21
22
23
Next