Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2404.09516
Cited By
State Space Model for New-Generation Network Alternative to Transformers: A Survey
15 April 2024
Tianlin Li
Shiao Wang
Yuhe Ding
Yuehang Li
Wentao Wu
Yao Rong
Weizhe Kong
Ju Huang
Shihao Li
Haoxiang Yang
Ziwen Wang
Bowei Jiang
Chenglong Li
Yaowei Wang
Yonghong Tian
Jin Tang
Mamba
Re-assign community
ArXiv
PDF
HTML
Papers citing
"State Space Model for New-Generation Network Alternative to Transformers: A Survey"
50 / 106 papers shown
Title
Robustifying State-space Models for Long Sequences via Approximate Diagonalization
Annan Yu
Arnur Nigmetov
Dmitriy Morozov
Michael W. Mahoney
N. Benjamin Erichson
68
14
0
02 Oct 2023
Spiking Structured State Space Model for Monaural Speech Enhancement
Yu Du
Xu Liu
Yansong Chua
62
19
0
07 Sep 2023
Gated recurrent neural networks discover attention
Nicolas Zucchet
Seijin Kobayashi
Yassir Akram
J. Oswald
Maxime Larcher
Angelika Steger
João Sacramento
57
8
0
04 Sep 2023
RWKV: Reinventing RNNs for the Transformer Era
Bo Peng
Eric Alcaide
Quentin G. Anthony
Alon Albalak
Samuel Arcadinho
...
Qihang Zhao
P. Zhou
Qinghua Zhou
Jian Zhu
Rui-Jie Zhu
212
594
0
22 May 2023
Dynamic Graph Enhanced Contrastive Learning for Chest X-ray Report Generation
Mingjie Li
Bingqian Lin
Zicong Chen
Haokun Lin
Xiaodan Liang
Xiaojun Chang
MedIm
59
116
0
18 Mar 2023
Resurrecting Recurrent Neural Networks for Long Sequences
Antonio Orvieto
Samuel L. Smith
Albert Gu
Anushan Fernando
Çağlar Gülçehre
Razvan Pascanu
Soham De
326
288
0
11 Mar 2023
Pretraining Without Attention
Junxiong Wang
J. Yan
Albert Gu
Alexander M. Rush
63
48
0
20 Dec 2022
Efficient Long Sequence Modeling via State Space Augmented Transformer
Simiao Zuo
Xiaodong Liu
Jian Jiao
Denis Xavier Charles
Eren Manavoglu
Tuo Zhao
Jianfeng Gao
169
37
0
15 Dec 2022
Revisiting Color-Event based Tracking: A Unified Network, Dataset, and Metric
Chuanming Tang
Tianlin Li
Ju Huang
Bowei Jiang
Lin Zhu
Jianlin Zhang
Yaowei Wang
Yonghong Tian
87
38
0
20 Nov 2022
TF-GridNet: Making Time-Frequency Domain Models Great Again for Monaural Speaker Separation
Zhong-Qiu Wang
Samuele Cornell
Shukjae Choi
Younglo Lee
Byeonghak Kim
Shinji Watanabe
97
103
0
08 Sep 2022
AiATrack: Attention in Attention for Transformer Visual Tracking
Shenyuan Gao
Chunluan Zhou
Chao Ma
Xing Wang
Junsong Yuan
ViT
74
230
0
20 Jul 2022
Long Range Language Modeling via Gated State Spaces
Harsh Mehta
Ankit Gupta
Ashok Cutkosky
Behnam Neyshabur
Mamba
91
242
0
27 Jun 2022
Competence-based Multimodal Curriculum Learning for Medical Report Generation
Fenglin Liu
Shen Ge
Yuexian Zou
Xian Wu
MedIm
59
139
0
24 Jun 2022
How to Train Your HiPPO: State Space Models with Generalized Orthogonal Basis Projections
Albert Gu
Isys Johnson
Aman Timalsina
Atri Rudra
Christopher Ré
Mamba
157
97
0
24 Jun 2022
On the Parameterization and Initialization of Diagonal State Space Models
Albert Gu
Ankit Gupta
Karan Goel
Christopher Ré
86
316
0
23 Jun 2022
Diagonal State Spaces are as Effective as Structured State Spaces
Ankit Gupta
Albert Gu
Jonathan Berant
116
306
0
27 Mar 2022
Joint Feature Learning and Relation Modeling for Tracking: A One-Stream Framework
Botao Ye
Hong Chang
Bingpeng Ma
Shiguang Shan
Xilin Chen
ViT
87
472
0
22 Mar 2022
Transforming Model Prediction for Tracking
Christoph Mayer
Martin Danelljan
Goutam Bhat
M. Paul
D. Paudel
Feng Yu
Luc Van Gool
118
237
0
21 Mar 2022
MixFormer: End-to-End Tracking with Iterative Mixed Attention
Yutao Cui
Jiang Cheng
Limin Wang
Gangshan Wu
VOT
114
473
0
21 Mar 2022
Backbone is All Your Need: A Simplified Architecture for Visual Object Tracking
Boyu Chen
Peixia Li
Lei Bai
Leixian Qiao
Qiuhong Shen
Yue Liu
Weihao Gan
Wei Wu
Wanli Ouyang
ViT
VOT
67
193
0
10 Mar 2022
It's Raw! Audio Generation with State-Space Models
Karan Goel
Albert Gu
Chris Donahue
Christopher Ré
55
191
0
20 Feb 2022
Masked Autoencoders Are Scalable Vision Learners
Kaiming He
Xinlei Chen
Saining Xie
Yanghao Li
Piotr Dollár
Ross B. Girshick
ViT
TPM
460
7,757
0
11 Nov 2021
Recent Advances in Natural Language Processing via Large Pre-Trained Language Models: A Survey
Bonan Min
Hayley L Ross
Elior Sulem
Amir Pouran Ben Veyseh
Thien Huu Nguyen
Oscar Sainz
Eneko Agirre
Ilana Heinz
Dan Roth
LM&MA
VLM
AI4CE
138
1,074
0
01 Nov 2021
Spatial and Semantic Consistency Regularizations for Pedestrian Attribute Recognition
Jian Jia
Xiaotang Chen
Kaiqi Huang
64
64
0
13 Sep 2021
Combiner: Full Attention Transformer with Sparse Computation Cost
Hongyu Ren
H. Dai
Zihang Dai
Mengjiao Yang
J. Leskovec
Dale Schuurmans
Bo Dai
129
80
0
12 Jul 2021
ERNIE 3.0: Large-scale Knowledge Enhanced Pre-training for Language Understanding and Generation
Yu Sun
Shuohuan Wang
Shikun Feng
Siyu Ding
Chao Pang
...
Ouyang Xuan
Dianhai Yu
Hao Tian
Hua Wu
Haifeng Wang
97
470
0
05 Jul 2021
Exploring and Distilling Posterior and Prior Knowledge for Radiology Report Generation
Fenglin Liu
Xian Wu
Shen Ge
Wei Fan
Yuexian Zou
MedIm
81
257
0
13 Jun 2021
Large-Scale Spatio-Temporal Person Re-identification: Algorithms and Benchmark
Xiujun Shu
Tianlin Li
Xian Zhang
Shiliang Zhang
Yuanqi Chen
Gezhong Li
Q. Tian
AI4TS
67
73
0
31 May 2021
Swin-Unet: Unet-like Pure Transformer for Medical Image Segmentation
Hu Cao
Yueyue Wang
Jieneng Chen
Dongsheng Jiang
Xiaopeng Zhang
Qi Tian
Manning Wang
ViT
MedIm
129
2,903
0
12 May 2021
Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
Ze Liu
Yutong Lin
Yue Cao
Han Hu
Yixuan Wei
Zheng Zhang
Stephen Lin
B. Guo
ViT
447
21,418
0
25 Mar 2021
TransReID: Transformer-based Object Re-Identification
Shuting He
Haowen Luo
Pichao Wang
F. Wang
Hao Li
Wei Jiang
ViT
266
817
0
08 Feb 2021
Training data-efficient image transformers & distillation through attention
Hugo Touvron
Matthieu Cord
Matthijs Douze
Francisco Massa
Alexandre Sablayrolles
Hervé Jégou
ViT
384
6,768
0
23 Dec 2020
Generating Radiology Reports via Memory-driven Transformer
Zhihong Chen
Yan Song
Tsung-Hui Chang
Xiang Wan
MedIm
65
479
0
30 Oct 2020
Attention is All You Need in Speech Separation
Cem Subakan
Mirco Ravanelli
Samuele Cornell
Mirko Bronzi
Jianyuan Zhong
95
557
0
25 Oct 2020
Orientation-aware Vehicle Re-identification with Semantics-guided Part Attention Network
Tsai-Shien Chen
Chih-Ting Liu
Chih-Wei Wu
Shao-Yi Chien
3DPC
217
85
0
26 Aug 2020
HiPPO: Recurrent Memory with Optimal Polynomial Projections
Albert Gu
Tri Dao
Stefano Ermon
Atri Rudra
Christopher Ré
112
519
0
17 Aug 2020
Identity-Guided Human Semantic Parsing for Person Re-Identification
Kuan Zhu
Haiyun Guo
Zhiwei Liu
Ming Tang
Jinqiao Wang
251
289
0
27 Jul 2020
Transformers are RNNs: Fast Autoregressive Transformers with Linear Attention
Angelos Katharopoulos
Apoorv Vyas
Nikolaos Pappas
Franccois Fleuret
201
1,765
0
29 Jun 2020
Parsing-based View-aware Embedding Network for Vehicle Re-Identification
Dechao Meng
Liang Li
Xuejing Liu
Yadong Li
Shijie Yang
Zhengjun Zha
Xingyu Gao
Shuhui Wang
Qingming Huang
100
176
0
10 Apr 2020
Probabilistic Regression for Visual Tracking
Martin Danelljan
Luc Van Gool
Radu Timofte
BDL
82
520
0
27 Mar 2020
Know Your Surroundings: Exploiting Scene Information for Object Tracking
Goutam Bhat
Martin Danelljan
Luc Van Gool
Radu Timofte
60
316
0
24 Mar 2020
Stripe-based and Attribute-aware Network: A Two-Branch Deep Model for Vehicle Re-identification
Jingjing Qian
Wei Jiang
Hao Luo
Hongyan Yu
57
102
0
12 Oct 2019
Vehicle Re-identification with Viewpoint-aware Metric Learning
Ruihang Chu
Yifan Sun
Yadong Li
Ziwei Liu
Chi Zhang
Yichen Wei
93
172
0
09 Oct 2019
ABD-Net: Attentive but Diverse Person Re-Identification
Tianlong Chen
Shaojin Ding
Jingyi Xie
Ye Yuan
Wuyang Chen
Yang
Zhou Ren
Zhangyang Wang
95
480
0
03 Aug 2019
Semantics-Aligned Representation Learning for Person Re-identification
Xin Jin
Cuiling Lan
Wenjun Zeng
Guoqiang Wei
Zhibo Chen
68
140
0
30 May 2019
Omni-Scale Feature Learning for Person Re-Identification
Kaiyang Zhou
Yongxin Yang
Andrea Cavallaro
Tao Xiang
79
831
0
02 May 2019
Learning Discriminative Model Prediction for Tracking
Goutam Bhat
Martin Danelljan
Luc Van Gool
Radu Timofte
131
1,038
0
15 Apr 2019
Knowledge-driven Encode, Retrieve, Paraphrase for Medical Image Report Generation
Yuan Li
Xiaodan Liang
Zhiting Hu
Eric Xing
MedIm
57
271
0
25 Mar 2019
Bag of Tricks and A Strong Baseline for Deep Person Re-identification
Hao Luo
Youzhi Gu
Xingyu Liao
Shenqi Lai
Wei Jiang
BDL
3DPC
136
1,176
0
17 Mar 2019
A Comprehensive Survey on Graph Neural Networks
Zonghan Wu
Shirui Pan
Fengwen Chen
Guodong Long
Chengqi Zhang
Philip S. Yu
FaML
GNN
AI4TS
AI4CE
764
8,533
0
03 Jan 2019
Previous
1
2
3
Next