ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2206.13947
  4. Cited By
Long Range Language Modeling via Gated State Spaces

Long Range Language Modeling via Gated State Spaces

27 June 2022
Harsh Mehta
Ankit Gupta
Ashok Cutkosky
Behnam Neyshabur
    Mamba
ArXivPDFHTML

Papers citing "Long Range Language Modeling via Gated State Spaces"

50 / 51 papers shown
Title
Efficient Unstructured Pruning of Mamba State-Space Models for Resource-Constrained Environments
Efficient Unstructured Pruning of Mamba State-Space Models for Resource-Constrained Environments
Ibne Farabi Shihab
Sanjeda Akter
Anuj Sharma
Mamba
48
0
0
13 May 2025
Image Recognition with Online Lightweight Vision Transformer: A Survey
Image Recognition with Online Lightweight Vision Transformer: A Survey
Zherui Zhang
Rongtao Xu
Jie Zhou
Changwei Wang
Xingtian Pei
...
Jiguang Zhang
Li Guo
Longxiang Gao
Wenyuan Xu
Shibiao Xu
ViT
148
0
0
06 May 2025
SSD-Poser: Avatar Pose Estimation with State Space Duality from Sparse Observations
SSD-Poser: Avatar Pose Estimation with State Space Duality from Sparse Observations
Shuting Zhao
Linxin Bai
Liangjing Shao
Ye Zhang
Xinrong Chen
26
0
0
25 Apr 2025
Simultaneous Learning of Optimal Transports for Training All-to-All Flow-Based Condition Transfer Model
Simultaneous Learning of Optimal Transports for Training All-to-All Flow-Based Condition Transfer Model
Kotaro Ikeda
Masanori Koyama
Jinzhe Zhang
Kohei Hayashi
Kenji Fukumizu
OT
142
0
0
04 Apr 2025
Audio-visual Controlled Video Diffusion with Masked Selective State Spaces Modeling for Natural Talking Head Generation
Audio-visual Controlled Video Diffusion with Masked Selective State Spaces Modeling for Natural Talking Head Generation
Fa-Ting Hong
Zunnan Xu
Zixiang Zhou
Zhiqiang Zhang
Xiu Li
Qin Lin
Qinglin Lu
D. Xu
DiffM
VGen
57
2
0
03 Apr 2025
EventMamba: Enhancing Spatio-Temporal Locality with State Space Models for Event-Based Video Reconstruction
EventMamba: Enhancing Spatio-Temporal Locality with State Space Models for Event-Based Video Reconstruction
Chengjie Ge
Xueyang Fu
Peng He
Kunyu Wang
Chengzhi Cao
Zheng-jun Zha
Mamba
46
0
0
25 Mar 2025
From S4 to Mamba: A Comprehensive Survey on Structured State Space Models
From S4 to Mamba: A Comprehensive Survey on Structured State Space Models
Shriyank Somvanshi
Md Monzurul Islam
Mahmuda Sultana Mimi
Sazzad Bin Bashar Polock
Gaurab Chhetri
Subasish Das
Mamba
AI4TS
45
0
0
22 Mar 2025
MambaIC: State Space Models for High-Performance Learned Image Compression
MambaIC: State Space Models for High-Performance Learned Image Compression
Fanhu Zeng
Hao Tang
Yihua Shao
Siyu Chen
Ling Shao
Yan Wang
Mamba
149
0
0
16 Mar 2025
BioMamba: Leveraging Spectro-Temporal Embedding in Bidirectional Mamba for Enhanced Biosignal Classification
BioMamba: Leveraging Spectro-Temporal Embedding in Bidirectional Mamba for Enhanced Biosignal Classification
Jian Qian
Teck Lun Goh
Bingyu Xie
Chengyao Zhu
Biao Wan
Yawen Guan
Rachel Ding Chen
Patrick Chiang
Mamba
47
0
0
14 Mar 2025
Future-Aware Interaction Network For Motion Forecasting
Shijie Li
Xun Xu
S. Yeo
Xulei Yang
Mamba
196
0
0
09 Mar 2025
Spectral State Space Model for Rotation-Invariant Visual Representation Learning
Spectral State Space Model for Rotation-Invariant Visual Representation Learning
Sahar Dastani
Ali Bahri
Moslem Yazdanpanah
Mehrdad Noori
David Osowiechi
...
Farzad Beizaee
Milad Cheraghalikhani
A. Mondal
H. Lombaert
Christian Desrosiers
55
0
0
09 Mar 2025
A Reverse Mamba Attention Network for Pathological Liver Segmentation
A Reverse Mamba Attention Network for Pathological Liver Segmentation
Jun Zeng
Debesh Jha
Ertugrul Aktas
Elif Keles
A. Medetalibeyoğlu
...
Robert Lewandowski
Daniela Ladner
Amir Borhani
Gorkem Durak
Ulas Bagci
Mamba
54
1
0
23 Feb 2025
CardiacMamba: A Multimodal RGB-RF Fusion Framework with State Space Models for Remote Physiological Measurement
CardiacMamba: A Multimodal RGB-RF Fusion Framework with State Space Models for Remote Physiological Measurement
Zheng Wu
Yiping Xie
Bo Zhao
Jiguang He
Fei Luo
Ning Deng
Zitong Yu
Mamba
44
1
0
20 Feb 2025
State-space models are accurate and efficient neural operators for dynamical systems
State-space models are accurate and efficient neural operators for dynamical systems
Zheyuan Hu
Nazanin Ahmadi Daryakenari
Qianli Shen
Kenji Kawaguchi
George Karniadakis
Mamba
AI4CE
67
11
0
28 Jan 2025
MambaHSI: Spatial-Spectral Mamba for Hyperspectral Image Classification
MambaHSI: Spatial-Spectral Mamba for Hyperspectral Image Classification
Yapeng Li
Yong Luo
L. Zhang
Zengmao Wang
Bo Du
Mamba
58
59
0
10 Jan 2025
VMamba: Visual State Space Model
VMamba: Visual State Space Model
Yue Liu
Yunjie Tian
Yuzhong Zhao
Hongtian Yu
Lingxi Xie
Yaowei Wang
Qixiang Ye
Jianbin Jiao
Yunfan Liu
Mamba
152
612
0
31 Dec 2024
FACTS: A Factored State-Space Framework For World Modelling
FACTS: A Factored State-Space Framework For World Modelling
Li Nanbo
Firas Laakom
Yucheng Xu
Wenyi Wang
Jürgen Schmidhuber
AI4TS
151
0
0
28 Oct 2024
HASN: Hybrid Attention Separable Network for Efficient Image
  Super-resolution
HASN: Hybrid Attention Separable Network for Efficient Image Super-resolution
Weifeng Cao
Xiaoyan Lei
Jun Shi
Wanyong Liang
Jie Liu
Zongfei Bai
SupR
29
0
0
13 Oct 2024
Large Language Model Inference Acceleration: A Comprehensive Hardware Perspective
Large Language Model Inference Acceleration: A Comprehensive Hardware Perspective
Jinhao Li
Jiaming Xu
Shan Huang
Yonghua Chen
Wen Li
...
Jiayi Pan
Li Ding
Hao Zhou
Yu Wang
Guohao Dai
62
16
0
06 Oct 2024
SkinMamba: A Precision Skin Lesion Segmentation Architecture with
  Cross-Scale Global State Modeling and Frequency Boundary Guidance
SkinMamba: A Precision Skin Lesion Segmentation Architecture with Cross-Scale Global State Modeling and Frequency Boundary Guidance
Shun Zou
Mingya Zhang
Bingjian Fan
Zhengyi Zhou
Xiuguo Zou
Mamba
29
3
0
17 Sep 2024
Shuffle Mamba: State Space Models with Random Shuffle for Multi-Modal
  Image Fusion
Shuffle Mamba: State Space Models with Random Shuffle for Multi-Modal Image Fusion
Ke Cao
Xuanhua He
Tao Hu
Chengjun Xie
Jie Zhang
Man Zhou
Danfeng Hong
Mamba
34
3
0
03 Sep 2024
Transformers to SSMs: Distilling Quadratic Knowledge to Subquadratic Models
Transformers to SSMs: Distilling Quadratic Knowledge to Subquadratic Models
Aviv Bick
Kevin Y. Li
Eric P. Xing
J. Zico Kolter
Albert Gu
Mamba
53
24
0
19 Aug 2024
DeciMamba: Exploring the Length Extrapolation Potential of Mamba
DeciMamba: Exploring the Length Extrapolation Potential of Mamba
Assaf Ben-Kish
Itamar Zimerman
Shady Abu Hussein
Nadav Cohen
Amir Globerson
Lior Wolf
Raja Giryes
Mamba
77
13
0
20 Jun 2024
MambaLRP: Explaining Selective State Space Sequence Models
MambaLRP: Explaining Selective State Space Sequence Models
F. Jafari
G. Montavon
Klaus-Robert Müller
Oliver Eberle
Mamba
59
9
0
11 Jun 2024
Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling
Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling
Liliang Ren
Yang Liu
Yadong Lu
Yelong Shen
Chen Liang
Weizhu Chen
Mamba
74
56
0
11 Jun 2024
LCM: Locally Constrained Compact Point Cloud Model for Masked Point
  Modeling
LCM: Locally Constrained Compact Point Cloud Model for Masked Point Modeling
Yaohua Zha
Naiqi Li
Yanzi Wang
Tao Dai
Hang Guo
Bin Chen
Zhi Wang
Zhihao Ouyang
Shu-Tao Xia
Mamba
42
8
0
27 May 2024
MambaTS: Improved Selective State Space Models for Long-term Time Series
  Forecasting
MambaTS: Improved Selective State Space Models for Long-term Time Series Forecasting
Xiuding Cai
Yaoyao Zhu
Xueyao Wang
Yu Yao
Mamba
AI4TS
38
7
0
26 May 2024
3DSS-Mamba: 3D-Spectral-Spatial Mamba for Hyperspectral Image
  Classification
3DSS-Mamba: 3D-Spectral-Spatial Mamba for Hyperspectral Image Classification
Yan He
Bing Tu
Bo Liu
Jun Li
Antonio J. Plaza
Mamba
33
24
0
21 May 2024
MambaAD: Exploring State Space Models for Multi-class Unsupervised Anomaly Detection
MambaAD: Exploring State Space Models for Multi-class Unsupervised Anomaly Detection
Haoyang He
Yuhu Bai
Jiangning Zhang
Qingdong He
Hongxu Chen
Zhenye Gan
Chengjie Wang
Xiangtai Li
Guanzhong Tian
Lei Xie
Mamba
66
35
0
09 Apr 2024
Learning Enriched Features via Selective State Spaces Model for
  Efficient Image Deblurring
Learning Enriched Features via Selective State Spaces Model for Efficient Image Deblurring
Huiyu Gao
Depeng Dang
40
13
0
29 Mar 2024
MambaTalk: Efficient Holistic Gesture Synthesis with Selective State Space Models
MambaTalk: Efficient Holistic Gesture Synthesis with Selective State Space Models
Zunnan Xu
Yukang Lin
Haonan Han
Sicheng Yang
Ronghui Li
Yachao Zhang
Xiu Li
Mamba
46
25
0
14 Mar 2024
Activating Wider Areas in Image Super-Resolution
Activating Wider Areas in Image Super-Resolution
Cheng Cheng
Hang Wang
Hongbin Sun
37
10
0
13 Mar 2024
Orchid: Flexible and Data-Dependent Convolution for Sequence Modeling
Orchid: Flexible and Data-Dependent Convolution for Sequence Modeling
Mahdi Karami
Ali Ghodsi
VLM
45
6
0
28 Feb 2024
Investigating Recurrent Transformers with Dynamic Halt
Investigating Recurrent Transformers with Dynamic Halt
Jishnu Ray Chowdhury
Cornelia Caragea
39
1
0
01 Feb 2024
Efficiency-oriented approaches for self-supervised speech representation
  learning
Efficiency-oriented approaches for self-supervised speech representation learning
Luis Lugo
Valentin Vielzeuf
SSL
26
1
0
18 Dec 2023
StableSSM: Alleviating the Curse of Memory in State-space Models through
  Stable Reparameterization
StableSSM: Alleviating the Curse of Memory in State-space Models through Stable Reparameterization
Shida Wang
Qianxiao Li
22
13
0
24 Nov 2023
Hieros: Hierarchical Imagination on Structured State Space Sequence
  World Models
Hieros: Hierarchical Imagination on Structured State Space Sequence World Models
Paul Mattes
Rainer Schlosser
R. Herbrich
21
4
0
08 Oct 2023
Transformer-VQ: Linear-Time Transformers via Vector Quantization
Transformer-VQ: Linear-Time Transformers via Vector Quantization
Albert Mohwald
28
15
0
28 Sep 2023
Long-range Language Modeling with Self-retrieval
Long-range Language Modeling with Self-retrieval
Ohad Rubin
Jonathan Berant
RALM
KELM
19
18
0
23 Jun 2023
Plug-and-Play Document Modules for Pre-trained Models
Plug-and-Play Document Modules for Pre-trained Models
Chaojun Xiao
Zhengyan Zhang
Xu Han
Chi-Min Chan
Yankai Lin
Zhiyuan Liu
Xiangyang Li
Zhonghua Li
Zhao Cao
Maosong Sun
KELM
22
5
0
28 May 2023
Focus Your Attention (with Adaptive IIR Filters)
Focus Your Attention (with Adaptive IIR Filters)
Shahar Lutati
Itamar Zimerman
Lior Wolf
32
9
0
24 May 2023
Dialogue-Contextualized Re-ranking for Medical History-Taking
Dialogue-Contextualized Re-ranking for Medical History-Taking
Jian Zhu
Ilya Valmianski
Anitha Kannan
19
1
0
04 Apr 2023
Simple Hardware-Efficient Long Convolutions for Sequence Modeling
Simple Hardware-Efficient Long Convolutions for Sequence Modeling
Daniel Y. Fu
Elliot L. Epstein
Eric N. D. Nguyen
A. Thomas
Michael Zhang
Tri Dao
Atri Rudra
Christopher Ré
16
52
0
13 Feb 2023
Hungry Hungry Hippos: Towards Language Modeling with State Space Models
Hungry Hungry Hippos: Towards Language Modeling with State Space Models
Daniel Y. Fu
Tri Dao
Khaled Kamal Saab
A. Thomas
Atri Rudra
Christopher Ré
73
370
0
28 Dec 2022
Pretraining Without Attention
Pretraining Without Attention
Junxiong Wang
J. Yan
Albert Gu
Alexander M. Rush
27
48
0
20 Dec 2022
Mega: Moving Average Equipped Gated Attention
Mega: Moving Average Equipped Gated Attention
Xuezhe Ma
Chunting Zhou
Xiang Kong
Junxian He
Liangke Gui
Graham Neubig
Jonathan May
Luke Zettlemoyer
14
183
0
21 Sep 2022
Efficient Methods for Natural Language Processing: A Survey
Efficient Methods for Natural Language Processing: A Survey
Marcos Vinícius Treviso
Ji-Ung Lee
Tianchu Ji
Betty van Aken
Qingqing Cao
...
Emma Strubell
Niranjan Balasubramanian
Leon Derczynski
Iryna Gurevych
Roy Schwartz
28
109
0
31 Aug 2022
Efficient Long-Text Understanding with Short-Text Models
Efficient Long-Text Understanding with Short-Text Models
Maor Ivgi
Uri Shaham
Jonathan Berant
VLM
22
75
0
01 Aug 2022
Transformer Quality in Linear Time
Transformer Quality in Linear Time
Weizhe Hua
Zihang Dai
Hanxiao Liu
Quoc V. Le
78
222
0
21 Feb 2022
Big Bird: Transformers for Longer Sequences
Big Bird: Transformers for Longer Sequences
Manzil Zaheer
Guru Guruganesh
Kumar Avinava Dubey
Joshua Ainslie
Chris Alberti
...
Philip Pham
Anirudh Ravula
Qifan Wang
Li Yang
Amr Ahmed
VLM
280
2,015
0
28 Jul 2020
12
Next