ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2401.04081
  4. Cited By
MoE-Mamba: Efficient Selective State Space Models with Mixture of
  Experts

MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts

8 January 2024
Maciej Pióro
Kamil Ciebiera
Krystian Król
Jan Ludziejewski
Michał Krutul
Jakub Krajewski
Szymon Antoniak
Piotr Miłoś
Marek Cygan
Sebastian Jaszczur
    MoE
    Mamba
ArXivPDFHTML

Papers citing "MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts"

11 / 11 papers shown
Title
BioMamba: Leveraging Spectro-Temporal Embedding in Bidirectional Mamba for Enhanced Biosignal Classification
BioMamba: Leveraging Spectro-Temporal Embedding in Bidirectional Mamba for Enhanced Biosignal Classification
Jian Qian
Teck Lun Goh
Bingyu Xie
Chengyao Zhu
Biao Wan
Yawen Guan
Rachel Ding Chen
Patrick Chiang
Mamba
50
0
0
14 Mar 2025
Spectral Informed Mamba for Robust Point Cloud Processing
Spectral Informed Mamba for Robust Point Cloud Processing
Ali Bahri
Moslem Yazdanpanah
Mehrdad Noori
Sahar Dastani
Milad Cheraghalikhani
David Osowiechi
G. A. V. Hakim
Farzad Beizaee
Ismail ben Ayed
Christian Desrosiers
Mamba
3DPC
76
0
0
06 Mar 2025
Mambular: A Sequential Model for Tabular Deep Learning
Mambular: A Sequential Model for Tabular Deep Learning
Anton Thielmann
Manish Kumar
Christoph Weisser
Arik Reuter
Benjamin Säfken
Soheila Samiee
Mamba
LMTD
76
6
0
12 Aug 2024
DeciMamba: Exploring the Length Extrapolation Potential of Mamba
DeciMamba: Exploring the Length Extrapolation Potential of Mamba
Assaf Ben-Kish
Itamar Zimerman
Shady Abu Hussein
Nadav Cohen
Amir Globerson
Lior Wolf
Raja Giryes
Mamba
77
13
0
20 Jun 2024
MambaLRP: Explaining Selective State Space Sequence Models
MambaLRP: Explaining Selective State Space Sequence Models
F. Jafari
G. Montavon
Klaus-Robert Müller
Oliver Eberle
Mamba
62
9
0
11 Jun 2024
Decision Mamba: A Multi-Grained State Space Model with Self-Evolution Regularization for Offline RL
Decision Mamba: A Multi-Grained State Space Model with Self-Evolution Regularization for Offline RL
Qi Lv
Xiang Deng
Gongwei Chen
Michael Yu Wang
Liqiang Nie
75
7
0
08 Jun 2024
CMViM: Contrastive Masked Vim Autoencoder for 3D Multi-modal
  Representation Learning for AD classification
CMViM: Contrastive Masked Vim Autoencoder for 3D Multi-modal Representation Learning for AD classification
Guangqian Yang
Kangrui Du
Zhihan Yang
Ye Du
Yongping Zheng
Shujun Wang
42
16
0
25 Mar 2024
Scaling Laws for Fine-Grained Mixture of Experts
Scaling Laws for Fine-Grained Mixture of Experts
Jakub Krajewski
Jan Ludziejewski
Kamil Adamczewski
Maciej Pióro
Michal Krutul
...
Krystian Król
Tomasz Odrzygó'zd'z
Piotr Sankowski
Marek Cygan
Sebastian Jaszczur
MoE
51
54
0
12 Feb 2024
Resurrecting Recurrent Neural Networks for Long Sequences
Resurrecting Recurrent Neural Networks for Long Sequences
Antonio Orvieto
Samuel L. Smith
Albert Gu
Anushan Fernando
Çağlar Gülçehre
Razvan Pascanu
Soham De
88
268
0
11 Mar 2023
In-context Learning and Induction Heads
In-context Learning and Induction Heads
Catherine Olsson
Nelson Elhage
Neel Nanda
Nicholas Joseph
Nova Dassarma
...
Tom B. Brown
Jack Clark
Jared Kaplan
Sam McCandlish
C. Olah
250
463
0
24 Sep 2022
Scaling Laws for Neural Language Models
Scaling Laws for Neural Language Models
Jared Kaplan
Sam McCandlish
T. Henighan
Tom B. Brown
B. Chess
R. Child
Scott Gray
Alec Radford
Jeff Wu
Dario Amodei
264
4,489
0
23 Jan 2020
1