ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2405.16486
  4. Cited By
Decomposing the Neurons: Activation Sparsity via Mixture of Experts for
  Continual Test Time Adaptation

Decomposing the Neurons: Activation Sparsity via Mixture of Experts for Continual Test Time Adaptation

26 May 2024
Rongyu Zhang
Aosong Cheng
Yulin Luo
Gaole Dai
Huanrui Yang
Jiaming Liu
Ran Xu
Li Du
Yuan Du
Yanbing Jiang
Shanghang Zhang
    MoE
    TTA
ArXivPDFHTML

Papers citing "Decomposing the Neurons: Activation Sparsity via Mixture of Experts for Continual Test Time Adaptation"

27 / 27 papers shown
Title
MoLe-VLA: Dynamic Layer-skipping Vision Language Action Model via Mixture-of-Layers for Efficient Robot Manipulation
MoLe-VLA: Dynamic Layer-skipping Vision Language Action Model via Mixture-of-Layers for Efficient Robot Manipulation
Rongyu Zhang
Menghang Dong
Yuan Zhang
Liang Heng
Xiaowei Chi
Gaole Dai
Li Du
Dan Wang
Yuan Du
MoE
124
1
0
26 Mar 2025
A Comprehensive Survey of Mixture-of-Experts: Algorithms, Theory, and Applications
A Comprehensive Survey of Mixture-of-Experts: Algorithms, Theory, and Applications
Siyuan Mu
Sen Lin
MoE
388
5
0
10 Mar 2025
Geometric Analysis of Reasoning Trajectories: A Phase Space Approach to Understanding Valid and Invalid Multi-Hop Reasoning in LLMs
Geometric Analysis of Reasoning Trajectories: A Phase Space Approach to Understanding Valid and Invalid Multi-Hop Reasoning in LLMs
Javier Marin
LRM
105
0
0
06 Oct 2024
Multi-level Personalized Federated Learning on Heterogeneous and
  Long-Tailed Data
Multi-level Personalized Federated Learning on Heterogeneous and Long-Tailed Data
Rongyu Zhang
Yun Chen
Chenrui Wu
Fangxin Wang
Boyan Li
62
12
0
10 May 2024
ProSparse: Introducing and Enhancing Intrinsic Activation Sparsity
  within Large Language Models
ProSparse: Introducing and Enhancing Intrinsic Activation Sparsity within Large Language Models
Chenyang Song
Xu Han
Zhengyan Zhang
Shengding Hu
Xiyu Shi
...
Chen Chen
Zhiyuan Liu
Guanglin Li
Tao Yang
Maosong Sun
81
29
0
21 Feb 2024
MoEC: Mixture of Experts Implicit Neural Compression
MoEC: Mixture of Experts Implicit Neural Compression
Jianchen Zhao
Cheng-Ching Tseng
Ming Lu
Ruichuan An
Xiaobao Wei
He Sun
Shanghang Zhang
59
3
0
03 Dec 2023
Distribution-Aware Continual Test-Time Adaptation for Semantic
  Segmentation
Distribution-Aware Continual Test-Time Adaptation for Semantic Segmentation
Jiayin Ni
Senqiao Yang
Ran Xu
Jiaming Liu
Xiaoqi Li
Wenyu Jiao
Zehui Chen
Yi Liu
Shanghang Zhang
TTA
47
7
0
24 Sep 2023
SparseViT: Revisiting Activation Sparsity for Efficient High-Resolution
  Vision Transformer
SparseViT: Revisiting Activation Sparsity for Efficient High-Resolution Vision Transformer
Xuanyao Chen
Zhijian Liu
Haotian Tang
Li Yi
Hang Zhao
Song Han
ViT
156
48
0
30 Mar 2023
EcoTTA: Memory-Efficient Continual Test-time Adaptation via
  Self-distilled Regularization
EcoTTA: Memory-Efficient Continual Test-time Adaptation via Self-distilled Regularization
Jun S. Song
Jungsoo Lee
In So Kweon
Sungha Choi
TTA
64
90
0
03 Mar 2023
Robust Mean Teacher for Continual and Gradual Test-Time Adaptation
Robust Mean Teacher for Continual and Gradual Test-Time Adaptation
Mario Döbler
Robert A. Marsden
Bin Yang
OOD
TTA
54
90
0
23 Nov 2022
M$^3$ViT: Mixture-of-Experts Vision Transformer for Efficient Multi-task
  Learning with Model-Accelerator Co-design
M3^33ViT: Mixture-of-Experts Vision Transformer for Efficient Multi-task Learning with Model-Accelerator Co-design
Hanxue Liang
Zhiwen Fan
Rishov Sarkar
Ziyu Jiang
Tianlong Chen
Kai Zou
Yu Cheng
Cong Hao
Zhangyang Wang
MoE
62
86
0
26 Oct 2022
Continual Test-Time Domain Adaptation
Continual Test-Time Domain Adaptation
Qin Wang
Olga Fink
Luc Van Gool
Dengxin Dai
OOD
TTA
101
419
0
25 Mar 2022
Mixture-of-Experts with Expert Choice Routing
Mixture-of-Experts with Expert Choice Routing
Yan-Quan Zhou
Tao Lei
Han-Chu Liu
Nan Du
Yanping Huang
Vincent Zhao
Andrew M. Dai
Zhifeng Chen
Quoc V. Le
James Laudon
MoE
257
350
0
18 Feb 2022
Scaling Vision with Sparse Mixture of Experts
Scaling Vision with Sparse Mixture of Experts
C. Riquelme
J. Puigcerver
Basil Mustafa
Maxim Neumann
Rodolphe Jenatton
André Susano Pinto
Daniel Keysers
N. Houlsby
MoE
76
597
0
10 Jun 2021
Hash Layers For Large Sparse Models
Hash Layers For Large Sparse Models
Stephen Roller
Sainbayar Sukhbaatar
Arthur Szlam
Jason Weston
MoE
154
210
0
08 Jun 2021
SegFormer: Simple and Efficient Design for Semantic Segmentation with
  Transformers
SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers
Enze Xie
Wenhai Wang
Zhiding Yu
Anima Anandkumar
J. Álvarez
Ping Luo
ViT
231
4,990
0
31 May 2021
Adversarial Learning for Zero-Shot Stance Detection on Social Media
Adversarial Learning for Zero-Shot Stance Detection on Social Media
Emily Allaway
Malavika Srikanth
Kathleen McKeown
ObjD
VLM
33
93
0
14 May 2021
Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective
  with Transformers
Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers
Sixiao Zheng
Jiachen Lu
Hengshuang Zhao
Xiatian Zhu
Zekun Luo
...
Yanwei Fu
Jianfeng Feng
Tao Xiang
Philip Torr
Li Zhang
ViT
170
2,893
0
31 Dec 2020
GShard: Scaling Giant Models with Conditional Computation and Automatic
  Sharding
GShard: Scaling Giant Models with Conditional Computation and Automatic Sharding
Dmitry Lepikhin
HyoukJoong Lee
Yuanzhong Xu
Dehao Chen
Orhan Firat
Yanping Huang
M. Krikun
Noam M. Shazeer
Zhiwen Chen
MoE
86
1,156
0
30 Jun 2020
FDA: Fourier Domain Adaptation for Semantic Segmentation
FDA: Fourier Domain Adaptation for Semantic Segmentation
Yanchao Yang
Stefano Soatto
OOD
81
895
0
11 Apr 2020
OccuSeg: Occupancy-aware 3D Instance Segmentation
OccuSeg: Occupancy-aware 3D Instance Segmentation
Lei Han
Tian Zheng
Lan Xu
Lu Fang
3DPC
214
260
0
14 Mar 2020
A Survey of Autonomous Driving: Common Practices and Emerging
  Technologies
A Survey of Autonomous Driving: Common Practices and Emerging Technologies
Ekim Yurtsever
Jacob Lambert
Alexander Carballo
K. Takeda
83
1,370
0
12 Jun 2019
Benchmarking Neural Network Robustness to Common Corruptions and
  Perturbations
Benchmarking Neural Network Robustness to Common Corruptions and Perturbations
Dan Hendrycks
Thomas G. Dietterich
OOD
VLM
144
3,423
0
28 Mar 2019
The Cityscapes Dataset for Semantic Urban Scene Understanding
The Cityscapes Dataset for Semantic Urban Scene Understanding
Marius Cordts
Mohamed Omran
Sebastian Ramos
Timo Rehfeld
Markus Enzweiler
Rodrigo Benenson
Uwe Franke
Stefan Roth
Bernt Schiele
904
11,587
0
06 Apr 2016
Learning Deep Features for Discriminative Localization
Learning Deep Features for Discriminative Localization
Bolei Zhou
A. Khosla
Àgata Lapedriza
A. Oliva
Antonio Torralba
SSL
SSeg
FAtt
223
9,298
0
14 Dec 2015
Domain-Adversarial Training of Neural Networks
Domain-Adversarial Training of Neural Networks
Yaroslav Ganin
E. Ustinova
Hana Ajakan
Pascal Germain
Hugo Larochelle
François Laviolette
M. Marchand
Victor Lempitsky
GAN
OOD
366
9,467
0
28 May 2015
Delving Deep into Rectifiers: Surpassing Human-Level Performance on
  ImageNet Classification
Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification
Kaiming He
Xinming Zhang
Shaoqing Ren
Jian Sun
VLM
276
18,587
0
06 Feb 2015
1