Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2411.02265
Cited By
Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent
4 November 2024
Xingchen Sun
Yanfeng Chen
Yanwen Huang
Ruobing Xie
Jiaqi Zhu
Kaipeng Zhang
Shuaipeng Li
Zhen Yang
J. N. Han
Xiaobo Shu
Jiahao Bu
Z. Chen
Xuemeng Huang
Fengzong Lian
Steve Yang
Jianfeng Yan
Yuyuan Zeng
Xiaoqin Ren
Chao Yu
Lulu Wu
Yue Mao
Jun Xia
Tao Yang
S. Zheng
Kan Wu
Dian Jiao
Jinbao Xue
Xinsong Zhang
Decheng Wu
Kai Liu
Dengpeng Wu
Guanghui Xu
S. Chen
Shuang Chen
Xiao Feng
Yigeng Hong
Junqiang Zheng
Chengcheng Xu
Zehan Li
Xiong Kuang
Jianglu Hu
Yiqi Chen
Yuchi Deng
Guiyang Li
Ao Liu
Chenchen Zhang
Shihui Hu
Zilong Zhao
Zifan Wu
Yao Ding
Wei Wang
Han Liu
R. Wang
Hao Fei
Peijie Yu
Ze Zhao
Xun Cao
Hai Wang
Fusheng Xiang
Mengyuan Huang
Zhiyuan Xiong
Bin Hu
Xuebin Hou
Lei Jiang
Jianqiang Ma
Jiajia Wu
Yaping Deng
Yi Shen
Qian Wang
Weijie Liu
Jie Liu
Meng Chen
Liang Dong
W. Jia
Hongyu Chen
Fengyuan Liu
Rui Yuan
Huilin Xu
Zhenxiang Yan
Tengfei Cao
Zhichao Hu
Xinhua Feng
Dong Du
T. Yu
Yangyu Tao
Feng Zhang
Jianchen Zhu
C. Xu
X. Li
Chong Zha
Wen Ouyang
Yinben Xia
Xiang Li
Zekun He
Rongpeng Chen
Jiawei Song
Ruibin Chen
F. Jiang
Chongqing Zhao
Binghui Wang
Hao Gong
Rong Gan
Winston Hu
Zhanhui Kang
Yong Yang
Yuhong Liu
Di Wang
Jie Jiang
MoE
ALM
ELM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent"
6 / 6 papers shown
Title
Dense Backpropagation Improves Training for Sparse Mixture-of-Experts
Ashwinee Panda
Vatsal Baherwani
Zain Sarwar
Benjamin Thérien
Supriyo Chakraborty
Tom Goldstein
MoE
42
0
0
16 Apr 2025
Multi-Mission Tool Bench: Assessing the Robustness of LLM based Agents through Related and Dynamic Missions
Peijie Yu
Yifan Yang
Jiyang Li
Zelong Zhang
Haorui Wang
Xiao Feng
Feng Zhang
LLMAG
117
0
0
03 Apr 2025
A Comprehensive Survey of Mixture-of-Experts: Algorithms, Theory, and Applications
Siyuan Mu
Sen Lin
MoE
153
2
0
10 Mar 2025
DSMoE: Matrix-Partitioned Experts with Dynamic Routing for Computation-Efficient Dense LLMs
Minxuan Lv
Zhenpeng Su
Leiyu Pan
Yizhe Xiong
Zijia Lin
...
Guiguang Ding
Cheng Luo
Di Zhang
Kun Gai
Songlin Hu
MoE
41
0
0
18 Feb 2025
video-SALMONN-o1: Reasoning-enhanced Audio-visual Large Language Model
Guangzhi Sun
Yudong Yang
Jimin Zhuang
Changli Tang
Yong Li
W. Li
Z. Ma
Chao Zhang
LRM
MLLM
VLM
64
4
0
17 Feb 2025
Scaling Laws for Floating Point Quantization Training
Xingchen Sun
Shuaipeng Li
Ruobing Xie
Weidong Han
Kan Wu
...
Yangyu Tao
Zhanhui Kang
C. Xu
Di Wang
Jie Jiang
MQ
AIFin
62
0
0
05 Jan 2025
1