Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2011.09208
Cited By
v1
v2
v3 (latest)
Whale: Efficient Giant Model Training over Heterogeneous GPUs
18 November 2020
Xianyan Jia
Le Jiang
Ang Wang
Wencong Xiao
Ziji Shi
Jie Zhang
Xinyuan Li
Lan-yue Chen
Yong Li
Zhen Zheng
Xiaoyong Liu
Wei Lin
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Whale: Efficient Giant Model Training over Heterogeneous GPUs"
5 / 5 papers shown
Title
Optimizing Large Model Training through Overlapped Activation Recomputation
Ping Chen
Wenjie Zhang
Shuibing He
Yingjie Gu
Zhuwei Peng
...
Yi Zheng
Zhefeng Wang
Yanlong Yin
Gang Chen
Gang Chen
127
6
0
13 Jun 2024
Auto-Parallelizing Large Models with Rhino: A Systematic Approach on Production AI Platform
Shiwei Zhang
Lansong Diao
Siyu Wang
Zongyan Cao
Yiliang Gu
Chang Si
Ziji Shi
Zhen Zheng
Chuan Wu
W. Lin
AI4CE
54
4
0
16 Feb 2023
Learning Large-scale Universal User Representation with Sparse Mixture of Experts
Caigao Jiang
Siqiao Xue
James Y. Zhang
Lingyue Liu
Zhibo Zhu
Hongyan Hao
65
4
0
11 Jul 2022
Merak: An Efficient Distributed DNN Training Framework with Automated 3D Parallelism for Giant Foundation Models
Zhiquan Lai
Shengwei Li
Xudong Tang
Ke-shi Ge
Weijie Liu
Yabo Duan
Linbo Qiao
Dongsheng Li
89
46
0
10 Jun 2022
HeterPS: Distributed Deep Learning With Reinforcement Learning Based Scheduling in Heterogeneous Environments
Ji Liu
Zhihua Wu
Dianhai Yu
Yanjun Ma
Danlei Feng
Minxu Zhang
Xinxuan Wu
Xuefeng Yao
Dejing Dou
76
49
0
20 Nov 2021
1