Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2403.16125
Cited By
A Codesign of Scheduling and Parallelization for Large Model Training in Heterogeneous Clusters
24 March 2024
Chunyu Xue
Weihao Cui
Han Zhao
Quan Chen
Shulai Zhang
Peng Yang
Jing Yang
Shaobo Li
Minyi Guo
Re-assign community
ArXiv
PDF
HTML
Papers citing
"A Codesign of Scheduling and Parallelization for Large Model Training in Heterogeneous Clusters"
2 / 2 papers shown
Title
Efficient Training of Large Language Models on Distributed Infrastructures: A Survey
Jiangfei Duan
Shuo Zhang
Zerui Wang
Lijuan Jiang
Wenwen Qu
...
Dahua Lin
Yonggang Wen
Xin Jin
Tianwei Zhang
Peng Sun
73
8
0
29 Jul 2024
Online Evolutionary Batch Size Orchestration for Scheduling Deep Learning Workloads in GPU Clusters
Chen Sun
Shenggui Li
Jinyue Wang
Jun Yu
54
47
0
08 Aug 2021
1