Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2205.02473
Cited By
dPRO: A Generic Profiling and Optimization System for Expediting Distributed DNN Training
5 May 2022
Han Hu
Chenyu Jiang
Yuchen Zhong
Size Zheng
Chuan Wu
Yibo Zhu
Yanghua Peng
Chuanxiong Guo
Re-assign community
ArXiv
PDF
HTML
Papers citing
"dPRO: A Generic Profiling and Optimization System for Expediting Distributed DNN Training"
4 / 4 papers shown
Title
Phantora: Live GPU Cluster Simulation for Machine Learning System Performance Estimation
Jianxing Qin
Jingrong Chen
Xinhao Kong
Yongji Wu
Liang Luo
Ziyi Wang
Ying Zhang
Tingjun Chen
Alvin R. Lebeck
Danyang Zhuo
199
0
0
02 May 2025
PipeWeaver: Addressing Data Dynamicity in Large Multimodal Model Training with Dynamic Interleaved Pipeline
Zhenliang Xue
Hanpeng Hu
Xing Chen
Yimin Jiang
Yixin Song
Zeyu Mi
Yibo Zhu
Daxin Jiang
Yubin Xia
Haibo Chen
49
0
0
19 Apr 2025
A Codesign of Scheduling and Parallelization for Large Model Training in Heterogeneous Clusters
Chunyu Xue
Weihao Cui
Han Zhao
Quan Chen
Shulai Zhang
Peng Yang
Jing Yang
Shaobo Li
Minyi Guo
61
2
0
24 Mar 2024
An Overview of the Data-Loader Landscape: Comparative Performance Analysis
Iason Ofeidis
Diego Kiedanski
Leandros Tassiulas
43
7
0
27 Sep 2022
1