Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2109.01313
Cited By
Characterization and Prediction of Deep Learning Workloads in Large-Scale GPU Datacenters
3 September 2021
Qi Hu
Peng Sun
Shengen Yan
Yonggang Wen
Tianwei Zhang
3DH
GNN
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Characterization and Prediction of Deep Learning Workloads in Large-Scale GPU Datacenters"
15 / 15 papers shown
Title
Resource Heterogeneity-Aware and Utilization-Enhanced Scheduling for Deep Learning Clusters
Abeda Sultana
Nabin Pakka
F. Xu
Xu Yuan
Li Chen
N. Tzeng
93
0
0
13 Mar 2025
THOR: A Generic Energy Estimation Approach for On-Device Training
Jiaru Zhang
Zesong Wang
Hao Wang
Tao Song
Huai-an Su
...
Yang Hua
Xiangwei Zhou
Ruhui Ma
Miao Pan
Haibing Guan
97
0
0
27 Jan 2025
Energy-aware Task Scheduling with Deadline Constraint in DVFS-enabled Heterogeneous Clusters
Xinxin Mei
Qiang-qiang Wang
Xiaowen Chu
Hai Liu
Y. Leung
Zongpeng Li
27
8
0
01 Apr 2021
Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficient Sparsity
W. Fedus
Barret Zoph
Noam M. Shazeer
MoE
85
2,181
0
11 Jan 2021
Pollux: Co-adaptive Cluster Scheduling for Goodput-Optimized Deep Learning
Aurick Qiao
Sang Keun Choe
Suhas Jayaram Subramanya
Willie Neiswanger
Qirong Ho
Hao Zhang
G. Ganger
Eric Xing
VLM
55
181
0
27 Aug 2020
Heterogeneity-Aware Cluster Scheduling Policies for Deep Learning Workloads
Deepak Narayanan
Keshav Santhanam
Fiodar Kazhamiaka
Amar Phanishayee
Matei A. Zaharia
53
210
0
20 Aug 2020
Language Models are Few-Shot Learners
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
...
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
BDL
743
41,932
0
28 May 2020
Serverless in the Wild: Characterizing and Optimizing the Serverless Workload at a Large Cloud Provider
Mohammad Shahrad
Rodrigo Fonseca
Íñigo Goiri
G. Chaudhry
Paul Batum
Jason Cooke
Eduardo Laureano
Colby Tresness
M. Russinovich
Ricardo Bianchini
117
618
0
06 Mar 2020
Characterizing Deep Learning Training Workloads on Alibaba-PAI
Mengdi Wang
Chen Meng
Guoping Long
Chuan Wu
Jun Yang
Wei Lin
Yangqing Jia
55
54
0
14 Oct 2019
Themis: Fair and Efficient GPU Cluster Scheduling
Kshiteej S. Mahajan
Arjun Balasubramanian
Arjun Singhvi
Shivaram Venkataraman
Aditya Akella
Amar Phanishayee
Shuchi Chawla
54
182
0
02 Jul 2019
Salus: Fine-Grained GPU Sharing Primitives for Deep Learning Applications
Peifeng Yu
Mosharaf Chowdhury
53
72
0
12 Feb 2019
Analysis of Large-Scale Multi-Tenant GPU Clusters for DNN Training Workloads
Myeongjae Jeon
Shivaram Venkataraman
Amar Phanishayee
Junjie Qian
Wencong Xiao
Fan Yang
GNN
65
353
0
17 Jan 2019
A Workload Analysis of NSF's Innovative HPC Resources Using XDMoD
N. Simakov
Joseph P. White
R. L. Deleon
S. Gallo
Matthew D. Jones
Jeffrey T. Palmer
Benjamin D. Plessinger
T. Furlani
33
38
0
12 Jan 2018
Sequence to Sequence Learning with Neural Networks
Ilya Sutskever
Oriol Vinyals
Quoc V. Le
AIMat
434
20,541
0
10 Sep 2014
Neural Machine Translation by Jointly Learning to Align and Translate
Dzmitry Bahdanau
Kyunghyun Cho
Yoshua Bengio
AIMat
546
27,300
0
01 Sep 2014
1