ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2401.16492
  4. Cited By
GPU Cluster Scheduling for Network-Sensitive Deep Learning

GPU Cluster Scheduling for Network-Sensitive Deep Learning

29 January 2024
Aakash Sharma
Vivek M. Bhasi
Sonali Singh
G. Kesidis
M. Kandemir
Chita R. Das
ArXivPDFHTML

Papers citing "GPU Cluster Scheduling for Network-Sensitive Deep Learning"

10 / 10 papers shown
Title
Themis: A Network Bandwidth-Aware Collective Scheduling Policy for
  Distributed Training of DL Models
Themis: A Network Bandwidth-Aware Collective Scheduling Policy for Distributed Training of DL Models
Saeed Rashidi
William Won
Sudarshan Srinivasan
Srinivas Sridharan
T. Krishna
GNN
47
31
0
09 Oct 2021
Characterization and Prediction of Deep Learning Workloads in
  Large-Scale GPU Datacenters
Characterization and Prediction of Deep Learning Workloads in Large-Scale GPU Datacenters
Qi Hu
Peng Sun
Shengen Yan
Yonggang Wen
Tianwei Zhang
3DH
GNN
43
131
0
03 Sep 2021
Pollux: Co-adaptive Cluster Scheduling for Goodput-Optimized Deep
  Learning
Pollux: Co-adaptive Cluster Scheduling for Goodput-Optimized Deep Learning
Aurick Qiao
Sang Keun Choe
Suhas Jayaram Subramanya
Willie Neiswanger
Qirong Ho
Hao Zhang
G. Ganger
Eric Xing
VLM
55
181
0
27 Aug 2020
MLPerf Training Benchmark
MLPerf Training Benchmark
Arya D. McCarthy
Christine Cheng
Cody Coleman
Greg Diamos
Paulius Micikevicius
...
Carole-Jean Wu
Lingjie Xu
Masafumi Yamazaki
C. Young
Matei A. Zaharia
86
314
0
02 Oct 2019
Megatron-LM: Training Multi-Billion Parameter Language Models Using
  Model Parallelism
Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism
Mohammad Shoeybi
M. Patwary
Raul Puri
P. LeGresley
Jared Casper
Bryan Catanzaro
MoE
324
1,899
0
17 Sep 2019
Themis: Fair and Efficient GPU Cluster Scheduling
Themis: Fair and Efficient GPU Cluster Scheduling
Kshiteej S. Mahajan
Arjun Balasubramanian
Arjun Singhvi
Shivaram Venkataraman
Aditya Akella
Amar Phanishayee
Shuchi Chawla
54
182
0
02 Jul 2019
Evaluating Modern GPU Interconnect: PCIe, NVLink, NV-SLI, NVSwitch and
  GPUDirect
Evaluating Modern GPU Interconnect: PCIe, NVLink, NV-SLI, NVSwitch and GPUDirect
Ang Li
Shuaiwen Leon Song
Jieyang Chen
Jiajia Li
Xu Liu
Nathan R. Tallent
Kevin J. Barker
GNN
63
214
0
11 Mar 2019
Analysis of Large-Scale Multi-Tenant GPU Clusters for DNN Training
  Workloads
Analysis of Large-Scale Multi-Tenant GPU Clusters for DNN Training Workloads
Myeongjae Jeon
Shivaram Venkataraman
Amar Phanishayee
Junjie Qian
Wencong Xiao
Fan Yang
GNN
60
353
0
17 Jan 2019
MobileNetV2: Inverted Residuals and Linear Bottlenecks
MobileNetV2: Inverted Residuals and Linear Bottlenecks
Mark Sandler
Andrew G. Howard
Menglong Zhu
A. Zhmoginov
Liang-Chieh Chen
178
19,271
0
13 Jan 2018
Identity Mappings in Deep Residual Networks
Identity Mappings in Deep Residual Networks
Kaiming He
Xinming Zhang
Shaoqing Ren
Jian Sun
354
10,180
0
16 Mar 2016
1