Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1906.08879
Cited By
Placeto: Learning Generalizable Device Placement Algorithms for Distributed Machine Learning
20 June 2019
Ravichandra Addanki
S. Venkatakrishnan
Shreyan Gupta
Hongzi Mao
Mohammad Alizadeh
OOD
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Placeto: Learning Generalizable Device Placement Algorithms for Distributed Machine Learning"
17 / 17 papers shown
Title
A Structure-Aware Framework for Learning Device Placements on Computation Graphs
Shukai Duan
Heng Ping
Nikos Kanakaris
Xiongye Xiao
Panagiotis Kyriakis
...
Guixiang Ma
Mihai Capota
Shahin Nazarian
Theodore L. Willke
Paul Bogdan
45
2
0
23 May 2024
HAP: SPMD DNN Training on Heterogeneous GPU Clusters with Automated Program Synthesis
Shiwei Zhang
Lansong Diao
Chuan Wu
Zongyan Cao
Siyu Wang
Wei Lin
43
12
0
11 Jan 2024
Moirai: Towards Optimal Placement for Distributed Inference on Heterogeneous Devices
Beibei Zhang
Hongwei Zhu
Feng Gao
Zhihui Yang
Xiaoyang Sean Wang
29
1
0
07 Dec 2023
Automated Tensor Model Parallelism with Overlapped Communication for Efficient Foundation Model Training
Shengwei Li
Zhiquan Lai
Yanqi Hao
Weijie Liu
Ke-shi Ge
Xiaoge Deng
Dongsheng Li
KaiCheng Lu
19
10
0
25 May 2023
Auto-Parallelizing Large Models with Rhino: A Systematic Approach on Production AI Platform
Shiwei Zhang
Lansong Diao
Siyu Wang
Zongyan Cao
Yiliang Gu
Chang Si
Ziji Shi
Zhen Zheng
Chuan Wu
W. Lin
AI4CE
32
4
0
16 Feb 2023
Expediting Distributed DNN Training with Device Topology-Aware Graph Deployment
Shiwei Zhang
Xiaodong Yi
Lansong Diao
Chuan Wu
Siyu Wang
W. Lin
GNN
22
5
0
13 Feb 2023
Robust Scheduling with GFlowNets
David W. Zhang
Corrado Rainone
M. Peschl
Roberto Bondesan
34
50
0
17 Jan 2023
DreamShard: Generalizable Embedding Table Placement for Recommender Systems
Daochen Zha
Louis Feng
Qiaoyu Tan
Zirui Liu
Kwei-Herng Lai
Bhargav Bhushanam
Yuandong Tian
A. Kejariwal
Xia Hu
LMTD
OffRL
33
28
0
05 Oct 2022
Celeritas: Fast Optimizer for Large Dataflow Graphs
Hengwei Xu
Yong Liao
Haiyong Xie
Pengyuan Zhou
GNN
17
1
0
30 Jul 2022
Accelerate Model Parallel Training by Using Efficient Graph Traversal Order in Device Placement
Tianze Wang
A. H. Payberah
D. Hagos
Vladimir Vlassov
GNN
28
0
0
21 Jan 2022
Automated Deep Learning: Neural Architecture Search Is Not the End
Xuanyi Dong
D. Kedziora
Katarzyna Musial
Bogdan Gabrys
29
26
0
16 Dec 2021
A Learned Performance Model for Tensor Processing Units
Samuel J. Kaufman
P. Phothilimthana
Yanqi Zhou
Charith Mendis
Sudip Roy
Amit Sabne
Mike Burrows
21
8
0
03 Aug 2020
DeepSoCS: A Neural Scheduler for Heterogeneous System-on-Chip (SoC) Resource Scheduling
Tegg Taekyong Sung
J. Ha
Jeewoo Kim
Alex Yahja
Chae-Bong Sohn
Bo Ryu
21
9
0
15 May 2020
Wield: Systematic Reinforcement Learning With Progressive Randomization
Michael Schaarschmidt
Kai Fricke
Eiko Yoneki
19
2
0
15 Sep 2019
Learning Scheduling Algorithms for Data Processing Clusters
Hongzi Mao
Malte Schwarzkopf
S. Venkatakrishnan
Zili Meng
Mohammad Alizadeh
OffRL
20
637
0
03 Oct 2018
Geometric deep learning: going beyond Euclidean data
M. Bronstein
Joan Bruna
Yann LeCun
Arthur Szlam
P. Vandergheynst
GNN
264
3,243
0
24 Nov 2016
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation
Yonghui Wu
M. Schuster
Z. Chen
Quoc V. Le
Mohammad Norouzi
...
Alex Rudnick
Oriol Vinyals
G. Corrado
Macduff Hughes
J. Dean
AIMat
718
6,748
0
26 Sep 2016
1