Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1802.05799
Cited By
v1
v2
v3 (latest)
Horovod: fast and easy distributed deep learning in TensorFlow
15 February 2018
Alexander Sergeev
Mike Del Balso
Re-assign community
ArXiv (abs)
PDF
HTML
Github (14494★)
Papers citing
"Horovod: fast and easy distributed deep learning in TensorFlow"
50 / 454 papers shown
Title
Graph Generative Models for Fast Detector Simulations in High Energy Physics
A. Hariri
Darya Dyachkova
Sergei Gleyzer
AI4CE
63
6
0
05 Apr 2021
MergeComp: A Compression Scheduler for Scalable Communication-Efficient Distributed Training
Zhuang Wang
X. Wu
T. Ng
GNN
32
4
0
28 Mar 2021
Czert -- Czech BERT-like Model for Language Representation
Jakub Sido
O. Pražák
P. Pribán
Jan Pasek
Michal Seják
Miloslav Konopík
76
44
0
24 Mar 2021
Federated Quantum Machine Learning
Samuel Yen-Chi Chen
Shinjae Yoo
FedML
AI4CE
79
121
0
22 Mar 2021
Distributed Deep Learning Using Volunteer Computing-Like Paradigm
Medha Atre
B. Jha
Ashwini Rao
91
11
0
16 Mar 2021
CrossoverScheduler: Overlapping Multiple Distributed Training Applications in a Crossover Manner
Cheng Luo
L. Qu
Youshan Miao
Peng Cheng
Y. Xiong
46
0
0
14 Mar 2021
Performance of a Geometric Deep Learning Pipeline for HL-LHC Particle Tracking
X. Ju
D. Murnane
P. Calafiura
Nicholas Choma
S. Conlon
...
Aditi Chauhan
A. Schuy
Shih-Chieh Hsu
A. Ballow
A. Lazar
50
65
0
11 Mar 2021
On the Utility of Gradient Compression in Distributed Training Systems
Saurabh Agarwal
Hongyi Wang
Shivaram Venkataraman
Dimitris Papailiopoulos
107
47
0
28 Feb 2021
Peering Beyond the Gradient Veil with Distributed Auto Differentiation
Bradley T. Baker
Aashis Khanal
Vince D. Calhoun
Barak A. Pearlmutter
Sergey Plis
47
1
0
18 Feb 2021
Oscars: Adaptive Semi-Synchronous Parallel Model for Distributed Deep Learning with Global View
Sheng-Jun Huang
38
0
0
17 Feb 2021
User Embedding based Neighborhood Aggregation Method for Inductive Recommendation
Rahul Ragesh
Sundararajan Sellamanickam
Vijay Lingam
Arun Shankar Iyer
Ramakrishna Bairi
GNN
71
6
0
15 Feb 2021
GradPIM: A Practical Processing-in-DRAM Architecture for Gradient Descent
Heesu Kim
Hanmin Park
Taehyun Kim
Kwanheum Cho
Eojin Lee
Soojung Ryu
Hyuk-Jae Lee
Kiyoung Choi
Jinho Lee
66
37
0
15 Feb 2021
On the Impact of Device and Behavioral Heterogeneity in Federated Learning
A. Abdelmoniem
Chen-Yu Ho
Pantelis Papageorgiou
Muhammad Bilal
Marco Canini
FedML
59
18
0
15 Feb 2021
DeepReduce: A Sparse-tensor Communication Framework for Distributed Deep Learning
Kelly Kostopoulou
Hang Xu
Aritra Dutta
Xin Li
A. Ntoulas
Panos Kalnis
43
7
0
05 Feb 2021
An Efficient Statistical-based Gradient Compression Technique for Distributed Training Systems
A. Abdelmoniem
Ahmed Elzanaty
Mohamed-Slim Alouini
Marco Canini
132
77
0
26 Jan 2021
Training Multilingual Pre-trained Language Model with Byte-level Subwords
Junqiu Wei
Qun Liu
Yinpeng Guo
Xin Jiang
63
20
0
23 Jan 2021
Clairvoyant Prefetching for Distributed Machine Learning I/O
Nikoli Dryden
Roman Böhringer
Tal Ben-Nun
Torsten Hoefler
79
58
0
21 Jan 2021
SceneGen: Learning to Generate Realistic Traffic Scenes
Shuhan Tan
K. Wong
Shenlong Wang
S. Manivasagam
Mengye Ren
R. Urtasun
92
107
0
16 Jan 2021
A deep learning modeling framework to capture mixing patterns in reactive-transport systems
N. V. Jagtap
M. Mudunuru
K. Nakshatrala
34
5
0
11 Jan 2021
Deeplite Neutrino: An End-to-End Framework for Constrained Deep Learning Model Optimization
A. Sankaran
Olivier Mastropietro
Ehsan Saboori
Yasser Idris
Davis Sawyer
Mohammadhossein Askarihemmat
G. B. Hacene
73
4
0
11 Jan 2021
Crossover-SGD: A gossip-based communication in distributed deep learning for alleviating large mini-batch problem and enhancing scalability
Sangho Yeo
Minho Bae
Minjoong Jeong
Oh-Kyoung Kwon
Sangyoon Oh
59
3
0
30 Dec 2020
NeurST: Neural Speech Translation Toolkit
Chengqi Zhao
Mingxuan Wang
Qianqian Dong
Rong Ye
Lei Li
89
32
0
18 Dec 2020
Data optimization for large batch distributed training of deep neural networks
Shubhankar Gahlot
Junqi Yin
Mallikarjun Shankar
23
1
0
16 Dec 2020
Cyclic orthogonal convolutions for long-range integration of features
Federica Freddi
Jezabel R. Garcia
Michael Bromberg
Sepehr Jalali
Da-shan Shiu
Alvin Chua
A. Bernacchia
39
0
0
11 Dec 2020
Parallel Training of Deep Networks with Local Updates
Michael Laskin
Luke Metz
Seth Nabarrao
Mark Saroufim
Badreddine Noune
Carlo Luschi
Jascha Narain Sohl-Dickstein
Pieter Abbeel
FedML
122
27
0
07 Dec 2020
Accumulated Decoupled Learning: Mitigating Gradient Staleness in Inter-Layer Model Parallelization
Huiping Zhuang
Zhiping Lin
Kar-Ann Toh
124
4
0
03 Dec 2020
Distributed Training and Optimization Of Neural Networks
J. Vlimant
Junqi Yin
AI4CE
30
2
0
03 Dec 2020
A Study of Checkpointing in Large Scale Training of Deep Neural Networks
Elvis Rojas
A. Kahira
Esteban Meneses
L. Bautista-Gomez
Rosa M. Badia
53
26
0
01 Dec 2020
Scalable Deep-Learning-Accelerated Topology Optimization for Additively Manufactured Materials
Sirui Bi
Jiaxin Zhang
Guannan Zhang
AI4CE
32
8
0
28 Nov 2020
Protein model quality assessment using rotation-equivariant, hierarchical neural networks
Stephan Eismann
Patricia Suriana
Bowen Jing
Raphael J. L. Townshend
R. Dror
41
13
0
27 Nov 2020
TLeague: A Framework for Competitive Self-Play based Distributed Multi-Agent Reinforcement Learning
Peng Sun
Jiechao Xiong
Lei Han
Xinghai Sun
Shuxing Li
Jiawei Xu
Meng Fang
Zhengyou Zhang
OffRL
LRM
73
19
0
25 Nov 2020
Integrating Deep Learning in Domain Sciences at Exascale
Rick Archibald
E. Chow
E. DÁzevedo
Jack J. Dongarra
M. Eisenbach
...
Florent Lopez
Daniel Nichols
S. Tomov
Kwai Wong
Junqi Yin
PINN
48
5
0
23 Nov 2020
Emergent Road Rules In Multi-Agent Driving Environments
Avik Pal
Jonah Philion
Yuan-Hong Liao
Sanja Fidler
79
11
0
21 Nov 2020
Logically Consistent Loss for Visual Question Answering
Anh-Cat Le-Ngo
T. Tran
Santu Rana
Sunil R. Gupta
Svetha Venkatesh
OOD
74
0
0
19 Nov 2020
Whale: Efficient Giant Model Training over Heterogeneous GPUs
Xianyan Jia
Le Jiang
Ang Wang
Wencong Xiao
Ziji Shi
...
Lan-yue Chen
Yong Li
Zhen Zheng
Xiaoyong Liu
Wei Lin
90
56
0
18 Nov 2020
A Novel Memory-Efficient Deep Learning Training Framework via Error-Bounded Lossy Compression
Sian Jin
Guanpeng Li
Shuaiwen Leon Song
Dingwen Tao
AI4CE
72
12
0
18 Nov 2020
Cascade RNN-Transducer: Syllable Based Streaming On-device Mandarin Speech Recognition with a Syllable-to-Character Converter
Xiong Wang
Zhuoyuan Yao
Xian Shi
Lei Xie
68
30
0
17 Nov 2020
MuSCLE: Multi Sweep Compression of LiDAR using Deep Entropy Models
Sourav Biswas
Jerry Liu
K. Wong
Shenlong Wang
R. Urtasun
47
75
0
15 Nov 2020
AgEBO-Tabular: Joint Neural Architecture and Hyperparameter Search with Autotuned Data-Parallel Training for Tabular Data
Romain Egele
Prasanna Balaprakash
V. Vishwanath
Isabelle M Guyon
Zhengying Liu
LMTD
58
21
0
30 Oct 2020
SLM: Learning a Discourse Language Representation with Sentence Unshuffling
Haejun Lee
Drew A. Hudson
Kangwook Lee
Christopher D. Manning
SSL
119
52
0
30 Oct 2020
Accordion: Adaptive Gradient Communication via Critical Learning Regime Identification
Saurabh Agarwal
Hongyi Wang
Kangwook Lee
Shivaram Venkataraman
Dimitris Papailiopoulos
85
25
0
29 Oct 2020
Hierarchical Federated Learning through LAN-WAN Orchestration
Jinliang Yuan
Mengwei Xu
Xiao Ma
Ao Zhou
Xuanzhe Liu
Shangguang Wang
FedML
60
38
0
22 Oct 2020
Context-Aware Drive-thru Recommendation Service at Fast Food Restaurants
Luyang Wang
Kai-Qi Huang
Jiao Wang
Shengsheng Huang
J. Dai
Zhuang Yue
29
1
0
13 Oct 2020
DistDGL: Distributed Graph Neural Network Training for Billion-Scale Graphs
Da Zheng
Chao Ma
Minjie Wang
Jinjing Zhou
Qidong Su
Xiang Song
Quan Gan
Zheng Zhang
George Karypis
FedML
GNN
71
250
0
11 Oct 2020
A Predictive Autoscaler for Elastic Batch Jobs
Peng Gao
8
1
0
10 Oct 2020
Accelerating Finite-temperature Kohn-Sham Density Functional Theory with Deep Neural Networks
J. Ellis
Lenz Fiedler
G. Popoola
N. Modine
J. A. Stephens
A. Thompson
A. Cangi
S. Rajamanickam
AI4CE
58
40
0
10 Oct 2020
A Tensor Compiler for Unified Machine Learning Prediction Serving
Supun Nakandala Karla Saur
Karla Saur
Gyeong-In Yu
Konstantinos Karanasos
Carlo Curino
Markus Weimer
Matteo Interlandi
98
53
0
09 Oct 2020
Towards a Scalable and Distributed Infrastructure for Deep Learning Applications
Bita Hasheminezhad
S. Shirzad
Nanmiao Wu
Patrick Diehl
Hannes Schulz
Hartmut Kaiser
GNN
AI4CE
85
4
0
06 Oct 2020
HetSeq: Distributed GPU Training on Heterogeneous Infrastructure
Yifan Ding
Nicholas Botzer
Tim Weninger
VLM
MoE
37
7
0
25 Sep 2020
VirtualFlow: Decoupling Deep Learning Models from the Underlying Hardware
Andrew Or
Haoyu Zhang
M. Freedman
73
10
0
20 Sep 2020
Previous
1
2
3
...
10
5
6
7
8
9
Next