Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2110.10548
Cited By
Synthesizing Optimal Parallelism Placement and Reduction Strategies on Hierarchical Systems for Deep Learning
20 October 2021
Ningning Xie
Tamara Norman
Dominik Grewe
Dimitrios Vytiniotis
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Synthesizing Optimal Parallelism Placement and Reduction Strategies on Hierarchical Systems for Deep Learning"
2 / 2 papers shown
Title
PID-Comm: A Fast and Flexible Collective Communication Framework for Commodity Processing-in-DIMM Devices
Si Ung Noh
Junguk Hong
Chaemin Lim
Seong-Yeol Park
Jeehyun Kim
Hanjun Kim
Youngsok Kim
Jinho Lee
34
6
0
13 Apr 2024
Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism
M. Shoeybi
M. Patwary
Raul Puri
P. LeGresley
Jared Casper
Bryan Catanzaro
MoE
245
1,821
0
17 Sep 2019
1