ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1810.07354
  4. Cited By
Fault Tolerance in Iterative-Convergent Machine Learning

Fault Tolerance in Iterative-Convergent Machine Learning

17 October 2018
Aurick Qiao
Bryon Aragam
Bingjing Zhang
Eric Xing
ArXivPDFHTML

Papers citing "Fault Tolerance in Iterative-Convergent Machine Learning"

2 / 2 papers shown
Title
Poseidon: An Efficient Communication Architecture for Distributed Deep
  Learning on GPU Clusters
Poseidon: An Efficient Communication Architecture for Distributed Deep Learning on GPU Clusters
Huatian Zhang
Zeyu Zheng
Shizhen Xu
Wei-Ming Dai
Qirong Ho
Xiaodan Liang
Zhiting Hu
Jinliang Wei
P. Xie
Eric Xing
GNN
47
343
0
11 Jun 2017
Speeding Up Distributed Machine Learning Using Codes
Speeding Up Distributed Machine Learning Using Codes
Kangwook Lee
Maximilian Lam
Ramtin Pedarsani
Dimitris Papailiopoulos
Kannan Ramchandran
126
856
0
08 Dec 2015
1