ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2208.14228
  4. Cited By
EasyScale: Accuracy-consistent Elastic Training for Deep Learning

EasyScale: Accuracy-consistent Elastic Training for Deep Learning

30 August 2022
Mingzhen Li
Wencong Xiao
Biao Sun
Hanyu Zhao
Hailong Yang
S. Ren
Zhongzhi Luan
Xianyan Jia
Yi Liu
Yong Li
Wei Lin
D. Qian
ArXivPDFHTML

Papers citing "EasyScale: Accuracy-consistent Elastic Training for Deep Learning"

5 / 5 papers shown
Title
LoongServe: Efficiently Serving Long-context Large Language Models with
  Elastic Sequence Parallelism
LoongServe: Efficiently Serving Long-context Large Language Models with Elastic Sequence Parallelism
Bingya Wu
Shengyu Liu
Yinmin Zhong
Peng Sun
Xuanzhe Liu
Xin Jin
RALM
43
53
0
15 Apr 2024
SuperBench: Improving Cloud AI Infrastructure Reliability with Proactive
  Validation
SuperBench: Improving Cloud AI Infrastructure Reliability with Proactive Validation
Yifan Xiong
Yuting Jiang
Ziyue Yang
L. Qu
Guoshuai Zhao
...
Luke Melton
Joe Chau
Peng Cheng
Yongqiang Xiong
Lidong Zhou
57
6
0
09 Feb 2024
Scavenger: A Cloud Service for Optimizing Cost and Performance of ML
  Training
Scavenger: A Cloud Service for Optimizing Cost and Performance of ML Training
S. Tyagi
Prateek Sharma
24
5
0
12 Mar 2023
Varuna: Scalable, Low-cost Training of Massive Deep Learning Models
Varuna: Scalable, Low-cost Training of Massive Deep Learning Models
Sanjith Athlur
Nitika Saran
Muthian Sivathanu
Ramachandran Ramjee
Nipun Kwatra
GNN
31
80
0
07 Nov 2021
Online Evolutionary Batch Size Orchestration for Scheduling Deep
  Learning Workloads in GPU Clusters
Online Evolutionary Batch Size Orchestration for Scheduling Deep Learning Workloads in GPU Clusters
Chen Sun
Shenggui Li
Jinyue Wang
Jun Yu
54
47
0
08 Aug 2021
1