Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2208.14228
Cited By
EasyScale: Accuracy-consistent Elastic Training for Deep Learning
30 August 2022
Mingzhen Li
Wencong Xiao
Biao Sun
Hanyu Zhao
Hailong Yang
S. Ren
Zhongzhi Luan
Xianyan Jia
Yi Liu
Yong Li
Wei Lin
D. Qian
Re-assign community
ArXiv
PDF
HTML
Papers citing
"EasyScale: Accuracy-consistent Elastic Training for Deep Learning"
5 / 5 papers shown
Title
LoongServe: Efficiently Serving Long-context Large Language Models with Elastic Sequence Parallelism
Bingya Wu
Shengyu Liu
Yinmin Zhong
Peng Sun
Xuanzhe Liu
Xin Jin
RALM
35
53
0
15 Apr 2024
SuperBench: Improving Cloud AI Infrastructure Reliability with Proactive Validation
Yifan Xiong
Yuting Jiang
Ziyue Yang
L. Qu
Guoshuai Zhao
...
Luke Melton
Joe Chau
Peng Cheng
Yongqiang Xiong
Lidong Zhou
54
6
0
09 Feb 2024
Scavenger: A Cloud Service for Optimizing Cost and Performance of ML Training
S. Tyagi
Prateek Sharma
21
5
0
12 Mar 2023
Varuna: Scalable, Low-cost Training of Massive Deep Learning Models
Sanjith Athlur
Nitika Saran
Muthian Sivathanu
Ramachandran Ramjee
Nipun Kwatra
GNN
31
80
0
07 Nov 2021
Online Evolutionary Batch Size Orchestration for Scheduling Deep Learning Workloads in GPU Clusters
Chen Sun
Shenggui Li
Jinyue Wang
Jun Yu
54
47
0
08 Aug 2021
1