Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2110.07029
Cited By
Adaptive Elastic Training for Sparse Deep Learning on Heterogeneous Multi-GPU Servers
13 October 2021
Yujing Ma
Florin Rusu
Kesheng Wu
A. Sim
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Adaptive Elastic Training for Sparse Deep Learning on Heterogeneous Multi-GPU Servers"
4 / 4 papers shown
Title
LoongServe: Efficiently Serving Long-context Large Language Models with Elastic Sequence Parallelism
Bingya Wu
Shengyu Liu
Yinmin Zhong
Peng Sun
Xuanzhe Liu
Xin Jin
RALM
43
53
0
15 Apr 2024
Saturn: An Optimized Data System for Large Model Deep Learning Workloads
Kabir Nagrecha
Arun Kumar
16
6
0
03 Sep 2023
Consistent Lock-free Parallel Stochastic Gradient Descent for Fast and Stable Convergence
Karl Bäckström
Ivan Walulya
Marina Papatriantafilou
P. Tsigas
26
5
0
17 Feb 2021
LightXML: Transformer with Dynamic Negative Sampling for High-Performance Extreme Multi-label Text Classification
Ting Jiang
Deqing Wang
Leilei Sun
Huayi Yang
Zhengyang Zhao
Fuzhen Zhuang
VLM
122
136
0
09 Jan 2021
1