Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2408.12857
Cited By
Memory-Efficient LLM Training with Online Subspace Descent
23 August 2024
Kaizhao Liang
Bo Liu
Lizhang Chen
Qiang Liu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Memory-Efficient LLM Training with Online Subspace Descent"
6 / 6 papers shown
Title
PLUMAGE: Probabilistic Low rank Unbiased Min Variance Gradient Estimator for Efficient Large Model Training
Matan Haroush
Daniel Soudry
80
0
0
23 May 2025
Cautious Optimizers: Improving Training with One Line of Code
Kaizhao Liang
Lizhang Chen
B. Liu
Qiang Liu
ODL
141
8
0
25 Nov 2024
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
Jiawei Zhao
Zhenyu Zhang
Beidi Chen
Zhangyang Wang
A. Anandkumar
Yuandong Tian
75
205
0
06 Mar 2024
Symbolic Discovery of Optimization Algorithms
Xiangning Chen
Chen Liang
Da Huang
Esteban Real
Kaiyuan Wang
...
Xuanyi Dong
Thang Luong
Cho-Jui Hsieh
Yifeng Lu
Quoc V. Le
107
367
0
13 Feb 2023
8-bit Optimizers via Block-wise Quantization
Tim Dettmers
M. Lewis
Sam Shleifer
Luke Zettlemoyer
MQ
96
286
0
06 Oct 2021
LoRA: Low-Rank Adaptation of Large Language Models
J. E. Hu
Yelong Shen
Phillip Wallis
Zeyuan Allen-Zhu
Yuanzhi Li
Shean Wang
Lu Wang
Weizhu Chen
OffRL
AI4TS
AI4CE
ALM
AIMat
230
10,099
0
17 Jun 2021
1