Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2312.14567
Cited By
Accelerated Convergence of Stochastic Heavy Ball Method under Anisotropic Gradient Noise
22 December 2023
Rui Pan
Yuxing Liu
Xiaoyu Wang
Tong Zhang
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Accelerated Convergence of Stochastic Heavy Ball Method under Anisotropic Gradient Noise"
5 / 5 papers shown
Title
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
330
11,953
0
04 Mar 2022
Last Iterate Risk Bounds of SGD with Decaying Stepsize for Overparameterized Linear Regression
Jingfeng Wu
Difan Zou
Vladimir Braverman
Quanquan Gu
Sham Kakade
104
20
0
12 Oct 2021
DecentLaM: Decentralized Momentum SGD for Large-batch Deep Training
Kun Yuan
Yiming Chen
Xinmeng Huang
Yingya Zhang
Pan Pan
Yinghui Xu
W. Yin
MoE
55
61
0
24 Apr 2021
Densely Connected Convolutional Networks
Gao Huang
Zhuang Liu
L. V. D. van der Maaten
Kilian Q. Weinberger
PINN
3DV
312
36,371
0
25 Aug 2016
Stochastic Gradient Descent for Non-smooth Optimization: Convergence Results and Optimal Averaging Schemes
Ohad Shamir
Tong Zhang
101
570
0
08 Dec 2012
1