Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1802.08009
Cited By
Iterate averaging as regularization for stochastic gradient descent
22 February 2018
Gergely Neu
Lorenzo Rosasco
MoMe
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Iterate averaging as regularization for stochastic gradient descent"
11 / 11 papers shown
Title
Design Considerations in Offline Preference-based RL
Alekh Agarwal
Christoph Dann
T. V. Marinov
OffRL
56
0
0
08 Feb 2025
Generalized equivalences between subsampling and ridge regularization
Pratik V. Patil
Jin-Hong Du
34
5
0
29 May 2023
Average of Pruning: Improving Performance and Stability of Out-of-Distribution Detection
Zhen Cheng
Fei Zhu
Xu-Yao Zhang
Cheng-Lin Liu
MoMe
OODD
40
11
0
02 Mar 2023
Stop Wasting My Time! Saving Days of ImageNet and BERT Training with Latest Weight Averaging
Jean Kaddour
MoMe
3DH
24
39
0
29 Sep 2022
Stochastic Weight Averaging Revisited
Hao Guo
Jiyong Jin
B. Liu
35
29
0
03 Jan 2022
Scalable Bayesian Approach for the DINA Q-matrix Estimation Combining Stochastic Optimization and Variational Inference
Motonori Oka
Kensuke Okada
18
6
0
20 May 2021
RATT: Leveraging Unlabeled Data to Guarantee Generalization
Saurabh Garg
Sivaraman Balakrishnan
J. Zico Kolter
Zachary Chase Lipton
30
30
0
01 May 2021
Obtaining Adjustable Regularization for Free via Iterate Averaging
Jingfeng Wu
Vladimir Braverman
Lin F. Yang
30
2
0
15 Aug 2020
Detached Error Feedback for Distributed SGD with Random Sparsification
An Xu
Heng-Chiao Huang
39
9
0
11 Apr 2020
The Step Decay Schedule: A Near Optimal, Geometrically Decaying Learning Rate Procedure For Least Squares
Rong Ge
Sham Kakade
Rahul Kidambi
Praneeth Netrapalli
34
149
0
29 Apr 2019
Stochastic Gradient Descent for Non-smooth Optimization: Convergence Results and Optimal Averaging Schemes
Ohad Shamir
Tong Zhang
101
571
0
08 Dec 2012
1