Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2406.05033
Cited By
Gradient Descent on Logistic Regression with Non-Separable Data and Large Step Sizes
7 June 2024
Si Yi Meng
Antonio Orvieto
Daniel Yiming Cao
Christopher De Sa
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Gradient Descent on Logistic Regression with Non-Separable Data and Large Step Sizes"
8 / 8 papers shown
Title
Directional Smoothness and Gradient Methods: Convergence and Adaptivity
Aaron Mishkin
Ahmed Khaled
Yuanhao Wang
Aaron Defazio
Robert Mansel Gower
95
9
0
06 Mar 2024
Good regularity creates large learning rate implicit biases: edge of stability, balancing, and catapult
Yuqing Wang
Zhenghao Xu
Tuo Zhao
Molei Tao
72
11
0
26 Oct 2023
Trajectory Alignment: Understanding the Edge of Stability Phenomenon via Bifurcation Theory
Minhak Song
Chulhee Yun
74
11
1
09 Jul 2023
Catapults in SGD: spikes in the training loss and their impact on generalization through feature learning
Libin Zhu
Chaoyue Liu
Adityanarayanan Radhakrishnan
M. Belkin
102
15
0
07 Jun 2023
Learning threshold neurons via the "edge of stability"
Kwangjun Ahn
Sébastien Bubeck
Sinho Chewi
Y. Lee
Felipe Suarez
Yi Zhang
MLT
82
41
0
14 Dec 2022
Understanding Edge-of-Stability Training Dynamics with a Minimalist Example
Xingyu Zhu
Zixuan Wang
Xiang Wang
Mo Zhou
Rong Ge
109
39
0
07 Oct 2022
Understanding the unstable convergence of gradient descent
Kwangjun Ahn
J.N. Zhang
S. Sra
74
63
0
03 Apr 2022
Squareplus: A Softplus-Like Algebraic Rectifier
Jonathan T. Barron
63
20
0
22 Dec 2021
1