Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2305.15850
Cited By
Stochastic Modified Equations and Dynamics of Dropout Algorithm
25 May 2023
Zhongwang Zhang
Yuqing Li
Tao Luo
Z. Xu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Stochastic Modified Equations and Dynamics of Dropout Algorithm"
4 / 4 papers shown
Title
Reasoning Bias of Next Token Prediction Training
Pengxiao Lin
Zhongwang Zhang
Zhi-Qin John Xu
LRM
94
2
0
21 Feb 2025
On the SDEs and Scaling Rules for Adaptive Gradient Algorithms
Sadhika Malladi
Kaifeng Lyu
A. Panigrahi
Sanjeev Arora
92
42
0
20 May 2022
On Large-Batch Training for Deep Learning: Generalization Gap and Sharp Minima
N. Keskar
Dheevatsa Mudigere
J. Nocedal
M. Smelyanskiy
P. T. P. Tang
ODL
308
2,892
0
15 Sep 2016
Improving neural networks by preventing co-adaptation of feature detectors
Geoffrey E. Hinton
Nitish Srivastava
A. Krizhevsky
Ilya Sutskever
Ruslan Salakhutdinov
VLM
266
7,638
0
03 Jul 2012
1