
v1v2 (latest)
A Theory on Adam Instability in Large-Scale Machine Learning
Igor Molybog
Sharan Narang
Papers citing "A Theory on Adam Instability in Large-Scale Machine Learning"
28 / 28 papers shown
Title |
---|
![]() XGen-7B Technical Report Erik Nijkamp Tian Xie Hiroaki Hayashi Bo Pang Congying Xia ...Chien-Sheng Wu Silvio Savarese Yingbo Zhou Shafiq Joty Caiming Xiong |