
v1v2 (latest)
On the validity of pre-trained transformers for natural language processing in the software engineering domain
Papers citing "On the validity of pre-trained transformers for natural language processing in the software engineering domain"
30 / 30 papers shown
Title |
---|
![]() Large Batch Optimization for Deep Learning: Training BERT in 76 minutes Yang You Jing Li Sashank J. Reddi Jonathan Hseu Sanjiv Kumar Srinadh Bhojanapalli Xiaodan Song J. Demmel Kurt Keutzer Cho-Jui Hsieh |
![]() Mixed Precision Training Paulius Micikevicius Sharan Narang Jonah Alben G. Diamos Erich Elsen ...Boris Ginsburg Michael Houston Oleksii Kuchaiev Ganesh Venkatesh Hao Wu |