
RoBERTa: A Robustly Optimized BERT Pretraining Approach
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
Papers citing "RoBERTa: A Robustly Optimized BERT Pretraining Approach"
14 / 414 papers shown
Title |
---|
![]() Mixed Precision Training Paulius Micikevicius Sharan Narang Jonah Alben G. Diamos Erich Elsen ...Boris Ginsburg Michael Houston Oleksii Kuchaiev Ganesh Venkatesh Hao Wu |