
Demystifying BERT: Implications for Accelerator Design
Suchita Pati
Shaizeen Aga
Nuwan Jayasena
Matthew D. Sinclair
Papers citing "Demystifying BERT: Implications for Accelerator Design"
39 / 39 papers shown
Title |
---|
![]() SeqPoint: Identifying Representative Iterations of Sequence-based Neural
Networks Suchita Pati Shaizeen Aga Matthew D. Sinclair Nuwan Jayasena |
![]() MLPerf Training Benchmark Arya D. McCarthy Christine Cheng Cody Coleman Greg Diamos Paulius Micikevicius ...Carole-Jean Wu Lingjie Xu Masafumi Yamazaki C. Young Matei A. Zaharia |
![]() Large Batch Optimization for Deep Learning: Training BERT in 76 minutes Yang You Jing Li Sashank J. Reddi Jonathan Hseu Sanjiv Kumar Srinadh Bhojanapalli Xiaodan Song J. Demmel Kurt Keutzer Cho-Jui Hsieh |