
Wukong: A 100 Million Large-scale Chinese Cross-modal Pre-training Benchmark
Xiaodan Liang
Wei Zhang
Hang Xu
Papers citing "Wukong: A 100 Million Large-scale Chinese Cross-modal Pre-training Benchmark"
48 / 48 papers shown
Title |
---|
![]() Large Batch Optimization for Deep Learning: Training BERT in 76 minutes Yang You Jing Li Sashank J. Reddi Jonathan Hseu Sanjiv Kumar Srinadh Bhojanapalli Xiaodan Song J. Demmel Kurt Keutzer Cho-Jui Hsieh |