Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2311.04799
Cited By
DACBERT: Leveraging Dependency Agreement for Cost-Efficient Bert Pretraining
8 November 2023
Martin Kuo
Jianyi Zhang
Yiran Chen
Re-assign community
ArXiv
PDF
HTML
Papers citing
"DACBERT: Leveraging Dependency Agreement for Cost-Efficient Bert Pretraining"
3 / 3 papers shown
Title
Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism
M. Shoeybi
M. Patwary
Raul Puri
P. LeGresley
Jared Casper
Bryan Catanzaro
MoE
245
1,826
0
17 Sep 2019
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding
Alex Jinpeng Wang
Amanpreet Singh
Julian Michael
Felix Hill
Omer Levy
Samuel R. Bowman
ELM
297
6,984
0
20 Apr 2018
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation
Yonghui Wu
M. Schuster
Z. Chen
Quoc V. Le
Mohammad Norouzi
...
Alex Rudnick
Oriol Vinyals
G. Corrado
Macduff Hughes
J. Dean
AIMat
716
6,746
0
26 Sep 2016
1