Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2010.10499
Cited By
Optimal Subarchitecture Extraction For BERT
20 October 2020
Adrian de Wynter
Daniel J. Perry
MQ
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Optimal Subarchitecture Extraction For BERT"
4 / 4 papers shown
Title
Curriculum learning for language modeling
Daniel Fernando Campos
16
32
0
04 Aug 2021
How to Train BERT with an Academic Budget
Peter Izsak
Moshe Berchansky
Omer Levy
23
113
0
15 Apr 2021
BERT-of-Theseus: Compressing BERT by Progressive Module Replacing
Canwen Xu
Wangchunshu Zhou
Tao Ge
Furu Wei
Ming Zhou
223
197
0
07 Feb 2020
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding
Alex Jinpeng Wang
Amanpreet Singh
Julian Michael
Felix Hill
Omer Levy
Samuel R. Bowman
ELM
299
6,984
0
20 Apr 2018
1