Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2212.05998
Cited By
Continuation KD: Improved Knowledge Distillation through the Lens of Continuation Optimization
12 December 2022
A. Jafari
I. Kobyzev
Mehdi Rezagholizadeh
Pascal Poupart
A. Ghodsi
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Continuation KD: Improved Knowledge Distillation through the Lens of Continuation Optimization"
2 / 2 papers shown
Title
GKD: A General Knowledge Distillation Framework for Large-scale Pre-trained Language Model
Shicheng Tan
Weng Lam Tam
Yuanchun Wang
Wenwen Gong
Yang Yang
...
Jiahao Liu
Jingang Wang
Shuo Zhao
Peng-Zhen Zhang
Jie Tang
ALM
MoE
33
11
0
11 Jun 2023
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding
Alex Jinpeng Wang
Amanpreet Singh
Julian Michael
Felix Hill
Omer Levy
Samuel R. Bowman
ELM
299
6,984
0
20 Apr 2018
1