Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1811.01088
Cited By
Sentence Encoders on STILTs: Supplementary Training on Intermediate Labeled-data Tasks
2 November 2018
Jason Phang
Thibault Févry
Samuel R. Bowman
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Sentence Encoders on STILTs: Supplementary Training on Intermediate Labeled-data Tasks"
22 / 122 papers shown
Title
AdapterFusion: Non-Destructive Task Composition for Transfer Learning
Jonas Pfeiffer
Aishwarya Kamath
Andreas Rucklé
Kyunghyun Cho
Iryna Gurevych
CLL
MoMe
47
819
0
01 May 2020
Pre-training Is (Almost) All You Need: An Application to Commonsense Reasoning
Alexandre Tamborrino
Nicola Pellicanò
B. Pannier
Pascal Voitot
Louise Naudin
LRM
22
62
0
29 Apr 2020
Recall and Learn: Fine-tuning Deep Pretrained Language Models with Less Forgetting
Sanyuan Chen
Yutai Hou
Yiming Cui
Wanxiang Che
Ting Liu
Xiangzhan Yu
KELM
CLL
21
213
0
27 Apr 2020
Train No Evil: Selective Masking for Task-Guided Pre-Training
Yuxian Gu
Zhengyan Zhang
Xiaozhi Wang
Zhiyuan Liu
Maosong Sun
32
59
0
21 Apr 2020
Beyond Fine-tuning: Few-Sample Sentence Embedding Transfer
Siddhant Garg
Rohit Kumar Sharma
Yingyu Liang
36
4
0
10 Apr 2020
Pre-trained Models for Natural Language Processing: A Survey
Xipeng Qiu
Tianxiang Sun
Yige Xu
Yunfan Shao
Ning Dai
Xuanjing Huang
LM&MA
VLM
243
1,452
0
18 Mar 2020
HUBERT Untangles BERT to Improve Transfer across NLP Tasks
M. Moradshahi
Hamid Palangi
M. Lam
P. Smolensky
Jianfeng Gao
26
16
0
25 Oct 2019
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
AIMat
129
19,529
0
23 Oct 2019
MMM: Multi-stage Multi-task Learning for Multi-choice Reading Comprehension
Di Jin
Shuyang Gao
Jiun-Yu Kao
Tagyoung Chung
Dilek Z. Hakkani-Tür
29
69
0
01 Oct 2019
Mixout: Effective Regularization to Finetune Large-scale Pretrained Language Models
Cheolhyoung Lee
Kyunghyun Cho
Wanmo Kang
MoE
249
208
0
25 Sep 2019
Supervised Multimodal Bitransformers for Classifying Images and Text
Douwe Kiela
Suvrat Bhooshan
Hamed Firooz
Ethan Perez
Davide Testuggine
59
242
0
06 Sep 2019
Task Selection Policies for Multitask Learning
John Glover
Chris Hokamp
OffRL
29
7
0
14 Jul 2019
BAM! Born-Again Multi-Task Networks for Natural Language Understanding
Kevin Clark
Minh-Thang Luong
Urvashi Khandelwal
Christopher D. Manning
Quoc V. Le
24
228
0
10 Jul 2019
Transfer Learning for Risk Classification of Social Media Posts: Model Evaluation Study
Derek Howard
M. Maslej
Justin Lee
Jacob Ritchie
G. Woollard
L. French
AI4MH
23
30
0
04 Jul 2019
MultiQA: An Empirical Investigation of Generalization and Transfer in Reading Comprehension
Alon Talmor
Jonathan Berant
20
172
0
31 May 2019
Human vs. Muppet: A Conservative Estimate of Human Performance on the GLUE Benchmark
Nikita Nangia
Samuel R. Bowman
ELM
ALM
34
75
0
24 May 2019
BoolQ: Exploring the Surprising Difficulty of Natural Yes/No Questions
Christopher Clark
Kenton Lee
Ming-Wei Chang
Tom Kwiatkowski
Michael Collins
Kristina Toutanova
96
1,413
0
24 May 2019
Story Ending Prediction by Transferable BERT
Zhongyang Li
Xiao Ding
Ting Liu
34
52
0
17 May 2019
Multi-Task Deep Neural Networks for Natural Language Understanding
Xiaodong Liu
Pengcheng He
Weizhu Chen
Jianfeng Gao
AI4CE
24
1,261
0
31 Jan 2019
Linguistic Analysis of Pretrained Sentence Encoders with Acceptability Judgments
Alex Warstadt
Samuel R. Bowman
22
23
0
11 Jan 2019
What you can cram into a single vector: Probing sentence embeddings for linguistic properties
Alexis Conneau
Germán Kruszewski
Guillaume Lample
Loïc Barrault
Marco Baroni
201
883
0
03 May 2018
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding
Alex Jinpeng Wang
Amanpreet Singh
Julian Michael
Felix Hill
Omer Levy
Samuel R. Bowman
ELM
299
6,996
0
20 Apr 2018
Previous
1
2
3