Sentence Encoders on STILTs: Supplementary Training on Intermediate Labeled-data Tasks

2 November 2018

Papers citing "Sentence Encoders on STILTs: Supplementary Training on Intermediate Labeled-data Tasks"

22 / 122 papers shown

Title
AdapterFusion: Non-Destructive Task Composition for Transfer Learning Jonas Pfeiffer Aishwarya Kamath Andreas Rucklé Kyunghyun Cho Iryna Gurevych CLL MoMe 47 819 0 01 May 2020
Pre-training Is (Almost) All You Need: An Application to Commonsense Reasoning Alexandre Tamborrino Nicola Pellicanò B. Pannier Pascal Voitot Louise Naudin LRM 22 62 0 29 Apr 2020
Recall and Learn: Fine-tuning Deep Pretrained Language Models with Less Forgetting Sanyuan Chen Yutai Hou Yiming Cui Wanxiang Che Ting Liu Xiangzhan Yu KELM CLL 21 213 0 27 Apr 2020
Train No Evil: Selective Masking for Task-Guided Pre-Training Yuxian Gu Zhengyan Zhang Xiaozhi Wang Zhiyuan Liu Maosong Sun 32 59 0 21 Apr 2020
Beyond Fine-tuning: Few-Sample Sentence Embedding Transfer Siddhant Garg Rohit Kumar Sharma Yingyu Liang 36 4 0 10 Apr 2020
Pre-trained Models for Natural Language Processing: A Survey Xipeng Qiu Tianxiang Sun Yige Xu Yunfan Shao Ning Dai Xuanjing Huang LM&MA VLM 243 1,452 0 18 Mar 2020
HUBERT Untangles BERT to Improve Transfer across NLP Tasks M. Moradshahi Hamid Palangi M. Lam P. Smolensky Jianfeng Gao 26 16 0 25 Oct 2019
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer Colin Raffel Noam M. Shazeer Adam Roberts Katherine Lee Sharan Narang Michael Matena Yanqi Zhou Wei Li Peter J. Liu AIMat 129 19,529 0 23 Oct 2019
MMM: Multi-stage Multi-task Learning for Multi-choice Reading Comprehension Di Jin Shuyang Gao Jiun-Yu Kao Tagyoung Chung Dilek Z. Hakkani-Tür 29 69 0 01 Oct 2019
Mixout: Effective Regularization to Finetune Large-scale Pretrained Language Models Cheolhyoung Lee Kyunghyun Cho Wanmo Kang MoE 249 208 0 25 Sep 2019
Supervised Multimodal Bitransformers for Classifying Images and Text Douwe Kiela Suvrat Bhooshan Hamed Firooz Ethan Perez Davide Testuggine 59 242 0 06 Sep 2019
Task Selection Policies for Multitask Learning John Glover Chris Hokamp OffRL 29 7 0 14 Jul 2019
BAM! Born-Again Multi-Task Networks for Natural Language Understanding Kevin Clark Minh-Thang Luong Urvashi Khandelwal Christopher D. Manning Quoc V. Le 24 228 0 10 Jul 2019
Transfer Learning for Risk Classification of Social Media Posts: Model Evaluation Study Derek Howard M. Maslej Justin Lee Jacob Ritchie G. Woollard L. French AI4MH 23 30 0 04 Jul 2019
MultiQA: An Empirical Investigation of Generalization and Transfer in Reading Comprehension Alon Talmor Jonathan Berant 20 172 0 31 May 2019
Human vs. Muppet: A Conservative Estimate of Human Performance on the GLUE Benchmark Nikita Nangia Samuel R. Bowman ELM ALM 34 75 0 24 May 2019
BoolQ: Exploring the Surprising Difficulty of Natural Yes/No Questions Christopher Clark Kenton Lee Ming-Wei Chang Tom Kwiatkowski Michael Collins Kristina Toutanova 96 1,413 0 24 May 2019
Story Ending Prediction by Transferable BERT Zhongyang Li Xiao Ding Ting Liu 34 52 0 17 May 2019
Multi-Task Deep Neural Networks for Natural Language Understanding Xiaodong Liu Pengcheng He Weizhu Chen Jianfeng Gao AI4CE 24 1,261 0 31 Jan 2019
Linguistic Analysis of Pretrained Sentence Encoders with Acceptability Judgments Alex Warstadt Samuel R. Bowman 22 23 0 11 Jan 2019
What you can cram into a single vector: Probing sentence embeddings for linguistic properties Alexis Conneau Germán Kruszewski Guillaume Lample Loïc Barrault Marco Baroni 201 883 0 03 May 2018
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding Alex Jinpeng Wang Amanpreet Singh Julian Michael Felix Hill Omer Levy Samuel R. Bowman ELM 299 6,996 0 20 Apr 2018