Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1910.10683
Cited By
v1
v2
v3
v4 (latest)
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"
50 / 9,843 papers shown
Title
Beneath the Tip of the Iceberg: Current Challenges and New Directions in Sentiment Analysis Research
Soujanya Poria
Devamanyu Hazarika
Navonil Majumder
Rada Mihalcea
135
221
0
01 May 2020
HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training
Linjie Li
Yen-Chun Chen
Yu Cheng
Zhe Gan
Licheng Yu
Jingjing Liu
MLLM
VLM
OffRL
AI4TS
133
506
0
01 May 2020
Progressively Pretrained Dense Corpus Index for Open-Domain Question Answering
Wenhan Xiong
Hong Wang
Wenjie Wang
RALM
86
17
0
30 Apr 2020
TLDR: Extreme Summarization of Scientific Documents
Isabel Cachola
Kyle Lo
Arman Cohan
Daniel S. Weld
139
217
0
30 Apr 2020
A Matter of Framing: The Impact of Linguistic Formalism on Probing Results
Ilia Kuznetsov
Iryna Gurevych
49
26
0
30 Apr 2020
Recipes for Adapting Pre-trained Monolingual and Multilingual Models to Machine Translation
Asa Cooper Stickland
Xian Li
Marjan Ghazvininejad
101
46
0
30 Apr 2020
Few-Shot Learning for Opinion Summarization
Arthur Brazinskas
Mirella Lapata
Ivan Titov
45
2
0
30 Apr 2020
Mind Your Inflections! Improving NLP for Non-Standard Englishes with Base-Inflection Encoding
Samson Tan
Shafiq Joty
Lav Varshney
Min-Yen Kan
133
35
0
30 Apr 2020
Logic2Text: High-Fidelity Natural Language Generation from Logical Forms
Zhiyu Zoey Chen
Wenhu Chen
Hanwen Zha
Xiyou Zhou
Yunkai Zhang
Sairam Sundaresan
William Yang Wang
NAI
67
66
0
30 Apr 2020
WT5?! Training Text-to-Text Models to Explain their Predictions
Sharan Narang
Colin Raffel
Katherine Lee
Adam Roberts
Noah Fiedel
Karishma Malkan
82
201
0
30 Apr 2020
What Happens To BERT Embeddings During Fine-tuning?
Amil Merchant
Elahe Rahimtoroghi
Ellie Pavlick
Ian Tenney
110
189
0
29 Apr 2020
General Purpose Text Embeddings from Pre-trained Language Models for Scalable Inference
Jingfei Du
Myle Ott
Haoran Li
Xing Zhou
Veselin Stoyanov
AI4CE
66
10
0
29 Apr 2020
Pre-training Is (Almost) All You Need: An Application to Commonsense Reasoning
Alexandre Tamborrino
Nicola Pellicanò
B. Pannier
Pascal Voitot
Louise Naudin
LRM
81
63
0
29 Apr 2020
Multi-Task Learning for Dense Prediction Tasks: A Survey
Simon Vandenhende
Stamatios Georgoulis
Wouter Van Gansbeke
Marc Proesmans
Dengxin Dai
Luc Van Gool
CVBM
71
73
0
28 Apr 2020
ColBERT: Efficient and Effective Passage Search via Contextualized Late Interaction over BERT
Omar Khattab
Matei A. Zaharia
145
1,383
0
27 Apr 2020
Syntactic Data Augmentation Increases Robustness to Inference Heuristics
Junghyun Min
R. Thomas McCoy
Dipanjan Das
Emily Pitler
Tal Linzen
96
180
0
24 Apr 2020
Generative Data Augmentation for Commonsense Reasoning
Yiben Yang
Chaitanya Malaviya
Jared Fernandez
Swabha Swayamdipta
Ronan Le Bras
Ji-ping Wang
Chandra Bhagavatula
Yejin Choi
Doug Downey
LRM
82
90
0
24 Apr 2020
Rapidly Bootstrapping a Question Answering Dataset for COVID-19
Raphael Tang
Rodrigo Nogueira
Edwin Zhang
Nikhil Gupta
P. Cẩm
Kyunghyun Cho
Jimmy J. Lin
62
72
0
23 Apr 2020
A Review of Winograd Schema Challenge Datasets and Approaches
Vid Kocijan
Thomas Lukasiewicz
E. Davis
G. Marcus
L. Morgenstern
72
44
0
23 Apr 2020
Don't Stop Pretraining: Adapt Language Models to Domains and Tasks
Suchin Gururangan
Ana Marasović
Swabha Swayamdipta
Kyle Lo
Iz Beltagy
Doug Downey
Noah A. Smith
VLM
AI4CE
CLL
205
2,450
0
23 Apr 2020
Syntactic Structure from Deep Learning
Tal Linzen
Marco Baroni
NAI
93
184
0
22 Apr 2020
CORD-19: The COVID-19 Open Research Dataset
Lucy Lu Wang
Kyle Lo
Yoganand Chandrasekhar
Russell Reas
Jiangjiang Yang
...
Boya Xie
Douglas A. Raymond
Daniel S. Weld
Oren Etzioni
Sebastian Kohlmeier
150
812
0
22 Apr 2020
MT-Clinical BERT: Scaling Clinical Information Extraction with Multitask Learning
Andriy Mulyar
Bridget T. McInnes
71
56
0
21 Apr 2020
MPNet: Masked and Permuted Pre-training for Language Understanding
Kaitao Song
Xu Tan
Tao Qin
Jianfeng Lu
Tie-Yan Liu
111
1,142
0
20 Apr 2020
Fine-tuning Multi-hop Question Answering with Hierarchical Graph Network
Guanming Xiong
128
0
0
20 Apr 2020
The Cost of Training NLP Models: A Concise Overview
Or Sharir
Barak Peleg
Y. Shoham
110
214
0
19 Apr 2020
ETC: Encoding Long and Structured Inputs in Transformers
Joshua Ainslie
Santiago Ontanon
Chris Alberti
Vaclav Cvicek
Zachary Kenneth Fisher
Philip Pham
Anirudh Ravula
Sumit Sanghai
Qifan Wang
Li Yang
75
55
0
17 Apr 2020
Can You Put it All Together: Evaluating Conversational Agents' Ability to Blend Skills
Eric Michael Smith
Mary Williamson
Kurt Shuster
Jason Weston
Y-Lan Boureau
93
226
0
17 Apr 2020
Understanding the Difficulty of Training Transformers
Liyuan Liu
Xiaodong Liu
Jianfeng Gao
Weizhu Chen
Jiawei Han
AI4CE
76
258
0
17 Apr 2020
The Right Tool for the Job: Matching Model and Instance Complexities
Roy Schwartz
Gabriel Stanovsky
Swabha Swayamdipta
Jesse Dodge
Noah A. Smith
144
170
0
16 Apr 2020
Entities as Experts: Sparse Memory Access with Entity Supervision
Thibault Févry
Livio Baldini Soares
Nicholas FitzGerald
Eunsol Choi
Tom Kwiatkowski
RALM
120
155
0
15 Apr 2020
TOD-BERT: Pre-trained Natural Language Understanding for Task-Oriented Dialogue
Chien-Sheng Wu
Guosheng Lin
R. Socher
Caiming Xiong
104
324
0
15 Apr 2020
CrisisBench: Benchmarking Crisis-related Social Media Datasets for Humanitarian Information Processing
Firoj Alam
Hassan Sajjad
Muhammad Imran
Ferda Ofli
24
14
0
14 Apr 2020
CLUE: A Chinese Language Understanding Evaluation Benchmark
Liang Xu
Hai Hu
Xuanwei Zhang
Lu Li
Chenjie Cao
...
Cong Yue
Xinrui Zhang
Zhen-Yi Yang
Kyle Richardson
Zhenzhong Lan
ELM
110
388
0
13 Apr 2020
XtremeDistil: Multi-stage Distillation for Massive Multilingual Models
Subhabrata Mukherjee
Ahmed Hassan Awadallah
80
59
0
12 Apr 2020
Explaining Question Answering Models through Text Generation
Veronica Latcinnik
Jonathan Berant
LRM
96
51
0
12 Apr 2020
Unsupervised Commonsense Question Answering with Self-Talk
Vered Shwartz
Peter West
Ronan Le Bras
Chandra Bhagavatula
Yejin Choi
ReLM
SSL
AI4MH
LRM
72
263
0
11 Apr 2020
Longformer: The Long-Document Transformer
Iz Beltagy
Matthew E. Peters
Arman Cohan
RALM
VLM
216
4,109
0
10 Apr 2020
Rapidly Deploying a Neural Search Engine for the COVID-19 Open Research Dataset: Preliminary Thoughts and Lessons Learned
Edwin Zhang
Nikhil Gupta
Rodrigo Nogueira
Kyunghyun Cho
Jimmy J. Lin
55
58
0
10 Apr 2020
Dense Passage Retrieval for Open-Domain Question Answering
Vladimir Karpukhin
Barlas Oğuz
Sewon Min
Patrick Lewis
Ledell Yu Wu
Sergey Edunov
Danqi Chen
Wen-tau Yih
RALM
239
3,810
0
10 Apr 2020
Improving Readability for Automatic Speech Recognition Transcription
Junwei Liao
Sefik Emre Eskimez
Liyang Lu
Yu Shi
Ming Gong
Linjun Shou
Hong Qu
Michael Zeng
67
56
0
09 Apr 2020
Rapformer: Conditional Rap Lyrics Generation with Denoising Autoencoders
Nikola I. Nikolov
Eric Malmi
Curtis G. Northcutt
Loreto Parisi
AI4CE
57
6
0
08 Apr 2020
Exploring Versatile Generative Language Model Via Parameter-Efficient Transfer Learning
Zhaojiang Lin
Andrea Madotto
Pascale Fung
105
163
0
08 Apr 2020
Downstream Model Design of Pre-trained Language Model for Relation Extraction Task
Cheng-rong Li
Ye Tian
72
36
0
08 Apr 2020
Byte Pair Encoding is Suboptimal for Language Model Pretraining
Kaj Bostrom
Greg Durrett
69
214
0
07 Apr 2020
"You are grounded!": Latent Name Artifacts in Pre-trained Language Models
Vered Shwartz
Rachel Rudinger
Oyvind Tafjord
53
51
0
06 Apr 2020
Deep Learning Based Text Classification: A Comprehensive Review
Shervin Minaee
Nal Kalchbrenner
Min Zhang
Narjes Nikzad
M. Asgari-Chenaghlu
Jianfeng Gao
AILaw
VLM
AI4TS
116
1,115
0
06 Apr 2020
FastBERT: a Self-distilling BERT with Adaptive Inference Time
Weijie Liu
Peng Zhou
Zhe Zhao
Zhiruo Wang
Haotang Deng
Qi Ju
97
361
0
05 Apr 2020
Unsupervised Domain Clusters in Pretrained Language Models
Roee Aharoni
Yoav Goldberg
101
252
0
05 Apr 2020
Optimus: Organizing Sentences via Pre-trained Modeling of a Latent Space
Chunyuan Li
Xiang Gao
Yuan Li
Baolin Peng
Xiujun Li
Yizhe Zhang
Jianfeng Gao
SSL
DRL
86
182
0
05 Apr 2020
Previous
1
2
3
...
194
195
196
197
Next