Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1910.10683
Cited By
v1
v2
v3
v4 (latest)
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"
50 / 9,866 papers shown
Title
Scientific Claim Verification with VERT5ERINI
Ronak Pradeep
Xueguang Ma
Rodrigo Nogueira
Jimmy J. Lin
75
63
0
22 Oct 2020
Calibrated Language Model Fine-Tuning for In- and Out-of-Distribution Data
Lingkai Kong
Haoming Jiang
Yuchen Zhuang
Jie Lyu
T. Zhao
Chao Zhang
OODD
91
26
0
22 Oct 2020
DuoRAT: Towards Simpler Text-to-SQL Models
Torsten Scholak
Raymond Li
Dzmitry Bahdanau
H. D. Vries
C. Pal
AI4TS
100
28
0
21 Oct 2020
Open-Domain Frame Semantic Parsing Using Transformers
Aditya Kalyanpur
Or Biran
Tom Breloff
Jennifer Chu-Carroll
Ariel Diertani
Owen Rambow
Mark Sammons
68
21
0
21 Oct 2020
Emformer: Efficient Memory Transformer Based Acoustic Model For Low Latency Streaming Speech Recognition
Yangyang Shi
Yongqiang Wang
Chunyang Wu
Ching-Feng Yeh
Julian Chan
Frank Zhang
Duc Le
M. Seltzer
195
172
0
21 Oct 2020
An Empirical Investigation of Contextualized Number Prediction
Daniel M. Spokoyny
Taylor Berg-Kirkpatrick
AI4TS
80
38
0
20 Oct 2020
SKATE: A Natural Language Interface for Encoding Structured Knowledge
C. McFate
Aditya Kalyanpur
D. Ferrucci
Andrea Bradshaw
Ariel Diertani
David O. Melville
Lori Moon
34
0
0
20 Oct 2020
Optimal Subarchitecture Extraction For BERT
Adrian de Wynter
Daniel J. Perry
MQ
96
18
0
20 Oct 2020
Local Knowledge Powered Conversational Agents
Sashank Santhanam
Ming-Yu Liu
Raul Puri
Mohammad Shoeybi
M. Patwary
Bryan Catanzaro
93
4
0
20 Oct 2020
Neural Language Modeling for Contextualized Temporal Graph Generation
Aman Madaan
Yiming Yang
104
20
0
20 Oct 2020
Anti-Distillation: Improving reproducibility of deep networks
G. Shamir
Lorenzo Coviello
106
19
0
19 Oct 2020
ChemBERTa: Large-Scale Self-Supervised Pretraining for Molecular Property Prediction
Seyone Chithrananda
Gabriel Grand
Bharath Ramsundar
AI4CE
128
415
0
19 Oct 2020
Mixed-Lingual Pre-training for Cross-lingual Summarization
Ruochen Xu
Chenguang Zhu
Yu Shi
Michael Zeng
Xuedong Huang
57
26
0
18 Oct 2020
Knowledge-Grounded Dialogue Generation with Pre-trained Language Models
Xueliang Zhao
Wei Wu
Can Xu
Chongyang Tao
Dongyan Zhao
Rui Yan
260
193
0
17 Oct 2020
CoDA: Contrast-enhanced and Diversity-promoting Data Augmentation for Natural Language Understanding
Yanru Qu
Dinghan Shen
Yelong Shen
Sandra Sajeev
Jiawei Han
Weizhu Chen
204
69
0
16 Oct 2020
Delaying Interaction Layers in Transformer-based Encoders for Efficient Open Domain Question Answering
W. Siblini
Mohamed Challal
Charlotte Pasqual
56
3
0
16 Oct 2020
The Deep Bootstrap Framework: Good Online Learners are Good Offline Generalizers
Preetum Nakkiran
Behnam Neyshabur
Hanie Sedghi
OffRL
97
11
0
16 Oct 2020
Fine-Tuning Pre-trained Language Model with Weak Supervision: A Contrastive-Regularized Self-Training Approach
Yue Yu
Simiao Zuo
Haoming Jiang
Wendi Ren
T. Zhao
Chao Zhang
AI4MH
73
133
0
15 Oct 2020
Re-evaluating Evaluation in Text Summarization
Manik Bhandari
Pranav Narayan Gour
A. Ashfaq
Pengfei Liu
Graham Neubig
178
178
0
14 Oct 2020
Length-Adaptive Transformer: Train Once with Length Drop, Use Anytime with Search
Gyuwan Kim
Kyunghyun Cho
94
98
0
14 Oct 2020
Neural Databases
James Thorne
Majid Yazdani
Marzieh Saeidi
Fabrizio Silvestri
Sebastian Riedel
A. Halevy
NAI
96
9
0
14 Oct 2020
Vokenization: Improving Language Understanding with Contextualized, Visual-Grounded Supervision
Hao Tan
Joey Tianyi Zhou
CLIP
89
121
0
14 Oct 2020
With Little Power Comes Great Responsibility
Dallas Card
Peter Henderson
Urvashi Khandelwal
Robin Jia
Kyle Mahowald
Dan Jurafsky
277
119
0
13 Oct 2020
Pretrained Transformers for Text Ranking: BERT and Beyond
Jimmy J. Lin
Rodrigo Nogueira
Andrew Yates
VLM
393
628
0
13 Oct 2020
BioMegatron: Larger Biomedical Domain Language Model
Hoo-Chang Shin
Yang Zhang
Evelina Bakhturina
Raul Puri
M. Patwary
Mohammad Shoeybi
Raghav Mani
AI4CE
63
148
0
12 Oct 2020
Measuring and Reducing Gendered Correlations in Pre-trained Models
Kellie Webster
Xuezhi Wang
Ian Tenney
Alex Beutel
Emily Pitler
Ellie Pavlick
Jilin Chen
Ed Chi
Slav Petrov
FaML
97
260
0
12 Oct 2020
End-to-End Synthetic Data Generation for Domain Adaptation of Question Answering Systems
Siamak Shakeri
Cicero Nogueira dos Santos
He Zhu
Patrick Ng
Feng Nan
Zhiguo Wang
Ramesh Nallapati
Bing Xiang
OOD
82
104
0
12 Oct 2020
Chatbot Interaction with Artificial Intelligence: Human Data Augmentation with T5 and Language Transformer Ensemble for Text Classification
Jordan J. Bird
Anikó Ekárt
Diego Resende Faria
61
60
0
12 Oct 2020
SLEDGE-Z: A Zero-Shot Baseline for COVID-19 Literature Search
Sean MacAvaney
Arman Cohan
Nazli Goharian
66
21
0
12 Oct 2020
COMET-ATOMIC 2020: On Symbolic and Neural Commonsense Knowledge Graphs
Jena D. Hwang
Chandra Bhagavatula
Ronan Le Bras
Jeff Da
Keisuke Sakaguchi
Antoine Bosselut
Yejin Choi
84
415
0
12 Oct 2020
Controlled Hallucinations: Learning to Generate Faithfully from Noisy Data
Katja Filippova
68
113
0
12 Oct 2020
Probing Pretrained Language Models for Lexical Semantics
Ivan Vulić
Edoardo Ponti
Robert Litschko
Goran Glavaš
Anna Korhonen
KELM
86
246
0
12 Oct 2020
Reformulating Unsupervised Style Transfer as Paraphrase Generation
Kalpesh Krishna
John Wieting
Mohit Iyyer
85
242
0
12 Oct 2020
OCNLI: Original Chinese Natural Language Inference
Hai Hu
Kyle Richardson
Liang Xu
Lu Li
Sandra Kübler
L. Moss
95
118
0
12 Oct 2020
Learning Which Features Matter: RoBERTa Acquires a Preference for Linguistic Generalizations (Eventually)
Alex Warstadt
Yian Zhang
Haau-Sing Li
Haokun Liu
Samuel R. Bowman
SSL
AI4CE
78
21
0
11 Oct 2020
SMYRF: Efficient Attention using Asymmetric Clustering
Giannis Daras
Nikita Kitaev
Augustus Odena
A. Dimakis
106
46
0
11 Oct 2020
Towards Accurate and Reliable Energy Measurement of NLP Models
Qingqing Cao
A. Balasubramanian
Niranjan Balasubramanian
36
33
0
11 Oct 2020
What Do Position Embeddings Learn? An Empirical Study of Pre-Trained Language Model Positional Encoding
Yu-An Wang
Yun-Nung Chen
SSL
57
95
0
10 Oct 2020
Artificial Intelligence (AI) in Action: Addressing the COVID-19 Pandemic with Natural Language Processing (NLP)
Qingyu Chen
Robert Leaman
Alexis Allot
Ling Luo
Chih-Hsuan Wei
Shankai Yan
Zhiyong Lu
80
38
0
09 Oct 2020
A Survey of Knowledge-Enhanced Text Generation
Wenhao Yu
Chenguang Zhu
Zaitang Li
Zhiting Hu
Qingyun Wang
Heng Ji
Meng Jiang
132
290
0
09 Oct 2020
Style Attuned Pre-training and Parameter Efficient Fine-tuning for Spoken Language Understanding
Jin Cao
Jun Wang
Wael Hamza
Kelly Vanee
Shang-Wen Li
30
10
0
09 Oct 2020
Leakage-Adjusted Simulatability: Can Models Generate Non-Trivial Explanations of Their Behavior in Natural Language?
Peter Hase
Shiyue Zhang
Harry Xie
Joey Tianyi Zhou
88
102
0
08 Oct 2020
On the importance of pre-training data volume for compact language models
Vincent Micheli
Martin d'Hoffschmidt
Franccois Fleuret
67
42
0
08 Oct 2020
TextSETTR: Few-Shot Text Style Extraction and Tunable Targeted Restyling
Parker Riley
Noah Constant
Mandy Guo
Girish Kumar
David C. Uthus
Zarana Parekh
105
54
0
08 Oct 2020
Assessing Phrasal Representation and Composition in Transformers
Lang-Chi Yu
Allyson Ettinger
CoGe
90
68
0
08 Oct 2020
Infusing Disease Knowledge into BERT for Health Question Answering, Medical Inference and Disease Name Recognition
Yun He
Ziwei Zhu
Yin Zhang
Qin Chen
James Caverlee
AI4MH
87
109
0
08 Oct 2020
PARADE: A New Dataset for Paraphrase Identification Requiring Computer Science Domain Knowledge
Yun He
Zhuoer Wang
Yin Zhang
Ruihong Huang
James Caverlee
46
23
0
08 Oct 2020
A Cascade Approach to Neural Abstractive Summarization with Content Selection and Fusion
Logan Lebanoff
Franck Dernoncourt
Doo Soon Kim
W. Chang
Fei Liu
CVBM
64
18
0
08 Oct 2020
A Mathematical Exploration of Why Language Models Help Solve Downstream Tasks
Nikunj Saunshi
Sadhika Malladi
Sanjeev Arora
87
89
0
07 Oct 2020
Uncovering the Limits of Adversarial Training against Norm-Bounded Adversarial Examples
Sven Gowal
Chongli Qin
J. Uesato
Timothy A. Mann
Pushmeet Kohli
AAML
73
331
0
07 Oct 2020
Previous
1
2
3
...
190
191
192
...
196
197
198
Next