Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1910.10683
Cited By
v1
v2
v3
v4 (latest)
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"
50 / 9,870 papers shown
Title
Defending Against Backdoor Attacks in Natural Language Generation
Xiaofei Sun
Xiaoya Li
Yuxian Meng
Xiang Ao
Leilei Gan
Jiwei Li
Tianwei Zhang
AAML
SILM
103
52
0
03 Jun 2021
Reordering Examples Helps during Priming-based Few-Shot Learning
Sawan Kumar
Partha P. Talukdar
82
58
0
03 Jun 2021
Few-shot Knowledge Graph-to-Text Generation with Pretrained Language Models
Junyi Li
Tianyi Tang
Wayne Xin Zhao
Zhicheng Wei
N. Yuan
Ji-Rong Wen
79
49
0
03 Jun 2021
Can Generative Pre-trained Language Models Serve as Knowledge Bases for Closed-book QA?
Cunxiang Wang
Pai Liu
Yue Zhang
RALM
104
84
0
03 Jun 2021
Question Answering Over Temporal Knowledge Graphs
Apoorv Saxena
Soumen Chakrabarti
Partha P. Talukdar
AI4MH
118
139
0
03 Jun 2021
Ethical-Advice Taker: Do Language Models Understand Natural Language Interventions?
Jieyu Zhao
Daniel Khashabi
Tushar Khot
Ashish Sabharwal
Kai-Wei Chang
KELM
87
53
0
02 Jun 2021
multiPRover: Generating Multiple Proofs for Improved Interpretability in Rule Reasoning
Swarnadeep Saha
Prateek Yadav
Joey Tianyi Zhou
ReLM
LRM
92
26
0
02 Jun 2021
A Unified Generative Framework for Various NER Subtasks
Hang Yan
Tao Gui
Junqi Dai
Qipeng Guo
Zheng Zhang
Xipeng Qiu
92
298
0
02 Jun 2021
COM2SENSE: A Commonsense Reasoning Benchmark with Complementary Sentences
Shikhar Singh
Nuan Wen
Yu Hou
Pegah Alipoormolabashi
Te-Lin Wu
Xuezhe Ma
Nanyun Peng
LRM
98
59
0
02 Jun 2021
Answer Generation for Retrieval-based Question Answering Systems
Chao-Chun Hsu
Eric Lind
Luca Soldaini
Alessandro Moschitti
68
26
0
02 Jun 2021
Claim Matching Beyond English to Scale Global Fact-Checking
Ashkan Kazemi
Kiran Garimella
Devin Gaffney
Scott A. Hale
77
60
0
01 Jun 2021
ConvoSumm: Conversation Summarization Benchmark and Improved Abstractive Summarization with Argument Mining
Alexander R. Fabbri
Faiaz Rahman
Imad Rizvi
Borui Wang
Haoran Li
Yashar Mehdad
Dragomir R. Radev
96
65
0
01 Jun 2021
Implicit Representations of Meaning in Neural Language Models
Belinda Z. Li
Maxwell Nye
Jacob Andreas
NAI
MILM
67
177
0
01 Jun 2021
CIDER: Commonsense Inference for Dialogue Explanation and Reasoning
Deepanway Ghosal
Pengfei Hong
Siqi Shen
Navonil Majumder
Rada Mihalcea
Soujanya Poria
86
23
0
01 Jun 2021
PIGLeT: Language Grounding Through Neuro-Symbolic Interaction in a 3D World
Rowan Zellers
Ari Holtzman
Matthew E. Peters
Roozbeh Mottaghi
Aniruddha Kembhavi
Ali Farhadi
Yejin Choi
108
69
0
01 Jun 2021
HiddenCut: Simple Data Augmentation for Natural Language Understanding with Better Generalization
Jiaao Chen
Dinghan Shen
Weizhu Chen
Diyi Yang
BDL
74
48
0
31 May 2021
MSG-Transformer: Exchanging Local Spatial Information by Manipulating Messenger Tokens
Jiemin Fang
Lingxi Xie
Xinggang Wang
Xiaopeng Zhang
Wenyu Liu
Qi Tian
ViT
73
77
0
31 May 2021
M6-T: Exploring Sparse Expert Models and Beyond
An Yang
Junyang Lin
Rui Men
Chang Zhou
Le Jiang
...
Dingyang Zhang
Wei Lin
Lin Qu
Jingren Zhou
Hongxia Yang
MoE
122
24
0
31 May 2021
SemEval-2021 Task 4: Reading Comprehension of Abstract Meaning
Boyuan Zheng
Xiaoyu Yang
Yu-Ping Ruan
Zhen-Hua Ling
Quan Liu
Si Wei
Xiao-Dan Zhu
ELM
44
13
0
31 May 2021
Transfer Learning for Sequence Generation: from Single-source to Multi-source
Xuancheng Huang
Jingfang Xu
Maosong Sun
Yang Liu
57
5
0
31 May 2021
On Compositional Generalization of Neural Machine Translation
Yafu Li
Yongjing Yin
Yulong Chen
Yue Zhang
233
46
0
31 May 2021
On the Interplay Between Fine-tuning and Composition in Transformers
Lang-Chi Yu
Allyson Ettinger
77
14
0
31 May 2021
LEAP: Learnable Pruning for Transformer-based Models
Z. Yao
Xiaoxia Wu
Linjian Ma
Sheng Shen
Kurt Keutzer
Michael W. Mahoney
Yuxiong He
60
7
0
30 May 2021
StyTr
2
^2
2
: Image Style Transfer with Transformers
Yingying Deng
Fan Tang
Weiming Dong
Chongyang Ma
Xingjia Pan
Lei Wang
Changsheng Xu
ViT
117
268
0
30 May 2021
Gaze Estimation using Transformer
Yihua Cheng
Feng Lu
ViT
80
94
0
30 May 2021
Towards mental time travel: a hierarchical memory for reinforcement learning agents
Andrew Kyle Lampinen
Stephanie C. Y. Chan
Andrea Banino
Felix Hill
92
47
0
28 May 2021
Knowledge Inheritance for Pre-trained Language Models
Yujia Qin
Yankai Lin
Jing Yi
Jiajie Zhang
Xu Han
...
Yusheng Su
Zhiyuan Liu
Peng Li
Maosong Sun
Jie Zhou
VLM
78
50
0
28 May 2021
Early Exiting with Ensemble Internal Classifiers
Tianxiang Sun
Yunhua Zhou
Xiangyang Liu
Xinyu Zhang
Hao Jiang
Bo Zhao
Xuanjing Huang
Xipeng Qiu
70
31
0
28 May 2021
ByT5: Towards a token-free future with pre-trained byte-to-byte models
Linting Xue
Aditya Barua
Noah Constant
Rami Al-Rfou
Sharan Narang
Mihir Kale
Adam Roberts
Colin Raffel
141
508
0
28 May 2021
SciFive: a text-to-text transformer model for biomedical literature
Long Phan
J. Anibal
H. Tran
Shaurya Chanana
Erol Bahadroglu
Alec Peltekian
G. Altan-Bonnet
MedIm
71
151
0
28 May 2021
Inspecting the concept knowledge graph encoded by modern language models
Carlos Aspillaga
Marcelo Mendoza
Alvaro Soto
72
13
0
27 May 2021
Synthetic Data Generation for Grammatical Error Correction with Tagged Corruption Models
Felix Stahlberg
Shankar Kumar
SyDa
137
97
0
27 May 2021
Contrastive Fine-tuning Improves Robustness for Neural Rankers
Xiaofei Ma
Cicero Nogueira dos Santos
Andrew O. Arnold
111
20
0
27 May 2021
Sequence Parallelism: Long Sequence Training from System Perspective
Shenggui Li
Fuzhao Xue
Chaitanya Baranwal
Yongbin Li
Yang You
92
103
0
26 May 2021
TreeBERT: A Tree-Based Pre-Trained Model for Programming Language
Xue Jiang
Zhuoran Zheng
Chen Lyu
Liang Li
Lei Lyu
85
91
0
26 May 2021
LMMS Reloaded: Transformer-based Sense Embeddings for Disambiguation and Beyond
Daniel Loureiro
A. Jorge
Jose Camacho-Collados
88
26
0
26 May 2021
Super Tickets in Pre-Trained Language Models: From Model Compression to Improving Generalization
Chen Liang
Simiao Zuo
Minshuo Chen
Haoming Jiang
Xiaodong Liu
Pengcheng He
T. Zhao
Weizhu Chen
59
69
0
25 May 2021
Focus Attention: Promoting Faithfulness and Diversity in Summarization
Rahul Aralikatte
Shashi Narayan
Joshua Maynez
S. Rothe
Ryan T. McDonald
112
46
0
25 May 2021
Estimating Redundancy in Clinical Text
Thomas Searle
Zina M. Ibrahim
J. Teo
Richard J. B. Dobson
67
21
0
25 May 2021
PTR: Prompt Tuning with Rules for Text Classification
Xu Han
Weilin Zhao
Ning Ding
Zhiyuan Liu
Maosong Sun
VLM
106
532
0
24 May 2021
Pretrained Language Models for Text Generation: A Survey
Junyi Li
Tianyi Tang
Wayne Xin Zhao
Ji-Rong Wen
LM&MA
VLM
SyDa
106
191
0
21 May 2021
Dynaboard: An Evaluation-As-A-Service Platform for Holistic Next-Generation Benchmarking
Zhiyi Ma
Kawin Ethayarajh
Tristan Thrush
Somya Jain
Ledell Yu Wu
Robin Jia
Christopher Potts
Adina Williams
Douwe Kiela
ELM
115
59
0
21 May 2021
KLUE: Korean Language Understanding Evaluation
Sungjoon Park
Jihyung Moon
Sungdong Kim
Won Ik Cho
Jiyoon Han
...
Seonghyun Kim
Lucy Park
Alice Oh
Jung-Woo Ha
Kyunghyun Cho
ELM
VLM
121
198
0
20 May 2021
DeepDebug: Fixing Python Bugs Using Stack Traces, Backtranslation, and Code Skeletons
Dawn Drain
Colin B. Clement
Guillermo Serrato
Neel Sundaresan
71
31
0
19 May 2021
Exploring Text-to-Text Transformers for English to Hinglish Machine Translation with Synthetic Code-Mixing
Ganesh Jawahar
El Moatez Billah Nagoudi
Muhammad Abdul-Mageed
L. Lakshmanan
95
30
0
18 May 2021
CoTexT: Multi-task Learning with Code-Text Transformer
Long Phan
H. Tran
Daniel Le
Hieu Duy Nguyen
J. Anibal
Alec Peltekian
Yanfang Ye
92
136
0
18 May 2021
Relative Positional Encoding for Transformers with Linear Complexity
Antoine Liutkus
Ondřej Cífka
Shih-Lun Wu
Umut Simsekli
Yi-Hsuan Yang
Gaël Richard
84
48
0
18 May 2021
BookSum: A Collection of Datasets for Long-form Narrative Summarization
Wojciech Kry'sciñski
Nazneen Rajani
Divyansh Agarwal
Caiming Xiong
Dragomir R. Radev
RALM
116
154
0
18 May 2021
SHARE: a System for Hierarchical Assistive Recipe Editing
Shuyang Li
Yufei Li
Jianmo Ni
Julian McAuley
52
20
0
17 May 2021
Pay Attention to MLPs
Hanxiao Liu
Zihang Dai
David R. So
Quoc V. Le
AI4CE
159
669
0
17 May 2021
Previous
1
2
3
...
181
182
183
...
196
197
198
Next