Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1910.10683
Cited By
v1
v2
v3
v4 (latest)
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"
50 / 9,870 papers shown
Title
Deep Extrapolation for Attribute-Enhanced Generation
Alvin Chan
Ali Madani
Ben Krause
Nikhil Naik
111
26
0
07 Jul 2021
PhotoChat: A Human-Human Dialogue Dataset with Photo Sharing Behavior for Joint Image-Text Modeling
Xiaoxue Zang
Lijuan Liu
Maria Wang
Yang Song
Hao Zhang
Jindong Chen
VLM
99
60
0
06 Jul 2021
Rethinking Positional Encoding
Jianqiao Zheng
Sameera Ramasinghe
Simon Lucey
85
52
0
06 Jul 2021
FaVIQ: FAct Verification from Information-seeking Questions
Jungsoo Park
Sewon Min
Jaewoo Kang
Luke Zettlemoyer
Hannaneh Hajishirzi
HILM
77
40
0
05 Jul 2021
ERNIE 3.0: Large-scale Knowledge Enhanced Pre-training for Language Understanding and Generation
Yu Sun
Shuohuan Wang
Shikun Feng
Siyu Ding
Chao Pang
...
Ouyang Xuan
Dianhai Yu
Hao Tian
Hua Wu
Haifeng Wang
114
475
0
05 Jul 2021
Training Adaptive Computation for Open-Domain Question Answering with Computational Constraints
Yuxiang Wu
Pasquale Minervini
Pontus Stenetorp
Sebastian Riedel
55
5
0
05 Jul 2021
Sentence-level Online Handwritten Chinese Character Recognition
Yunxin Li
Qian Yang
Qingcai Chen
Lin Ma
Baotian Hu
Xiaolong Wang
Yuxin Ding
18
0
0
04 Jul 2021
Can Transformers Jump Around Right in Natural Language? Assessing Performance Transfer from SCAN
Rahma Chaabouni
Roberto Dessì
Eugene Kharitonov
86
20
0
03 Jul 2021
Solving Machine Learning Problems
Sunny Tran
P. Krishna
Ishan Pakuwal
Prabhakar Kafle
Nikhil Singh
J. Lynch
Iddo Drori
VLM
120
11
0
02 Jul 2021
CrowdSpeech and VoxDIY: Benchmark Datasets for Crowdsourced Audio Transcription
Nikita Pavlichenko
Ivan Stelmakh
Dmitry Ustalov
74
19
0
02 Jul 2021
An Investigation of the (In)effectiveness of Counterfactually Augmented Data
Nitish Joshi
He He
OODD
86
47
0
01 Jul 2021
A Primer on Pretrained Multilingual Language Models
Sumanth Doddapaneni
Gowtham Ramesh
Mitesh M. Khapra
Anoop Kunchukuttan
Pratyush Kumar
LRM
119
76
0
01 Jul 2021
Reinforcement Learning for Abstractive Question Summarization with Question-aware Semantic Rewards
S. Yadav
D. Gupta
Asma Ben Abacha
Dina Demner-Fushman
OffRL
63
34
0
01 Jul 2021
Improving Factual Consistency of Abstractive Summarization on Customer Feedback
Yang Liu
Yifei Sun
Vincent Gao
HILM
55
6
0
30 Jun 2021
XLM-E: Cross-lingual Language Model Pre-training via ELECTRA
Zewen Chi
Shaohan Huang
Li Dong
Shuming Ma
Bo Zheng
...
Payal Bajaj
Xia Song
Xian-Ling Mao
Heyan Huang
Furu Wei
111
121
0
30 Jun 2021
SCARF: Self-Supervised Contrastive Learning using Random Feature Corruption
Dara Bahri
Heinrich Jiang
Yi Tay
Donald Metzler
SSL
74
178
0
29 Jun 2021
Time-Aware Language Models as Temporal Knowledge Bases
Bhuwan Dhingra
Jeremy R. Cole
Julian Martin Eisenschlos
D. Gillick
Jacob Eisenstein
William W. Cohen
KELM
128
281
0
29 Jun 2021
Overview of BioASQ 2021: The ninth BioASQ challenge on Large-Scale Biomedical Semantic Indexing and Question Answering
A. Nentidis
K. Bougiatiotis
Carlos Rodríguez-Penagos
Anastasia Krithara
Marta Villegas
Martin Krallinger
George Giannakopoulos
68
45
0
28 Jun 2021
A Knowledge-Grounded Dialog System Based on Pre-Trained Language Models
Weijie Zhang
Jiaoxuan Chen
Haipang Wu
Sanhui Wan
Gongfeng Li
51
4
0
28 Jun 2021
Draw Me a Flower: Processing and Grounding Abstraction in Natural Language
R. Lachmy
Valentina Pyatkin
Avshalom Manevich
Reut Tsarfaty
50
19
0
27 Jun 2021
Multimodal Few-Shot Learning with Frozen Language Models
Maria Tsimpoukelli
Jacob Menick
Serkan Cabi
S. M. Ali Eslami
Oriol Vinyals
Felix Hill
MLLM
226
791
0
25 Jun 2021
Transflower: probabilistic autoregressive dance generation with multimodal attention
Guillermo Valle Pérez
G. Henter
Jonas Beskow
A. Holzapfel
Pierre-Yves Oudeyer
Simon Alexanderson
126
43
0
25 Jun 2021
XL-Sum: Large-Scale Multilingual Abstractive Summarization for 44 Languages
Tahmid Hasan
Abhik Bhattacharjee
Md. Saiful Islam
Kazi Samin Mubasshir
Yuan-Fang Li
Yong-Bin Kang
M. Rahman
Rifat Shahriyar
104
372
0
25 Jun 2021
DeltaLM: Encoder-Decoder Pre-training for Language Generation and Translation by Augmenting Pretrained Multilingual Encoders
Shuming Ma
Li Dong
Shaohan Huang
Dongdong Zhang
Alexandre Muzio
Saksham Singhal
Hany Awadalla
Xia Song
Furu Wei
SLR
AI4CE
90
81
0
25 Jun 2021
Domain-Specific Pretraining for Vertical Search: Case Study on Biomedical Literature
Yu Wang
Jinchao Li
Tristan Naumann
Chenyan Xiong
Hao Cheng
...
Yang Qin
Eric Horvitz
Paul N. Bennett
Jianfeng Gao
Hoifung Poon
OOD
85
14
0
25 Jun 2021
Video Swin Transformer
Ze Liu
Jia Ning
Yue Cao
Yixuan Wei
Zheng Zhang
Stephen Lin
Han Hu
ViT
125
1,498
0
24 Jun 2021
Charformer: Fast Character Transformers via Gradient-based Subword Tokenization
Yi Tay
Vinh Q. Tran
Sebastian Ruder
Jai Gupta
Hyung Won Chung
Dara Bahri
Zhen Qin
Simon Baumgartner
Cong Yu
Donald Metzler
155
162
0
23 Jun 2021
Stable, Fast and Accurate: Kernelized Attention with Relative Positional Encoding
Shengjie Luo
Shanda Li
Tianle Cai
Di He
Dinglan Peng
Shuxin Zheng
Guolin Ke
Liwei Wang
Tie-Yan Liu
95
50
0
23 Jun 2021
Towards Knowledge-Grounded Counter Narrative Generation for Hate Speech
Yi-Ling Chung
Serra Sinem Tekiroğlu
Marco Guerini
72
67
0
22 Jun 2021
Learn to Resolve Conversational Dependency: A Consistency Training Framework for Conversational Question Answering
Gangwoo Kim
Hyunjae Kim
Jungsoo Park
Jaewoo Kang
103
38
0
22 Jun 2021
DocFormer: End-to-End Transformer for Document Understanding
Srikar Appalaraju
Bhavan A. Jasani
Bhargava Urala Kota
Yusheng Xie
R. Manmatha
ViT
102
281
0
22 Jun 2021
BARTScore: Evaluating Generated Text as Text Generation
Weizhe Yuan
Graham Neubig
Pengfei Liu
187
851
0
22 Jun 2021
GAIA: A Transfer Learning System of Object Detection that Fits Your Needs
Xingyuan Bu
Junran Peng
Junjie Yan
Tieniu Tan
Zhaoxiang Zhang
ObjD
VLM
144
53
0
21 Jun 2021
VIMPAC: Video Pre-Training via Masked Token Prediction and Contrastive Learning
Hao Tan
Jie Lei
Thomas Wolf
Joey Tianyi Zhou
118
66
0
21 Jun 2021
CPM-2: Large-scale Cost-effective Pre-trained Language Models
Zhengyan Zhang
Yuxian Gu
Xu Han
Shengqi Chen
Chaojun Xiao
...
Minlie Huang
Wentao Han
Yang Liu
Xiaoyan Zhu
Maosong Sun
MoE
90
88
0
20 Jun 2021
Multi-Pair Text Style Transfer on Unbalanced Data
Xing Han
J. Lundin
42
0
0
20 Jun 2021
JointGT: Graph-Text Joint Representation Learning for Text Generation from Knowledge Graphs
Pei Ke
Haozhe Ji
Yuanyuan Ran
Xin Cui
Liwei Wang
Linfeng Song
Xiaoyan Zhu
Minlie Huang
114
97
0
19 Jun 2021
Distributed Deep Learning in Open Collaborations
Michael Diskin
Alexey Bukhtiyarov
Max Ryabinin
Lucile Saulnier
Quentin Lhoest
...
Denis Mazur
Ilia Kobelev
Yacine Jernite
Thomas Wolf
Gennady Pekhimenko
FedML
129
59
0
18 Jun 2021
Large-Scale Chemical Language Representations Capture Molecular Structure and Properties
Jerret Ross
Brian M. Belgodere
Vijil Chenthamarakshan
Inkit Padhi
Youssef Mroueh
Payel Das
AI4CE
91
302
0
17 Jun 2021
Text2Event: Controllable Sequence-to-Structure Generation for End-to-end Event Extraction
Yaojie Lu
Hongyu Lin
Jin Xu
Xianpei Han
Jialong Tang
Annan Li
Le Sun
M. Liao
Shaoyi Chen
69
280
0
17 Jun 2021
Why Do Pretrained Language Models Help in Downstream Tasks? An Analysis of Head and Prompt Tuning
Colin Wei
Sang Michael Xie
Tengyu Ma
148
100
0
17 Jun 2021
Can I Be of Further Assistance? Using Unstructured Knowledge Access to Improve Task-oriented Conversational Modeling
Di Jin
Seokhwan Kim
Dilek Z. Hakkani-Tür
58
14
0
16 Jun 2021
Automatic Construction of Evaluation Suites for Natural Language Generation Datasets
Simon Mille
Kaustubh D. Dhole
Saad Mahamood
Laura Perez-Beltrachini
Varun Gangal
Mihir Kale
Emiel van Miltenburg
Sebastian Gehrmann
ELM
87
23
0
16 Jun 2021
Named Entity Recognition with Small Strongly Labeled and Large Weakly Labeled Data
Haoming Jiang
Danqing Zhang
Tianyu Cao
Bing Yin
T. Zhao
NoLa
80
45
0
16 Jun 2021
Eigen Analysis of Self-Attention and its Reconstruction from Partial Computation
Srinadh Bhojanapalli
Ayan Chakrabarti
Himanshu Jain
Sanjiv Kumar
Michal Lukasik
Andreas Veit
65
8
0
16 Jun 2021
To Raise or Not To Raise: The Autonomous Learning Rate Question
Xiaomeng Dong
Tao Tan
Michael Potter
Yun-Chan Tsai
Gaurav Kumar
V. R. Saripalli
Theodore Trafalis
OOD
28
2
0
16 Jun 2021
BEiT: BERT Pre-Training of Image Transformers
Hangbo Bao
Li Dong
Songhao Piao
Furu Wei
ViT
334
2,853
0
15 Jun 2021
Interpretable Self-supervised Multi-task Learning for COVID-19 Information Retrieval and Extraction
Nima Ebadi
Peyman Najafirad
39
0
0
15 Jun 2021
Communicating Natural Programs to Humans and Machines
Samuel Acquaviva
Yewen Pu
Marta Kryven
Theo Sechopoulos
Catherine Wong
Gabrielle Ecanow
Maxwell Nye
Michael Henry Tessler
J. Tenenbaum
92
42
0
15 Jun 2021
Improving Paraphrase Detection with the Adversarial Paraphrasing Task
Animesh Nighojkar
John Licato
70
39
0
14 Jun 2021
Previous
1
2
3
...
179
180
181
...
196
197
198
Next