Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1910.10683
Cited By
v1
v2
v3
v4 (latest)
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"
50 / 9,891 papers shown
Title
Describing Differences between Text Distributions with Natural Language
Ruiqi Zhong
Charles Burton Snell
Dan Klein
Jacob Steinhardt
VLM
198
44
0
28 Jan 2022
Generative Cooperative Networks for Natural Language Generation
Sylvain Lamprier
Thomas Scialom
Antoine Chaffin
Vincent Claveau
Ewa Kijak
Jacopo Staiano
Benjamin Piwowarski
GAN
104
13
0
28 Jan 2022
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Jason W. Wei
Xuezhi Wang
Dale Schuurmans
Maarten Bosma
Brian Ichter
F. Xia
Ed H. Chi
Quoc Le
Denny Zhou
LM&Ro
LRM
AI4CE
ReLM
1.0K
9,813
0
28 Jan 2022
Grad2Task: Improved Few-shot Text Classification Using Gradients for Task Representation
Jixuan Wang
Kuan-Chieh Wang
Frank Rudzicz
M. Brudno
VLM
66
22
0
27 Jan 2022
Reasoning Like Program Executors
Xinyu Pi
Qian Liu
Bei Chen
Morteza Ziyadi
Zeqi Lin
Qiang Fu
Yan Gao
Jian-Guang Lou
Weizhu Chen
ReLM
LRM
319
53
0
27 Jan 2022
Pan More Gold from the Sand: Refining Open-domain Dialogue Training with Noisy Self-Retrieval Generation
Yihe Wang
Yitong Li
Yasheng Wang
Fei Mi
Pingyi Zhou
Xin Wang
Jin Liu
Xin Jiang
Qun Liu
RALM
91
3
0
27 Jan 2022
Synchromesh: Reliable code generation from pre-trained language models
Gabriel Poesia
Oleksandr Polozov
Vu Le
A. Tiwari
Gustavo Soares
Christopher Meek
Sumit Gulwani
78
163
0
26 Jan 2022
SCAI-QReCC Shared Task on Conversational Question Answering
S. Vakulenko
Johannes Kiesel
Maik Fröbe
LRM
57
6
0
26 Jan 2022
Whose Language Counts as High Quality? Measuring Language Ideologies in Text Data Selection
Suchin Gururangan
Dallas Card
Sarah K. Drier
E. K. Gade
Leroy Z. Wang
Zeyu Wang
Luke Zettlemoyer
Noah A. Smith
270
81
0
25 Jan 2022
Text and Code Embeddings by Contrastive Pre-Training
Arvind Neelakantan
Tao Xu
Raul Puri
Alec Radford
Jesse Michael Han
...
Tabarak Khan
Toki Sherbakov
Joanne Jang
Peter Welinder
Lilian Weng
SSL
AI4TS
394
445
0
24 Jan 2022
Table Pre-training: A Survey on Model Architectures, Pre-training Objectives, and Downstream Tasks
Haoyu Dong
Zhoujun Cheng
Xinyi He
Mengyuan Zhou
Anda Zhou
Fan Zhou
Ao Liu
Shi Han
Dongmei Zhang
LMTD
151
65
0
24 Jan 2022
Unified Question Generation with Continual Lifelong Learning
Wei Yuan
Hongzhi Yin
Tieke He
Tong Chen
Qiufeng Wang
Li-zhen Cui
98
11
0
24 Jan 2022
Artefact Retrieval: Overview of NLP Models with Knowledge Base Access
Vilém Zouhar
Marius Mosbach
Debanjali Biswas
Dietrich Klakow
KELM
90
4
0
24 Jan 2022
Question rewriting? Assessing its importance for conversational question answering
Gonçalo Raposo
Rui Ribeiro
Bruno Martins
Luísa Coheur
KELM
79
20
0
22 Jan 2022
Leaf: Multiple-Choice Question Generation
Kristiyan Vachev
Momchil Hardalov
Georgi Karadzhov
Georgi Georgiev
Ivan Koychev
Preslav Nakov
AI4Ed
72
23
0
22 Jan 2022
Description-Driven Task-Oriented Dialog Modeling
Jeffrey Zhao
Raghav Gupta
Yuan Cao
Dian Yu
Mingqiu Wang
Harrison Lee
Abhinav Rastogi
Izhak Shafran
Yonghui Wu
106
65
0
21 Jan 2022
GreaseLM: Graph REASoning Enhanced Language Models for Question Answering
Xikun Zhang
Antoine Bosselut
Michihiro Yasunaga
Hongyu Ren
Percy Liang
Christopher D. Manning
J. Leskovec
ReLM
AI4MH
LRM
109
231
0
21 Jan 2022
A Comparative Study on Language Models for Task-Oriented Dialogue Systems
Vinsen Marselino Andreas
Genta Indra Winata
Ayu Purwarianti
58
8
0
21 Jan 2022
Context-Tuning: Learning Contextualized Prompts for Natural Language Generation
Tianyi Tang
Junyi Li
Wayne Xin Zhao
Ji-Rong Wen
83
17
0
21 Jan 2022
Cheating Automatic Short Answer Grading: On the Adversarial Usage of Adjectives and Adverbs
Anna Filighera
Sebastian Ochs
Tim Steuer
Thomas Tregel
AAML
60
11
0
20 Jan 2022
LaMDA: Language Models for Dialog Applications
R. Thoppilan
Daniel De Freitas
Jamie Hall
Noam M. Shazeer
Apoorv Kulshreshtha
...
Blaise Aguera-Arcas
Claire Cui
M. Croak
Ed H. Chi
Quoc Le
ALM
154
1,606
0
20 Jan 2022
A Latent-Variable Model for Intrinsic Probing
Karolina Stañczak
Lucas Torroba Hennigen
Adina Williams
Ryan Cotterell
Isabelle Augenstein
116
4
0
20 Jan 2022
Linguistically-driven Multi-task Pre-training for Low-resource Neural Machine Translation
Zhuoyuan Mao
Chenhui Chu
Sadao Kurohashi
44
7
0
20 Jan 2022
Improving Biomedical Information Retrieval with Neural Retrievers
Man Luo
Arindam Mitra
Tejas Gokhale
Chitta Baral
82
35
0
19 Jan 2022
Uncovering More Shallow Heuristics: Probing the Natural Language Inference Capacities of Transformer-Based Pre-Trained Language Models Using Syllogistic Patterns
Reto Gubelmann
Siegfried Handschuh
ReLM
LRM
80
6
0
19 Jan 2022
CM3: A Causal Masked Multimodal Model of the Internet
Armen Aghajanyan
Po-Yao (Bernie) Huang
Candace Ross
Vladimir Karpukhin
Hu Xu
...
Dmytro Okhonko
Mandar Joshi
Gargi Ghosh
M. Lewis
Luke Zettlemoyer
128
158
0
19 Jan 2022
GAP-Gen: Guided Automatic Python Code Generation
Junchen Zhao
Yurun Song
Junlin Wang
Ian G. Harris
60
6
0
19 Jan 2022
Fooling MOSS Detection with Pretrained Language Models
Stella Biderman
Edward Raff
DeLMO
74
36
0
19 Jan 2022
Unveiling Project-Specific Bias in Neural Code Models
Zhiming Li
Yanzhou Li
Tianlin Li
Mengnan Du
Bozhi Wu
Yushi Cao
Yi Li
Yang Liu
86
5
0
19 Jan 2022
Language Models as Zero-Shot Planners: Extracting Actionable Knowledge for Embodied Agents
Wenlong Huang
Pieter Abbeel
Deepak Pathak
Igor Mordatch
LM&Ro
126
1,129
0
18 Jan 2022
What Makes the Story Forward? Inferring Commonsense Explanations as Prompts for Future Event Generation
Li Lin
Yixin Cao
Lifu Huang
Shuang Li
Xuming Hu
Lijie Wen
Jianmin Wang
AI4TS
79
16
0
18 Jan 2022
ZeroPrompt: Scaling Prompt-Based Pretraining to 1,000 Tasks Improves Zero-Shot Generalization
Hanwei Xu
Yujun Chen
Yulun Du
Nan Shao
Yanggang Wang
Haiyu Li
Zhilin Yang
VLM
LRM
AI4CE
77
69
0
18 Jan 2022
COPA-SSE: Semi-structured Explanations for Commonsense Reasoning
Ana Brassard
Benjamin Heinzerling
Pride Kavumba
Kentaro Inui
FAtt
LRM
98
11
0
18 Jan 2022
Towards a Cleaner Document-Oriented Multilingual Crawled Corpus
Julien Abadji
Pedro Ortiz Suarez
Laurent Romary
Benoît Sagot
CLL
97
159
0
17 Jan 2022
Natural Language Deduction through Search over Statement Compositions
Kaj Bostrom
Zayne Sprague
Swarat Chaudhuri
Greg Durrett
ReLM
LRM
101
46
0
16 Jan 2022
Memory-assisted prompt editing to improve GPT-3 after deployment
Aman Madaan
Niket Tandon
Peter Clark
Yiming Yang
KELM
93
0
0
16 Jan 2022
UnifiedSKG: Unifying and Multi-Tasking Structured Knowledge Grounding with Text-to-Text Language Models
Tianbao Xie
Chen Henry Wu
Peng Shi
Ruiqi Zhong
Torsten Scholak
...
Lingpeng Kong
Rui Zhang
Noah A. Smith
Luke Zettlemoyer
Tao Yu
LMTD
123
304
0
16 Jan 2022
WANLI: Worker and AI Collaboration for Natural Language Inference Dataset Creation
Alisa Liu
Swabha Swayamdipta
Noah A. Smith
Yejin Choi
206
221
0
16 Jan 2022
Unobserved Local Structures Make Compositional Generalization Hard
Ben Bogin
Shivanshu Gupta
Jonathan Berant
CoGe
100
33
0
15 Jan 2022
Transferability in Deep Learning: A Survey
Junguang Jiang
Yang Shu
Jianmin Wang
Mingsheng Long
OOD
93
105
0
15 Jan 2022
DeepSpeed-MoE: Advancing Mixture-of-Experts Inference and Training to Power Next-Generation AI Scale
Samyam Rajbhandari
Conglong Li
Z. Yao
Minjia Zhang
Reza Yazdani Aminabadi
A. A. Awan
Jeff Rasley
Yuxiong He
138
309
0
14 Jan 2022
A Survey of Controllable Text Generation using Transformer-based Pre-trained Language Models
Hanqing Zhang
Haolin Song
Shaoyu Li
Ming Zhou
Dawei Song
141
230
0
14 Jan 2022
CommonsenseQA 2.0: Exposing the Limits of AI through Gamification
Alon Talmor
Ori Yoran
Ronan Le Bras
Chandrasekhar Bhagavatula
Yoav Goldberg
Yejin Choi
Jonathan Berant
ELM
108
146
0
14 Jan 2022
Applying a Generic Sequence-to-Sequence Model for Simple and Effective Keyphrase Generation
Md. Faisal Mahbub Chowdhury
Gaetano Rossiello
Michael R. Glass
Nandana Mihindukulasooriya
A. Gliozzo
56
14
0
14 Jan 2022
Multi-Narrative Semantic Overlap Task: Evaluation and Benchmark
Naman Bansal
Mousumi Akter
Shubhra (Santu) Karmaker
78
0
0
14 Jan 2022
Assemble Foundation Models for Automatic Code Summarization
Jian Gu
P. Salza
H. Gall
91
36
0
13 Jan 2022
Grow-and-Clip: Informative-yet-Concise Evidence Distillation for Answer Explanation
Yuyan Chen
Yanghua Xiao
Bang Liu
80
17
0
13 Jan 2022
LARD: Large-scale Artificial Disfluency Generation
T. Passali
T. Mavropoulos
Grigorios Tsoumakas
G. Meditskos
S. Vrochidis
62
17
0
13 Jan 2022
Multiview Transformers for Video Recognition
Shen Yan
Xuehan Xiong
Anurag Arnab
Zhichao Lu
Mi Zhang
Chen Sun
Cordelia Schmid
ViT
97
221
0
12 Jan 2022
Black-Box Tuning for Language-Model-as-a-Service
Tianxiang Sun
Yunfan Shao
Hong Qian
Xuanjing Huang
Xipeng Qiu
VLM
181
275
0
10 Jan 2022
Previous
1
2
3
...
168
169
170
...
196
197
198
Next