Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1910.10683
Cited By
v1
v2
v3
v4 (latest)
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"
50 / 9,901 papers shown
Title
ConEntail: An Entailment-based Framework for Universal Zero and Few Shot Classification with Supervised Contrastive Pretraining
H. Zhang
Aysa Xuemo Fan
Rui Zhang
VLM
95
3
0
14 Oct 2022
Is synthetic data from generative models ready for image recognition?
Ruifei He
Shuyang Sun
Xin Yu
Chuhui Xue
Wenqing Zhang
Philip Torr
Song Bai
Xiaojuan Qi
135
302
0
14 Oct 2022
Q-TOD: A Query-driven Task-oriented Dialogue System
Xin Tian
Yingzhan Lin
Mengfei Song
Siqi Bao
Fan Wang
H. He
Shuqi Sun
Hua Wu
65
21
0
14 Oct 2022
Self-Adaptive Named Entity Recognition by Retrieving Unstructured Knowledge
Kosuke Nishida
Naoki Yoshinaga
Kyosuke Nishida
96
2
0
14 Oct 2022
Can Language Representation Models Think in Bets?
Zhi–Bin Tang
Mayank Kejriwal
53
6
0
14 Oct 2022
"John is 50 years old, can his son be 65?" Evaluating NLP Models' Understanding of Feasibility
Himanshu Gupta
Neeraj Varshney
Swaroop Mishra
Kuntal Kumar Pal
Saurabh Arjun Sawant
Kevin Scaria
Siddharth Goyal
Chitta Baral
ELM
102
14
0
14 Oct 2022
Using Graph Algorithms to Pretrain Graph Completion Transformers
Jonathan Pilault
Mikhail Galkin
Bahare Fatemi
Perouz Taslakian
David Vasquez
C. Pal
60
0
0
14 Oct 2022
Behavior Cloned Transformers are Neurosymbolic Reasoners
Ruoyao Wang
Peter Alexander Jansen
Marc-Alexandre Côté
Prithviraj Ammanabrolu
101
12
0
13 Oct 2022
Mind the Labels: Describing Relations in Knowledge Graphs With Pretrained Models
Zdeněk Kasner
Ioannis Konstas
Ondrej Dusek
78
6
0
13 Oct 2022
Machine Generated Text: A Comprehensive Survey of Threat Models and Detection Methods
Evan Crothers
Nathalie Japkowicz
H. Viktor
DeLMO
153
113
0
13 Oct 2022
MTEB: Massive Text Embedding Benchmark
Niklas Muennighoff
Nouamane Tazi
L. Magne
Nils Reimers
575
422
0
13 Oct 2022
Towards a Unified Multi-Dimensional Evaluator for Text Generation
Ming Zhong
Yang Liu
Da Yin
Yuning Mao
Yizhu Jiao
Peng Liu
Chenguang Zhu
Heng Ji
Jiawei Han
ELM
110
276
0
13 Oct 2022
Language Models of Code are Few-Shot Commonsense Learners
Aman Madaan
Shuyan Zhou
Uri Alon
Yiming Yang
Graham Neubig
ReLM
LRM
141
223
0
13 Oct 2022
Towards End-to-End Open Conversational Machine Reading
Sizhe Zhou
Siru Ouyang
Zhuosheng Zhang
AI Institute
LRM
66
2
0
13 Oct 2022
Spontaneous Emerging Preference in Two-tower Language Model
Zhengqi He
Taro Toyoizumi
LRM
50
1
0
13 Oct 2022
LSG Attention: Extrapolation of pretrained Transformers to long sequences
Charles Condevaux
S. Harispe
84
24
0
13 Oct 2022
Tone prediction and orthographic conversion for Basaa
I. Nikitin
Brian O'Connor
Anastasia N. Safonova
47
1
0
13 Oct 2022
An Empirical Study on Finding Spans
Weiwei Gu
Boyuan Zheng
Yunmo Chen
Tongfei Chen
Benjamin Van Durme
54
4
0
13 Oct 2022
Benchmarking Long-tail Generalization with Likelihood Splits
Ameya Godbole
Robin Jia
ALM
79
9
0
13 Oct 2022
Closed-book Question Generation via Contrastive Learning
Xiangjue Dong
Jiaying Lu
Jianling Wang
James Caverlee
87
8
0
13 Oct 2022
Shortcomings of Question Answering Based Factuality Frameworks for Error Localization
Ryo Kamoi
Tanya Goyal
Greg Durrett
HILM
92
14
0
13 Oct 2022
Explanations from Large Language Models Make Small Reasoners Better
Shiyang Li
Jianshu Chen
Yelong Shen
Zhiyu Zoey Chen
Xinlu Zhang
...
Jingu Qian
Baolin Peng
Yi Mao
Wenhu Chen
Xifeng Yan
ReLM
LRM
109
138
0
13 Oct 2022
Large Language Models are few(1)-shot Table Reasoners
Wenhu Chen
LMTD
ReLM
LRM
91
153
0
13 Oct 2022
SubeventWriter: Iterative Sub-event Sequence Generation with Coherence Controller
Zhaowei Wang
Hongming Zhang
Tianqing Fang
Yangqiu Song
Ginny Wong
Simon See
116
14
0
13 Oct 2022
Knowledge-grounded Dialog State Tracking
Dian Yu
Mingqiu Wang
Yuan Cao
Izhak Shafran
Laurent El Shafey
H. Soltau
BDL
85
3
0
13 Oct 2022
Instruction Tuning for Few-Shot Aspect-Based Sentiment Analysis
Siddharth Varia
Shuai Wang
Kishaloy Halder
Robert Vacareanu
Miguel Ballesteros
Yassine Benajiba
Neha Ann John
Rishita Anubhai
Smaranda Muresan
Dan Roth
88
36
0
12 Oct 2022
OpenCQA: Open-ended Question Answering with Charts
Shankar Kantharaj
Do Xuan Long
Rixie Tiffany Ko Leong
J. Tan
Enamul Hoque
Shafiq Joty
83
53
0
12 Oct 2022
Iterative Document-level Information Extraction via Imitation Learning
Yunmo Chen
William Gantt
Weiwei Gu
Tongfei Chen
Aaron Steven White
Benjamin Van Durme
81
11
0
12 Oct 2022
RankT5: Fine-Tuning T5 for Text Ranking with Ranking Losses
Honglei Zhuang
Zhen Qin
R. Jagerman
Kai Hui
Ji Ma
Jing Lu
Jianmo Ni
Xuanhui Wang
Michael Bendersky
AIMat
103
141
0
12 Oct 2022
Developing a general-purpose clinical language inference model from a large corpus of clinical notes
Madhumita Sushil
Dana Ludwig
A. Butte
V. Rudrapatna
LM&MA
77
12
0
12 Oct 2022
RedHOT: A Corpus of Annotated Medical Questions, Experiences, and Claims on Social Media
Somin Wadhwa
Vivek Khetan
Silvio Amir
Byron C. Wallace
68
19
0
12 Oct 2022
The Lazy Neuron Phenomenon: On Emergence of Activation Sparsity in Transformers
Zong-xiao Li
Chong You
Srinadh Bhojanapalli
Daliang Li
A. S. Rawat
...
Kenneth Q Ye
Felix Chern
Felix X. Yu
Ruiqi Guo
Surinder Kumar
MoE
104
97
0
12 Oct 2022
Language Models are Realistic Tabular Data Generators
V. Borisov
Kathrin Seßler
Tobias Leemann
Martin Pawelczyk
Gjergji Kasneci
LMTD
117
255
0
12 Oct 2022
Task Compass: Scaling Multi-task Pre-training with Task Prefix
Zhuosheng Zhang
Shuohang Wang
Yichong Xu
Yuwei Fang
Wenhao Yu
Yang Liu
Han Zhao
Chenguang Zhu
Michael Zeng
SSL
LRM
70
16
0
12 Oct 2022
Zero-Shot On-the-Fly Event Schema Induction
Rotem Dror
Haoyu Wang
Dan Roth
93
18
0
12 Oct 2022
SilverAlign: MT-Based Silver Data Algorithm For Evaluating Word Alignment
Abdullatif Köksal
Silvia Severini
Hinrich Schütze
75
0
0
12 Oct 2022
EduQG: A Multi-format Multiple Choice Dataset for the Educational Domain
Amir Hadifar
Semere Kiros Bitew
Johannes Deleu
Chris Develder
Thomas Demeester
AI4Ed
75
19
0
12 Oct 2022
ZITS++: Image Inpainting by Improving the Incremental Transformer on Structural Priors
Chenjie Cao
Qiaole Dong
Yanwei Fu
127
31
0
12 Oct 2022
Underspecification in Scene Description-to-Depiction Tasks
Ben Hutchinson
Jason Baldridge
Vinodkumar Prabhakaran
DiffM
128
34
0
11 Oct 2022
Designing Robust Transformers using Robust Kernel Density Estimation
Xing Han
Zhaolin Ren
T. Nguyen
Khai Nguyen
Joydeep Ghosh
Nhat Ho
110
6
0
11 Oct 2022
Decoupled Context Processing for Context Augmented Language Modeling
Zonglin Li
Ruiqi Guo
Surinder Kumar
RALM
KELM
86
24
0
11 Oct 2022
A Kernel-Based View of Language Model Fine-Tuning
Sadhika Malladi
Alexander Wettig
Dingli Yu
Danqi Chen
Sanjeev Arora
VLM
157
69
0
11 Oct 2022
Enriching Biomedical Knowledge for Low-resource Language Through Large-Scale Translation
Long Phan
Tai Dang
H. Tran
Trieu H. Trinh
Vy Phan
Lam D. Chau
Minh-Thang Luong
56
8
0
11 Oct 2022
An Exploration of Hierarchical Attention Transformers for Efficient Long Document Classification
Ilias Chalkidis
Xiang Dai
Manos Fergadiotis
Prodromos Malakasiotis
Desmond Elliott
85
35
0
11 Oct 2022
Model Cascading: Towards Jointly Improving Efficiency and Accuracy of NLP Systems
Neeraj Varshney
Chitta Baral
80
28
0
11 Oct 2022
Improving Sharpness-Aware Minimization with Fisher Mask for Better Generalization on Language Models
Qihuang Zhong
Liang Ding
Li Shen
Peng Mi
Juhua Liu
Bo Du
Dacheng Tao
AAML
90
51
0
11 Oct 2022
T5 for Hate Speech, Augmented Data and Ensemble
Tosin Adewumi
Sana Sabah Sabry
Nosheen Abid
F. Liwicki
Marcus Liwicki
72
11
0
11 Oct 2022
Instance Regularization for Discriminative Language Model Pre-training
Zhuosheng Zhang
Hai Zhao
M. Zhou
95
1
0
11 Oct 2022
Mind's Eye: Grounded Language Model Reasoning through Simulation
Ruibo Liu
Jason W. Wei
S. Gu
Te-Yen Wu
Soroush Vosoughi
Claire Cui
Denny Zhou
Andrew M. Dai
ReLM
LRM
219
83
0
11 Oct 2022
A Win-win Deal: Towards Sparse and Robust Pre-trained Language Models
Yuanxin Liu
Fandong Meng
Zheng Lin
JiangNan Li
Peng Fu
Yanan Cao
Weiping Wang
Jie Zhou
87
6
0
11 Oct 2022
Previous
1
2
3
...
149
150
151
...
197
198
199
Next