Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1910.10683
Cited By
v1
v2
v3
v4 (latest)
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"
50 / 9,870 papers shown
Title
Augmenting Pre-trained Language Models with QA-Memory for Open-Domain Question Answering
Wenhu Chen
Pat Verga
Michiel de Jong
John Wieting
William W. Cohen
RALM
KELM
90
27
0
10 Apr 2022
Show, Don't Tell: Demonstrations Outperform Descriptions for Schema-Guided Task-Oriented Dialogue
Raghav Gupta
Harrison Lee
Jeffrey Zhao
Abhinav Rastogi
Yuan Cao
Yonghui Wu
56
24
0
08 Apr 2022
MMTAfrica: Multilingual Machine Translation for African Languages
Chris C. Emezue
Bonaventure F. P. Dossou
73
25
0
08 Apr 2022
Contextual Representation Learning beyond Masked Language Modeling
Zhiyi Fu
Wangchunshu Zhou
Jingjing Xu
Hao Zhou
Lei Li
75
26
0
08 Apr 2022
Improving Tokenisation by Alternative Treatment of Spaces
Edward Gow-Smith
Harish Tayyar Madabushi
Carolina Scarton
Aline Villavicencio
89
21
0
08 Apr 2022
Enhance Incomplete Utterance Restoration by Joint Learning Token Extraction and Text Generation
Shumpei Inoue
Tsun-Jui Liu
Nguyen Hong Son
Minh Le Nguyen
101
17
0
08 Apr 2022
From Rewriting to Remembering: Common Ground for Conversational QA Models
Marco Del Tredici
Xiaoyu Shen
Gianni Barlacchi
Bill Byrne
Adria de Gispert
KELM
29
10
0
08 Apr 2022
BioBART: Pretraining and Evaluation of A Biomedical Generative Language Model
Hongyi Yuan
Zheng Yuan
Ruyi Gan
Jiaxing Zhang
Yutao Xie
Sheng Yu
LM&MA
98
132
0
08 Apr 2022
PharmMT: A Neural Machine Translation Approach to Simplify Prescription Directions
Jiazhao Li
Corey A. Lester
Xinyan Zhao
Yuting Ding
Yun Jiang
V. Vydiswaran
MedIm
57
13
0
08 Apr 2022
IA-GCN: Interactive Graph Convolutional Network for Recommendation
Yinan Zhang
Pei Wang
Congcong Liu
Xiwei Zhao
Hao Qi
Jie He
Junsheng Jin
Changping Peng
Zhangang Lin
Jingping Shao
GNN
55
6
0
08 Apr 2022
BERTuit: Understanding Spanish language in Twitter through a native transformer
Javier Huertas-Tato
Alejandro Martín
David Camacho
49
9
0
07 Apr 2022
Entailment Graph Learning with Textual Entailment and Soft Transitivity
Zhibin Chen
Yansong Feng
Dongyan Zhao
88
14
0
07 Apr 2022
Accelerating Attention through Gradient-Based Learned Runtime Pruning
Zheng Li
Soroush Ghodrati
Amir Yazdanbakhsh
H. Esmaeilzadeh
Mingu Kang
81
18
0
07 Apr 2022
Knowledge Infused Decoding
Ruibo Liu
Guoqing Zheng
Shashank Gupta
Radhika Gaonkar
Chongyang Gao
Soroush Vosoughi
Milad Shokouhi
Ahmed Hassan Awadallah
KELM
85
14
0
06 Apr 2022
ByT5 model for massively multilingual grapheme-to-phoneme conversion
Jian Zhu
Cong Zhang
David Jurgens
56
42
0
06 Apr 2022
The Moral Integrity Corpus: A Benchmark for Ethical Dialogue Systems
Caleb Ziems
Jane A. Yu
Yi-Chia Wang
A. Halevy
Diyi Yang
87
97
0
06 Apr 2022
Unleashing Vanilla Vision Transformer with Masked Image Modeling for Object Detection
Yuxin Fang
Shusheng Yang
Shijie Wang
Yixiao Ge
Ying Shan
Xinggang Wang
91
57
0
06 Apr 2022
Inducing Positive Perspectives with Text Reframing
Caleb Ziems
Minzhi Li
Anthony Zhang
Diyi Yang
DiffM
92
37
0
06 Apr 2022
Question Generation for Reading Comprehension Assessment by Modeling How and What to Ask
Bilal Ghanem
Lauren Lutz Coleman
Julia Rivard Dexter
Spencer McIntosh von der Ohe
Alona Fyshe
AI4Ed
54
32
0
06 Apr 2022
Match-Prompt: Improving Multi-task Generalization Ability for Neural Text Matching via Prompt Learning
Shicheng Xu
Liang Pang
Huawei Shen
Xueqi Cheng
VLM
85
17
0
06 Apr 2022
DAGAM: Data Augmentation with Generation And Modification
Byeong-Cheol Jo
Tak-Sung Heo
Yeongjoon Park
Yongmin Yoo
Won-Ik Cho
Kyungsun Kim
VLM
65
2
0
06 Apr 2022
On the Transferability of Pre-trained Language Models for Low-Resource Programming Languages
Fuxiang Chen
F. Fard
David Lo
T. Bryksin
81
49
0
05 Apr 2022
Can language models learn from explanations in context?
Andrew Kyle Lampinen
Ishita Dasgupta
Stephanie C. Y. Chan
Kory Matthewson
Michael Henry Tessler
Antonia Creswell
James L. McClelland
Jane X. Wang
Felix Hill
LRM
ReLM
186
302
0
05 Apr 2022
PaLM: Scaling Language Modeling with Pathways
Aakanksha Chowdhery
Sharan Narang
Jacob Devlin
Maarten Bosma
Gaurav Mishra
...
Kathy Meier-Hellstern
Douglas Eck
J. Dean
Slav Petrov
Noah Fiedel
PILM
LRM
570
6,320
0
05 Apr 2022
Abstractive summarization of hospitalisation histories with transformer networks
Alexander Yalunin
D. Umerenkov
V. Kokh
MedIm
40
8
0
05 Apr 2022
The COVMis-Stance dataset: Stance Detection on Twitter for COVID-19 Misinformation
Yanfang Hou
P. V. D. Putten
Suzan Verberne
83
10
0
05 Apr 2022
MaxViT: Multi-Axis Vision Transformer
Zhengzhong Tu
Hossein Talebi
Han Zhang
Feng Yang
P. Milanfar
A. Bovik
Yinxiao Li
ViT
161
674
0
04 Apr 2022
Long Movie Clip Classification with State-Space Video Models
Md. Mohaiminul Islam
Gedas Bertasius
VLM
136
105
0
04 Apr 2022
Do As I Can, Not As I Say: Grounding Language in Robotic Affordances
Michael Ahn
Anthony Brohan
Noah Brown
Yevgen Chebotar
Omar Cortes
...
Ted Xiao
Peng Xu
Sichun Xu
Mengyuan Yan
Andy Zeng
LM&Ro
217
1,990
0
04 Apr 2022
Using Pre-Trained Language Models for Producing Counter Narratives Against Hate Speech: a Comparative Study
Serra Sinem Tekiroğlu
Helena Bonaldi
Margherita Fanton
Marco Guerini
107
48
0
04 Apr 2022
Diverse Text Generation via Variational Encoder-Decoder Models with Gaussian Process Priors
Wanyu Du
Jianqiao Zhao
Liwei Wang
Yangfeng Ji
BDL
70
16
0
04 Apr 2022
PERFECT: Prompt-free and Efficient Few-shot Learning with Language Models
Rabeeh Karimi Mahabadi
Luke Zettlemoyer
James Henderson
Marzieh Saeidi
Lambert Mathias
Ves Stoyanov
Majid Yazdani
VLM
81
72
0
03 Apr 2022
Why Exposure Bias Matters: An Imitation Learning Perspective of Error Accumulation in Language Generation
Kushal Arora
Layla El Asri
Hareesh Bahuleyan
Jackie C.K. Cheung
90
82
0
03 Apr 2022
A sequence-to-sequence approach for document-level relation extraction
John Giorgi
Gary D. Bader
Bo Wang
108
52
0
03 Apr 2022
CTRLEval: An Unsupervised Reference-Free Metric for Evaluating Controlled Text Generation
Pei Ke
Hao Zhou
Yankai Lin
Peng Li
Jie Zhou
Xiaoyan Zhu
Minlie Huang
95
40
0
02 Apr 2022
Constrained Sequence-to-Tree Generation for Hierarchical Text Classification
Chao Yu
Yi Shen
Yue Mao
Longjun Cai
71
22
0
02 Apr 2022
Structured Pruning Learns Compact and Accurate Models
Mengzhou Xia
Zexuan Zhong
Danqi Chen
VLM
113
189
0
01 Apr 2022
Scaling Up Models and Data with
t5x
\texttt{t5x}
t5x
and
seqio
\texttt{seqio}
seqio
Adam Roberts
Hyung Won Chung
Anselm Levskaya
Gaurav Mishra
James Bradbury
...
Brennan Saeta
Ryan Sepassi
A. Spiridonov
Joshua Newlan
Andrea Gesmundo
ALM
141
199
0
31 Mar 2022
Interpretation of Black Box NLP Models: A Survey
Shivani Choudhary
N. Chatterjee
S. K. Saha
FAtt
86
11
0
31 Mar 2022
Leveraging pre-trained language models for conversational information seeking from text
Patrizio Bellan
M. Dragoni
Chiara Ghidini
50
6
0
31 Mar 2022
BRIO: Bringing Order to Abstractive Summarization
Yixin Liu
Pengfei Liu
Dragomir R. Radev
Graham Neubig
105
287
0
31 Mar 2022
SpeechPrompt: An Exploration of Prompt Tuning on Generative Spoken Language Model for Speech Processing Tasks
Kai-Wei Chang
Wei-Cheng Tseng
Shang-Wen Li
Hung-yi Lee
101
23
0
31 Mar 2022
Towards Differential Relational Privacy and its use in Question Answering
Simone Bombari
Alessandro Achille
Zijian Wang
Yu Wang
Yusheng Xie
Kunwar Yashraj Singh
Srikar Appalaraju
Vijay Mahadevan
Stefano Soatto
63
1
0
30 Mar 2022
Transformer Language Models without Positional Encodings Still Learn Positional Information
Adi Haviv
Ori Ram
Ofir Press
Peter Izsak
Omer Levy
112
128
0
30 Mar 2022
Towards Few-shot Entity Recognition in Document Images: A Label-aware Sequence-to-Sequence Framework
Zilong Wang
Jingbo Shang
60
13
0
30 Mar 2022
Exploring Plain Vision Transformer Backbones for Object Detection
Yanghao Li
Hanzi Mao
Ross B. Girshick
Kaiming He
ViT
108
819
0
30 Mar 2022
Towards Interpretable Deep Reinforcement Learning Models via Inverse Reinforcement Learning
Yuansheng Xie
Soroush Vosoughi
Saeed Hassanpour
82
3
0
30 Mar 2022
Position-based Prompting for Health Outcome Generation
Micheal Abaho
Danushka Bollegala
P. Williamson
S. Dodd
61
10
0
30 Mar 2022
TextPruner: A Model Pruning Toolkit for Pre-Trained Language Models
Ziqing Yang
Yiming Cui
Zhigang Chen
SyDa
VLM
73
12
0
30 Mar 2022
Entity-driven Fact-aware Abstractive Summarization of Biomedical Literature
Amanuel Alambo
Tanvi Banerjee
K. Thirunarayan
M. Raymer
MedIm
58
7
0
30 Mar 2022
Previous
1
2
3
...
163
164
165
...
196
197
198
Next