Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1910.10683
Cited By
v1
v2
v3
v4 (latest)
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"
50 / 9,920 papers shown
Title
Finding Skill Neurons in Pre-trained Transformer-based Language Models
Xiaozhi Wang
Kaiyue Wen
Zhengyan Zhang
Lei Hou
Zhiyuan Liu
Juanzi Li
MILM
MoE
88
52
0
14 Nov 2022
Towards Understanding Omission in Dialogue Summarization
Yicheng Zou
Kaitao Song
Xu Tan
Zhongkai Fu
Qi Zhang
Dongsheng Li
Tao Gui
69
2
0
14 Nov 2022
Controllable Citation Sentence Generation with Language Models
Nianlong Gu
Richard H. R. Hahnloser
58
2
0
14 Nov 2022
FPT: Improving Prompt Tuning Efficiency via Progressive Training
Yufei Huang
Yujia Qin
Huadong Wang
Yichun Yin
Maosong Sun
Zhiyuan Liu
Qun Liu
VLM
LRM
63
6
0
13 Nov 2022
Large-scale Contrastive Language-Audio Pretraining with Feature Fusion and Keyword-to-Caption Augmentation
Yusong Wu
Kai Chen
Tianyu Zhang
Yuchen Hui
Marianna Nezhurina
Taylor Berg-Kirkpatrick
Shlomo Dubnov
CLIP
190
546
0
12 Nov 2022
Few-shot Multimodal Sentiment Analysis based on Multimodal Probabilistic Fusion Prompts
Xiaocui Yang
Shi Feng
Daling Wang
Pengfei Hong
Soujanya Poria
82
23
0
12 Nov 2022
DocuT5: Seq2seq SQL Generation with Table Documentation
E. Soare
Iain Mackie
Jeffrey Stephen Dalton
LMTD
79
2
0
11 Nov 2022
A Survey of Knowledge Enhanced Pre-trained Language Models
Linmei Hu
Zeyi Liu
Ziwang Zhao
Lei Hou
Liqiang Nie
Juanzi Li
KELM
VLM
170
137
0
11 Nov 2022
InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions
Wenhai Wang
Jifeng Dai
Zhe Chen
Zhenhang Huang
Zhiqi Li
...
Tong Lu
Lewei Lu
Hongsheng Li
Xiaogang Wang
Yu Qiao
VLM
180
699
0
10 Nov 2022
DisentQA: Disentangling Parametric and Contextual Knowledge with Counterfactual Question Answering
Ella Neeman
Roee Aharoni
Or Honovich
Leshem Choshen
Idan Szpektor
Omri Abend
KELM
CML
105
84
0
10 Nov 2022
Can Transformers Reason in Fragments of Natural Language?
Viktor Schlegel
Kamen V. Pavlov
Ian Pratt-Hartmann
LRM
ReLM
77
7
0
10 Nov 2022
ADEPT: A DEbiasing PrompT Framework
Ke Yang
Charles Yu
Yi R. Fung
Manling Li
Heng Ji
124
27
0
10 Nov 2022
Grammatical Error Correction: A Survey of the State of the Art
Christopher Bryant
Zheng Yuan
Muhammad Reza Qorib
Hannan Cao
Hwee Tou Ng
Ted Briscoe
3DV
91
87
0
09 Nov 2022
Uni-Parser: Unified Semantic Parser for Question Answering on Knowledge Base and Database
Ye Liu
Semih Yavuz
Rui Meng
Dragomir R. Radev
Caiming Xiong
Yingbo Zhou
78
33
0
09 Nov 2022
Large Language Models with Controllable Working Memory
Daliang Li
A. S. Rawat
Manzil Zaheer
Xin Wang
Michal Lukasik
Andreas Veit
Felix X. Yu
Surinder Kumar
KELM
146
171
0
09 Nov 2022
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model
BigScience Workshop
:
Teven Le Scao
Angela Fan
Christopher Akiki
...
Zhongli Xie
Zifan Ye
M. Bras
Younes Belkada
Thomas Wolf
VLM
486
2,401
0
09 Nov 2022
What is Wrong with Language Models that Can Not Tell a Story?
Ivan P. Yamshchikov
Alexey Tikhonov
69
7
0
09 Nov 2022
Syntax-Aware On-the-Fly Code Completion
Wannita Takerngsaksiri
Chakkrit Tantithamthavorn
Yuankui Li
93
19
0
09 Nov 2022
Discovering the Hidden Facts of User-Dispatcher Interactions via Text-based Reporting Systems for Community Safety
Yiren Liu
Ryan D. W. Mayfield
Yun Huang
47
2
0
09 Nov 2022
Active Example Selection for In-Context Learning
Yiming Zhang
Shi Feng
Chenhao Tan
SILM
LRM
114
207
0
08 Nov 2022
nBIIG: A Neural BI Insights Generation System for Table Reporting
Yotam Perlitz
D. Sheinwald
Noam Slonim
Michal Shmueli-Scheuer
31
2
0
08 Nov 2022
Self-conditioned Embedding Diffusion for Text Generation
Robin Strudel
Corentin Tallec
Florent Altché
Yilun Du
Yaroslav Ganin
...
Will Grathwohl
Nikolay Savinov
Sander Dieleman
Laurent Sifre
Rémi Leblond
DiffM
89
88
0
08 Nov 2022
Conciseness: An Overlooked Language Task
Felix Stahlberg
Aashish Kumar
Chris Alberti
Shankar Kumar
47
1
0
08 Nov 2022
COPEN: Probing Conceptual Knowledge in Pre-trained Language Models
Hao Peng
Xiaozhi Wang
Shengding Hu
Hailong Jin
Lei Hou
Juanzi Li
Zhiyuan Liu
Qun Liu
93
25
0
08 Nov 2022
Reranking Overgenerated Responses for End-to-End Task-Oriented Dialogue Systems
Songbo Hu
Ivan Vulić
Fangyu Liu
Anna Korhonen
93
0
0
07 Nov 2022
ERNIE-SAT: Speech and Text Joint Pretraining for Cross-Lingual Multi-Speaker Text-to-Speech
Xiaoran Fan
Chao Pang
Tian Yuan
Richard He Bai
Renjie Zheng
...
Junkun Chen
Zeyu Chen
Liang Huang
Yu Sun
Hua Wu
125
0
0
07 Nov 2022
Knowledge Graph Embedding: A Survey from the Perspective of Representation Spaces
Jiahang Cao
Jinyuan Fang
Zaiqiao Meng
Shangsong Liang
108
76
0
07 Nov 2022
NAPG: Non-Autoregressive Program Generation for Hybrid Tabular-Textual Question Answering
Tengxun Zhang
Hongfei Xu
Josef van Genabith
Deyi Xiong
Hongying Zan
AIMat
LRM
65
5
0
07 Nov 2022
Fixing Model Bugs with Natural Language Patches
Shikhar Murty
Christopher D. Manning
Scott M. Lundberg
Marco Tulio Ribeiro
KELM
80
39
0
07 Nov 2022
Contrastive Learning enhanced Author-Style Headline Generation
Hui Liu
Weidong Guo
Yige Chen
Xiangyang Li
56
5
0
07 Nov 2022
Complex Reading Comprehension Through Question Decomposition
Xiao-Yu Guo
Yuan-Fang Li
Gholamreza Haffari
ReLM
74
10
0
07 Nov 2022
On the Domain Adaptation and Generalization of Pretrained Language Models: A Survey
Xu Guo
Han Yu
LM&MA
VLM
145
30
0
06 Nov 2022
Tuning Language Models as Training Data Generators for Augmentation-Enhanced Few-Shot Learning
Yu Meng
Martin Michalski
Jiaxin Huang
Yu Zhang
Tarek Abdelzaher
Jiawei Han
VLM
122
49
0
06 Nov 2022
Robust Lottery Tickets for Pre-trained Language Models
Rui Zheng
Rong Bao
Yuhao Zhou
Di Liang
Sirui Wang
Wei Wu
Tao Gui
Qi Zhang
Xuanjing Huang
AAML
87
14
0
06 Nov 2022
Continuous Prompt Tuning Based Textual Entailment Model for E-commerce Entity Typing
Yibo Wang
Congying Xia
Guan Wang
Philip Yu
58
6
0
04 Nov 2022
A General Purpose Neural Architecture for Geospatial Systems
Nasim Rahaman
Martin Weiss
Frederik Trauble
Francesco Locatello
Alexandre Lacoste
Yoshua Bengio
C. Pal
Li Erran Li
Bernhard Schölkopf
AI4TS
AI4CE
55
6
0
04 Nov 2022
Time-aware Prompting for Text Generation
Shuyang Cao
Lu Wang
70
12
0
03 Nov 2022
Book Cover Synthesis from the Summary
Emdadul Haque
Md. Faraz Kabir Khan
Mohammad Imrul Jubair
Jarin Anjum
Abrar Zahir Niloy
3DV
51
2
0
03 Nov 2022
LMentry: A Language Model Benchmark of Elementary Language Tasks
Avia Efrat
Or Honovich
Omer Levy
105
20
0
03 Nov 2022
Inverse scaling can become U-shaped
Jason W. Wei
Najoung Kim
Yi Tay
Quoc V. Le
LRM
110
64
0
03 Nov 2022
Large Language Models Are Human-Level Prompt Engineers
Yongchao Zhou
Andrei Ioan Muresanu
Ziwen Han
Keiran Paster
Silviu Pitis
Harris Chan
Jimmy Ba
ALM
LLMAG
195
907
0
03 Nov 2022
Latent Prompt Tuning for Text Summarization
Yubo Zhang
Xingxing Zhang
Xun Wang
Si-Qing Chen
Furu Wei
VLM
95
12
0
03 Nov 2022
Eliciting Knowledge from Large Pre-Trained Models for Unsupervised Knowledge-Grounded Conversation
Yanyang Li
Jianqiao Zhao
Michael R. Lyu
Liwei Wang
70
16
0
03 Nov 2022
Using Large Pre-Trained Language Model to Assist FDA in Premarket Medical Device
Zongzhe Xu
LM&MA
MedIm
66
0
0
03 Nov 2022
Open-Vocabulary Argument Role Prediction for Event Extraction
Yizhu Jiao
Sha Li
Yiqing Xie
Ming Zhong
Heng Ji
Jiawei Han
115
17
0
03 Nov 2022
PINTO: Faithful Language Reasoning Using Prompt-Generated Rationales
Peifeng Wang
Aaron Chan
Filip Ilievski
Muhao Chen
Xiang Ren
LRM
ReLM
117
65
0
03 Nov 2022
RQUGE: Reference-Free Metric for Evaluating Question Generation by Answering the Question
Alireza Mohammadshahi
Thomas Scialom
Majid Yazdani
Pouya Yanki
Angela Fan
James Henderson
Marzieh Saeidi
94
20
0
02 Nov 2022
Generative Entity-to-Entity Stance Detection with Knowledge Graph Augmentation
Xinliang Frederick Zhang
Nick Beauchamp
Lu Wang
57
10
0
02 Nov 2022
MPCFormer: fast, performant and private Transformer inference with MPC
Dacheng Li
Rulin Shao
Hongyi Wang
Han Guo
Eric P. Xing
Haotong Zhang
95
87
0
02 Nov 2022
eDiff-I: Text-to-Image Diffusion Models with an Ensemble of Expert Denoisers
Yogesh Balaji
Seungjun Nah
Xun Huang
Arash Vahdat
Jiaming Song
...
Timo Aila
S. Laine
Bryan Catanzaro
Tero Karras
Xuan Li
VLM
MoE
219
832
0
02 Nov 2022
Previous
1
2
3
...
145
146
147
...
197
198
199
Next