Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1910.10683
Cited By
v1
v2
v3
v4 (latest)
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"
50 / 9,877 papers shown
Title
Improving Compositional Generalization with Latent Structure and Data Augmentation
Linlu Qiu
Peter Shaw
Panupong Pasupat
Pawel Krzysztof Nowak
Tal Linzen
Fei Sha
Kristina Toutanova
CoGe
98
57
0
14 Dec 2021
GPL: Generative Pseudo Labeling for Unsupervised Domain Adaptation of Dense Retrieval
Kexin Wang
Nandan Thakur
Nils Reimers
Iryna Gurevych
VLM
168
157
0
14 Dec 2021
LMTurk: Few-Shot Learners as Crowdsourcing Workers in a Language-Model-as-a-Service Framework
Mengjie Zhao
Fei Mi
Yasheng Wang
Minglei Li
Xin Jiang
Qun Liu
Hinrich Schütze
RALM
115
11
0
14 Dec 2021
Model Uncertainty-Aware Knowledge Amalgamation for Pre-Trained Language Models
Lei Li
Yankai Lin
Xuancheng Ren
Guangxiang Zhao
Peng Li
Jie Zhou
Xu Sun
MoMe
60
2
0
14 Dec 2021
Towards a Unified Foundation Model: Jointly Pre-Training Transformers on Unpaired Images and Text
Qing Li
Boqing Gong
Huayu Chen
Dan Kondratyuk
Xianzhi Du
Ming-Hsuan Yang
Matthew A. Brown
ViT
49
17
0
14 Dec 2021
GLaM: Efficient Scaling of Language Models with Mixture-of-Experts
Nan Du
Yanping Huang
Andrew M. Dai
Simon Tong
Dmitry Lepikhin
...
Kun Zhang
Quoc V. Le
Yonghui Wu
Zhiwen Chen
Claire Cui
ALM
MoE
275
833
0
13 Dec 2021
VL-Adapter: Parameter-Efficient Transfer Learning for Vision-and-Language Tasks
Yi-Lin Sung
Jaemin Cho
Joey Tianyi Zhou
VLM
VPVLM
114
358
0
13 Dec 2021
Step-unrolled Denoising Autoencoders for Text Generation
Nikolay Savinov
Junyoung Chung
Mikolaj Binkowski
Erich Elsen
Aaron van den Oord
DiffM
134
120
0
13 Dec 2021
Automated Evidence Collection for Fake News Detection
Mrinal Rawat
Diptesh Kanojia
83
3
0
13 Dec 2021
ISEEQ: Information Seeking Question Generation using Dynamic Meta-Information Retrieval and Knowledge Graphs
Manas Gaur
Kalpa Gunaratna
Vijay Srinivasan
Hongxia Jin
RALM
73
53
0
13 Dec 2021
Dependency Learning for Legal Judgment Prediction with a Unified Text-to-Text Transformer
Yunyun Huang
Xiaoyu Shen
Chuanyi Li
Jidong Ge
B. Luo
AILaw
77
20
0
13 Dec 2021
Weakly Supervised Text-to-SQL Parsing through Question Decomposition
Tomer Wolfson
Daniel Deutch
Jonathan Berant
ReLM
43
16
0
12 Dec 2021
UPV at TREC Health Misinformation Track 2021 Ranking with SBERT and Quality Estimators
Ipek Baris Schlicht
Angel Felipe Magnossão de Paula
Paolo Rosso
42
8
0
11 Dec 2021
Discourse-Aware Soft Prompting for Text Generation
Marjan Ghazvininejad
Vladimir Karpukhin
Vera Gor
Asli Celikyilmaz
67
6
0
10 Dec 2021
Pruning Pretrained Encoders with a Multitask Objective
Patrick Xia
Richard Shin
65
0
0
10 Dec 2021
VUT: Versatile UI Transformer for Multi-Modal Multi-Task User Interface Modeling
Yang Li
Gang Li
Xin Zhou
Mostafa Dehghani
A. Gritsenko
MLLM
92
36
0
10 Dec 2021
Spinning Language Models: Risks of Propaganda-As-A-Service and Countermeasures
Eugene Bagdasaryan
Vitaly Shmatikov
SILM
AAML
106
84
0
09 Dec 2021
Compositional Generalization for Natural Language Interfaces to Web APIs
Saghar Hosseini
Ahmed Hassan Awadallah
Yu-Chuan Su
66
6
0
09 Dec 2021
Semantic Search as Extractive Paraphrase Span Detection
Jenna Kanerva
Hanna Kitti
Li-Hsin Chang
Teemu Vahtola
Mathias Creutz
Filip Ginter
57
2
0
09 Dec 2021
Multimodal Pre-Training Model for Sequence-based Prediction of Protein-Protein Interaction
Yang Xue
Zijing Liu
Xiaomin Fang
Fan Wang
111
8
0
09 Dec 2021
FLAVA: A Foundational Language And Vision Alignment Model
Amanpreet Singh
Ronghang Hu
Vedanuj Goswami
Guillaume Couairon
Wojciech Galuba
Marcus Rohrbach
Douwe Kiela
CLIP
VLM
151
719
0
08 Dec 2021
Improving language models by retrieving from trillions of tokens
Sebastian Borgeaud
A. Mensch
Jordan Hoffmann
Trevor Cai
Eliza Rutherford
...
Simon Osindero
Karen Simonyan
Jack W. Rae
Erich Elsen
Laurent Sifre
KELM
RALM
303
1,109
0
08 Dec 2021
Ethical and social risks of harm from Language Models
Laura Weidinger
John F. J. Mellor
Maribeth Rauh
Conor Griffin
J. Uesato
...
Lisa Anne Hendricks
William S. Isaac
Sean Legassick
G. Irving
Iason Gabriel
PILM
208
1,045
0
08 Dec 2021
Does Structure Matter? Leveraging Data-to-Text Generation for Answering Complex Information Needs
Hanane Djeddal
Thomas Gerald
Laure Soulier
K. Pinel-Sauvagnat
L. Tamine
33
1
0
08 Dec 2021
VIRT: Improving Representation-based Models for Text Matching through Virtual Interaction
Dan Li
Yang Yang
Hongyin Tang
Jingang Wang
Tong Xu
Wei Wu
Enhong Chen
64
9
0
08 Dec 2021
Bidimensional Leaderboards: Generate and Evaluate Language Hand in Hand
Jungo Kasai
Keisuke Sakaguchi
Ronan Le Bras
Lavinia Dunagan
Jacob Morrison
Alexander R. Fabbri
Yejin Choi
Noah A. Smith
97
40
0
08 Dec 2021
A Transferable Approach for Partitioning Machine Learning Models on Multi-Chip-Modules
Xinfeng Xie
Prakash Prabhu
Ulysse Beaugnon
P. Phothilimthana
Sudip Roy
Azalia Mirhoseini
E. Brevdo
James Laudon
Yanqi Zhou
51
5
0
07 Dec 2021
UCD-CS at TREC 2021 Incident Streams Track
Congcong Wang
David Lillis
44
1
0
07 Dec 2021
Human Parity on CommonsenseQA: Augmenting Self-Attention with External Attention
Yichong Xu
Chenguang Zhu
Shuohang Wang
Siqi Sun
Hao Cheng
Xiaodong Liu
Jianfeng Gao
Pengcheng He
Michael Zeng
Xuedong Huang
LRM
326
59
0
06 Dec 2021
Quantifying Adaptability in Pre-trained Language Models with 500 Tasks
Belinda Z. Li
Jane A. Yu
Madian Khabsa
Luke Zettlemoyer
A. Halevy
Jacob Andreas
ELM
89
17
0
06 Dec 2021
UniLog: Deploy One Model and Specialize it for All Log Analysis Tasks
Yichen Zhu
Weibin Meng
Ying Liu
Shenglin Zhang
Tao Han
Shimin Tao
Dan Pei
MoE
89
15
0
06 Dec 2021
General Facial Representation Learning in a Visual-Linguistic Manner
Yinglin Zheng
Hao Yang
Ting Zhang
Jianmin Bao
Dongdong Chen
Yangyu Huang
Lu Yuan
Dong Chen
Ming Zeng
Fang Wen
CVBM
207
176
0
06 Dec 2021
Search and Learn: Improving Semantic Coverage for Data-to-Text Generation
Shailza Jolly
Zi Xuan Zhang
Andreas Dengel
Lili Mou
67
11
0
06 Dec 2021
End-to-end Adaptive Distributed Training on PaddlePaddle
Yulong Ao
Zhihua Wu
Dianhai Yu
Weibao Gong
Zhiqing Kui
...
Yanjun Ma
Tian Wu
Haifeng Wang
Wei Zeng
Chao Yang
118
11
0
06 Dec 2021
JointLK: Joint Reasoning with Language Models and Knowledge Graphs for Commonsense Question Answering
Yueqing Sun
Qi Shi
Le Qi
Yu Zhang
RALM
LRM
89
72
0
06 Dec 2021
NL-Augmenter: A Framework for Task-Sensitive Natural Language Augmentation
Kaustubh D. Dhole
Varun Gangal
Sebastian Gehrmann
Aadesh Gupta
Zhenhao Li
...
Tianbao Xie
Usama Yaseen
Michael A. Yee
Jing Zhang
Yue Zhang
235
88
0
06 Dec 2021
VarCLR: Variable Semantic Representation Pre-training via Contrastive Learning
Qibin Chen
Jeremy Lacomis
Edward J. Schwartz
Graham Neubig
Bogdan Vasilescu
Claire Le Goues
VLM
77
35
0
05 Dec 2021
Representation Learning for Conversational Data using Discourse Mutual Information Maximization
Bishal Santra
Sumegh Roychowdhury
Aishik Mandal
Vasu Gurram
Atharva Naik
Manish Gupta
Pawan Goyal
SSL
89
4
0
04 Dec 2021
Bridging Pre-trained Models and Downstream Tasks for Source Code Understanding
Deze Wang
Zhouyang Jia
Shanshan Li
Yue Yu
Yun Xiong
Wei Dong
Xiangke Liao
103
83
0
04 Dec 2021
Hierarchical Neural Data Synthesis for Semantic Parsing
Wei Yang
Peng Xu
Yanshuai Cao
66
9
0
04 Dec 2021
ALX: Large Scale Matrix Factorization on TPUs
Harsh Mehta
Steffen Rendle
Walid Krichene
Li Zhang
25
6
0
03 Dec 2021
Multilingual training for Software Engineering
Toufique Ahmed
Prem Devanbu
134
76
0
03 Dec 2021
MetaQA: Combining Expert Agents for Multi-Skill Question Answering
Haritz Puerto
Gözde Gül Sahin
Iryna Gurevych
LLMAG
105
21
0
03 Dec 2021
The Influence of Data Pre-processing and Post-processing on Long Document Summarization
Xinwei Du
Kailun Dong
Yuchen Zhang
Yongsheng Li
R. Tsay
34
0
0
03 Dec 2021
PLSUM: Generating PT-BR Wikipedia by Summarizing Multiple Websites
A. Oliveira
A. H. R. Costa
44
2
0
02 Dec 2021
InfoLM: A New Metric to Evaluate Summarization & Data2Text Generation
Pierre Colombo
Chloe Clave
Pablo Piantanida
129
44
0
02 Dec 2021
KPDrop: Improving Absent Keyphrase Generation
Jishnu Ray Chowdhury
Seoyeon Park
Tuhin Kundu
Cornelia Caragea
95
7
0
02 Dec 2021
How not to Lie with a Benchmark: Rearranging NLP Leaderboards
Tatiana Shavrina
Valentin Malykh
ALM
ELM
505
12
0
02 Dec 2021
CO2Sum:Contrastive Learning for Factual-Consistent Abstractive Summarization
Wei Liu
Huanqin Wu
Wenjing Mu
Zhen Li
Tao Chen
Dan Nie
HILM
60
17
0
02 Dec 2021
Iconary: A Pictionary-Based Game for Testing Multimodal Communication with Drawings and Text
Christopher Clark
Jordi Salvador
Dustin Schwenk
Derrick Bonafilia
Mark Yatskar
...
Aaron Sarnat
Hannaneh Hajishirzi
Aniruddha Kembhavi
Oren Etzioni
Ali Farhadi
MLLM
52
5
0
01 Dec 2021
Previous
1
2
3
...
170
171
172
...
196
197
198
Next