Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1910.10683
Cited By
v1
v2
v3
v4 (latest)
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"
50 / 9,870 papers shown
Title
A Survey on Green Deep Learning
Jingjing Xu
Wangchunshu Zhou
Zhiyi Fu
Hao Zhou
Lei Li
VLM
203
84
0
08 Nov 2021
How does a Pre-Trained Transformer Integrate Contextual Keywords? Application to Humanitarian Computing
Valentin Barrière
Guillaume Jacquet
25
1
0
07 Nov 2021
Grounded Graph Decoding Improves Compositional Generalization in Question Answering
Yu Gai
Paras Jain
Wendi Zhang
Joseph E. Gonzalez
Basel Alomair
Ion Stoica
BDL
OOD
79
8
0
05 Nov 2021
LILA: Language-Informed Latent Actions
Siddharth Karamcheti
Megha Srivastava
Percy Liang
Dorsa Sadigh
LM&Ro
96
32
0
05 Nov 2021
Benchmarking Multimodal AutoML for Tabular Data with Text Fields
Xingjian Shi
Jonas W. Mueller
Nick Erickson
Mu Li
Alexander J. Smola
LMTD
79
31
0
04 Nov 2021
CLUES: Few-Shot Learning Evaluation in Natural Language Understanding
Subhabrata Mukherjee
Xiaodong Liu
Guoqing Zheng
Saghar Hosseini
Hao Cheng
Greg Yang
Christopher Meek
Ahmed Hassan Awadallah
Jianfeng Gao
ELM
70
11
0
04 Nov 2021
An Empirical Study of Training End-to-End Vision-and-Language Transformers
Zi-Yi Dou
Yichong Xu
Zhe Gan
Jianfeng Wang
Shuohang Wang
...
Pengchuan Zhang
Lu Yuan
Nanyun Peng
Zicheng Liu
Michael Zeng
VLM
104
381
0
03 Nov 2021
VLMo: Unified Vision-Language Pre-Training with Mixture-of-Modality-Experts
Hangbo Bao
Wenhui Wang
Li Dong
Qiang Liu
Owais Khan Mohammed
Kriti Aggarwal
Subhojit Som
Furu Wei
VLM
MLLM
MoE
104
559
0
03 Nov 2021
Automatic Evaluation and Moderation of Open-domain Dialogue Systems
Chen Zhang
João Sedoc
L. F. D’Haro
Rafael E. Banchs
Alexander I. Rudnicky
78
38
0
03 Nov 2021
OpenPrompt: An Open-source Framework for Prompt-learning
Ning Ding
Shengding Hu
Weilin Zhao
Yulin Chen
Zhiyuan Liu
Haitao Zheng
Maosong Sun
VLM
LLMAG
111
299
0
03 Nov 2021
Assessing Effectiveness of Using Internal Signals for Check-Worthy Claim Identification in Unlabeled Data for Automated Fact-Checking
Archita Pathak
Rohini Srihari
HILM
76
1
0
02 Nov 2021
Can Vision Transformers Perform Convolution?
Shanda Li
Xiangning Chen
Di He
Cho-Jui Hsieh
ViT
110
21
0
02 Nov 2021
Recent Advances in Natural Language Processing via Large Pre-Trained Language Models: A Survey
Bonan Min
Hayley L Ross
Elior Sulem
Amir Pouran Ben Veyseh
Thien Huu Nguyen
Oscar Sainz
Eneko Agirre
Ilana Heinz
Dan Roth
LM&MA
VLM
AI4CE
197
1,094
0
01 Nov 2021
Evaluating deep transfer learning for whole-brain cognitive decoding
A. Thomas
U. Lindenberger
Wojciech Samek
K. Müller
AI4CE
52
12
0
01 Nov 2021
Template Filling for Controllable Commonsense Reasoning
Dheeraj Rajagopal
Vivek Khetan
Bogdan Sacaleanu
A. Gershman
Andy E. Fano
Eduard H. Hovy
BDL
LRM
60
7
0
31 Oct 2021
Automatic Knowledge Augmentation for Generative Commonsense Reasoning
Jaehyung Seo
Chanjun Park
Sugyeong Eo
Hyeonseok Moon
Heuiseok Lim
ReLM
LRM
43
3
0
30 Oct 2021
Amendable Generation for Dialogue State Tracking
Xin Tian
Liankai Huang
Yingzhan Lin
Siqi Bao
H. He
Yunyi Yang
Hua Wu
Fan Wang
Shuqi Sun
89
35
0
29 Oct 2021
Structure-aware Fine-tuning of Sequence-to-sequence Transformers for Transition-based AMR Parsing
Jiawei Zhou
Tahira Naseem
Ramón Fernández Astudillo
Young-Suk Lee
Radu Florian
Salim Roukos
85
43
0
29 Oct 2021
NxMTransformer: Semi-Structured Sparsification for Natural Language Understanding via ADMM
Connor Holmes
Minjia Zhang
Yuxiong He
Bo Wu
60
20
0
28 Oct 2021
Understanding How Encoder-Decoder Architectures Attend
Kyle Aitken
V. Ramasesh
Yuan Cao
Niru Maheswaranathan
71
17
0
28 Oct 2021
ÚFAL at MultiLexNorm 2021: Improving Multilingual Lexical Normalization by Fine-tuning ByT5
David Samuel
Milan Straka
40
16
0
28 Oct 2021
SQALER: Scaling Question Answering by Decoupling Multi-Hop and Logical Reasoning
Mattia Atzeni
Jasmina Bogojeska
Andreas Loukas
ReLM
LRM
73
15
0
27 Oct 2021
WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing
Sanyuan Chen
Chengyi Wang
Zhengyang Chen
Yu-Huan Wu
Shujie Liu
...
Yao Qian
Jian Wu
Micheal Zeng
Xiangzhan Yu
Furu Wei
SSL
296
1,911
0
26 Oct 2021
s2s-ft: Fine-Tuning Pretrained Transformer Encoders for Sequence-to-Sequence Learning
Hangbo Bao
Li Dong
Wenhui Wang
Nan Yang
Furu Wei
61
11
0
26 Oct 2021
Can Character-based Language Models Improve Downstream Task Performance in Low-Resource and Noisy Language Scenarios?
Arij Riabi
Benoît Sagot
Djamé Seddah
85
15
0
26 Oct 2021
The Efficiency Misnomer
Daoyuan Chen
Liuyi Yao
Dawei Gao
Ashish Vaswani
Yaliang Li
109
103
0
25 Oct 2021
Sentence Punctuation for Collaborative Commentary Generation in Esports Live-Streaming
M. Lutter
Johannes Silberbauer
Xiaoling Ling
Pujana Paliyawan
40
2
0
24 Oct 2021
Fast Model Editing at Scale
E. Mitchell
Charles Lin
Antoine Bosselut
Chelsea Finn
Christopher D. Manning
KELM
363
379
0
21 Oct 2021
SciXGen: A Scientific Paper Dataset for Context-Aware Text Generation
Hong Chen
Hiroya Takamura
Hideki Nakayama
86
20
0
20 Oct 2021
Interpreting Deep Learning Models in Natural Language Processing: A Review
Xiaofei Sun
Diyi Yang
Xiaoya Li
Tianwei Zhang
Yuxian Meng
Han Qiu
Guoyin Wang
Eduard H. Hovy
Jiwei Li
97
46
0
20 Oct 2021
SLAM: A Unified Encoder for Speech and Language Modeling via Speech-Text Joint Pre-Training
Ankur Bapna
Yu-An Chung
Na Wu
Anmol Gulati
Ye Jia
J. Clark
Melvin Johnson
Jason Riesa
Alexis Conneau
Yu Zhang
VLM
137
96
0
20 Oct 2021
When in Doubt, Summon the Titans: Efficient Inference with Large Models
A. S. Rawat
Manzil Zaheer
A. Menon
Amr Ahmed
Sanjiv Kumar
38
7
0
19 Oct 2021
GenNI: Human-AI Collaboration for Data-Backed Text Generation
Hendrik Strobelt
J. Kinley
Robert Krueger
Johanna Beyer
Hanspeter Pfister
Alexander M. Rush
87
23
0
19 Oct 2021
DEEPAGÉ: Answering Questions in Portuguese about the Brazilian Environment
Flávio N. Caccao
M. M. José
A. Oliveira
Stefano Spindola
A. H. R. Costa
Fabio Gagliardi Cozman
79
6
0
19 Oct 2021
Permutation invariant graph-to-sequence model for template-free retrosynthesis and reaction prediction
Zhengkai Tu
Connor W. Coley
80
96
0
19 Oct 2021
BERMo: What can BERT learn from ELMo?
Sangamesh Kodge
Kaushik Roy
65
3
0
18 Oct 2021
NormFormer: Improved Transformer Pretraining with Extra Normalization
Sam Shleifer
Jason Weston
Myle Ott
AI4CE
73
76
0
18 Oct 2021
HRFormer: High-Resolution Transformer for Dense Prediction
Yuhui Yuan
Rao Fu
Lang Huang
Weihong Lin
Chao Zhang
Xilin Chen
Jingdong Wang
ViT
142
235
0
18 Oct 2021
Energon: Towards Efficient Acceleration of Transformers Using Dynamic Sparse Attention
Zhe Zhou
Junling Liu
Zhenyu Gu
Guangyu Sun
125
45
0
18 Oct 2021
Ensembling Graph Predictions for AMR Parsing
Hoang Thanh Lam
Gabriele Picco
Yufang Hou
Young-Suk Lee
Lam M. Nguyen
Dzung Phan
V. López
Ramón Fernández Astudillo
GNN
68
26
0
18 Oct 2021
SCENIC: A JAX Library for Computer Vision Research and Beyond
Mostafa Dehghani
A. Gritsenko
Anurag Arnab
Matthias Minderer
Yi Tay
106
68
0
18 Oct 2021
Deep Transfer Learning & Beyond: Transformer Language Models in Information Systems Research
Ross Gruetzemacher
D. Paradice
80
35
0
18 Oct 2021
PAGnol: An Extra-Large French Generative Model
Julien Launay
E. L. Tommasone
B. Pannier
Franccois Boniface
A. Chatelain
Alessandro Cappelli
Iacopo Poli
Djamé Seddah
AILaw
MoE
AI4CE
79
8
0
16 Oct 2021
Hey AI, Can You Solve Complex Tasks by Talking to Agents?
Tushar Khot
Kyle Richardson
Daniel Khashabi
Ashish Sabharwal
RALM
LRM
72
14
0
16 Oct 2021
Sharpness-Aware Minimization Improves Language Model Generalization
Dara Bahri
H. Mobahi
Yi Tay
182
104
0
16 Oct 2021
The Power of Prompt Tuning for Low-Resource Semantic Parsing
Nathan Schucher
Siva Reddy
H. D. Vries
VLM
163
36
0
16 Oct 2021
Multimodal Dialogue Response Generation
Qingfeng Sun
Yujing Wang
Can Xu
Kai Zheng
Yaming Yang
Huang Hu
Fei Xu
Jessica Zhang
Xiubo Geng
Daxin Jiang
104
49
0
16 Oct 2021
PRIMERA: Pyramid-based Masked Sentence Pre-training for Multi-document Summarization
Wen Xiao
Iz Beltagy
Giuseppe Carenini
Arman Cohan
CVBM
140
119
0
16 Oct 2021
Summ^N: A Multi-Stage Summarization Framework for Long Input Dialogues and Documents
Yusen Zhang
Ansong Ni
Ziming Mao
Chen Henry Wu
Chenguang Zhu
Budhaditya Deb
Ahmed Hassan Awadallah
Dragomir R. Radev
Rui Zhang
RALM
102
91
0
16 Oct 2021
A Good Prompt Is Worth Millions of Parameters: Low-resource Prompt-based Learning for Vision-Language Models
Woojeong Jin
Yu Cheng
Yelong Shen
Weizhu Chen
Xiang Ren
VLM
VPVLM
MLLM
117
138
0
16 Oct 2021
Previous
1
2
3
...
172
173
174
...
196
197
198
Next