Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1910.10683
Cited By
v1
v2
v3
v4 (latest)
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"
50 / 9,891 papers shown
Title
ArgLegalSumm: Improving Abstractive Summarization of Legal Documents with Argument Mining
Mohamed S. Elaraby
Diane Litman
AILaw
ELM
103
33
0
04 Sep 2022
Equivariant Self-Supervision for Musical Tempo Estimation
Elio Quinton
92
9
0
03 Sep 2022
CrossDial: An Entertaining Dialogue Dataset of Chinese Crosstalk
Baizhou Huang
Shikang Du
Xiao-Yi Wan
53
0
0
03 Sep 2022
Exploiting Pretrained Biochemical Language Models for Targeted Drug Design
Gökçe Uludogan
Elif Özkirimli
K. Ülgen
N. Karalı
Arzucan Özgür
60
16
0
02 Sep 2022
Diffusion Models: A Comprehensive Survey of Methods and Applications
Ling Yang
Zhilong Zhang
Yingxia Shao
Shenda Hong
Runsheng Xu
Yue Zhao
Wentao Zhang
Tengjiao Wang
Ming-Hsuan Yang
DiffM
MedIm
488
1,425
0
02 Sep 2022
Transformers are Sample-Efficient World Models
Vincent Micheli
Eloi Alonso
Franccois Fleuret
VLM
OffRL
185
189
0
01 Sep 2022
Negation detection in Dutch clinical texts: an evaluation of rule-based and machine learning methods
Bram van Es
L. Reteig
Sander C Tan
M. Schraagen
Myrthe M. Hemker
S. R. Arends
Miguel Rios
S. Haitjema
50
12
0
01 Sep 2022
Exploring Effective Information Utilization in Multi-Turn Topic-Driven Conversations
Jiatong Li
Bin He
Fei Mi
71
4
0
01 Sep 2022
Orloj: Predictably Serving Unpredictable DNNs
Peifeng Yu
Yuqing Qiu
Xin Jin
Mosharaf Chowdhury
37
1
0
31 Aug 2022
Efficient Methods for Natural Language Processing: A Survey
Marcos Vinícius Treviso
Ji-Ung Lee
Tianchu Ji
Betty van Aken
Qingqing Cao
...
Emma Strubell
Niranjan Balasubramanian
Leon Derczynski
Iryna Gurevych
Roy Schwartz
156
114
0
31 Aug 2022
GRILLBot: An Assistant for Real-World Tasks with Neural Semantic Parsing and Graph-Based Representations
Carlos Gemmell
Iain Mackie
Paul Owoicho
Federico Rossetto
Sophie Fischer
Jeffrey Stephen Dalton
LM&Ro
GNN
59
1
0
31 Aug 2022
EViT: Privacy-Preserving Image Retrieval via Encrypted Vision Transformer in Cloud Computing
Qihua Feng
Peiya Li
Zhixun Lu
Chaozhuo Li
Zefang Wang
Zhiquan Liu
Chunhui Duan
Feiran Huang
ViT
55
13
0
31 Aug 2022
Unified Knowledge Prompt Pre-training for Customer Service Dialogues
Keqing He
Jingang Wang
Chaobo Sun
Wei Wu
72
4
0
31 Aug 2022
Efficient Sparsely Activated Transformers
Salar Latifi
Saurav Muralidharan
M. Garland
MoE
136
2
0
31 Aug 2022
Transformers with Learnable Activation Functions
Haishuo Fang
Ji-Ung Lee
N. Moosavi
Iryna Gurevych
AI4CE
46
8
0
30 Aug 2022
PercentMatch: Percentile-based Dynamic Thresholding for Multi-Label Semi-Supervised Classification
Jun Huang
Alexander Huang
Beatriz C. Guerra
Yen-Yun Yu
59
5
0
30 Aug 2022
On Grounded Planning for Embodied Tasks with Language Models
Bill Yuchen Lin
Chengsong Huang
Qian Liu
Wenda Gu
Sam Sommerer
Xiang Ren
LM&Ro
114
41
0
29 Aug 2022
A Survey on Text-to-SQL Parsing: Concepts, Methods, and Future Directions
Bowen Qin
Binyuan Hui
Lihan Wang
Min Yang
Jinyang Li
...
Rongyu Cao
Jian Sun
Luo Si
Fei Huang
Yongbin Li
LMTD
111
58
0
29 Aug 2022
StoryTrans: Non-Parallel Story Author-Style Transfer with Discourse Representations and Content Enhancing
Xuekai Zhu
Jian Guan
Minlie Huang
Juan Liu
DiffM
77
7
0
29 Aug 2022
Podcast Summary Assessment: A Resource for Evaluating Summary Assessment Methods
Potsawee Manakul
Mark Gales
65
5
0
28 Aug 2022
A Multi-Format Transfer Learning Model for Event Argument Extraction via Variational Information Bottleneck
Jie Zhou
Qi Zhang
Qin Chen
Liang He
Xuanjing Huang
100
21
0
27 Aug 2022
AutoQGS: Auto-Prompt for Low-Resource Knowledge-based Question Generation from SPARQL
Guanming Xiong
Junwei Bao
Wen Zhao
Youzheng Wu
Xiaodong He
RALM
89
10
0
26 Aug 2022
A Compact Pretraining Approach for Neural Language Models
Shahriar Golchin
Mihai Surdeanu
N. Tavabi
A. Kiapour
VLM
35
1
0
25 Aug 2022
Multimedia Generative Script Learning for Task Planning
Qingyun Wang
Manling Li
Hou Pong Chan
Lifu Huang
Julia Hockenmaier
Girish Chowdhary
Heng Ji
VGen
117
13
0
25 Aug 2022
DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation
Nataniel Ruiz
Yuanzhen Li
Varun Jampani
Yael Pritch
Michael Rubinstein
Kfir Aberman
386
2,908
0
25 Aug 2022
FedPrompt: Communication-Efficient and Privacy Preserving Prompt Tuning in Federated Learning
Haodong Zhao
Wei Du
Fang Li
Peixuan Li
Gongshen Liu
FedML
81
74
0
25 Aug 2022
Shortcut Learning of Large Language Models in Natural Language Understanding
Mengnan Du
Fengxiang He
Na Zou
Dacheng Tao
Helen Zhou
KELM
OffRL
136
90
0
25 Aug 2022
Interpreting Song Lyrics with an Audio-Informed Pre-trained Language Model
Yixiao Zhang
Junyan Jiang
Gus Xia
S. Dixon
62
9
0
24 Aug 2022
PEER: A Collaborative Language Model
Timo Schick
Jane Dwivedi-Yu
Zhengbao Jiang
Fabio Petroni
Patrick Lewis
Gautier Izacard
Qingfei You
Christoforos Nalmpantis
Edouard Grave
Sebastian Riedel
ALM
108
97
0
24 Aug 2022
Repair Is Nearly Generation: Multilingual Program Repair with LLMs
Harshit Joshi
J. Cambronero
Sumit Gulwani
Vu Le
Ivan Radicek
Gust Verbruggen
LRM
72
136
0
24 Aug 2022
Diverse Title Generation for Stack Overflow Posts with Multiple Sampling Enhanced Transformer
Fengji Zhang
Jin Liu
Yao Wan
Xiao Yu
Xiao Liu
J. Keung
132
11
0
24 Aug 2022
DPTDR: Deep Prompt Tuning for Dense Passage Retrieval
Zhen-Quan Tang
Benyou Wang
Ting Yao
VLM
63
14
0
24 Aug 2022
K-Order Graph-oriented Transformer with GraAttention for 3D Pose and Shape Estimation
Weixi Zhao
Weiqiang Wang
ViT
3DPC
69
2
0
24 Aug 2022
Efficient Attention-free Video Shift Transformers
Adrian Bulat
Brais Martínez
Georgios Tzimiropoulos
ViT
60
1
0
23 Aug 2022
Learning Better Masking for Better Language Model Pre-training
Dongjie Yang
Zhuosheng Zhang
Hai Zhao
80
15
0
23 Aug 2022
Interpreting Embedding Spaces by Conceptualization
Adi Simhi
Shaul Markovitch
93
7
0
22 Aug 2022
PANDA: Prompt Transfer Meets Knowledge Distillation for Efficient Model Adaptation
Qihuang Zhong
Liang Ding
Juhua Liu
Bo Du
Dacheng Tao
VLM
CLL
94
44
0
22 Aug 2022
Z-Code++: A Pre-trained Language Model Optimized for Abstractive Summarization
Pengcheng He
Baolin Peng
Liyang Lu
Song Wang
Jie Mei
...
Chenguang Zhu
Wayne Xiong
Michael Zeng
Jianfeng Gao
Xuedong Huang
117
47
0
21 Aug 2022
Unsupervisedly Prompting AlphaFold2 for Few-Shot Learning of Accurate Folding Landscape and Protein Structure Prediction
Jun Zhang
Sirui Liu
Mengyun Chen
Haotian Chu
Min Wang
...
Yue Yang
Boxin Xue
Lijiang Yang
Yuan Liu
Y. Gao
104
6
0
20 Aug 2022
SPOT: Knowledge-Enhanced Language Representations for Information Extraction
Jiacheng Li
Yannis Katsis
Tyler Baldwin
Ho-Cheol Kim
Andrew Bartko
Julian McAuley
Chun-Nan Hsu
80
17
0
20 Aug 2022
Pretrained Language Encoders are Natural Tagging Frameworks for Aspect Sentiment Triplet Extraction
Yanjie Gou
Yinjie Lei
Lingqiao Liu
Yong Dai
Chun-Yen Shen
Yongqi Tong
ViT
58
0
0
20 Aug 2022
General-to-Specific Transfer Labeling for Domain Adaptable Keyphrase Generation
Rui Meng
Tong Wang
Xingdi Yuan
Yingbo Zhou
Daqing He
70
6
0
20 Aug 2022
Topical: Learning Repository Embeddings from Source Code using Attention
Agathe Lherondelle
Varun Babbar
Yash Satsangi
Fran Silavong
Shaltiel Eloul
Sean J. Moran
59
0
0
19 Aug 2022
End-to-end Clinical Event Extraction from Chinese Electronic Health Record
Wei Feng
Ruocheng Huang
Yun-mei Yu
Hui-nan Sun
Yun Liu
87
0
0
19 Aug 2022
Coarse-to-Fine: Hierarchical Multi-task Learning for Natural Language Understanding
Zhaoye Fei
Yu Tian
Yongkang Wu
Xinyu Zhang
Yutao Zhu
...
Dejiang Kong
Ruofei Lai
Bo Zhao
Zhicheng Dou
Xipeng Qiu
287
1
0
19 Aug 2022
A Survey on Open Information Extraction from Rule-based Model to Large Language Model
Pai Liu
Wenya Gao
Wenjie Dong
Lin Ai
Wen Dong
Songfang Huang
Zongsheng Li
Ehsan Hoque
Julia Hirschberg
Yue Zhang
180
3
0
18 Aug 2022
MulZDG: Multilingual Code-Switching Framework for Zero-shot Dialogue Generation
Yongkang Liu
Shi Feng
Daling Wang
Yifei Zhang
66
8
0
18 Aug 2022
Summarizing Patients Problems from Hospital Progress Notes Using Pre-trained Sequence-to-Sequence Models
Yanjun Gao
Dmitriy Dligach
T. Miller
Dongfang Xu
M. Churpek
Majid Afshar
AI4MH
74
40
0
17 Aug 2022
CommitBART: A Large Pre-trained Model for GitHub Commits
Shangqing Liu
Yanzhou Li
Xiaofei Xie
Yang Liu
VLM
AI4TS
95
20
0
17 Aug 2022
NECE: Narrative Event Chain Extraction Toolkit
Guangxuan Xu
Paulina Toro Isaza
Moshi Li
Akintoye Oloko
Bingsheng Yao
Cassia Sanctos
Aminat Adebeyi
Yufang Hou
Nanyun Peng
Dakuo Wang
82
5
0
17 Aug 2022
Previous
1
2
3
...
153
154
155
...
196
197
198
Next