Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1910.10683
Cited By
v1
v2
v3
v4 (latest)
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"
50 / 9,948 papers shown
Title
Revisiting Pre-training in Audio-Visual Learning
Ruoxuan Feng
Wenke Xia
Di Hu
77
1
0
07 Feb 2023
Learning Translation Quality Evaluation on Low Resource Languages from Large Language Models
Amirkeivan Mohtashami
M. Verzetti
Paul Kishan Rubenstein
67
4
0
07 Feb 2023
Beyond Statistical Similarity: Rethinking Metrics for Deep Generative Models in Engineering Design
Lyle Regenwetter
Akash Srivastava
Dan Gutfreund
Faez Ahmed
121
30
0
06 Feb 2023
Controllable Lexical Simplification for English
Kim Cheng Sheang
Daniel Ferrés
Horacio Saggion
47
3
0
06 Feb 2023
A Scalable and Efficient Iterative Method for Copying Machine Learning Classifiers
N. Statuto
Irene Unceta
Jordi Nin
O. Pujol
73
0
0
06 Feb 2023
Mixture of Diffusers for scene composition and high resolution image generation
Á. Jiménez
DiffM
73
46
0
05 Feb 2023
Quantized Distributed Training of Large Models with Convergence Guarantees
I. Markov
Adrian Vladu
Qi Guo
Dan Alistarh
MQ
100
11
0
05 Feb 2023
Revisiting Discriminative vs. Generative Classifiers: Theory and Implications
Chenyu Zheng
Guoqiang Wu
Fan Bao
Yue Cao
Chongxuan Li
Jun Zhu
BDL
104
30
0
05 Feb 2023
Measuring The Impact Of Programming Language Distribution
Gabriel Orlanski
Kefan Xiao
Xavier Garcia
Jeffrey Hui
Joshua Howland
J. Malmaud
Jacob Austin
Rishah Singh
Michele Catasta
167
33
0
03 Feb 2023
Revisiting Intermediate Layer Distillation for Compressing Language Models: An Overfitting Perspective
Jongwoo Ko
Seungjoon Park
Minchan Jeong
S. Hong
Euijai Ahn
Duhyeuk Chang
Se-Young Yun
69
6
0
03 Feb 2023
Dreamix: Video Diffusion Models are General Video Editors
Eyal Molad
Eliahu Horwitz
Dani Valevski
Alex Rav-Acha
Yossi Matias
Yael Pritch
Yaniv Leviathan
Yedid Hoshen
DiffM
VGen
133
188
0
02 Feb 2023
Mnemosyne: Learning to Train Transformers with Transformers
Deepali Jain
K. Choromanski
Kumar Avinava Dubey
Sumeet Singh
Vikas Sindhwani
Tingnan Zhang
Jie Tan
OffRL
147
9
0
02 Feb 2023
Language Quantized AutoEncoders: Towards Unsupervised Text-Image Alignment
Hao Liu
Wilson Yan
Pieter Abbeel
99
25
0
02 Feb 2023
SimMTM: A Simple Pre-Training Framework for Masked Time-Series Modeling
Jiaxiang Dong
Haixu Wu
Haoran Zhang
Li Zhang
Jianmin Wang
Mingsheng Long
AI4TS
144
94
0
02 Feb 2023
Analyzing Leakage of Personally Identifiable Information in Language Models
Nils Lukas
A. Salem
Robert Sim
Shruti Tople
Lukas Wutschitz
Santiago Zanella Béguelin
PILM
203
235
0
01 Feb 2023
KNNs of Semantic Encodings for Rating Prediction
Leo Laugier
Raghuram Vadapalli
Thomas Bonald
Lucas Dixon
31
2
0
01 Feb 2023
An Evaluation of Persian-English Machine Translation Datasets with Transformers
A. Sartipi
Meghdad Dehghan
A. Fatemi
69
3
0
01 Feb 2023
The geometry of hidden representations of large transformer models
L. Valeriani
Diego Doimo
F. Cuturello
Alessandro Laio
A. Ansuini
Alberto Cazzaniga
MILM
102
60
0
01 Feb 2023
Jointist: Simultaneous Improvement of Multi-instrument Transcription and Music Source Separation via Joint Training
K. Cheuk
Keunwoo Choi
Qiuqiang Kong
Bochen Li
Minz Won
Ju-Chiang Wang
Yun-Ning Hung
Dorien Herremans
108
6
0
01 Feb 2023
TAP: Accelerating Large-Scale DNN Training Through Tensor Automatic Parallelisation
Ziji Shi
Le Jiang
Ang Wang
Jie Zhang
Xianyan Jia
Yong Li
Chencan Wu
Jialin Li
Wei Lin
GNN
86
2
0
01 Feb 2023
Program Generation from Diverse Video Demonstrations
Anthony Manchin
Jamie Sherrah
Qi Wu
Anton Van Den Hengel
VGen
36
0
0
01 Feb 2023
Learning Universal Policies via Text-Guided Video Generation
Yilun Du
Mengjiao Yang
Bo Dai
H. Dai
Ofir Nachum
J. Tenenbaum
Dale Schuurmans
Pieter Abbeel
PINN
LM&Ro
142
264
0
31 Jan 2023
PADL: Language-Directed Physics-Based Character Control
Jordan Juravsky
Yunrong Guo
Sanja Fidler
Xue Bin Peng
89
45
0
31 Jan 2023
Do Multi-Document Summarization Models Synthesize?
Jay DeYoung
Stephanie C. Martinez
Iain J. Marshall
Byron C. Wallace
102
8
0
31 Jan 2023
Execution-based Code Generation using Deep Reinforcement Learning
Parshin Shojaee
Aneesh Jain
Sindhu Tipirneni
Chandan K. Reddy
148
58
0
31 Jan 2023
Improving Open-Domain Dialogue Evaluation with a Causal Inference Model
Cat P. Le
Luke Dai
Michael Johnston
Yang Liu
M. Walker
R. Ghanadan
ELM
33
10
0
31 Jan 2023
Finding the Law: Enhancing Statutory Article Retrieval via Graph Neural Networks
Antoine Louis
Gijs van Dijck
Gerasimos Spanakis
AILaw
115
9
0
30 Jan 2023
Make-An-Audio: Text-To-Audio Generation with Prompt-Enhanced Diffusion Models
Rongjie Huang
Jia-Bin Huang
Dongchao Yang
Yi Ren
Luping Liu
Mingze Li
Zhenhui Ye
Jinglin Liu
Xiaoyue Yin
Zhou Zhao
DiffM
243
344
0
30 Jan 2023
HeroNet: A Hybrid Retrieval-Generation Network for Conversational Bots
Jiahao Wang
Yunzhe Xu
Zhiying Tu
Dianhui Chu
45
0
0
29 Jan 2023
Progressive Prompts: Continual Learning for Language Models
Anastasia Razdaibiedina
Yuning Mao
Rui Hou
Madian Khabsa
M. Lewis
Amjad Almahairi
VLM
KELM
CLL
138
142
0
29 Jan 2023
Context-Aware Differential Privacy for Language Modeling
M. H. Dinh
Ferdinando Fioretto
68
2
0
28 Jan 2023
AutoPEFT: Automatic Configuration Search for Parameter-Efficient Fine-Tuning
Han Zhou
Xingchen Wan
Ivan Vulić
Anna Korhonen
87
48
0
28 Jan 2023
Understanding the Effectiveness of Very Large Language Models on Dialog Evaluation
Jessica Huynh
Cathy Jiao
Prakhar Gupta
Shikib Mehri
Payal Bajaj
Vishrav Chaudhary
M. Eskénazi
ELM
LM&MA
73
17
0
27 Jan 2023
Investigating the use of ChatGPT for the scheduling of construction projects
S. Prieto
Eyob Mengiste
Borja García de Soto
31
135
0
27 Jan 2023
Probing Out-of-Distribution Robustness of Language Models with Parameter-Efficient Transfer Learning
Hyunsoo Cho
Choonghyun Park
Junyeop Kim
Sungmin Cho
Kang Min Yoo
Sang-goo Lee
OODD
102
3
0
27 Jan 2023
Learning 6-DoF Fine-grained Grasp Detection Based on Part Affordance Grounding
Yaoxian Song
Penglei Sun
Piaopiao Jin
Yi Ren
Yu Zheng
Zhixu Li
Xiaowen Chu
Yueying Zhang
Tiefeng Li
Jason Gu
215
17
0
27 Jan 2023
MusicLM: Generating Music From Text
A. Agostinelli
Timo I. Denk
Zalan Borsos
Jesse Engel
Mauro Verzetti
...
Adam Roberts
Marco Tagliasacchi
Matthew Sharifi
Neil Zeghidour
Christian Frank
MGen
154
451
0
26 Jan 2023
DetectGPT: Zero-Shot Machine-Generated Text Detection using Probability Curvature
E. Mitchell
Yoonho Lee
Alexander Khazatsky
Christopher D. Manning
Chelsea Finn
140
633
0
26 Jan 2023
Understanding Finetuning for Factual Knowledge Extraction from Language Models
Mehran Kazemi
Sid Mittal
Deepak Ramachandran
KELM
102
11
0
26 Jan 2023
Simple diffusion: End-to-end diffusion for high resolution images
Emiel Hoogeboom
Jonathan Heek
Tim Salimans
118
268
0
26 Jan 2023
SWING: Balancing Coverage and Faithfulness for Dialogue Summarization
Kung-Hsiang Huang
Siffi Singh
Xiaofei Ma
Wei Xiao
Wei Xiao
Nicholas Dingwall
William Yang Wang
Kathleen McKeown
HILM
70
13
0
25 Jan 2023
One Model for All Domains: Collaborative Domain-Prefix Tuning for Cross-Domain NER
Xiang Chen
Lei Li
Q. Fei
Ningyu Zhang
Chuanqi Tan
Yong Jiang
Fei Huang
Huajun Chen
106
24
0
25 Jan 2023
Multitask Instruction-based Prompting for Fallacy Recognition
Tariq Alhindi
Tuhin Chakrabarty
Elena Musi
Smaranda Muresan
LRM
72
30
0
24 Jan 2023
Optimus-CC: Efficient Large NLP Model Training with 3D Parallelism Aware Communication Compression
Jaeyong Song
Jinkyu Yim
Jaewon Jung
Hongsun Jang
H. Kim
Youngsok Kim
Jinho Lee
GNN
76
28
0
24 Jan 2023
Transformer-Patcher: One Mistake worth One Neuron
Zeyu Huang
Songlin Yang
Xiaofeng Zhang
Jie Zhou
Wenge Rong
Zhang Xiong
KELM
107
179
0
24 Jan 2023
Truveta Mapper: A Zero-shot Ontology Alignment Framework
Mariyam Amir
Murchana Baruah
Mahsa Eslamialishah
Sina Ehsani
Alireza Bahramali
Sadra Naddaf-sh
Saman Zarandioon
91
7
0
24 Jan 2023
Semantic-aware Contrastive Learning for Electroencephalography-to-Text Generation with Curriculum Learning
Xiachong Feng
Xiaocheng Feng
Bing Qin
64
5
0
23 Jan 2023
Summarize the Past to Predict the Future: Natural Language Descriptions of Context Boost Multimodal Object Interaction Anticipation
Razvan-George Pasca
Alexey Gavryushin
Muhammad Hamza
Yen-Ling Kuo
Kaichun Mo
Luc Van Gool
Otmar Hilliges
Xi Wang
171
14
0
22 Jan 2023
Differentially Private Natural Language Models: Recent Advances and Future Directions
Lijie Hu
Ivan Habernal
Lei Shen
Di Wang
AAML
98
19
0
22 Jan 2023
The Backpropagation algorithm for a math student
S. Damadi
Golnaz Moharrer
Mostafa Cham
60
4
0
22 Jan 2023
Previous
1
2
3
...
137
138
139
...
197
198
199
Next