ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.10683
  4. Cited By
Exploring the Limits of Transfer Learning with a Unified Text-to-Text
  Transformer
v1v2v3v4 (latest)

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
    AIMat
ArXiv (abs)PDFHTML

Papers citing "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

50 / 9,948 papers shown
Title
Revisiting Pre-training in Audio-Visual Learning
Revisiting Pre-training in Audio-Visual Learning
Ruoxuan Feng
Wenke Xia
Di Hu
77
1
0
07 Feb 2023
Learning Translation Quality Evaluation on Low Resource Languages from
  Large Language Models
Learning Translation Quality Evaluation on Low Resource Languages from Large Language Models
Amirkeivan Mohtashami
M. Verzetti
Paul Kishan Rubenstein
67
4
0
07 Feb 2023
Beyond Statistical Similarity: Rethinking Metrics for Deep Generative
  Models in Engineering Design
Beyond Statistical Similarity: Rethinking Metrics for Deep Generative Models in Engineering Design
Lyle Regenwetter
Akash Srivastava
Dan Gutfreund
Faez Ahmed
121
30
0
06 Feb 2023
Controllable Lexical Simplification for English
Controllable Lexical Simplification for English
Kim Cheng Sheang
Daniel Ferrés
Horacio Saggion
47
3
0
06 Feb 2023
A Scalable and Efficient Iterative Method for Copying Machine Learning
  Classifiers
A Scalable and Efficient Iterative Method for Copying Machine Learning Classifiers
N. Statuto
Irene Unceta
Jordi Nin
O. Pujol
73
0
0
06 Feb 2023
Mixture of Diffusers for scene composition and high resolution image
  generation
Mixture of Diffusers for scene composition and high resolution image generation
Á. Jiménez
DiffM
73
46
0
05 Feb 2023
Quantized Distributed Training of Large Models with Convergence
  Guarantees
Quantized Distributed Training of Large Models with Convergence Guarantees
I. Markov
Adrian Vladu
Qi Guo
Dan Alistarh
MQ
100
11
0
05 Feb 2023
Revisiting Discriminative vs. Generative Classifiers: Theory and
  Implications
Revisiting Discriminative vs. Generative Classifiers: Theory and Implications
Chenyu Zheng
Guoqiang Wu
Fan Bao
Yue Cao
Chongxuan Li
Jun Zhu
BDL
104
30
0
05 Feb 2023
Measuring The Impact Of Programming Language Distribution
Measuring The Impact Of Programming Language Distribution
Gabriel Orlanski
Kefan Xiao
Xavier Garcia
Jeffrey Hui
Joshua Howland
J. Malmaud
Jacob Austin
Rishah Singh
Michele Catasta
167
33
0
03 Feb 2023
Revisiting Intermediate Layer Distillation for Compressing Language
  Models: An Overfitting Perspective
Revisiting Intermediate Layer Distillation for Compressing Language Models: An Overfitting Perspective
Jongwoo Ko
Seungjoon Park
Minchan Jeong
S. Hong
Euijai Ahn
Duhyeuk Chang
Se-Young Yun
69
6
0
03 Feb 2023
Dreamix: Video Diffusion Models are General Video Editors
Dreamix: Video Diffusion Models are General Video Editors
Eyal Molad
Eliahu Horwitz
Dani Valevski
Alex Rav-Acha
Yossi Matias
Yael Pritch
Yaniv Leviathan
Yedid Hoshen
DiffMVGen
133
188
0
02 Feb 2023
Mnemosyne: Learning to Train Transformers with Transformers
Mnemosyne: Learning to Train Transformers with Transformers
Deepali Jain
K. Choromanski
Kumar Avinava Dubey
Sumeet Singh
Vikas Sindhwani
Tingnan Zhang
Jie Tan
OffRL
147
9
0
02 Feb 2023
Language Quantized AutoEncoders: Towards Unsupervised Text-Image
  Alignment
Language Quantized AutoEncoders: Towards Unsupervised Text-Image Alignment
Hao Liu
Wilson Yan
Pieter Abbeel
99
25
0
02 Feb 2023
SimMTM: A Simple Pre-Training Framework for Masked Time-Series Modeling
SimMTM: A Simple Pre-Training Framework for Masked Time-Series Modeling
Jiaxiang Dong
Haixu Wu
Haoran Zhang
Li Zhang
Jianmin Wang
Mingsheng Long
AI4TS
144
94
0
02 Feb 2023
Analyzing Leakage of Personally Identifiable Information in Language
  Models
Analyzing Leakage of Personally Identifiable Information in Language Models
Nils Lukas
A. Salem
Robert Sim
Shruti Tople
Lukas Wutschitz
Santiago Zanella Béguelin
PILM
203
235
0
01 Feb 2023
KNNs of Semantic Encodings for Rating Prediction
KNNs of Semantic Encodings for Rating Prediction
Leo Laugier
Raghuram Vadapalli
Thomas Bonald
Lucas Dixon
31
2
0
01 Feb 2023
An Evaluation of Persian-English Machine Translation Datasets with
  Transformers
An Evaluation of Persian-English Machine Translation Datasets with Transformers
A. Sartipi
Meghdad Dehghan
A. Fatemi
69
3
0
01 Feb 2023
The geometry of hidden representations of large transformer models
The geometry of hidden representations of large transformer models
L. Valeriani
Diego Doimo
F. Cuturello
Alessandro Laio
A. Ansuini
Alberto Cazzaniga
MILM
102
60
0
01 Feb 2023
Jointist: Simultaneous Improvement of Multi-instrument Transcription and
  Music Source Separation via Joint Training
Jointist: Simultaneous Improvement of Multi-instrument Transcription and Music Source Separation via Joint Training
K. Cheuk
Keunwoo Choi
Qiuqiang Kong
Bochen Li
Minz Won
Ju-Chiang Wang
Yun-Ning Hung
Dorien Herremans
108
6
0
01 Feb 2023
TAP: Accelerating Large-Scale DNN Training Through Tensor Automatic
  Parallelisation
TAP: Accelerating Large-Scale DNN Training Through Tensor Automatic Parallelisation
Ziji Shi
Le Jiang
Ang Wang
Jie Zhang
Xianyan Jia
Yong Li
Chencan Wu
Jialin Li
Wei Lin
GNN
86
2
0
01 Feb 2023
Program Generation from Diverse Video Demonstrations
Program Generation from Diverse Video Demonstrations
Anthony Manchin
Jamie Sherrah
Qi Wu
Anton Van Den Hengel
VGen
36
0
0
01 Feb 2023
Learning Universal Policies via Text-Guided Video Generation
Learning Universal Policies via Text-Guided Video Generation
Yilun Du
Mengjiao Yang
Bo Dai
H. Dai
Ofir Nachum
J. Tenenbaum
Dale Schuurmans
Pieter Abbeel
PINNLM&Ro
142
264
0
31 Jan 2023
PADL: Language-Directed Physics-Based Character Control
PADL: Language-Directed Physics-Based Character Control
Jordan Juravsky
Yunrong Guo
Sanja Fidler
Xue Bin Peng
89
45
0
31 Jan 2023
Do Multi-Document Summarization Models Synthesize?
Do Multi-Document Summarization Models Synthesize?
Jay DeYoung
Stephanie C. Martinez
Iain J. Marshall
Byron C. Wallace
102
8
0
31 Jan 2023
Execution-based Code Generation using Deep Reinforcement Learning
Execution-based Code Generation using Deep Reinforcement Learning
Parshin Shojaee
Aneesh Jain
Sindhu Tipirneni
Chandan K. Reddy
148
58
0
31 Jan 2023
Improving Open-Domain Dialogue Evaluation with a Causal Inference Model
Improving Open-Domain Dialogue Evaluation with a Causal Inference Model
Cat P. Le
Luke Dai
Michael Johnston
Yang Liu
M. Walker
R. Ghanadan
ELM
33
10
0
31 Jan 2023
Finding the Law: Enhancing Statutory Article Retrieval via Graph Neural
  Networks
Finding the Law: Enhancing Statutory Article Retrieval via Graph Neural Networks
Antoine Louis
Gijs van Dijck
Gerasimos Spanakis
AILaw
115
9
0
30 Jan 2023
Make-An-Audio: Text-To-Audio Generation with Prompt-Enhanced Diffusion
  Models
Make-An-Audio: Text-To-Audio Generation with Prompt-Enhanced Diffusion Models
Rongjie Huang
Jia-Bin Huang
Dongchao Yang
Yi Ren
Luping Liu
Mingze Li
Zhenhui Ye
Jinglin Liu
Xiaoyue Yin
Zhou Zhao
DiffM
243
344
0
30 Jan 2023
HeroNet: A Hybrid Retrieval-Generation Network for Conversational Bots
HeroNet: A Hybrid Retrieval-Generation Network for Conversational Bots
Jiahao Wang
Yunzhe Xu
Zhiying Tu
Dianhui Chu
45
0
0
29 Jan 2023
Progressive Prompts: Continual Learning for Language Models
Progressive Prompts: Continual Learning for Language Models
Anastasia Razdaibiedina
Yuning Mao
Rui Hou
Madian Khabsa
M. Lewis
Amjad Almahairi
VLMKELMCLL
138
142
0
29 Jan 2023
Context-Aware Differential Privacy for Language Modeling
Context-Aware Differential Privacy for Language Modeling
M. H. Dinh
Ferdinando Fioretto
68
2
0
28 Jan 2023
AutoPEFT: Automatic Configuration Search for Parameter-Efficient
  Fine-Tuning
AutoPEFT: Automatic Configuration Search for Parameter-Efficient Fine-Tuning
Han Zhou
Xingchen Wan
Ivan Vulić
Anna Korhonen
87
48
0
28 Jan 2023
Understanding the Effectiveness of Very Large Language Models on Dialog
  Evaluation
Understanding the Effectiveness of Very Large Language Models on Dialog Evaluation
Jessica Huynh
Cathy Jiao
Prakhar Gupta
Shikib Mehri
Payal Bajaj
Vishrav Chaudhary
M. Eskénazi
ELMLM&MA
73
17
0
27 Jan 2023
Investigating the use of ChatGPT for the scheduling of construction
  projects
Investigating the use of ChatGPT for the scheduling of construction projects
S. Prieto
Eyob Mengiste
Borja García de Soto
31
135
0
27 Jan 2023
Probing Out-of-Distribution Robustness of Language Models with
  Parameter-Efficient Transfer Learning
Probing Out-of-Distribution Robustness of Language Models with Parameter-Efficient Transfer Learning
Hyunsoo Cho
Choonghyun Park
Junyeop Kim
Sungmin Cho
Kang Min Yoo
Sang-goo Lee
OODD
102
3
0
27 Jan 2023
Learning 6-DoF Fine-grained Grasp Detection Based on Part Affordance Grounding
Learning 6-DoF Fine-grained Grasp Detection Based on Part Affordance Grounding
Yaoxian Song
Penglei Sun
Piaopiao Jin
Yi Ren
Yu Zheng
Zhixu Li
Xiaowen Chu
Yueying Zhang
Tiefeng Li
Jason Gu
215
17
0
27 Jan 2023
MusicLM: Generating Music From Text
MusicLM: Generating Music From Text
A. Agostinelli
Timo I. Denk
Zalan Borsos
Jesse Engel
Mauro Verzetti
...
Adam Roberts
Marco Tagliasacchi
Matthew Sharifi
Neil Zeghidour
Christian Frank
MGen
154
451
0
26 Jan 2023
DetectGPT: Zero-Shot Machine-Generated Text Detection using Probability
  Curvature
DetectGPT: Zero-Shot Machine-Generated Text Detection using Probability Curvature
E. Mitchell
Yoonho Lee
Alexander Khazatsky
Christopher D. Manning
Chelsea Finn
140
633
0
26 Jan 2023
Understanding Finetuning for Factual Knowledge Extraction from Language
  Models
Understanding Finetuning for Factual Knowledge Extraction from Language Models
Mehran Kazemi
Sid Mittal
Deepak Ramachandran
KELM
102
11
0
26 Jan 2023
Simple diffusion: End-to-end diffusion for high resolution images
Simple diffusion: End-to-end diffusion for high resolution images
Emiel Hoogeboom
Jonathan Heek
Tim Salimans
118
268
0
26 Jan 2023
SWING: Balancing Coverage and Faithfulness for Dialogue Summarization
SWING: Balancing Coverage and Faithfulness for Dialogue Summarization
Kung-Hsiang Huang
Siffi Singh
Xiaofei Ma
Wei Xiao
Wei Xiao
Nicholas Dingwall
William Yang Wang
Kathleen McKeown
HILM
70
13
0
25 Jan 2023
One Model for All Domains: Collaborative Domain-Prefix Tuning for
  Cross-Domain NER
One Model for All Domains: Collaborative Domain-Prefix Tuning for Cross-Domain NER
Xiang Chen
Lei Li
Q. Fei
Ningyu Zhang
Chuanqi Tan
Yong Jiang
Fei Huang
Huajun Chen
106
24
0
25 Jan 2023
Multitask Instruction-based Prompting for Fallacy Recognition
Multitask Instruction-based Prompting for Fallacy Recognition
Tariq Alhindi
Tuhin Chakrabarty
Elena Musi
Smaranda Muresan
LRM
72
30
0
24 Jan 2023
Optimus-CC: Efficient Large NLP Model Training with 3D Parallelism Aware
  Communication Compression
Optimus-CC: Efficient Large NLP Model Training with 3D Parallelism Aware Communication Compression
Jaeyong Song
Jinkyu Yim
Jaewon Jung
Hongsun Jang
H. Kim
Youngsok Kim
Jinho Lee
GNN
76
28
0
24 Jan 2023
Transformer-Patcher: One Mistake worth One Neuron
Transformer-Patcher: One Mistake worth One Neuron
Zeyu Huang
Songlin Yang
Xiaofeng Zhang
Jie Zhou
Wenge Rong
Zhang Xiong
KELM
107
179
0
24 Jan 2023
Truveta Mapper: A Zero-shot Ontology Alignment Framework
Truveta Mapper: A Zero-shot Ontology Alignment Framework
Mariyam Amir
Murchana Baruah
Mahsa Eslamialishah
Sina Ehsani
Alireza Bahramali
Sadra Naddaf-sh
Saman Zarandioon
91
7
0
24 Jan 2023
Semantic-aware Contrastive Learning for Electroencephalography-to-Text
  Generation with Curriculum Learning
Semantic-aware Contrastive Learning for Electroencephalography-to-Text Generation with Curriculum Learning
Xiachong Feng
Xiaocheng Feng
Bing Qin
64
5
0
23 Jan 2023
Summarize the Past to Predict the Future: Natural Language Descriptions
  of Context Boost Multimodal Object Interaction Anticipation
Summarize the Past to Predict the Future: Natural Language Descriptions of Context Boost Multimodal Object Interaction Anticipation
Razvan-George Pasca
Alexey Gavryushin
Muhammad Hamza
Yen-Ling Kuo
Kaichun Mo
Luc Van Gool
Otmar Hilliges
Xi Wang
171
14
0
22 Jan 2023
Differentially Private Natural Language Models: Recent Advances and
  Future Directions
Differentially Private Natural Language Models: Recent Advances and Future Directions
Lijie Hu
Ivan Habernal
Lei Shen
Di Wang
AAML
98
19
0
22 Jan 2023
The Backpropagation algorithm for a math student
The Backpropagation algorithm for a math student
S. Damadi
Golnaz Moharrer
Mostafa Cham
60
4
0
22 Jan 2023
Previous
123...137138139...197198199
Next