ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.10683
  4. Cited By
Exploring the Limits of Transfer Learning with a Unified Text-to-Text
  Transformer
v1v2v3v4 (latest)

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
    AIMat
ArXiv (abs)PDFHTML

Papers citing "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

50 / 9,891 papers shown
Title
Enhancing Retrieval-Augmented Generation: A Study of Best Practices
Enhancing Retrieval-Augmented Generation: A Study of Best Practices
Siran Li
Linus Stenzel
Carsten Eickhoff
Seyed Ali Bahrainian
RALM3DV
114
8
0
13 Jan 2025
Breaking Memory Limits: Gradient Wavelet Transform Enhances LLMs Training
Breaking Memory Limits: Gradient Wavelet Transform Enhances LLMs Training
Ziqing Wen
Ping Luo
Jun Wang
Xiaoge Deng
Jinping Zou
Kun Yuan
Tao Sun
Dongsheng Li
CLL
43
0
0
13 Jan 2025
FlexQuant: Elastic Quantization Framework for Locally Hosted LLM on Edge Devices
FlexQuant: Elastic Quantization Framework for Locally Hosted LLM on Edge Devices
Yuji Chai
Mujin Kwen
David Brooks
Gu-Yeon Wei
MQ
92
3
0
13 Jan 2025
MathReader : Text-to-Speech for Mathematical Documents
MathReader : Text-to-Speech for Mathematical Documents
Sieun Hyeon
Kyudan Jung
N. Kim
Hyun Gon Ryu
Jaeyoung Do
111
2
0
13 Jan 2025
Enhancing Image Generation Fidelity via Progressive Prompts
Enhancing Image Generation Fidelity via Progressive Prompts
Zhen Xiong
Yuqi Li
Chuanguang Yang
Tiao Tan
Zhihong Zhu
Siyuan Li
Yue Ma
84
4
0
13 Jan 2025
Explore the Use of Time Series Foundation Model for Car-Following Behavior Analysis
Explore the Use of Time Series Foundation Model for Car-Following Behavior Analysis
Luwei Zeng
Runze Yan
AI4TS
84
0
0
13 Jan 2025
Dataset-Agnostic Recommender Systems
Dataset-Agnostic Recommender Systems
Tri Kurniawan Wijaya
Edoardo DÁmico
Xinyang Shao
107
1
0
13 Jan 2025
A Hessian-informed hyperparameter optimization for differential learning rate
A Hessian-informed hyperparameter optimization for differential learning rate
Shiyun Xu
Zhiqi Bu
Yiliang Zhang
Ian Barnett
123
1
0
12 Jan 2025
SPAM: Spike-Aware Adam with Momentum Reset for Stable LLM Training
SPAM: Spike-Aware Adam with Momentum Reset for Stable LLM Training
Tianjin Huang
Ziquan Zhu
Gaojie Jin
Lu Liu
Zhangyang Wang
Shiwei Liu
119
6
0
12 Jan 2025
Synthetic Feature Augmentation Improves Generalization Performance of Language Models
Synthetic Feature Augmentation Improves Generalization Performance of Language Models
Ashok Choudhary
Cornelius Thiels
Hojjat Salehinejad
104
1
0
11 Jan 2025
Using Pre-trained LLMs for Multivariate Time Series Forecasting
Using Pre-trained LLMs for Multivariate Time Series Forecasting
Malcolm Wolff
Shenghao Yang
Kari Torkkola
Michael W. Mahoney
AI4TSAIFin
85
2
0
10 Jan 2025
Personalized Language Model Learning on Text Data Without User Identifiers
Personalized Language Model Learning on Text Data Without User Identifiers
Yucheng Ding
Yangwenjian Tan
Xiangyu Liu
Chaoyue Niu
Fandong Meng
Jie Zhou
Ning Liu
Fan Wu
Guihai Chen
106
2
0
10 Jan 2025
FlowSep: Language-Queried Sound Separation with Rectified Flow Matching
FlowSep: Language-Queried Sound Separation with Rectified Flow Matching
Yi Yuan
Xubo Liu
Haohe Liu
Mark D. Plumbley
Wenwu Wang
140
9
0
10 Jan 2025
On Creating A Brain-To-Text Decoder
On Creating A Brain-To-Text Decoder
Zenon Lamprou
Yashar Moshfeghi
80
0
0
10 Jan 2025
Beyond Flat Text: Dual Self-inherited Guidance for Visual Text Generation
Beyond Flat Text: Dual Self-inherited Guidance for Visual Text Generation
Minxing Luo
Zixun Xia
L. Chen
Zhenhang Li
Weichao Zeng
Jinqiao Wang
Wentao Cheng
Yaxing Wang
Yu Zhou
Jian Yang
DiffM
149
1
0
10 Jan 2025
Safeguarding System Prompts for LLMs
Safeguarding System Prompts for LLMs
Zhifeng Jiang
Zhihua Jin
Guoliang He
AAMLSILM
168
2
0
10 Jan 2025
ConSim: Measuring Concept-Based Explanations' Effectiveness with Automated Simulatability
ConSim: Measuring Concept-Based Explanations' Effectiveness with Automated Simulatability
Antonin Poché
Alon Jacovi
Agustin Picard
Victor Boutin
Fanny Jourdan
102
3
0
10 Jan 2025
A Novel Approach to Scalable and Automatic Topic-Controlled Question Generation in Education
A Novel Approach to Scalable and Automatic Topic-Controlled Question Generation in Education
Ziqing Li
Mutlu Cukurova
Sahan Bulathwela
92
3
0
10 Jan 2025
From Lazy to Prolific: Tackling Missing Labels in Open Vocabulary Extreme Classification by Positive-Unlabeled Sequence Learning
From Lazy to Prolific: Tackling Missing Labels in Open Vocabulary Extreme Classification by Positive-Unlabeled Sequence Learning
Ranran Haoran Zhang
Bensu Uçar
Soumik Dey
Hansi Wu
Binbin Li
Rui Zhang
136
4
0
10 Jan 2025
COCOLA: Coherence-Oriented Contrastive Learning of Musical Audio Representations
COCOLA: Coherence-Oriented Contrastive Learning of Musical Audio Representations
Ruben Ciranni
Emilian Postolache
Giorgio Mariani
Michele Mancusi
Giorgio Fabbro
Emanuele Rodolà
Luca Cosmo
278
8
0
10 Jan 2025
Time Transfer: On Optimal Learning Rate and Batch Size In The Infinite Data Limit
Time Transfer: On Optimal Learning Rate and Batch Size In The Infinite Data Limit
Oleg Filatov
Jan Ebert
Jiangtao Wang
Stefan Kesselheim
115
4
0
10 Jan 2025
RAPGen: An Approach for Fixing Code Inefficiencies in Zero-Shot
RAPGen: An Approach for Fixing Code Inefficiencies in Zero-Shot
Spandan Garg
Roshanak Zilouchian Moghaddam
Neel Sundaresan
182
11
0
10 Jan 2025
Multi-Task Model Merging via Adaptive Weight Disentanglement
Multi-Task Model Merging via Adaptive Weight Disentanglement
Feng Xiong
Runxi Cheng
Wang Chen
Zhanqiu Zhang
Yiwen Guo
Chun Yuan
Ruifeng Xu
MoMe
209
8
0
10 Jan 2025
TextToucher: Fine-Grained Text-to-Touch Generation
TextToucher: Fine-Grained Text-to-Touch Generation
Jiahang Tu
Hao Fu
Fengyu Yang
Hanbin Zhao
Chao Zhang
Hui Qian
VLMDiffM
159
12
0
10 Jan 2025
LogLM: From Task-based to Instruction-based Automated Log Analysis
LogLM: From Task-based to Instruction-based Automated Log Analysis
Yilun Liu
Yuhe Ji
Shimin Tao
Minggui He
Weibin Meng
Shenglin Zhang
Yongqian Sun
Yuming Xie
Boxing Chen
Hao Yang
116
5
0
10 Jan 2025
Audio-Language Datasets of Scenes and Events: A Survey
Audio-Language Datasets of Scenes and Events: A Survey
Gijs Wijngaard
Elia Formisano
Michele Esposito
M. Dumontier
189
3
0
10 Jan 2025
Deriving Coding-Specific Sub-Models from LLMs using Resource-Efficient Pruning
Deriving Coding-Specific Sub-Models from LLMs using Resource-Efficient Pruning
Laura Puccioni
Alireza Farshin
Mariano Scazzariello
Changjie Wang
Marco Chiesa
Dejan Kostic
48
0
0
10 Jan 2025
Prediction-Assisted Online Distributed Deep Learning Workload Scheduling in GPU Clusters
Prediction-Assisted Online Distributed Deep Learning Workload Scheduling in GPU Clusters
Ziyue Luo
Jia-Wei Liu
Myungjin Lee
Ness B. Shroff
79
0
0
09 Jan 2025
Unlocking In-Context Learning for Natural Datasets Beyond Language Modelling
Unlocking In-Context Learning for Natural Datasets Beyond Language Modelling
Jelena Bratulić
Sudhanshu Mittal
David T. Hoffmann
Samuel Böhm
R. Schirrmeister
T. Ball
Christian Rupprecht
Thomas Brox
90
1
0
09 Jan 2025
Towards a scalable AI-driven framework for data-independent Cyber Threat Intelligence Information Extraction
Towards a scalable AI-driven framework for data-independent Cyber Threat Intelligence Information Extraction
Olga Sorokoletova
Emanuele Antonioni
Giordano Colò
69
0
0
08 Jan 2025
Integrating LLMs with ITS: Recent Advances, Potentials, Challenges, and Future Directions
Integrating LLMs with ITS: Recent Advances, Potentials, Challenges, and Future Directions
Doaa Mahmud
Hadeel Hajmohamed
Shamma Almentheri
Shamma Alqaydi
Lameya Aldhaheri
R. A. Khalil
Nasir Saeed
AI4TS
99
12
0
08 Jan 2025
CURing Large Models: Compression via CUR Decomposition
CURing Large Models: Compression via CUR Decomposition
Sanghyeon Park
Soo-Mook Moon
91
1
0
08 Jan 2025
How to Select Pre-Trained Code Models for Reuse? A Learning Perspective
How to Select Pre-Trained Code Models for Reuse? A Learning Perspective
Zhangqian Bi
Yao Wan
Zhaoyang Chu
Yufei Hu
Junyi Zhang
Hongyu Zhang
Guandong Xu
Hai Jin
81
0
0
08 Jan 2025
ConceptMaster: Multi-Concept Video Customization on Diffusion Transformer Models Without Test-Time Tuning
ConceptMaster: Multi-Concept Video Customization on Diffusion Transformer Models Without Test-Time Tuning
Yuzhou Huang
Ziyang Yuan
Quande Liu
Qiulin Wang
Xintao Wang
Ruimao Zhang
Pengfei Wan
Di Zhang
Kun Gai
VGenDiffM
154
16
0
08 Jan 2025
Clinical Insights: A Comprehensive Review of Language Models in Medicine
Clinical Insights: A Comprehensive Review of Language Models in Medicine
Nikita Neveditsin
Pawan Lingras
V. Mago
LM&MA
125
5
0
08 Jan 2025
Learning Informative Latent Representation for Quantum State Tomography
Learning Informative Latent Representation for Quantum State Tomography
Hailan Ma
Zhenhong Sun
Daoyi Dong
Dong Gong
99
1
0
08 Jan 2025
Localize-and-Stitch: Efficient Model Merging via Sparse Task Arithmetic
Localize-and-Stitch: Efficient Model Merging via Sparse Task Arithmetic
Yifei He
Yuzheng Hu
Yong Lin
Tong Zhang
Han Zhao
FedMLMoMe
129
25
0
08 Jan 2025
On the Consideration of AI Openness: Can Good Intent Be Abused?
On the Consideration of AI Openness: Can Good Intent Be Abused?
Yeeun Kim
Eunkyung Choi
Hyunjun Kim
Hongseok Oh
Hyunseo Shin
Wonseok Hwang
SILM
116
1
0
08 Jan 2025
A Diversity-Enhanced Knowledge Distillation Model for Practical Math Word Problem Solving
A Diversity-Enhanced Knowledge Distillation Model for Practical Math Word Problem Solving
Yi Zhang
Guangyou Zhou
Zhiwen Xie
Jinjin Ma
Jimmy Xiangji Huang
AIMat
70
4
0
08 Jan 2025
GLiREL -- Generalist Model for Zero-Shot Relation Extraction
Jack Boylan
Chris Hokamp
D. Ghalandari
VLM
85
1
0
06 Jan 2025
Trust Modeling in Counseling Conversations: A Benchmark Study
Aseem Srivastava
Zuhair Hasan Shaik
Tanmoy Chakraborty
Md. Shad Akhtar
82
0
0
06 Jan 2025
STAR: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution
Rui Xie
Yinhong Liu
Penghao Zhou
Chen Zhao
Jun Zhou
Peng Sun
Zhenru Zhang
Jian Yang
Zhiyong Yang
Ying Tai
VGenDiffM
112
7
0
06 Jan 2025
Foundations of GenIR
Qingyao Ai
Jingtao Zhan
Yang Liu
126
0
0
06 Jan 2025
Visual Large Language Models for Generalized and Specialized Applications
Yifan Li
Zhixin Lai
Wentao Bao
Zhen Tan
Anh Dao
Kewei Sui
Jiayi Shen
Dong Liu
Huan Liu
Yu Kong
VLM
171
15
0
06 Jan 2025
CLIX: Cross-Lingual Explanations of Idiomatic Expressions
CLIX: Cross-Lingual Explanations of Idiomatic Expressions
Aaron Gluck
Katharina von der Wense
Maria Pacheco
88
1
0
06 Jan 2025
Registering Source Tokens to Target Language Spaces in Multilingual Neural Machine Translation
Registering Source Tokens to Target Language Spaces in Multilingual Neural Machine Translation
Zhi Qu
Yiran Wang
Jiannan Mao
Chenchen Ding
Hideki Tanaka
Masao Utiyama
Taro Watanabe
LRM
124
0
0
06 Jan 2025
GS-DiT: Advancing Video Generation with Pseudo 4D Gaussian Fields through Efficient Dense 3D Point Tracking
Weikang Bian
Zhaoyang Huang
Xiaoyu Shi
Yijin Li
Fu-Yun Wang
Hongsheng Li
3DGSVGenDiffM
126
9
0
05 Jan 2025
Interactive Information Need Prediction with Intent and Context
Interactive Information Need Prediction with Intent and Context
Kevin Ros
Dhyey Pandya
ChengXiang Zhai
55
0
0
05 Jan 2025
Swift Cross-Dataset Pruning: Enhancing Fine-Tuning Efficiency in Natural Language Understanding
Swift Cross-Dataset Pruning: Enhancing Fine-Tuning Efficiency in Natural Language Understanding
Binh-Nguyen Nguyen
Yang He
116
1
0
05 Jan 2025
Validity Arguments For Constructed Response Scoring Using Generative Artificial Intelligence Applications
Validity Arguments For Constructed Response Scoring Using Generative Artificial Intelligence Applications
Jodi M. Casabianca
Daniel F. McCaffrey
Matthew S. Johnson
Naim Alper
Vladimir Zubenko
66
0
0
04 Jan 2025
Previous
123...232425...196197198
Next