ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.10683
  4. Cited By
Exploring the Limits of Transfer Learning with a Unified Text-to-Text
  Transformer
v1v2v3v4 (latest)

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
    AIMat
ArXiv (abs)PDFHTML

Papers citing "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

50 / 9,843 papers shown
Title
Enhancing Channel-Independent Time Series Forecasting via Cross-Variate Patch Embedding
Enhancing Channel-Independent Time Series Forecasting via Cross-Variate Patch Embedding
Donghwa Shin
Edwin Zhang
AI4TS
78
0
0
19 May 2025
Emergent Specialization: Rare Token Neurons in Language Models
Emergent Specialization: Rare Token Neurons in Language Models
Jing Liu
Haozheng Wang
Yueheng Li
MILMLRM
66
0
0
19 May 2025
JNLP at SemEval-2025 Task 11: Cross-Lingual Multi-Label Emotion Detection Using Generative Models
JNLP at SemEval-2025 Task 11: Cross-Lingual Multi-Label Emotion Detection Using Generative Models
Jieying Xue
Phuong Minh Nguyen
Minh Le Nguyen
Xin Liu
48
0
0
19 May 2025
FinePhys: Fine-grained Human Action Generation by Explicitly Incorporating Physical Laws for Effective Skeletal Guidance
FinePhys: Fine-grained Human Action Generation by Explicitly Incorporating Physical Laws for Effective Skeletal Guidance
Dian Shao
Mingfei Shi
Shengda Xu
Haodong Chen
Yongle Huang
Binglu Wang
3DH
63
0
0
19 May 2025
Unlocking Non-Invasive Brain-to-Text
Unlocking Non-Invasive Brain-to-Text
Dulhan Jayalath
Gilad Landau
Oiwi Parker Jones
86
2
0
19 May 2025
SQLForge: Synthesizing Reliable and Diverse Data to Enhance Text-to-SQL Reasoning in LLMs
SQLForge: Synthesizing Reliable and Diverse Data to Enhance Text-to-SQL Reasoning in LLMs
Yu Guo
Dong Jin
Shenghao Ye
Shuangwu Chen
Jian Yang
Xiaobin Tan
63
0
0
19 May 2025
Scalable Video-to-Dataset Generation for Cross-Platform Mobile Agents
Scalable Video-to-Dataset Generation for Cross-Platform Mobile Agents
Yunseok Jang
Yeda Song
Sungryull Sohn
Lajanugen Logeswaran
Tiange Luo
Dong-Ki Kim
Kyunghoon Bae
Honglak Lee
VGen
62
0
0
19 May 2025
SounDiT: Geo-Contextual Soundscape-to-Landscape Generation
SounDiT: Geo-Contextual Soundscape-to-Landscape Generation
Junbo Wang
Haofeng Tan
Bowen Liao
Albert Jiang
Teng Fei
Qixing Huang
Zhengzhong Tu
Shan Ye
Yuhao Kang
118
0
0
19 May 2025
Enhancing LLMs for Time Series Forecasting via Structure-Guided Cross-Modal Alignment
Enhancing LLMs for Time Series Forecasting via Structure-Guided Cross-Modal Alignment
Siming Sun
Kai Zhang
Xuejun Jiang
Wenchao Meng
Qinmin Yang
AI4TS
58
0
0
19 May 2025
A Case Study of Cross-Lingual Zero-Shot Generalization for Classical Languages in LLMs
A Case Study of Cross-Lingual Zero-Shot Generalization for Classical Languages in LLMs
V.S.D.S.Mahesh Akavarapu
Hrishikesh Terdalkar
Pramit Bhattacharyya
Shubhangi Agarwal
Vishakha Deulgaonkar
Pralay Manna
Chaitali Dangarikar
Arnab Bhattacharya
84
0
0
19 May 2025
Hyperspectral Image Land Cover Captioning Dataset for Vision Language Models
Hyperspectral Image Land Cover Captioning Dataset for Vision Language Models
Aryan Das
Tanishq Rachamalla
Pravendra Singh
Koushik Biswas
Vinay Kumar Verma
Swalpa Kumar Roy
VLM
76
0
0
18 May 2025
AltLoRA: Towards Better Gradient Approximation in Low-Rank Adaptation with Alternating Projections
AltLoRA: Towards Better Gradient Approximation in Low-Rank Adaptation with Alternating Projections
Xin Yu
Yujia Wang
Jinghui Chen
Lingzhou Xue
95
0
0
18 May 2025
HBO: Hierarchical Balancing Optimization for Fine-Tuning Large Language Models
HBO: Hierarchical Balancing Optimization for Fine-Tuning Large Language Models
Weixuan Wang
Minghao Wu
Barry Haddow
Alexandra Birch
146
0
0
18 May 2025
Data Whisperer: Efficient Data Selection for Task-Specific LLM Fine-Tuning via Few-Shot In-Context Learning
Data Whisperer: Efficient Data Selection for Task-Specific LLM Fine-Tuning via Few-Shot In-Context Learning
Shaobo Wang
Xiangqi Jin
Ziming Wang
Jinqiao Wang
Jingyun Zhang
...
Zichen Wen
Zhong Li
Zeang Sheng
Xuming Hu
Linfeng Zhang
SyDa
112
3
0
18 May 2025
ChemPile: A 250GB Diverse and Curated Dataset for Chemical Foundation Models
ChemPile: A 250GB Diverse and Curated Dataset for Chemical Foundation Models
Adrian Mirza
Nawaf Alampara
Martiño Ríos-García
Mohamed Abdelalim
Jack Butler
...
Mark Worrall
Adamo Young
Philippe Schwaller
Michael Pieler
Kevin Maik Jablonka
145
0
0
18 May 2025
A Survey of Attacks on Large Language Models
A Survey of Attacks on Large Language Models
Wenrui Xu
Keshab K. Parhi
AAMLELM
82
0
0
18 May 2025
HybridServe: Efficient Serving of Large AI Models with Confidence-Based Cascade Routing
HybridServe: Efficient Serving of Large AI Models with Confidence-Based Cascade Routing
Leyang Xue
Yao Fu
Luo Mai
Mahesh K. Marina
136
0
0
18 May 2025
One-for-All Pruning: A Universal Model for Customized Compression of Large Language Models
One-for-All Pruning: A Universal Model for Customized Compression of Large Language Models
Rongguang Ye
Ming Tang
38
0
0
18 May 2025
Hyperbolic Residual Quantization: Discrete Representations for Data with Latent Hierarchies
Hyperbolic Residual Quantization: Discrete Representations for Data with Latent Hierarchies
Piotr Piękos
Subhradeep Kayal
Alexandros Karatzoglou
86
0
0
18 May 2025
Bidirectional LMs are Better Knowledge Memorizers? A Benchmark for Real-world Knowledge Injection
Bidirectional LMs are Better Knowledge Memorizers? A Benchmark for Real-world Knowledge Injection
Yuwei Zhang
Wenhao Yu
Shangbin Feng
Yifan Zhu
Letian Peng
Jayanth Srinivasa
Gaowen Liu
Jingbo Shang
KELM
73
2
0
18 May 2025
Learning to Highlight Audio by Watching Movies
Learning to Highlight Audio by Watching Movies
Chao Huang
Ruohan Gao
J. M. F. Tsang
Jan Kurcius
Cagdas Bilen
Chenliang Xu
Anurag Kumar
Sanjeel Parekh
VGen
95
1
0
17 May 2025
AI-Driven Automation Can Become the Foundation of Next-Era Science of Science Research
AI-Driven Automation Can Become the Foundation of Next-Era Science of Science Research
Renqi Chen
Haoyang Su
Shixiang Tang
Zhenfei Yin
Qi Wu
Hui Li
Ye Sun
Nanqing Dong
Wanli Ouyang
Philip Torr
AI4CE
54
0
0
17 May 2025
Revisiting Residual Connections: Orthogonal Updates for Stable and Efficient Deep Networks
Revisiting Residual Connections: Orthogonal Updates for Stable and Efficient Deep Networks
Giyeong Oh
Woohyun Cho
Siyeol Kim
Suhwan Choi
Younjae Yu
60
0
0
17 May 2025
Conditioning Matters: Training Diffusion Policies is Faster Than You Think
Conditioning Matters: Training Diffusion Policies is Faster Than You Think
Zibin Dong
Yicheng Liu
Yinchuan Li
Hang Zhao
Haifeng Zhang
114
0
0
16 May 2025
MergeBench: A Benchmark for Merging Domain-Specialized LLMs
MergeBench: A Benchmark for Merging Domain-Specialized LLMs
Yifei He
Siqi Zeng
Yuzheng Hu
Rui Yang
Tong Zhang
Han Zhao
MoMeALM
108
0
0
16 May 2025
SpecEdge: Scalable Edge-Assisted Serving Framework for Interactive LLMs
SpecEdge: Scalable Edge-Assisted Serving Framework for Interactive LLMs
Jinwoo Park
Seunggeun Cho
Dongsu Han
71
0
0
16 May 2025
The Ripple Effect: On Unforeseen Complications of Backdoor Attacks
The Ripple Effect: On Unforeseen Complications of Backdoor Attacks
Rui Zhang
Yun Shen
Hongwei Li
Wenbo Jiang
Hanxiao Chen
Yuan Zhang
Guowen Xu
Yang Zhang
SILMAAML
83
0
0
16 May 2025
GenKnowSub: Improving Modularity and Reusability of LLMs through General Knowledge Subtraction
GenKnowSub: Improving Modularity and Reusability of LLMs through General Knowledge Subtraction
Mohammadtaha Bagherifard
Sahar Rajabi
Ali Edalat
Yadollah Yaghoobzadeh
KELM
69
0
0
16 May 2025
Masking in Multi-hop QA: An Analysis of How Language Models Perform with Context Permutation
Masking in Multi-hop QA: An Analysis of How Language Models Perform with Context Permutation
Wenyu Huang
Pavlos Vougiouklis
Mirella Lapata
Jeff Z. Pan
58
0
0
16 May 2025
Gaussian Weight Sampling for Scalable, Efficient and Stable Pseudo-Quantization Training
Gaussian Weight Sampling for Scalable, Efficient and Stable Pseudo-Quantization Training
Myeonghwan Ahn
Sungjoo Yoo
MQ
98
0
0
16 May 2025
Low-Resource Language Processing: An OCR-Driven Summarization and Translation Pipeline
Low-Resource Language Processing: An OCR-Driven Summarization and Translation Pipeline
Hrishit Madhavi
Jacob Cherian
Yuvraj Khamkar
Dhananjay Bhagat
VLM
109
0
0
16 May 2025
Qronos: Correcting the Past by Shaping the Future... in Post-Training Quantization
Qronos: Correcting the Past by Shaping the Future... in Post-Training Quantization
Shihao Zhang
Haoyu Zhang
Ian Colbert
Rayan Saab
MQ
101
0
0
16 May 2025
From Trade-off to Synergy: A Versatile Symbiotic Watermarking Framework for Large Language Models
From Trade-off to Synergy: A Versatile Symbiotic Watermarking Framework for Large Language Models
Yidan Wang
Yubing Ren
Yanan Cao
Binxing Fang
98
0
0
15 May 2025
Exploring the Deep Fusion of Large Language Models and Diffusion Transformers for Text-to-Image Synthesis
Exploring the Deep Fusion of Large Language Models and Diffusion Transformers for Text-to-Image Synthesis
Bingda Tang
Boyang Zheng
Xichen Pan
Sayak Paul
Saining Xie
78
0
0
15 May 2025
Comparing LLM Text Annotation Skills: A Study on Human Rights Violations in Social Media Data
Comparing LLM Text Annotation Skills: A Study on Human Rights Violations in Social Media Data
Poli A. Nemkova
Solomon Ubani
Mark V. Albert
AILaw
65
0
0
15 May 2025
Multi-Token Prediction Needs Registers
Multi-Token Prediction Needs Registers
Anastasios Gerontopoulos
Spyros Gidaris
N. Komodakis
113
0
0
15 May 2025
Task-Core Memory Management and Consolidation for Long-term Continual Learning
Task-Core Memory Management and Consolidation for Long-term Continual Learning
Tianyu Huai
Jie Zhou
Yuxuan Cai
Qin Chen
Wen Wu
Xingjiao Wu
Xipeng Qiu
Liang He
CLL
94
0
0
15 May 2025
AI Agents vs. Agentic AI: A Conceptual Taxonomy, Applications and Challenges
AI Agents vs. Agentic AI: A Conceptual Taxonomy, Applications and Challenges
Ranjan Sapkota
Konstantinos I. Roumeliotis
Manoj Karkee
AI4TS
141
6
0
15 May 2025
ComplexFormer: Disruptively Advancing Transformer Inference Ability via Head-Specific Complex Vector Attention
ComplexFormer: Disruptively Advancing Transformer Inference Ability via Head-Specific Complex Vector Attention
Jintian Shao
Hongyi Huang
Hongyi Huang
Beiwen Zhang
ZhiYu Wu
You Shan
MingKai Zheng
154
0
0
15 May 2025
Superposition Yields Robust Neural Scaling
Superposition Yields Robust Neural Scaling
Yizhou Liu
Ziming Liu
Jeff Gore
MILM
124
1
0
15 May 2025
Multilingual Machine Translation with Quantum Encoder Decoder Attention-based Convolutional Variational Circuits
Multilingual Machine Translation with Quantum Encoder Decoder Attention-based Convolutional Variational Circuits
Subrit Dikshit
Ritu Tiwari
Priyank Jain
56
0
0
14 May 2025
Language Agents Mirror Human Causal Reasoning Biases. How Can We Help Them Think Like Scientists?
Language Agents Mirror Human Causal Reasoning Biases. How Can We Help Them Think Like Scientists?
Anthony GX-Chen
Dongyan Lin
Mandana Samiei
Doina Precup
Blake A. Richards
Rob Fergus
Kenneth Marino
CMLLRM
71
1
0
14 May 2025
Variational Prefix Tuning for Diverse and Accurate Code Summarization Using Pre-trained Language Models
Variational Prefix Tuning for Diverse and Accurate Code Summarization Using Pre-trained Language Models
Junda Zhao
Yuliang Song
Eldan Cohen
102
0
0
14 May 2025
A 2D Semantic-Aware Position Encoding for Vision Transformers
A 2D Semantic-Aware Position Encoding for Vision Transformers
Xi Chen
Shiyang Zhou
Muqi Huang
Jiaxu Feng
Yun Xiong
...
Yize Zhang
Huishuai Bao
Sijia Peng
Chong Li
Feng Shi
ViT
70
0
0
14 May 2025
MorphMark: Flexible Adaptive Watermarking for Large Language Models
MorphMark: Flexible Adaptive Watermarking for Large Language Models
Zongqi Wang
Tianle Gu
Baoyuan Wu
Yujiu Yang
WaLM
122
0
0
14 May 2025
From Seeing to Doing: Bridging Reasoning and Decision for Robotic Manipulation
From Seeing to Doing: Bridging Reasoning and Decision for Robotic Manipulation
Yifu Yuan
Haiqin Cui
Yibin Chen
Zibin Dong
Fei Ni
Longxin Kou
Jinyi Liu
Pengyi Li
Yan Zheng
Jianye Hao
145
0
0
13 May 2025
Large Language Models Meet Stance Detection: A Survey of Tasks, Methods, Applications, Challenges and Future Directions
Large Language Models Meet Stance Detection: A Survey of Tasks, Methods, Applications, Challenges and Future Directions
Lata Pangtey
Anukriti Bhatnagar
Shubhi Bansal
Shahid Shafi Dar
Nagendra Kumar
75
0
0
13 May 2025
Lost in Transliteration: Bridging the Script Gap in Neural IR
Lost in Transliteration: Bridging the Script Gap in Neural IR
Andreas Chari
Iadh Ounis
Sean MacAvaney
65
0
0
13 May 2025
RepCali: High Efficient Fine-tuning Via Representation Calibration in Latent Space for Pre-trained Language Models
RepCali: High Efficient Fine-tuning Via Representation Calibration in Latent Space for Pre-trained Language Models
Fujun Zhang
Xiaoying Fan
Xiangdong Su
Guanglai Gao
71
0
0
13 May 2025
Evaluating LLM Metrics Through Real-World Capabilities
Evaluating LLM Metrics Through Real-World Capabilities
Justin K Miller
Wenjia Tang
ELMALM
93
1
0
13 May 2025
Previous
123...789...195196197
Next