Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1910.10683
Cited By
v1
v2
v3
v4 (latest)
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"
50 / 9,844 papers shown
Title
ArgHiTZ at ArchEHR-QA 2025: A Two-Step Divide and Conquer Approach to Patient Question Answering for Top Factuality
Adrián Cuadrón
Aimar Sagasti
Maitane Urruela
Iker de la Iglesia
Ane G Domingo-Aldama
Aitziber Atutxa
Josu Goikoetxea
Ander Barrena
19
0
0
15 Jun 2025
Transforming Chatbot Text: A Sequence-to-Sequence Approach
Natesh Reddy
Mark Stamp
DeLMO
SILM
40
0
0
15 Jun 2025
MaskPro: Linear-Space Probabilistic Learning for Strict (N:M)-Sparsity on Large Language Models
Yan Sun
Qixin Zhang
Zhiyuan Yu
Xikun Zhang
Li Shen
Dacheng Tao
21
0
0
15 Jun 2025
Rethinking Hate Speech Detection on Social Media: Can LLMs Replace Traditional Models?
Daman Deep Singh
Ramanuj Bhattacharjee
Abhijnan Chakraborty
23
0
0
15 Jun 2025
INTERPOS: Interaction Rhythm Guided Positional Morphing for Mobile App Recommender Systems
M. H. Maqbool
Moghis Fereidouni
Umar Farooq
A.B. Siddique
H. Foroosh
AI4TS
15
0
0
14 Jun 2025
Exploring Cultural Variations in Moral Judgments with Large Language Models
Hadi Mohammadi
Efthymia Papadopoulou
Yasmeen F.S.S. Meijer
Ayoub Bagheri
22
0
0
14 Jun 2025
Brewing Knowledge in Context: Distillation Perspectives on In-Context Learning
Chengye Li
Haiyun Liu
Yuanxi Li
10
0
0
13 Jun 2025
Prioritizing Alignment Paradigms over Task-Specific Model Customization in Time-Series LLMs
Wei Li
Yunyao Cheng
Xinli Hao
Chaohong Ma
Yuxuan Liang
Bin Yang
Christian S.Jensen
Xiaofeng Meng
AI4TS
21
0
0
13 Jun 2025
Dynamic Double Space Tower
Weikai Sun
Shijie Song
Han Wang
10
0
0
13 Jun 2025
Abstract Sound Fusion with Unconditioned Inversion Model
Jing Liu
EnQi Lian
18
0
0
13 Jun 2025
A Watermark for Auto-Regressive Image Generation Models
Yihan Wu
Xuehao Cui
Ruibo Chen
Georgios Milis
Heng Huang
WIGM
36
0
0
13 Jun 2025
Explaining Recovery Trajectories of Older Adults Post Lower-Limb Fracture Using Modality-wise Multiview Clustering and Large Language Models
Shehroz S. Khan
Ali Abedi
Charlene H. Chu
10
0
0
13 Jun 2025
Auditing Data Provenance in Real-world Text-to-Image Diffusion Models for Privacy and Copyright Protection
Jie Zhu
Leye Wang
13
0
0
13 Jun 2025
Foundation Models in Autonomous Driving: A Survey on Scenario Generation and Scenario Analysis
Yuan Gao
Mattia Piccinini
Yuchen Zhang
Dingrui Wang
Korbinian Moller
...
Steven Peters
Andrea Stocco
Bassam Alrifaee
Marco Pavone
Johannes Betz
19
0
0
13 Jun 2025
SlotPi: Physics-informed Object-centric Reasoning Models
Jian Li
Wan Han
Ning Lin
Yu-Liang Zhan
Ruizhi Chengze
...
Yi-Feng Zhang
Hongsheng Liu
Zidong Wang
Fan Yu
Hao Sun
OCL
LRM
AI4CE
112
0
0
12 Jun 2025
PosterCraft: Rethinking High-Quality Aesthetic Poster Generation in a Unified Framework
Sixiang Chen
Jianyu Lai
Jialin Gao
Tian-Chun Ye
Haoyu Chen
...
Zhaohu Xing
Yeying Jin
Junfeng Luo
Xiaoming Wei
Lei Zhu
DiffM
98
0
0
12 Jun 2025
The Diffusion Duality
Subham S. Sahoo
Justin Deschenaux
Aaron Gokaslan
Guanghan Wang
Justin T Chiu
Volodymyr Kuleshov
DiffM
123
4
0
12 Jun 2025
Pisces: An Auto-regressive Foundation Model for Image Understanding and Generation
Zhiyang Xu
Jiuhai Chen
Zhaojiang Lin
Xichen Pan
Lifu Huang
...
Di Jin
Michihiro Yasunaga
Lili Yu
Xi Lin
Shaoliang Nie
118
1
0
12 Jun 2025
Beyond Random Sampling: Efficient Language Model Pretraining via Curriculum Learning
Yang Zhang
Amr Mohamed
Hadi Abdine
Guokan Shang
Michalis Vazirgiannis
23
0
0
12 Jun 2025
Domain2Vec: Vectorizing Datasets to Find the Optimal Data Mixture without Training
Mozhi Zhang
Howe Tissue
Lu Wang
Xipeng Qiu
120
1
0
12 Jun 2025
Revisiting Transformers with Insights from Image Filtering
Laziz U. Abdullaev
Maksim Tkachenko
Tan M. Nguyen
ViT
121
0
0
12 Jun 2025
Do We Still Need Audio? Rethinking Speaker Diarization with a Text-Based Approach Using Multiple Prediction Models
Peilin Wu
Jinho Choi
21
0
0
12 Jun 2025
Vision Generalist Model: A Survey
Ziyi Wang
Yongming Rao
Shuofeng Sun
Xinrun Liu
Yi Wei
...
Zuyan Liu
Yanbo Wang
Hongmin Liu
Jie Zhou
Jiwen Lu
65
0
0
11 Jun 2025
DIVE into MoE: Diversity-Enhanced Reconstruction of Large Language Models from Dense into Mixture-of-Experts
Yuchen Feng
Bowen Shen
Naibin Gu
Jiaxuan Zhao
Peng Fu
Zheng Lin
Weiping Wang
MoMe
MoE
50
0
0
11 Jun 2025
From Intention to Execution: Probing the Generalization Boundaries of Vision-Language-Action Models
Irving Fang
Juexiao Zhang
Shengbang Tong
Chen Feng
LM&Ro
56
1
0
11 Jun 2025
LaMAGIC2: Advanced Circuit Formulations for Language Model-Based Analog Topology Generation
Chen-Chia Chang
Wan-Hsuan Lin
Yikang Shen
Yiran Chen
Xin Zhang
49
0
0
11 Jun 2025
On-the-Fly Adaptive Distillation of Transformer to Dual-State Linear Attention
Yeonju Ro
Zhenyu Zhang
Souvik Kundu
Zhangyang Wang
Aditya Akella
91
0
0
11 Jun 2025
Enhancing Traffic Accident Classifications: Application of NLP Methods for City Safety
Enes Özeren
Alexander Ulbrich
Sascha Filimon
David Rügamer
Andreas Bender
12
0
0
11 Jun 2025
Memorization in Language Models through the Lens of Intrinsic Dimension
Stefan Arnold
PILM
104
0
0
11 Jun 2025
Fine-Grained control over Music Generation with Activation Steering
Dipanshu Panda
Jayden Koshy Joe
Harshith M R
Swathi Narashiman
Pranay Mathur
Anish Veerakumar
Aniruddh Krishna
Keerthiharan A
LLMSV
66
0
0
11 Jun 2025
RoboSwap: A GAN-driven Video Diffusion Framework For Unsupervised Robot Arm Swapping
Yang Bai
Liudi Yang
George Eskandar
Fengyi Shen
Dong Chen
Mohammad Altillawi
Z. Liu
Gitta Kutyniok
VGen
15
0
0
10 Jun 2025
Low-resource domain adaptation while minimizing energy and hardware resource consumption
Hernán Maina
Nicolás Wolovick
Luciana Benotti
25
0
0
10 Jun 2025
CC-RAG: Structured Multi-Hop Reasoning via Theme-Based Causal Graphs
Jash Rajesh Parekh
Pengcheng Jiang
Jiawei Han
LRM
22
0
0
10 Jun 2025
Olica: Efficient Structured Pruning of Large Language Models without Retraining
Jiujun He
Huazhen Lin
24
0
0
10 Jun 2025
RAISE: Enhancing Scientific Reasoning in LLMs via Step-by-Step Retrieval
Minhae Oh
Jeonghye Kim
Nakyung Lee
Donggeon Seo
Taeuk Kim
Jungwoo Lee
ReLM
LRM
24
0
0
10 Jun 2025
Unifying Block-wise PTQ and Distillation-based QAT for Progressive Quantization toward 2-bit Instruction-Tuned LLMs
Jung Hyun Lee
Seungjae Shin
Vinnam Kim
Jaeseong You
An Chen
MQ
23
0
0
10 Jun 2025
Auto-Regressive vs Flow-Matching: a Comparative Study of Modeling Paradigms for Text-to-Music Generation
Or Tal
Felix Kreuk
Yossi Adi
AI4TS
49
0
0
10 Jun 2025
Rethinking Cross-Modal Interaction in Multimodal Diffusion Transformers
Zhengyao Lv
Tianlin Pan
Chenyang Si
Zhaoxi Chen
W. Zuo
Ziwei Liu
Kwan-Yee K. Wong
19
0
0
09 Jun 2025
SEED: Enhancing Text-to-SQL Performance and Practical Usability Through Automatic Evidence Generation
Janghyeon Yun
Sang-goo Lee
15
0
0
09 Jun 2025
A Comprehensive Study of Decoder-Only LLMs for Text-to-Image Generation
Andrew Z. Wang
Songwei Ge
Tero Karras
Ming-Yu Liu
Yogesh Balaji
28
0
0
09 Jun 2025
OneIG-Bench: Omni-dimensional Nuanced Evaluation for Image Generation
Jingjing Chang
Yixiao Fang
Peng Xing
Shuhan Wu
Wei Cheng
Rui Wang
Xianfang Zeng
Gang Yu
H. Chen
EGVM
VLM
30
0
0
09 Jun 2025
Cost-Optimal Active AI Model Evaluation
Anastasios Nikolas Angelopoulos
Jacob Eisenstein
Jonathan Berant
Alekh Agarwal
Adam Fisch
17
0
0
09 Jun 2025
ReCogDrive: A Reinforced Cognitive Framework for End-to-End Autonomous Driving
Yongkang Li
Kaixin Xiong
Xiangyu Guo
Fang Li
Sixu Yan
...
Bing Wang
Guang Chen
Hangjun Ye
Wenyu Liu
Xinggang Wang
VLM
40
0
0
09 Jun 2025
Improving Fairness of Large Language Models in Multi-document Summarization
Haoyuan Li Yusen Zhang
Snigdha Chaturvedi
Snigdha Chaturvedi
12
0
0
09 Jun 2025
Info-Coevolution: An Efficient Framework for Data Model Coevolution
Ziheng Qin
Hailun Xu
Wei Chee Yew
Qi Jia
Yang Luo
Kanchan Sarkar
Danhui Guan
Kai Wang
Yang You
22
0
0
09 Jun 2025
Enhancing Watermarking Quality for LLMs via Contextual Generation States Awareness
Peiru Yang
Xintian Li
Wanchun Ni
Jinhua Yin
Huili Wang
Guoshun Nan
Shangguang Wang
Yongfeng Huang
Tao Qi
15
0
0
09 Jun 2025
EgoM2P: Egocentric Multimodal Multitask Pretraining
Gen Li
Yutong Chen
Yiqian Wu
Kaifeng Zhao
Marc Pollefeys
Siyu Tang
EgoV
VLM
31
0
0
09 Jun 2025
Federated In-Context Learning: Iterative Refinement for Improved Answer Quality
Ruhan Wang
Zhiyong Wang
Chengkai Huang
Rui Wang
Tong Yu
Lina Yao
John C. S. Lui
Dongruo Zhou
15
0
0
09 Jun 2025
Plug-in and Fine-tuning: Bridging the Gap between Small Language Models and Large Language Models
Kyeonghyun Kim
Jinhee Jang
Juhwan Choi
Yoonji Lee
Kyohoon Jin
Youngbin Kim
16
0
0
09 Jun 2025
MIRA: Medical Time Series Foundation Model for Real-World Health Data
Hao Li
Bowen Deng
Chang Xu
Zhiyuan Feng
Viktor Schlegel
...
Yizheng Sun
Jingyuan Sun
Kailai Yang
Yiyao Yu
Jiang Bian
AI4TS
OOD
AI4CE
44
0
0
09 Jun 2025
Previous
1
2
3
4
5
...
195
196
197
Next