Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1910.10683
Cited By
v1
v2
v3
v4 (latest)
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"
50 / 9,870 papers shown
Title
TrajectoryCrafter: Redirecting Camera Trajectory for Monocular Videos via Diffusion Models
Mark YU
Wenbo Hu
Jinbo Xing
Ying Shan
VGen
152
12
0
07 Mar 2025
Audio Flamingo 2: An Audio-Language Model with Long-Audio Understanding and Expert Reasoning Abilities
Sreyan Ghosh
Zhifeng Kong
Sonal Kumar
S. Sakshi
Jaehyeon Kim
Ming-Yu Liu
Rafael Valle
Dinesh Manocha
Bryan Catanzaro
MLLM
AuLLM
LRM
126
21
0
06 Mar 2025
FuseChat-3.0: Preference Optimization Meets Heterogeneous Model Fusion
Ziyi Yang
Fanqi Wan
Longguang Zhong
Canbin Huang
Guosheng Liang
Xiaojun Quan
MoMe
140
2
0
06 Mar 2025
Collapse of Dense Retrievers: Short, Early, and Literal Biases Outranking Factual Evidence
Mohsen Fayyaz
Ali Modarressi
Hinrich Schuetze
Nanyun Peng
125
3
0
06 Mar 2025
Malware Detection at the Edge with Lightweight LLMs: A Performance Evaluation
Christian Rondanini
B. Carminati
E. Ferrari
Antonio Gaudiano
Ashish Kundu
112
0
0
06 Mar 2025
Guiding LLMs to Generate High-Fidelity and High-Quality Counterfactual Explanations for Text Classification
Van Bach Nguyen
C. Seifert
Jorg Schlotterer
BDL
129
0
0
06 Mar 2025
Dynamic-KGQA: A Scalable Framework for Generating Adaptive Question Answering Datasets
Preetam Prabhu Srikar Dammu
Himanshu Naidu
Chirag Shah
165
1
0
06 Mar 2025
Collaborative Evaluation of Deepfake Text with Deliberation-Enhancing Dialogue Systems
Jooyoung Lee
Xiaochen Zhu
Georgi Karadzhov
Tom Stafford
Andreas Vlachos
Dongwon Lee
72
0
0
06 Mar 2025
Compositional Translation: A Novel LLM-based Approach for Low-resource Machine Translation
A. Zebaze
Benoît Sagot
Rachel Bawden
130
1
0
06 Mar 2025
Towards Data-Efficient Language Models: A Child-Inspired Approach to Language Learning
Mohammad Amin Ghanizadeh
Mohammad Javad Dousti
82
1
0
06 Mar 2025
Revisiting the Othello World Model Hypothesis
Yifei Yuan
Anders Søgaard
LRM
97
0
0
06 Mar 2025
ValuePilot: A Two-Phase Framework for Value-Driven Decision-Making
Yitong Luo
Hou Hei Lam
Ziang Chen
Zhenliang Zhang
Xue Feng
128
0
0
06 Mar 2025
TPC: Cross-Temporal Prediction Connection for Vision-Language Model Hallucination Reduction
Chao Wang
Weiwei Fu
Yang Zhou
MLLM
VLM
139
0
0
06 Mar 2025
TS-RAG: Retrieval-Augmented Generation based Time Series Foundation Models are Stronger Zero-Shot Forecaster
Kanghui Ning
Zijie Pan
Yu Liu
Yushan Jiang
Junxuan Zhang
Kashif Rasul
Anderson Schneider
Lintao Ma
Yuriy Nevmyvaka
Dongjin Song
AI4TS
VLM
229
3
0
06 Mar 2025
VLA Model-Expert Collaboration for Bi-directional Manipulation Learning
Tian-Yu Xiang
Ao-Qun Jin
Xiao-Hu Zhou
Mei-Jiang Gui
Xiao-Liang Xie
...
Shuang-Yi Wang
Sheng-Bin Duang
Si-Cheng Wang
Zheng Lei
Z. Hou
110
2
0
06 Mar 2025
TimeFound: A Foundation Model for Time Series Forecasting
Congxi Xiao
Jingbo Zhou
Yixiong Xiao
Xinjiang Lu
Le Zhang
Hui Xiong
AI4TS
90
0
0
06 Mar 2025
Underlying Semantic Diffusion for Effective and Efficient In-Context Learning
Zhong Ji
Weilong Cao
Yan Zhang
Yanwei Pang
Jungong Han
Xuelong Li
DiffM
VLM
88
0
0
06 Mar 2025
Tgea: An error-annotated dataset and benchmark tasks for text generation from pretrained language models
Jie He
Bo Peng
Yi-Lun Liao
Qun Liu
Deyi Xiong
109
8
0
06 Mar 2025
Conformal Transformations for Symmetric Power Transformers
Saurabh Kumar
Jacob Buckman
Carles Gelada
Sean Zhang
102
0
0
05 Mar 2025
Addressing Overprescribing Challenges: Fine-Tuning Large Language Models for Medication Recommendation Tasks
Zihao Zhao
Chenxiao Fan
Chongming Gao
Fuli Feng
Xiangnan He
LM&MA
AI4MH
106
1
0
05 Mar 2025
An Optimization Algorithm for Multimodal Data Alignment
Wei Zhang
Xinyu Wang
Lan Yu
S. Li
66
0
0
05 Mar 2025
Targeted Distillation for Sentiment Analysis
Yice Zhang
Guangyu Xie
Jingjie Lin
Jianzhu Bao
Qianlong Wang
Xi Zeng
Ruifeng Xu
87
0
0
05 Mar 2025
When Claims Evolve: Evaluating and Enhancing the Robustness of Embedding Models Against Misinformation Edits
Jabez Magomere
Emanuele La Malfa
Manuel Tonneau
Ashkan Kazemi
Scott A. Hale
KELM
175
1
0
05 Mar 2025
SpiritSight Agent: Advanced GUI Agent with One Look
Zhiyuan Huang
Ziming Cheng
Junting Pan
Zhaohui Hou
Mingjie Zhan
LLMAG
166
4
0
05 Mar 2025
LLM as GNN: Graph Vocabulary Learning for Text-Attributed Graph Foundation Models
Xi Zhu
Haochen Xue
Ziwei Zhao
Wujiang Xu
Jingyuan Huang
Minghao Guo
Qifan Wang
Kaixiong Zhou
Yongfeng Zhang
120
5
0
05 Mar 2025
Developing and Utilizing a Large-Scale Cantonese Dataset for Multi-Tasking in Large Language Models
Jiyue Jiang
Alfred Kar Yin Truong
Yuxiao Chen
Qinghang Bao
Sheng Wang
Pengan Chen
Jinqiao Wang
Dianbo Sui
Yu Li
Chuan Wu
ALM
85
0
0
05 Mar 2025
The Effectiveness of Large Language Models in Transforming Unstructured Text to Standardized Formats
William Brach
Kristián Košťál
Michal Ries
482
0
0
04 Mar 2025
SAGE-Amine: Generative Amine Design with Multi-Property Optimization for Efficient CO2 Capture
Hocheol Lim
Hyein Cho
Jeonghoon Kim
113
0
0
04 Mar 2025
OmniSQL: Synthesizing High-quality Text-to-SQL Data at Scale
Haoyang Li
Shang Wu
Yanling Wang
Xinmei Huang
Jing Zhang
...
Tieying Zhang
Jianjun Chen
Rui Shi
Hong Chen
Cuiping Li
SyDa
159
9
0
04 Mar 2025
CMMLoc: Advancing Text-to-PointCloud Localization with Cauchy-Mixture-Model Based Framework
Yanlong Xu
Haoxuan Qu
Qingbin Liu
Wenxiao Zhang
Xun Yang
409
0
0
04 Mar 2025
Streaming Piano Transcription Based on Consistent Onset and Offset Decoding with Sustain Pedal Detection
Weixing Wei
Jiahao Zhao
Yulun Wu
Kazuyoshi Yoshii
51
0
0
03 Mar 2025
Llama-3.1-Sherkala-8B-Chat: An Open Large Language Model for Kazakh
Fajri Koto
Rituraj Joshi
Nurdaulet Mukhituly
Yanjie Wang
Zhuohan Xie
...
Avraham Sheinin
Natalia Vassilieva
Neha Sengupta
Larry Murray
Preslav Nakov
ALM
KELM
132
0
0
03 Mar 2025
Twenty Years of Personality Computing: Threats, Challenges and Future Directions
Fabio Celli
Aleksandar Kartelj
Miljan Đorđević
Derwin Suhartono
V. Filipovic
Veljko Milutinović
Georgios Spathoulas
Alessandro Vinciarelli
Michal Kosinski
Bruno Lepri
62
1
0
03 Mar 2025
Learning to Generate Long-term Future Narrations Describing Activities of Daily Living
Ramanathan Rajendiran
Debaditya Roy
Basura Fernando
VGen
124
0
0
03 Mar 2025
Scaling Law Phenomena Across Regression Paradigms: Multiple and Kernel Approaches
Yifang Chen
Xuyang Guo
Xiaoyu Li
Yingyu Liang
Zhenmei Shi
Zhao Song
103
3
0
03 Mar 2025
Automated Annotation of Evolving Corpora for Augmenting Longitudinal Network Data: A Framework Integrating Large Language Models and Expert Knowledge
Xiao Liu
Zirui Wu
Jiayi Li
Zhicheng Shao
Xun Pang
Yansong Feng
128
0
0
03 Mar 2025
Hypergraph Foundation Model
Yifan Feng
Shiquan Liu
Xiangmin Han
Shaoyi Du
Zongze Wu
Han Hu
Yue Gao
AI4CE
75
2
0
03 Mar 2025
Generalizable Prompt Learning of CLIP: A Brief Overview
Fangming Cui
Yonggang Zhang
Xuan Wang
Xule Wang
Liang Xiao
VPVLM
VLM
522
1
0
03 Mar 2025
KurTail : Kurtosis-based LLM Quantization
Mohammad Sadegh Akhondzadeh
Aleksandar Bojchevski
E. Eleftheriou
M. Dazzi
MQ
79
0
0
03 Mar 2025
AutoAdvExBench: Benchmarking autonomous exploitation of adversarial example defenses
Nicholas Carlini
Javier Rando
Edoardo Debenedetti
Milad Nasr
F. Tramèr
AAML
ELM
92
3
0
03 Mar 2025
Towards Improved Text-Aligned Codebook Learning: Multi-Hierarchical Codebook-Text Alignment with Long Text
Guotao Liang
Baoquan Zhang
Zhiyuan Wen
Junteng Zhao
Yunming Ye
Kola Ye
Yao He
96
0
0
03 Mar 2025
MiLiC-Eval: Benchmarking Multilingual LLMs for China's Minority Languages
Chen Zhang
Mingxu Tao
Zhiyuan Liao
Yansong Feng
76
0
0
03 Mar 2025
RSQ: Learning from Important Tokens Leads to Better Quantized LLMs
Yi-Lin Sung
Prateek Yadav
Jialu Li
Jaehong Yoon
Joey Tianyi Zhou
MQ
101
1
0
03 Mar 2025
Re-Imagining Multimodal Instruction Tuning: A Representation View
Yiyang Liu
James Liang
Ruixiang Tang
Yugyung Lee
Majid Rabbani
...
Raghuveer M. Rao
Lifu Huang
Dongfang Liu
Qifan Wang
Cheng Han
421
0
0
02 Mar 2025
DUAL: Diversity and Uncertainty Active Learning for Text Summarization
Petros Stylianos Giouroukis
Alexios Gidiotis
Grigorios Tsoumakas
70
1
0
02 Mar 2025
Predictive Data Selection: The Data That Predicts Is the Data That Teaches
Kashun Shum
Yuanmin Huang
Hongjian Zou
Qi Ding
Yixuan Liao
Xiao Chen
Qian Liu
Junxian He
178
4
0
02 Mar 2025
Transformer Meets Twicing: Harnessing Unattended Residual Information
Laziz U. Abdullaev
Tan M. Nguyen
144
3
0
02 Mar 2025
BodyGen: Advancing Towards Efficient Embodiment Co-Design
Haofei Lu
Zhe Wu
Junliang Xing
Jianshu Li
Ruoyu Li
Zhe Li
Yuanchun Shi
72
2
0
01 Mar 2025
Unlocking Efficient, Scalable, and Continual Knowledge Editing with Basis-Level Representation Fine-Tuning
Tianci Liu
R. Li
Yunzhe Qi
Hui Liu
Xianfeng Tang
...
Qingyu Yin
Monica Cheng
Jun Huan
Haoyu Wang
Jing Gao
KELM
98
4
0
01 Mar 2025
CL-MoE: Enhancing Multimodal Large Language Model with Dual Momentum Mixture-of-Experts for Continual Visual Question Answering
Tianyu Huai
Jie Zhou
Xingjiao Wu
Qin Chen
Qingchun Bai
Ze Zhou
Liang He
MoE
122
4
0
01 Mar 2025
Previous
1
2
3
...
17
18
19
...
196
197
198
Next