Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1910.10683
Cited By
v1
v2
v3
v4 (latest)
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"
50 / 9,851 papers shown
Title
Noise-Robustness Through Noise: Asymmetric LoRA Adaption with Poisoning Expert
Zhaokun Wang
Jinyu Guo
Jingwen Pu
Lingfeng Chen
Hongli Pu
Jie Ou.Libo Qin
Libo Qin
Wenhong Tian
AAML
34
0
0
29 May 2025
TRACE: Trajectory-Constrained Concept Erasure in Diffusion Models
Finn Carter
DiffM
79
0
0
29 May 2025
GeNRe: A French Gender-Neutral Rewriting System Using Collective Nouns
Enzo Doyen
Amalia Todirascu
42
0
0
29 May 2025
Point-MoE: Towards Cross-Domain Generalization in 3D Semantic Segmentation via Mixture-of-Experts
Xuweiyi Chen
Wentao Zhou
Aruni RoyChowdhury
Zezhou Cheng
3DPC
57
0
0
29 May 2025
A New Deep-learning-Based Approach For mRNA Optimization: High Fidelity, Computation Efficiency, and Multiple Optimization Factors
Zheng Gong
Ziyi Jiang
Weihao Gao
Deng Zhuo
Lan Ma
33
0
0
29 May 2025
MAP: Revisiting Weight Decomposition for Low-Rank Adaptation
Chongjie Si
Zhiyi Shi
Yadao Wang
Xiaokang Yang
Susanto Rahardja
Wei Shen
59
0
0
29 May 2025
Leave it to the Specialist: Repair Sparse LLMs with Sparse Fine-Tuning via Sparsity Evolution
Q. Xiao
Alan Ansell
Boqian Wu
Lu Yin
Mykola Pechenizkiy
Shiwei Liu
Decebal Constantin Mocanu
32
0
0
29 May 2025
GenIC: An LLM-Based Framework for Instance Completion in Knowledge Graphs
Amel Gader
Alsayed Algergawy
15
0
0
29 May 2025
Identity resolution of software metadata using Large Language Models
Eva Martín del Pico
Josep Lluís Gelpí
Salvador Capella-Gutiérrez
22
0
0
29 May 2025
MoRE: A Mixture of Low-Rank Experts for Adaptive Multi-Task Learning
Dacao Zhang
Kun Zhang
Shimao Chu
Le Wu
Xin Li
Si Wei
MoE
ALM
OffRL
32
0
0
28 May 2025
New Tools are Needed for Tracking Adherence to AI Model Behavioral Use Clauses
Daniel J. McDuff
Tim Korjakow
Kevin Klyman
Danish Contractor
MedIm
31
0
0
28 May 2025
ACE: Exploring Activation Cosine Similarity and Variance for Accurate and Calibration-Efficient LLM Pruning
Zhendong Mi
Zhenglun Kong
Geng Yuan
Shaoyi Huang
51
0
0
28 May 2025
DeepRTL2: A Versatile Model for RTL-Related Tasks
Yi Liu
Hongji Zhang
Yunhao Zhou
Zhengyuan Shi
Changran Xu
Qiang Xu
VLM
15
0
0
28 May 2025
ACE-Step: A Step Towards Music Generation Foundation Model
Junmin Gong
Sean Zhao
Sen Wang
S. Xu
Joe Guo
34
2
0
28 May 2025
Two-Stage Feature Generation with Transformer and Reinforcement Learning
Wanfu Gao
Zengyao Man
Zebin He
Yuhao Tang
Jun Gao
Kunpeng Liu
18
0
0
28 May 2025
Retrieval-Augmented Generation: A Comprehensive Survey of Architectures, Enhancements, and Robustness Frontiers
Chaitanya Sharma
RALM
3DV
25
0
0
28 May 2025
Improving Continual Pre-training Through Seamless Data Packing
Ruicheng Yin
Xuan Gao
Changze Lv
Xiaohua Wang
Xiaoqing Zheng
Xuanjing Huang
31
0
0
28 May 2025
Highly Efficient and Effective LLMs with Multi-Boolean Architectures
Ba-Hien Tran
Van Minh Nguyen
MQ
56
0
0
28 May 2025
Comprehensive Evaluation on Lexical Normalization: Boundary-Aware Approaches for Unsegmented Languages
S. Higashiyama
Masao Utiyama
12
0
0
28 May 2025
ConsRec: Denoising Sequential Recommendation through User-Consistent Preference Modeling
Haidong Xin
Qiushi Xiong
Zhenghao Liu
Sen Mei
Yukun Yan
Shi Yu
Shuo Wang
Yu Gu
Ge Yu
Chenyan Xiong
HAI
53
0
0
28 May 2025
Revisiting Bayesian Model Averaging in the Era of Foundation Models
Mijung Park
UQCV
MoMe
17
0
0
28 May 2025
MEDAL: A Framework for Benchmarking LLMs as Multilingual Open-Domain Chatbots and Dialogue Evaluators
John Mendonça
A. Lavie
Isabel Trancoso
40
0
0
28 May 2025
AlignGen: Boosting Personalized Image Generation with Cross-Modality Prior Alignment
Yiheng Lin
Shifang Zhao
Ting Liu
Xiaochao Qu
Luoqi Liu
Yao Zhao
Yunchao Wei
DiffM
41
0
0
28 May 2025
Unraveling LoRA Interference: Orthogonal Subspaces for Robust Model Merging
Haobo Zhang
Jiayu Zhou
MoMe
48
0
0
28 May 2025
Improving QA Efficiency with DistilBERT: Fine-Tuning and Inference on mobile Intel CPUs
Ngeyen Yinkfu
7
0
0
28 May 2025
From Large AI Models to Agentic AI: A Tutorial on Future Intelligent Communications
Feibo Jiang
Cunhua Pan
Li Dong
Kezhi Wang
O. Dobre
Mérouane Debbah
LLMAG
AI4TS
172
1
0
28 May 2025
From Motion to Behavior: Hierarchical Modeling of Humanoid Generative Behavior Control
Jusheng Zhang
Jinzhou Tang
Sidi Liu
Mingyan Li
Sheng Zhang
Jian Wang
Keze Wang
23
0
0
28 May 2025
Budget-Adaptive Adapter Tuning in Orthogonal Subspaces for Continual Learning in LLMs
Zhiyi Wan
Wanrou Du
Liang Li
Miao Pan
Xiaoqi Qin
CLL
38
0
0
28 May 2025
From Reasoning to Learning: A Survey on Hypothesis Discovery and Rule Learning with Large Language Models
Kaiyu He
Zhiyu Chen
ReLM
LRM
ELM
75
0
0
28 May 2025
DocReRank: Single-Page Hard Negative Query Generation for Training Multi-Modal RAG Rerankers
Navve Wasserman
Oliver Heinimann
Yuval Golbari
Tal Zimbalist
Eli Schwartz
Michal Irani
51
0
0
28 May 2025
ICH-Qwen: A Large Language Model Towards Chinese Intangible Cultural Heritage
Wenhao Ye
Tiansheng Zheng
Yue Qi
Wenhua Zhao
Xiyu Wang
Xue Zhao
Jiacheng He
Yaya Zheng
Dongbo Wang
15
0
0
28 May 2025
In Dialogue with Intelligence: Rethinking Large Language Models as Collective Knowledge
Eleni Vasilaki
KELM
15
0
0
28 May 2025
Automated Essay Scoring Incorporating Annotations from Automated Feedback Systems
Christopher Ormerod
20
0
0
28 May 2025
Explainability of Large Language Models using SMILE: Statistical Model-agnostic Interpretability with Local Explanations
Zeinab Dehghani
Koorosh Aslansefat
Adil Khan
Mohammed Naveed Akram
MILM
LRM
132
0
0
27 May 2025
M-Wanda: Improving One-Shot Pruning for Multilingual LLMs
Rochelle Choenni
Ivan Titov
18
0
0
27 May 2025
Rethinking Information Synthesis in Multimodal Question Answering A Multi-Agent Perspective
Krishna Singh Rajput
Tejas Anvekar
Chitta Baral
Vivek Gupta
13
0
0
27 May 2025
A Lightweight Multi-Expert Generative Language Model System for Engineering Information and Knowledge Extraction
Bogdan Bogachov
Yaoyao Fiona Zhao
37
0
0
27 May 2025
PartInstruct: Part-level Instruction Following for Fine-grained Robot Manipulation
Yifan Yin
Zhengtao Han
Shivam Aarya
Jianxin Wang
Shuhang Xu
Jiawei Peng
Angtian Wang
Alan Yuille
Tianmin Shu
LM&Ro
32
0
0
27 May 2025
Efficient Large Language Model Inference with Neural Block Linearization
Mete Erdogan
F. Tonin
Volkan Cevher
78
0
0
27 May 2025
ID-Align: RoPE-Conscious Position Remapping for Dynamic High-Resolution Adaptation in Vision-Language Models
Bozhou Li
Wentao Zhang
VLM
29
0
0
27 May 2025
Open-Det: An Efficient Learning Framework for Open-Ended Detection
Guiping Cao
Tao Wang
Wenjian Huang
X. Lan
Jianguo Zhang
D. Jiang
ObjD
VLM
22
0
0
27 May 2025
Emotion-aware Dual Cross-Attentive Neural Network with Label Fusion for Stance Detection in Misinformative Social Media Content
Lata Pangtey
Mohammad Zia Ur Rehman
Prasad Chaudhari
Shubhi Bansal
Nagendra Kumar
28
0
0
27 May 2025
QwT-v2: Practical, Effective and Efficient Post-Training Quantization
Ningyuan Tang
Minghao Fu
Hao Yu
Jianxin Wu
MQ
89
0
0
27 May 2025
DLP: Dynamic Layerwise Pruning in Large Language Models
Yuli Chen
B. Cheng
Jiale Han
Yingying Zhang
Yingting Li
Shuhao Zhang
42
0
0
27 May 2025
Pretrained LLMs Learn Multiple Types of Uncertainty
Roi Cohen
Omri Fahn
Gerard de Melo
39
0
0
27 May 2025
Test-Time Learning for Large Language Models
Jinwu Hu
Zhitian Zhang
Guohao Chen
Xutao Wen
Chao Shuai
Wei Luo
Bin Xiao
Yuanqing Li
Mingkui Tan
55
0
0
27 May 2025
From Directions to Cones: Exploring Multidimensional Representations of Propositional Facts in LLMs
Stanley Yu
Vaidehi Bulusu
Oscar Yasunaga
Clayton Lau
Cole Blondin
Sean O'Brien
Kevin Zhu
Vasu Sharma
54
0
0
27 May 2025
ResSVD: Residual Compensated SVD for Large Language Model Compression
Haolei Bai
Siyong Jian
Tuo Liang
Yu Yin
Huan Wang
46
0
0
26 May 2025
Deconstructing Obfuscation: A four-dimensional framework for evaluating Large Language Models assembly code deobfuscation capabilities
Anton Tkachenko
Dmitrij Suskevic
Benjamin Adolphi
58
0
0
26 May 2025
Learning Extrapolative Sequence Transformations from Markov Chains
Sophia Hager
Aleem Khan
Andrew Wang
Nicholas Andrews
BDL
33
0
0
26 May 2025
Previous
1
2
3
4
5
6
...
196
197
198
Next