ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.10683
  4. Cited By
Exploring the Limits of Transfer Learning with a Unified Text-to-Text
  Transformer
v1v2v3v4 (latest)

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
    AIMat
ArXiv (abs)PDFHTML

Papers citing "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

50 / 9,851 papers shown
Title
Noise-Robustness Through Noise: Asymmetric LoRA Adaption with Poisoning Expert
Noise-Robustness Through Noise: Asymmetric LoRA Adaption with Poisoning Expert
Zhaokun Wang
Jinyu Guo
Jingwen Pu
Lingfeng Chen
Hongli Pu
Jie Ou.Libo Qin
Libo Qin
Wenhong Tian
AAML
34
0
0
29 May 2025
TRACE: Trajectory-Constrained Concept Erasure in Diffusion Models
TRACE: Trajectory-Constrained Concept Erasure in Diffusion Models
Finn Carter
DiffM
79
0
0
29 May 2025
GeNRe: A French Gender-Neutral Rewriting System Using Collective Nouns
GeNRe: A French Gender-Neutral Rewriting System Using Collective Nouns
Enzo Doyen
Amalia Todirascu
42
0
0
29 May 2025
Point-MoE: Towards Cross-Domain Generalization in 3D Semantic Segmentation via Mixture-of-Experts
Point-MoE: Towards Cross-Domain Generalization in 3D Semantic Segmentation via Mixture-of-Experts
Xuweiyi Chen
Wentao Zhou
Aruni RoyChowdhury
Zezhou Cheng
3DPC
57
0
0
29 May 2025
A New Deep-learning-Based Approach For mRNA Optimization: High Fidelity, Computation Efficiency, and Multiple Optimization Factors
A New Deep-learning-Based Approach For mRNA Optimization: High Fidelity, Computation Efficiency, and Multiple Optimization Factors
Zheng Gong
Ziyi Jiang
Weihao Gao
Deng Zhuo
Lan Ma
33
0
0
29 May 2025
MAP: Revisiting Weight Decomposition for Low-Rank Adaptation
MAP: Revisiting Weight Decomposition for Low-Rank Adaptation
Chongjie Si
Zhiyi Shi
Yadao Wang
Xiaokang Yang
Susanto Rahardja
Wei Shen
59
0
0
29 May 2025
Leave it to the Specialist: Repair Sparse LLMs with Sparse Fine-Tuning via Sparsity Evolution
Leave it to the Specialist: Repair Sparse LLMs with Sparse Fine-Tuning via Sparsity Evolution
Q. Xiao
Alan Ansell
Boqian Wu
Lu Yin
Mykola Pechenizkiy
Shiwei Liu
Decebal Constantin Mocanu
32
0
0
29 May 2025
GenIC: An LLM-Based Framework for Instance Completion in Knowledge Graphs
GenIC: An LLM-Based Framework for Instance Completion in Knowledge Graphs
Amel Gader
Alsayed Algergawy
15
0
0
29 May 2025
Identity resolution of software metadata using Large Language Models
Identity resolution of software metadata using Large Language Models
Eva Martín del Pico
Josep Lluís Gelpí
Salvador Capella-Gutiérrez
22
0
0
29 May 2025
MoRE: A Mixture of Low-Rank Experts for Adaptive Multi-Task Learning
MoRE: A Mixture of Low-Rank Experts for Adaptive Multi-Task Learning
Dacao Zhang
Kun Zhang
Shimao Chu
Le Wu
Xin Li
Si Wei
MoEALMOffRL
32
0
0
28 May 2025
New Tools are Needed for Tracking Adherence to AI Model Behavioral Use Clauses
New Tools are Needed for Tracking Adherence to AI Model Behavioral Use Clauses
Daniel J. McDuff
Tim Korjakow
Kevin Klyman
Danish Contractor
MedIm
31
0
0
28 May 2025
ACE: Exploring Activation Cosine Similarity and Variance for Accurate and Calibration-Efficient LLM Pruning
ACE: Exploring Activation Cosine Similarity and Variance for Accurate and Calibration-Efficient LLM Pruning
Zhendong Mi
Zhenglun Kong
Geng Yuan
Shaoyi Huang
51
0
0
28 May 2025
DeepRTL2: A Versatile Model for RTL-Related Tasks
DeepRTL2: A Versatile Model for RTL-Related Tasks
Yi Liu
Hongji Zhang
Yunhao Zhou
Zhengyuan Shi
Changran Xu
Qiang Xu
VLM
15
0
0
28 May 2025
ACE-Step: A Step Towards Music Generation Foundation Model
ACE-Step: A Step Towards Music Generation Foundation Model
Junmin Gong
Sean Zhao
Sen Wang
S. Xu
Joe Guo
34
2
0
28 May 2025
Two-Stage Feature Generation with Transformer and Reinforcement Learning
Two-Stage Feature Generation with Transformer and Reinforcement Learning
Wanfu Gao
Zengyao Man
Zebin He
Yuhao Tang
Jun Gao
Kunpeng Liu
18
0
0
28 May 2025
Retrieval-Augmented Generation: A Comprehensive Survey of Architectures, Enhancements, and Robustness Frontiers
Retrieval-Augmented Generation: A Comprehensive Survey of Architectures, Enhancements, and Robustness Frontiers
Chaitanya Sharma
RALM3DV
25
0
0
28 May 2025
Improving Continual Pre-training Through Seamless Data Packing
Improving Continual Pre-training Through Seamless Data Packing
Ruicheng Yin
Xuan Gao
Changze Lv
Xiaohua Wang
Xiaoqing Zheng
Xuanjing Huang
31
0
0
28 May 2025
Highly Efficient and Effective LLMs with Multi-Boolean Architectures
Highly Efficient and Effective LLMs with Multi-Boolean Architectures
Ba-Hien Tran
Van Minh Nguyen
MQ
56
0
0
28 May 2025
Comprehensive Evaluation on Lexical Normalization: Boundary-Aware Approaches for Unsegmented Languages
Comprehensive Evaluation on Lexical Normalization: Boundary-Aware Approaches for Unsegmented Languages
S. Higashiyama
Masao Utiyama
12
0
0
28 May 2025
ConsRec: Denoising Sequential Recommendation through User-Consistent Preference Modeling
ConsRec: Denoising Sequential Recommendation through User-Consistent Preference Modeling
Haidong Xin
Qiushi Xiong
Zhenghao Liu
Sen Mei
Yukun Yan
Shi Yu
Shuo Wang
Yu Gu
Ge Yu
Chenyan Xiong
HAI
53
0
0
28 May 2025
Revisiting Bayesian Model Averaging in the Era of Foundation Models
Revisiting Bayesian Model Averaging in the Era of Foundation Models
Mijung Park
UQCVMoMe
17
0
0
28 May 2025
MEDAL: A Framework for Benchmarking LLMs as Multilingual Open-Domain Chatbots and Dialogue Evaluators
MEDAL: A Framework for Benchmarking LLMs as Multilingual Open-Domain Chatbots and Dialogue Evaluators
John Mendonça
A. Lavie
Isabel Trancoso
40
0
0
28 May 2025
AlignGen: Boosting Personalized Image Generation with Cross-Modality Prior Alignment
AlignGen: Boosting Personalized Image Generation with Cross-Modality Prior Alignment
Yiheng Lin
Shifang Zhao
Ting Liu
Xiaochao Qu
Luoqi Liu
Yao Zhao
Yunchao Wei
DiffM
41
0
0
28 May 2025
Unraveling LoRA Interference: Orthogonal Subspaces for Robust Model Merging
Unraveling LoRA Interference: Orthogonal Subspaces for Robust Model Merging
Haobo Zhang
Jiayu Zhou
MoMe
48
0
0
28 May 2025
Improving QA Efficiency with DistilBERT: Fine-Tuning and Inference on mobile Intel CPUs
Improving QA Efficiency with DistilBERT: Fine-Tuning and Inference on mobile Intel CPUs
Ngeyen Yinkfu
7
0
0
28 May 2025
From Large AI Models to Agentic AI: A Tutorial on Future Intelligent Communications
From Large AI Models to Agentic AI: A Tutorial on Future Intelligent Communications
Feibo Jiang
Cunhua Pan
Li Dong
Kezhi Wang
O. Dobre
Mérouane Debbah
LLMAGAI4TS
172
1
0
28 May 2025
From Motion to Behavior: Hierarchical Modeling of Humanoid Generative Behavior Control
From Motion to Behavior: Hierarchical Modeling of Humanoid Generative Behavior Control
Jusheng Zhang
Jinzhou Tang
Sidi Liu
Mingyan Li
Sheng Zhang
Jian Wang
Keze Wang
23
0
0
28 May 2025
Budget-Adaptive Adapter Tuning in Orthogonal Subspaces for Continual Learning in LLMs
Budget-Adaptive Adapter Tuning in Orthogonal Subspaces for Continual Learning in LLMs
Zhiyi Wan
Wanrou Du
Liang Li
Miao Pan
Xiaoqi Qin
CLL
38
0
0
28 May 2025
From Reasoning to Learning: A Survey on Hypothesis Discovery and Rule Learning with Large Language Models
From Reasoning to Learning: A Survey on Hypothesis Discovery and Rule Learning with Large Language Models
Kaiyu He
Zhiyu Chen
ReLMLRMELM
75
0
0
28 May 2025
DocReRank: Single-Page Hard Negative Query Generation for Training Multi-Modal RAG Rerankers
DocReRank: Single-Page Hard Negative Query Generation for Training Multi-Modal RAG Rerankers
Navve Wasserman
Oliver Heinimann
Yuval Golbari
Tal Zimbalist
Eli Schwartz
Michal Irani
51
0
0
28 May 2025
ICH-Qwen: A Large Language Model Towards Chinese Intangible Cultural Heritage
ICH-Qwen: A Large Language Model Towards Chinese Intangible Cultural Heritage
Wenhao Ye
Tiansheng Zheng
Yue Qi
Wenhua Zhao
Xiyu Wang
Xue Zhao
Jiacheng He
Yaya Zheng
Dongbo Wang
15
0
0
28 May 2025
In Dialogue with Intelligence: Rethinking Large Language Models as Collective Knowledge
In Dialogue with Intelligence: Rethinking Large Language Models as Collective Knowledge
Eleni Vasilaki
KELM
15
0
0
28 May 2025
Automated Essay Scoring Incorporating Annotations from Automated Feedback Systems
Automated Essay Scoring Incorporating Annotations from Automated Feedback Systems
Christopher Ormerod
20
0
0
28 May 2025
Explainability of Large Language Models using SMILE: Statistical Model-agnostic Interpretability with Local Explanations
Explainability of Large Language Models using SMILE: Statistical Model-agnostic Interpretability with Local Explanations
Zeinab Dehghani
Koorosh Aslansefat
Adil Khan
Mohammed Naveed Akram
MILMLRM
132
0
0
27 May 2025
M-Wanda: Improving One-Shot Pruning for Multilingual LLMs
M-Wanda: Improving One-Shot Pruning for Multilingual LLMs
Rochelle Choenni
Ivan Titov
18
0
0
27 May 2025
Rethinking Information Synthesis in Multimodal Question Answering A Multi-Agent Perspective
Rethinking Information Synthesis in Multimodal Question Answering A Multi-Agent Perspective
Krishna Singh Rajput
Tejas Anvekar
Chitta Baral
Vivek Gupta
13
0
0
27 May 2025
A Lightweight Multi-Expert Generative Language Model System for Engineering Information and Knowledge Extraction
A Lightweight Multi-Expert Generative Language Model System for Engineering Information and Knowledge Extraction
Bogdan Bogachov
Yaoyao Fiona Zhao
37
0
0
27 May 2025
PartInstruct: Part-level Instruction Following for Fine-grained Robot Manipulation
PartInstruct: Part-level Instruction Following for Fine-grained Robot Manipulation
Yifan Yin
Zhengtao Han
Shivam Aarya
Jianxin Wang
Shuhang Xu
Jiawei Peng
Angtian Wang
Alan Yuille
Tianmin Shu
LM&Ro
32
0
0
27 May 2025
Efficient Large Language Model Inference with Neural Block Linearization
Efficient Large Language Model Inference with Neural Block Linearization
Mete Erdogan
F. Tonin
Volkan Cevher
78
0
0
27 May 2025
ID-Align: RoPE-Conscious Position Remapping for Dynamic High-Resolution Adaptation in Vision-Language Models
ID-Align: RoPE-Conscious Position Remapping for Dynamic High-Resolution Adaptation in Vision-Language Models
Bozhou Li
Wentao Zhang
VLM
29
0
0
27 May 2025
Open-Det: An Efficient Learning Framework for Open-Ended Detection
Open-Det: An Efficient Learning Framework for Open-Ended Detection
Guiping Cao
Tao Wang
Wenjian Huang
X. Lan
Jianguo Zhang
D. Jiang
ObjDVLM
22
0
0
27 May 2025
Emotion-aware Dual Cross-Attentive Neural Network with Label Fusion for Stance Detection in Misinformative Social Media Content
Emotion-aware Dual Cross-Attentive Neural Network with Label Fusion for Stance Detection in Misinformative Social Media Content
Lata Pangtey
Mohammad Zia Ur Rehman
Prasad Chaudhari
Shubhi Bansal
Nagendra Kumar
28
0
0
27 May 2025
QwT-v2: Practical, Effective and Efficient Post-Training Quantization
QwT-v2: Practical, Effective and Efficient Post-Training Quantization
Ningyuan Tang
Minghao Fu
Hao Yu
Jianxin Wu
MQ
89
0
0
27 May 2025
DLP: Dynamic Layerwise Pruning in Large Language Models
DLP: Dynamic Layerwise Pruning in Large Language Models
Yuli Chen
B. Cheng
Jiale Han
Yingying Zhang
Yingting Li
Shuhao Zhang
42
0
0
27 May 2025
Pretrained LLMs Learn Multiple Types of Uncertainty
Pretrained LLMs Learn Multiple Types of Uncertainty
Roi Cohen
Omri Fahn
Gerard de Melo
39
0
0
27 May 2025
Test-Time Learning for Large Language Models
Test-Time Learning for Large Language Models
Jinwu Hu
Zhitian Zhang
Guohao Chen
Xutao Wen
Chao Shuai
Wei Luo
Bin Xiao
Yuanqing Li
Mingkui Tan
55
0
0
27 May 2025
From Directions to Cones: Exploring Multidimensional Representations of Propositional Facts in LLMs
From Directions to Cones: Exploring Multidimensional Representations of Propositional Facts in LLMs
Stanley Yu
Vaidehi Bulusu
Oscar Yasunaga
Clayton Lau
Cole Blondin
Sean O'Brien
Kevin Zhu
Vasu Sharma
54
0
0
27 May 2025
ResSVD: Residual Compensated SVD for Large Language Model Compression
ResSVD: Residual Compensated SVD for Large Language Model Compression
Haolei Bai
Siyong Jian
Tuo Liang
Yu Yin
Huan Wang
46
0
0
26 May 2025
Deconstructing Obfuscation: A four-dimensional framework for evaluating Large Language Models assembly code deobfuscation capabilities
Deconstructing Obfuscation: A four-dimensional framework for evaluating Large Language Models assembly code deobfuscation capabilities
Anton Tkachenko
Dmitrij Suskevic
Benjamin Adolphi
58
0
0
26 May 2025
Learning Extrapolative Sequence Transformations from Markov Chains
Learning Extrapolative Sequence Transformations from Markov Chains
Sophia Hager
Aleem Khan
Andrew Wang
Nicholas Andrews
BDL
33
0
0
26 May 2025
Previous
123456...196197198
Next