ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.10683
  4. Cited By
Exploring the Limits of Transfer Learning with a Unified Text-to-Text
  Transformer
v1v2v3v4 (latest)

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
    AIMat
ArXiv (abs)PDFHTML

Papers citing "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

50 / 9,870 papers shown
Title
Mantis: Lightweight Calibrated Foundation Model for User-Friendly Time Series Classification
Mantis: Lightweight Calibrated Foundation Model for User-Friendly Time Series Classification
Vasilii Feofanov
Songkang Wen
Marius Alonso
Romain Ilbert
Hongbo Guo
Malik Tiomoko
Lujia Pan
Jianfeng Zhang
I. Redko
AI4TSVLM
137
4
0
24 Feb 2025
Adaptive Retrieval Without Self-Knowledge? Bringing Uncertainty Back Home
Adaptive Retrieval Without Self-Knowledge? Bringing Uncertainty Back Home
Viktor Moskvoretskii
M. Lysyuk
Mikhail Salnikov
Nikolay Ivanov
Sergey Pletenev
Daria Galimzianova
Nikita Krayko
Vasily Konovalov
Irina Nikishina
Alexander Panchenko
RALM
146
7
0
24 Feb 2025
Fed-SB: A Silver Bullet for Extreme Communication Efficiency and Performance in (Private) Federated LoRA Fine-Tuning
Fed-SB: A Silver Bullet for Extreme Communication Efficiency and Performance in (Private) Federated LoRA Fine-Tuning
Raghav Singhal
Kaustubh Ponkshe
Rohit Vartak
Lav R. Varshney
Praneeth Vepakomma
FedML
128
4
0
24 Feb 2025
Corrections Meet Explanations: A Unified Framework for Explainable Grammatical Error Correction
Corrections Meet Explanations: A Unified Framework for Explainable Grammatical Error Correction
Jingheng Ye
Shang Qin
Hai-Tao Zheng
Hai-Tao Zheng
Shen Wang
Qingsong Wen
109
0
0
24 Feb 2025
Textual-to-Visual Iterative Self-Verification for Slide Generation
Textual-to-Visual Iterative Self-Verification for Slide Generation
Yunqing Xu
Xinbei Ma
Jiyang Qiu
Hai Zhao
117
0
0
24 Feb 2025
Entailment-Preserving First-order Logic Representations in Natural Language Entailment
Entailment-Preserving First-order Logic Representations in Natural Language Entailment
Jinu Lee
Qi Liu
Runzhi Ma
Vincent Han
Ziqi Wang
Heng Ji
Julia Hockenmaier
98
1
0
24 Feb 2025
KVCrush: Key value cache size-reduction using similarity in head-behaviour
Gopi Krishna Jha
Sameh Gobriel
Liubov Talamanova
Alexander Kozlov
Nilesh Jain
MQ
78
0
0
24 Feb 2025
Probe Pruning: Accelerating LLMs through Dynamic Pruning via Model-Probing
Probe Pruning: Accelerating LLMs through Dynamic Pruning via Model-Probing
Qi Le
Enmao Diao
Ziyan Wang
Xinran Wang
Jie Ding
Li Yang
Ali Anwar
117
5
0
24 Feb 2025
Policy Learning with a Natural Language Action Space: A Causal Approach
Policy Learning with a Natural Language Action Space: A Causal Approach
Bohan Zhang
Yixin Wang
Paramveer S. Dhillon
CML
76
0
0
24 Feb 2025
Delta Decompression for MoE-based LLMs Compression
Hao Gu
Wei Li
Lujun Li
Qiyuan Zhu
Mark Lee
Shengjie Sun
Wei Xue
Yike Guo
MoE
132
3
0
24 Feb 2025
In-context learning of evolving data streams with tabular foundational models
In-context learning of evolving data streams with tabular foundational models
Afonso Lourenço
João Gama
Eric P. Xing
Goreti Marreiros
135
0
0
24 Feb 2025
Automatic Input Rewriting Improves Translation with Large Language Models
Automatic Input Rewriting Improves Translation with Large Language Models
Dayeon Ki
Marine Carpuat
89
0
0
23 Feb 2025
Uncovering the Hidden Threat of Text Watermarking from Users with Cross-Lingual Knowledge
Uncovering the Hidden Threat of Text Watermarking from Users with Cross-Lingual Knowledge
Mansour Al Ghanim
Jiaqi Xue
Rochana Prih Hastuti
Mengxin Zheng
Yan Solihin
Qian Lou
WaLM
113
0
0
23 Feb 2025
RapidPen: Fully Automated IP-to-Shell Penetration Testing with LLM-based Agents
RapidPen: Fully Automated IP-to-Shell Penetration Testing with LLM-based Agents
Sho Nakatani
127
3
1
23 Feb 2025
MV-CLAM: Multi-View Molecular Interpretation with Cross-Modal Projection via Language Model
Sumin Ha
Jun Hyeong Kim
Yinhua Piao
Sun Kim
145
1
0
23 Feb 2025
Navigation-GPT: A Robust and Adaptive Framework Utilizing Large Language Models for Navigation Applications
Navigation-GPT: A Robust and Adaptive Framework Utilizing Large Language Models for Navigation Applications
Feng Ma
Xiang Wang
Chen Chen
Xiao-bin Xu
Xin-ping Yan
474
0
0
23 Feb 2025
Fine-Grained Captioning of Long Videos through Scene Graph Consolidation
Fine-Grained Captioning of Long Videos through Scene Graph Consolidation
Sanghyeok Chu
Seonguk Seo
Bohyung Han
114
1
0
23 Feb 2025
NatSGLD: A Dataset with Speech, Gesture, Logic, and Demonstration for Robot Learning in Natural Human-Robot Interaction
NatSGLD: A Dataset with Speech, Gesture, Logic, and Demonstration for Robot Learning in Natural Human-Robot Interaction
Snehesh Shrestha
Yantian Zha
Saketh Banagiri
Ge Gao
Yiannis Aloimonos
Cornelia Fermüller
69
0
0
23 Feb 2025
Recurrent Knowledge Identification and Fusion for Language Model Continual Learning
Recurrent Knowledge Identification and Fusion for Language Model Continual Learning
Yujie Feng
Xujia Wang
Zexin Lu
Shenghong Fu
Guangyuan Shi
Yongxin Xu
Yasha Wang
Philip S. Yu
Xu Chu
Xiao-Ming Wu
CLLKELM
120
1
0
22 Feb 2025
RAG-Enhanced Collaborative LLM Agents for Drug Discovery
RAG-Enhanced Collaborative LLM Agents for Drug Discovery
Namkyeong Lee
Edward De Brouwer
Ehsan Hajiramezanali
Tommaso Biancalani
Chanyoung Park
Gabriele Scalia
168
0
0
22 Feb 2025
BiDeV: Bilateral Defusing Verification for Complex Claim Fact-Checking
BiDeV: Bilateral Defusing Verification for Complex Claim Fact-Checking
Yuxuan Liu
Hongda Sun
Wenya Guo
Xinyan Xiao
Cunli Mao
Zhengtao Yu
Rui Yan
148
3
0
22 Feb 2025
Wrong Answers Can Also Be Useful: PlausibleQA -- A Large-Scale QA Dataset with Answer Plausibility Scores
Wrong Answers Can Also Be Useful: PlausibleQA -- A Large-Scale QA Dataset with Answer Plausibility Scores
Jamshid Mozafari
Abdelrahman Abdallah
Bhawna Piryani
Adam Jatowt
108
0
0
22 Feb 2025
Collaborative Retrieval for Large Language Model-based Conversational Recommender Systems
Collaborative Retrieval for Large Language Model-based Conversational Recommender Systems
Yaochen Zhu
Chao Wan
Harald Steck
Dawen Liang
Yesu Feng
Nathan Kallus
Wenlin Yao
RALM
121
2
0
21 Feb 2025
CSTRL: Context-Driven Sequential Transfer Learning for Abstractive Radiology Report Summarization
CSTRL: Context-Driven Sequential Transfer Learning for Abstractive Radiology Report Summarization
Mst. Fahmida Sultana Naznin
Adnan Ibney Faruq
Mostafa Rifat Tazwar
Md Jobayer
Md. Mehedi Hasan Shawon
Md Rakibul Hasan
MedIm
66
0
0
21 Feb 2025
Directional Gradient Projection for Robust Fine-Tuning of Foundation Models
Directional Gradient Projection for Robust Fine-Tuning of Foundation Models
Chengyue Huang
Junjiao Tian
Brisa Maneechotesuwan
Shivang Chopra
Z. Kira
175
2
0
21 Feb 2025
Rankify: A Comprehensive Python Toolkit for Retrieval, Re-Ranking, and Retrieval-Augmented Generation
Rankify: A Comprehensive Python Toolkit for Retrieval, Re-Ranking, and Retrieval-Augmented Generation
Abdelrahman Abdallah
Bhawna Piryani
Jamshid Mozafari
Mohammed Ali
Adam Jatowt
354
1
0
21 Feb 2025
Dynamic Low-Rank Sparse Adaptation for Large Language Models
Dynamic Low-Rank Sparse Adaptation for Large Language Models
Weizhong Huang
Yuxin Zhang
Xiawu Zheng
Yang Liu
Jing Lin
Yiwu Yao
Rongrong Ji
210
2
0
21 Feb 2025
Comprehensive Analysis of Transparency and Accessibility of ChatGPT, DeepSeek, And other SoTA Large Language Models
Comprehensive Analysis of Transparency and Accessibility of ChatGPT, DeepSeek, And other SoTA Large Language Models
Ranjan Sapkota
Shaina Raza
Manoj Karkee
101
7
0
21 Feb 2025
EAGER-LLM: Enhancing Large Language Models as Recommenders through Exogenous Behavior-Semantic Integration
EAGER-LLM: Enhancing Large Language Models as Recommenders through Exogenous Behavior-Semantic Integration
Minjie Hong
Yan Xia
Ziyi Wang
Jieming Zhu
Ye Wang
...
Xiaoda Yang
Quanyu Dai
Zhenhua Dong
Zhimeng Zhang
Zhou Zhao
83
4
0
21 Feb 2025
Question Answering with Texts and Tables through Deep Reinforcement Learning
Question Answering with Texts and Tables through Deep Reinforcement Learning
M. M. José
Flávio Nakasato Cação
Maria F. Ribeiro
Rafael M. Cheang
Paulo Pirozelli
Fabio Gagliardi Cozman
LMTDRALM
231
0
0
21 Feb 2025
RIFLEx: A Free Lunch for Length Extrapolation in Video Diffusion Transformers
RIFLEx: A Free Lunch for Length Extrapolation in Video Diffusion Transformers
Min Zhao
Guande He
Yixiao Chen
Hongzhou Zhu
Chong Li
Jun Zhu
VGen
130
11
0
21 Feb 2025
Straight to Zero: Why Linearly Decaying the Learning Rate to Zero Works Best for LLMs
Straight to Zero: Why Linearly Decaying the Learning Rate to Zero Works Best for LLMs
Shane Bergsma
Nolan Dey
Gurpreet Gosal
Gavia Gray
Daria Soboleva
Joel Hestness
109
8
0
21 Feb 2025
Tight Clusters Make Specialized Experts
Tight Clusters Make Specialized Experts
Stefan K. Nielsen
R. Teo
Laziz U. Abdullaev
Tan M. Nguyen
MoE
142
4
0
21 Feb 2025
A Survey of Model Architectures in Information Retrieval
A Survey of Model Architectures in Information Retrieval
Zhichao Xu
Fengran Mo
Zhiqi Huang
Crystina Zhang
Puxuan Yu
Bei Wang
Jimmy J. Lin
Vivek Srikumar
KELM3DV
187
2
0
21 Feb 2025
Looped ReLU MLPs May Be All You Need as Practical Programmable Computers
Looped ReLU MLPs May Be All You Need as Practical Programmable Computers
Yingyu Liang
Zhizhou Sha
Zhenmei Shi
Zhao Song
Yufa Zhou
181
19
0
21 Feb 2025
Visual Zero-Shot E-Commerce Product Attribute Value Extraction
Visual Zero-Shot E-Commerce Product Attribute Value Extraction
Jiaying Gong
Ming Cheng
Hongda Shen
Pierre-Yves Vandenbussche
Janet Jenq
Hoda Eldardiry
73
1
0
21 Feb 2025
SpinQuant: LLM quantization with learned rotations
SpinQuant: LLM quantization with learned rotations
Zechun Liu
Changsheng Zhao
Igor Fedorov
Bilge Soran
Dhruv Choudhary
Raghuraman Krishnamoorthi
Vikas Chandra
Yuandong Tian
Tijmen Blankevoort
MQ
263
126
0
21 Feb 2025
Protein Large Language Models: A Comprehensive Survey
Protein Large Language Models: A Comprehensive Survey
Yijia Xiao
Wanjia Zhao
Junkai Zhang
Yiqiao Jin
Han Zhang
...
Xiao Luo
Yu Zhang
James Zou
Yizhou Sun
Wei Wang
LM&MAAI4CE
145
4
0
21 Feb 2025
Contrastive Localized Language-Image Pre-Training
Contrastive Localized Language-Image Pre-Training
Hong-You Chen
Zhengfeng Lai
Hao Zhang
Xiang Wang
Marcin Eichner
Keen You
Meng Cao
Bowen Zhang
Yue Yang
Zhe Gan
CLIPVLM
124
10
0
20 Feb 2025
Scaling Text-Rich Image Understanding via Code-Guided Synthetic Multimodal Data Generation
Scaling Text-Rich Image Understanding via Code-Guided Synthetic Multimodal Data Generation
Yue Yang
Ajay Patel
Matt Deitke
Tanmay Gupta
Luca Weihs
...
Mark Yatskar
Chris Callison-Burch
Ranjay Krishna
Aniruddha Kembhavi
Christopher Clark
SyDa
207
3
0
20 Feb 2025
Evaluating Large Language Models for Public Health Classification and Extraction Tasks
Evaluating Large Language Models for Public Health Classification and Extraction Tasks
Joshua Harris
Timothy Laurence
Leo Loman
Fan Grayson
Toby Nonnenmacher
...
Hamish Mohammed
Thomas Finnie
Luke Hounsome
Michael Borowitz
Steven Riley
LM&MAAI4MH
148
5
0
20 Feb 2025
FedSpaLLM: Federated Pruning of Large Language Models
FedSpaLLM: Federated Pruning of Large Language Models
Guangji Bai
Yijiang Li
Zilinghan Li
Liang Zhao
Kibaek Kim
FedML
138
6
0
20 Feb 2025
Can a Single Model Master Both Multi-turn Conversations and Tool Use? CoALM: A Unified Conversational Agentic Language Model
Can a Single Model Master Both Multi-turn Conversations and Tool Use? CoALM: A Unified Conversational Agentic Language Model
Emre Can Acikgoz
Jeremiah Greer
Akul Datta
Ze Yang
William Zeng
Oussama Elachqar
Emmanouil Koukoumidis
Dilek Hakkani-Tur
Gokhan Tur
LLMAG
184
3
0
20 Feb 2025
MKE-Coder: Multi-Axial Knowledge with Evidence Verification in ICD Coding for Chinese EMRs
MKE-Coder: Multi-Axial Knowledge with Evidence Verification in ICD Coding for Chinese EMRs
Xinxin You
Xien Liu
Xue Yang
Ziyi Wang
Ji Wu
92
0
0
19 Feb 2025
A Comprehensive Survey on Composed Image Retrieval
A Comprehensive Survey on Composed Image Retrieval
Xuemeng Song
Haoqiang Lin
Haokun Wen
Bohan Hou
Mingzhu Xu
Liqiang Nie
131
3
0
19 Feb 2025
GneissWeb: Preparing High Quality Data for LLMs at Scale
GneissWeb: Preparing High Quality Data for LLMs at Scale
Hajar Emami-Gohari
S. Kadhe
Syed Yousaf Shah. Constantin Adam
Abdulhamid A. Adebayo
Praneet Adusumilli
...
Issei Yoshida
Syed Zawad
Petros Zerfos
Yi Zhou
Bishwaranjan Bhattacharjee
68
1
0
19 Feb 2025
EvoP: Robust LLM Inference via Evolutionary Pruning
EvoP: Robust LLM Inference via Evolutionary Pruning
Shangyu Wu
Hongchao Du
Ying Xiong
Shuai Chen
Tei-Wei Kuo
Nan Guan
Chun Jason Xue
98
1
0
19 Feb 2025
MaskPrune: Mask-based LLM Pruning for Layer-wise Uniform Structures
MaskPrune: Mask-based LLM Pruning for Layer-wise Uniform Structures
Jiayu Qin
Jianchao Tan
Kai Zhang
Xunliang Cai
Wei Wang
75
0
0
19 Feb 2025
Do we still need Human Annotators? Prompting Large Language Models for Aspect Sentiment Quad Prediction
Do we still need Human Annotators? Prompting Large Language Models for Aspect Sentiment Quad Prediction
Nils Constantin Hellwig
Jakob Fehle
Udo Kruschwitz
Christian Wolff
AI4MH
143
0
0
18 Feb 2025
Re-Align: Aligning Vision Language Models via Retrieval-Augmented Direct Preference Optimization
Re-Align: Aligning Vision Language Models via Retrieval-Augmented Direct Preference Optimization
Shuo Xing
Yuping Wang
Peiran Li
Ruizheng Bai
Yansen Wang
Chan-wei Hu
Chengxuan Qian
Huaxiu Yao
Zhengzhong Tu
185
8
0
18 Feb 2025
Previous
123...192021...196197198
Next