ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.10683
  4. Cited By
Exploring the Limits of Transfer Learning with a Unified Text-to-Text
  Transformer
v1v2v3v4 (latest)

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
    AIMat
ArXiv (abs)PDFHTML

Papers citing "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

50 / 9,959 papers shown
Title
SoundLoCD: An Efficient Conditional Discrete Contrastive Latent
  Diffusion Model for Text-to-Sound Generation
SoundLoCD: An Efficient Conditional Discrete Contrastive Latent Diffusion Model for Text-to-Sound Generation
Xinlei Niu
Jing Zhang
Christian J. Walder
Charles Patrick Martin
67
2
0
24 May 2024
Continuously Learning, Adapting, and Improving: A Dual-Process Approach
  to Autonomous Driving
Continuously Learning, Adapting, and Improving: A Dual-Process Approach to Autonomous Driving
Jianbiao Mei
Yukai Ma
Xuemeng Yang
Licheng Wen
Xinyu Cai
...
Min Dou
Botian Shi
Liang He
Yong-Jin Liu
Yu Qiao
105
15
0
24 May 2024
Organic Data-Driven Approach for Turkish Grammatical Error Correction
  and LLMs
Organic Data-Driven Approach for Turkish Grammatical Error Correction and LLMs
Asim Ersoy
O. T. Yildiz
61
0
0
24 May 2024
Before Generation, Align it! A Novel and Effective Strategy for
  Mitigating Hallucinations in Text-to-SQL Generation
Before Generation, Align it! A Novel and Effective Strategy for Mitigating Hallucinations in Text-to-SQL Generation
Ge Qu
Jinyang Li
Bowen Li
Bowen Qin
Nan Huo
Chenhao Ma
Reynold Cheng
76
30
0
24 May 2024
Machine Unlearning in Large Language Models
Machine Unlearning in Large Language Models
Saaketh Koundinya Gundavarapu
Shreya Agarwal
Arushi Arora
Chandana Thimmalapura Jagadeeshaiah
MU
40
0
0
24 May 2024
CHARP: Conversation History AwaReness Probing for Knowledge-grounded
  Dialogue Systems
CHARP: Conversation History AwaReness Probing for Knowledge-grounded Dialogue Systems
Abbas Ghaddar
David Alfonso-Hermelo
Philippe Langlais
Mehdi Rezagholizadeh
Boxing Chen
Prasanna Parthasarathi
83
0
0
24 May 2024
Sparse Spectral Training and Inference on Euclidean and Hyperbolic Neural Networks
Sparse Spectral Training and Inference on Euclidean and Hyperbolic Neural Networks
Jialin Zhao
Yingtao Zhang
Xinghang Li
Huaping Liu
C. Cannistraci
75
1
0
24 May 2024
AGRaME: Any-Granularity Ranking with Multi-Vector Embeddings
AGRaME: Any-Granularity Ranking with Multi-Vector Embeddings
R. Reddy
Omar Attia
Yunyao Li
Heng Ji
Saloni Potdar
66
1
0
23 May 2024
Extracting Prompts by Inverting LLM Outputs
Extracting Prompts by Inverting LLM Outputs
Collin Zhang
John X. Morris
Vitaly Shmatikov
76
22
0
23 May 2024
Linking In-context Learning in Transformers to Human Episodic Memory
Linking In-context Learning in Transformers to Human Episodic Memory
Ji-An Li
Corey Y. Zhou
M. Benna
Marcelo G. Mattar
64
4
0
23 May 2024
A Nurse is Blue and Elephant is Rugby: Cross Domain Alignment in Large
  Language Models Reveal Human-like Patterns
A Nurse is Blue and Elephant is Rugby: Cross Domain Alignment in Large Language Models Reveal Human-like Patterns
Asaf Yehudai
Taelin Karidi
Gabriel Stanovsky
Ariel Goldstein
Omri Abend
91
1
0
23 May 2024
Bitune: Bidirectional Instruction-Tuning
Bitune: Bidirectional Instruction-Tuning
D. J. Kopiczko
Tijmen Blankevoort
Yuki Markus Asano
50
3
0
23 May 2024
PV-Tuning: Beyond Straight-Through Estimation for Extreme LLM
  Compression
PV-Tuning: Beyond Straight-Through Estimation for Extreme LLM Compression
Vladimir Malinovskii
Denis Mazur
Ivan Ilin
Denis Kuznedelev
Konstantin Burlachenko
Kai Yi
Dan Alistarh
Peter Richtárik
MQ
118
24
0
23 May 2024
Analysis of Atom-level pretraining with Quantum Mechanics (QM) data for
  Graph Neural Networks Molecular property models
Analysis of Atom-level pretraining with Quantum Mechanics (QM) data for Graph Neural Networks Molecular property models
Jose A. Arjona-Medina
Ramil I. Nugmanov
AI4CE
88
2
0
23 May 2024
Small Language Models for Application Interactions: A Case Study
Small Language Models for Application Interactions: A Case Study
Beibin Li
Yi Zhang
Sébastien Bubeck
Jeevan Pathuri
Ishai Menache
89
4
0
23 May 2024
WISE: Rethinking the Knowledge Memory for Lifelong Model Editing of
  Large Language Models
WISE: Rethinking the Knowledge Memory for Lifelong Model Editing of Large Language Models
Peng Wang
Zexi Li
Ningyu Zhang
Ziwen Xu
Yunzhi Yao
Yong Jiang
Pengjun Xie
Fei Huang
Huajun Chen
KELMCLL
132
34
0
23 May 2024
Multi-turn Reinforcement Learning from Preference Human Feedback
Multi-turn Reinforcement Learning from Preference Human Feedback
Lior Shani
Aviv Rosenberg
Asaf B. Cassel
Oran Lang
Daniele Calandriello
...
Bilal Piot
Idan Szpektor
Avinatan Hassidim
Yossi Matias
Rémi Munos
104
34
0
23 May 2024
Integer Scale: A Free Lunch for Faster Fine-grained Quantization of LLMs
Integer Scale: A Free Lunch for Faster Fine-grained Quantization of LLMs
Qingyuan Li
Ran Meng
Yiduo Li
Bo Zhang
Yifan Lu
Yerui Sun
Lin Ma
Yuchen Xie
MQ
106
0
0
23 May 2024
Mitigating Quantization Errors Due to Activation Spikes in GLU-Based
  LLMs
Mitigating Quantization Errors Due to Activation Spikes in GLU-Based LLMs
Jaewoo Yang
Hayun Kim
Younghoon Kim
95
15
0
23 May 2024
Instruction Tuning With Loss Over Instructions
Instruction Tuning With Loss Over Instructions
Zhengyan Shi
Adam X. Yang
Bin Wu
Laurence Aitchison
Emine Yilmaz
Aldo Lipani
ALM
87
23
0
23 May 2024
Advancing Spiking Neural Networks for Sequential Modeling with Central
  Pattern Generators
Advancing Spiking Neural Networks for Sequential Modeling with Central Pattern Generators
Changze Lv
Dongqi Han
Yansen Wang
Xiaoqing Zheng
Xuanjing Huang
Dongsheng Li
57
1
0
23 May 2024
Towards Efficient LLM Grounding for Embodied Multi-Agent Collaboration
Towards Efficient LLM Grounding for Embodied Multi-Agent Collaboration
Yang Zhang
Shixin Yang
Chenjia Bai
Fei Wu
Xiu Li
Zhen Wang
Xuelong Li
LLMAG
119
32
0
23 May 2024
Focus Anywhere for Fine-grained Multi-page Document Understanding
Focus Anywhere for Fine-grained Multi-page Document Understanding
Chenglong Liu
Haoran Wei
Jinyue Chen
Lingyu Kong
Zheng Ge
Zining Zhu
Liang Zhao
Jian‐Yuan Sun
Chunrui Han
Xiangyu Zhang
85
25
0
23 May 2024
EHR-SeqSQL : A Sequential Text-to-SQL Dataset For Interactively
  Exploring Electronic Health Records
EHR-SeqSQL : A Sequential Text-to-SQL Dataset For Interactively Exploring Electronic Health Records
Jaehee Ryu
Seonhee Cho
Gyubok Lee
Edward Choi
120
2
0
23 May 2024
Federated Domain-Specific Knowledge Transfer on Large Language Models
  Using Synthetic Data
Federated Domain-Specific Knowledge Transfer on Large Language Models Using Synthetic Data
Haoran Li
Xinyuan Zhao
Dadi Guo
Hanlin Gu
Huiping Zhuang
Yuxing Han
Yangqiu Song
Lixin Fan
Qiang Yang
101
1
0
23 May 2024
OAC: Output-adaptive Calibration for Accurate Post-training Quantization
OAC: Output-adaptive Calibration for Accurate Post-training Quantization
Ali Edalati
Alireza Ghaffari
M. Asgharian
Lu Hou
Boxing Chen
Vahid Partovi Nia
V. Nia
MQ
177
0
0
23 May 2024
SliM-LLM: Salience-Driven Mixed-Precision Quantization for Large Language Models
SliM-LLM: Salience-Driven Mixed-Precision Quantization for Large Language Models
Wei Huang
Haotong Qin
Yangdong Liu
Yawei Li
Qinshuo Liu
Xianglong Liu
Luca Benini
Michele Magno
Shiming Zhang
Xiaojuan Qi
MQ
144
19
0
23 May 2024
A Survey on Vision-Language-Action Models for Embodied AI
A Survey on Vision-Language-Action Models for Embodied AI
Yueen Ma
Zixing Song
Yuzheng Zhuang
Jianye Hao
Irwin King
LM&Ro
355
54
0
23 May 2024
What is Your Data Worth to GPT? LLM-Scale Data Valuation with Influence
  Functions
What is Your Data Worth to GPT? LLM-Scale Data Valuation with Influence Functions
Sang Keun Choe
Hwijeen Ahn
Juhan Bae
Kewen Zhao
Minsoo Kang
...
Teruko Mitamura
Jeff Schneider
Eduard Hovy
Roger C. Grosse
Eric Xing
TDI
99
44
0
22 May 2024
Dense Connector for MLLMs
Dense Connector for MLLMs
Huanjin Yao
Wenhao Wu
Taojiannan Yang
Yuxin Song
Mengxi Zhang
Haocheng Feng
Yifan Sun
Zhiheng Li
Wanli Ouyang
Jingdong Wang
MLLMVLM
102
25
0
22 May 2024
Robust Disaster Assessment from Aerial Imagery Using Text-to-Image
  Synthetic Data
Robust Disaster Assessment from Aerial Imagery Using Text-to-Image Synthetic Data
Tarun Kalluri
Jihyeon Janel Lee
Kihyuk Sohn
Sahil Singla
Manmohan Chandraker
Joseph Z. Xu
Jeremiah Liu
125
1
0
22 May 2024
AltChart: Enhancing VLM-based Chart Summarization Through Multi-Pretext
  Tasks
AltChart: Enhancing VLM-based Chart Summarization Through Multi-Pretext Tasks
Omar Moured
Jiaming Zhang
M. Sarfraz
Rainer Stiefelhagen
65
3
0
22 May 2024
Multi-Scale Feature Fusion Quantum Depthwise Convolutional Neural
  Networks for Text Classification
Multi-Scale Feature Fusion Quantum Depthwise Convolutional Neural Networks for Text Classification
Yixiong Chen
Weichuan Fang
81
1
0
22 May 2024
AdpQ: A Zero-shot Calibration Free Adaptive Post Training Quantization
  Method for LLMs
AdpQ: A Zero-shot Calibration Free Adaptive Post Training Quantization Method for LLMs
Alireza Ghaffari
Sharareh Younesian
Vahid Partovi Nia
Boxing Chen
M. Asgharian
MQ
75
0
0
22 May 2024
Efficacy of ByT5 in Multilingual Translation of Biblical Texts for
  Underrepresented Languages
Efficacy of ByT5 in Multilingual Translation of Biblical Texts for Underrepresented Languages
Corinne Aars
Lauren Adams
Xiaokan Tian
Zhaoyu Wang
Colton Wismer
Jason Wu
Pablo Rivas
Korn Sooksatra
Matthew Fendt
43
0
0
22 May 2024
Equipping Transformer with Random-Access Reading for Long-Context
  Understanding
Equipping Transformer with Random-Access Reading for Long-Context Understanding
Chenghao Yang
Zi Yang
Nan Hua
72
1
0
21 May 2024
Efficient and Interpretable Information Retrieval for Product Question
  Answering with Heterogeneous Data
Efficient and Interpretable Information Retrieval for Product Question Answering with Heterogeneous Data
Biplob Biswas
R. Ramnath
32
1
0
21 May 2024
ReALLM: A general framework for LLM compression and fine-tuning
ReALLM: A general framework for LLM compression and fine-tuning
Louis Leconte
Lisa Bedin
Van Minh Nguyen
Eric Moulines
MQ
129
1
0
21 May 2024
An Empirical Study and Analysis of Text-to-Image Generation Using Large
  Language Model-Powered Textual Representation
An Empirical Study and Analysis of Text-to-Image Generation Using Large Language Model-Powered Textual Representation
Zhiyu Tan
Mengping Yang
Luozheng Qin
Hao Yang
Ye Qian
Qiang-feng Zhou
Cheng Zhang
Hao Li
110
6
0
21 May 2024
Investigating Persuasion Techniques in Arabic: An Empirical Study
  Leveraging Large Language Models
Investigating Persuasion Techniques in Arabic: An Empirical Study Leveraging Large Language Models
Abdurahmman Alzahrani
Eyad Babkier
Faisal Yanbaawi
Firas Yanbaawi
Hassan Alhuzali
82
0
0
21 May 2024
FAdam: Adam is a natural gradient optimizer using diagonal empirical
  Fisher information
FAdam: Adam is a natural gradient optimizer using diagonal empirical Fisher information
Dongseong Hwang
ODL
109
9
0
21 May 2024
C3L: Content Correlated Vision-Language Instruction Tuning Data
  Generation via Contrastive Learning
C3L: Content Correlated Vision-Language Instruction Tuning Data Generation via Contrastive Learning
Ji Ma
Wei Suo
Peng Wang
Yanning Zhang
VLM
122
0
0
21 May 2024
SPO: Multi-Dimensional Preference Sequential Alignment With Implicit
  Reward Modeling
SPO: Multi-Dimensional Preference Sequential Alignment With Implicit Reward Modeling
Xingzhou Lou
Junge Zhang
Jian Xie
Lifeng Liu
Dong Yan
Kaiqi Huang
96
13
0
21 May 2024
RecGPT: Generative Pre-training for Text-based Recommendation
RecGPT: Generative Pre-training for Text-based Recommendation
Hoang Ngo
Dat Quoc Nguyen
LRM
70
5
0
21 May 2024
Spotting AI's Touch: Identifying LLM-Paraphrased Spans in Text
Spotting AI's Touch: Identifying LLM-Paraphrased Spans in Text
Yafu Li
Zhilin Wang
Leyang Cui
Wei Bi
Shuming Shi
Yue Zhang
DeLMO
117
6
0
21 May 2024
Exploration of Masked and Causal Language Modelling for Text Generation
Exploration of Masked and Causal Language Modelling for Text Generation
Nicolo Micheletti
Samuel Belkadi
Lifeng Han
Goran Nenadic
92
8
0
21 May 2024
CustomText: Customized Textual Image Generation using Diffusion Models
CustomText: Customized Textual Image Generation using Diffusion Models
Shubham Paliwal
Arushi Jain
Monika Sharma
Vikram Jamwal
Lovekesh Vig
65
1
0
21 May 2024
Diverse and Effective Synthetic Data Generation for Adaptable Zero-Shot
  Dialogue State Tracking
Diverse and Effective Synthetic Data Generation for Adaptable Zero-Shot Dialogue State Tracking
James D. Finch
Jinho D. Choi
77
0
0
21 May 2024
Octo: An Open-Source Generalist Robot Policy
Octo: An Open-Source Generalist Robot Policy
Octo Model Team
Dibya Ghosh
Homer Walke
Karl Pertsch
Kevin Black
...
Quan Vuong
Ted Xiao
Dorsa Sadigh
Chelsea Finn
Sergey Levine
219
452
0
20 May 2024
CT-Eval: Benchmarking Chinese Text-to-Table Performance in Large
  Language Models
CT-Eval: Benchmarking Chinese Text-to-Table Performance in Large Language Models
Haoxiang Shi
Jiaan Wang
Jiarong Xu
Cen Wang
Tetsuya Sakai
LMTD
66
0
0
20 May 2024
Previous
123...585960...198199200
Next