ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.10683
  4. Cited By
Exploring the Limits of Transfer Learning with a Unified Text-to-Text
  Transformer
v1v2v3v4 (latest)

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
    AIMat
ArXiv (abs)PDFHTML

Papers citing "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

50 / 9,891 papers shown
Title
CPTuning: Contrastive Prompt Tuning for Generative Relation Extraction
CPTuning: Contrastive Prompt Tuning for Generative Relation Extraction
Jiaxin Duan
Fengyu Lu
Junfei Liu
95
0
0
04 Jan 2025
Boosting Explainability through Selective Rationalization in Pre-trained Language Models
Libing Yuan
Shuaibo Hu
Kui Yu
Le Wu
LRM
108
0
0
03 Jan 2025
Text2midi: Generating Symbolic Music from Captions
Text2midi: Generating Symbolic Music from Captions
Keshav Bhandari
Abhinaba Roy
Kyra Wang
Geeta Puri
Simon Colton
Dorien Herremans
160
6
0
03 Jan 2025
Mathematical Language Models: A Survey
Mathematical Language Models: A Survey
Wen Liu
Hanglei Hu
Jie Zhou
Yuyang Ding
Junsong Li
...
Mengliang He
Qin Chen
Bo Jiang
Aimin Zhou
Liang He
LRM
235
14
0
03 Jan 2025
PsychAdapter: Adapting LLM Transformers to Reflect Traits, Personality and Mental Health
PsychAdapter: Adapting LLM Transformers to Reflect Traits, Personality and Mental Health
Huy-Hien Vu
Huy Anh Nguyen
Adithya Ganesan
Swanie Juhng
Oscar Kjell
...
Margaret L. Kern
Ryan L. Boyd
L. Ungar
H. Andrew Schwartz
J. Eichstaedt
161
0
0
03 Jan 2025
Global dense vector representations for words or items using shared parameter alternating Tweedie model
Taejoon Kim
Haiyan Wang
62
0
0
03 Jan 2025
Exposing Limitations of Language Model Agents in Sequential-Task Compositions on the Web
Exposing Limitations of Language Model Agents in Sequential-Task Compositions on the Web
Hiroki Furuta
Yutaka Matsuo
Aleksandra Faust
Izzeddin Gur
CLL
205
16
0
03 Jan 2025
RealCustom++: Representing Images as Real-Word for Real-Time Customization
RealCustom++: Representing Images as Real-Word for Real-Time Customization
Zhendong Mao
Mengqi Huang
Fei Ding
Mingcong Liu
Qian He
Xiaojun Chang
DiffM
172
6
0
03 Jan 2025
BeliN: A Novel Corpus for Bengali Religious News Headline Generation using Contextual Feature Fusion
Md Osama
Ashim Dey
Kawsar Ahmed
Muhammad Ashad Kabir
150
0
0
03 Jan 2025
Reasoning-Oriented and Analogy-Based Methods for Locating and Editing in Zero-Shot Event-Relational Reasoning
Jingyao Tang
Lishuang Li
Liteng Mi
Haiming Wu
Hongbin Lu
KELM
108
0
0
03 Jan 2025
SOEDiff: Efficient Distillation for Small Object Editing
SOEDiff: Efficient Distillation for Small Object Editing
Yiming Wu
Qihe Pan
Zhen Zhao
Zicheng Wang
Sifan Long
Ronghua Liang
DiffM
181
0
0
03 Jan 2025
Sinhala Transliteration: A Comparative Analysis Between Rule-based and Seq2Seq Approaches
Yomal De Mel
Kasun Wickramasinghe
Nisansa de Silva
Surangika Ranathunga
112
1
0
03 Jan 2025
Text2Data: Low-Resource Data Generation with Textual Control
Text2Data: Low-Resource Data Generation with Textual Control
Shiyu Wang
Yihao Feng
Tian Lan
Ning Yu
Yu Bai
Ran Xu
Han Wang
Caiming Xiong
Siyang Song
DiffM
154
0
0
03 Jan 2025
Efficient support ticket resolution using Knowledge Graphs
Sherwin Varghese
James Tian
57
0
0
03 Jan 2025
Rethinking Addressing in Language Models via Contexualized Equivariant Positional Encoding
Jiajun Zhu
Peihao Wang
Ruisi Cai
Jason D. Lee
Pan Li
Ziyi Wang
KELM
112
1
0
03 Jan 2025
From Reading to Compressing: Exploring the Multi-document Reader for Prompt Compression
From Reading to Compressing: Exploring the Multi-document Reader for Prompt Compression
Eunseong Choi
Sunkyung Lee
Minjin Choi
June Park
Jongwuk Lee
158
2
0
03 Jan 2025
FED: Fast and Efficient Dataset Deduplication Framework with GPU Acceleration
FED: Fast and Efficient Dataset Deduplication Framework with GPU Acceleration
Youngjun Son
Chaewon Kim
Jaejin Lee
134
0
0
02 Jan 2025
Evaluating Time Series Foundation Models on Noisy Periodic Time Series
Evaluating Time Series Foundation Models on Noisy Periodic Time Series
Syamantak Datta Gupta
AI4TS
81
0
0
01 Jan 2025
Proof Recommendation System for the HOL4 Theorem Prover
Proof Recommendation System for the HOL4 Theorem Prover
Nour Dekhil
Adnan Rashid
Sofiene Tahar
81
1
0
31 Dec 2024
From Generalist to Specialist: A Survey of Large Language Models for Chemistry
From Generalist to Specialist: A Survey of Large Language Models for Chemistry
Yang Han
Ziping Wan
Lu Chen
Kai Yu
Xin Chen
LM&MA
102
3
0
31 Dec 2024
YAD: Leveraging T5 for Improved Automatic Diacritization of Yor\`ub\á Text
YAD: Leveraging T5 for Improved Automatic Diacritization of Yor\`ub\á Text
Akindele Michael Olawole
Jesujoba Oluwadara Alabi
Aderonke Busayo Sakpere
David Ifeoluwa Adelani
79
2
0
31 Dec 2024
GASLITEing the Retrieval: Exploring Vulnerabilities in Dense Embedding-based Search
GASLITEing the Retrieval: Exploring Vulnerabilities in Dense Embedding-based Search
Matan Ben-Tov
Mahmood Sharif
RALM
207
1
0
31 Dec 2024
NetFlowGen: Leveraging Generative Pre-training for Network Traffic Dynamics
NetFlowGen: Leveraging Generative Pre-training for Network Traffic Dynamics
Jiawei Zhou
Woojeong Kim
Zhiying Xu
Alexander M. Rush
Minlan Yu
AI4CE
79
0
0
31 Dec 2024
Unleashing the Power of Data Tsunami: A Comprehensive Survey on Data Assessment and Selection for Instruction Tuning of Language Models
Unleashing the Power of Data Tsunami: A Comprehensive Survey on Data Assessment and Selection for Instruction Tuning of Language Models
Yulei Qin
Yuncheng Yang
Pengcheng Guo
Gang Li
Hang Shao
Yuchen Shi
Zihan Xu
Yun Gu
Ke Li
Xing Sun
ALM
211
13
0
31 Dec 2024
GPT or BERT: why not both?
GPT or BERT: why not both?
Lucas Georges Gabriel Charpentier
David Samuel
156
5
0
31 Dec 2024
SAFE-MEME: Structured Reasoning Framework for Robust Hate Speech Detection in Memes
SAFE-MEME: Structured Reasoning Framework for Robust Hate Speech Detection in Memes
Palash Nandi
Shivam Sharma
Tanmoy Chakraborty
69
1
0
31 Dec 2024
A Comprehensive Survey of Large Language Models and Multimodal Large Language Models in Medicine
A Comprehensive Survey of Large Language Models and Multimodal Large Language Models in Medicine
Hanguang Xiao
Feizhong Zhou
Xianglong Liu
Tianqi Liu
Zhipeng Li
Xin Liu
Xiaoxuan Huang
AILawLM&MALRM
149
30
0
31 Dec 2024
Generate to Discriminate: Expert Routing for Continual Learning
Generate to Discriminate: Expert Routing for Continual Learning
Yewon Byun
Sanket Vaibhav Mehta
Saurabh Garg
Emma Strubell
Michael Oberst
Bryan Wilder
Zachary Chase Lipton
178
0
0
31 Dec 2024
AfriHG: News headline generation for African Languages
AfriHG: News headline generation for African Languages
Toyib Ogunremi
Serah Akojenu
Anthony Soronnadi
Olubayo Adekanmbi
David Ifeoluwa Adelani
91
1
0
31 Dec 2024
Disentangling Preference Representation and Text Generation for Efficient Individual Preference Alignment
Disentangling Preference Representation and Text Generation for Efficient Individual Preference Alignment
Jianfei Zhang
Jun Bai
Yangqiu Song
Yanmeng Wang
Rumei Li
Chenghua Lin
Wenge Rong
152
0
0
31 Dec 2024
AdaRankGrad: Adaptive Gradient-Rank and Moments for Memory-Efficient LLMs Training and Fine-Tuning
AdaRankGrad: Adaptive Gradient-Rank and Moments for Memory-Efficient LLMs Training and Fine-Tuning
Yehonathan Refael
Jonathan Svirsky
Boris Shustin
Wasim Huleihel
Ofir Lindenbaum
101
4
0
31 Dec 2024
Adaptive Batch Size Schedules for Distributed Training of Language Models with Data and Model Parallelism
Adaptive Batch Size Schedules for Distributed Training of Language Models with Data and Model Parallelism
Tim Tsz-Kit Lau
Weijian Li
Chenwei Xu
Han Liu
Mladen Kolar
466
0
0
30 Dec 2024
TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching and Clap-Ranked Preference Optimization
TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching and Clap-Ranked Preference Optimization
Chia-Yu Hung
Navonil Majumder
Zhifeng Kong
Ambuj Mehrish
Rafael Valle
Bryan Catanzaro
Soujanya Poria
Bryan Catanzaro
Soujanya Poria
162
10
0
30 Dec 2024
LLM Reasoning Engine: Specialized Training for Enhanced Mathematical Reasoning
LLM Reasoning Engine: Specialized Training for Enhanced Mathematical Reasoning
Shuguang Chen
Guang Lin
LRM
484
1
0
28 Dec 2024
Context-Aware Deep Learning for Multi Modal Depression Detection
Context-Aware Deep Learning for Multi Modal Depression Detection
Genevieve Lam
Huang Dongyan
Weisi Lin
81
0
0
26 Dec 2024
SILC-EFSA: Self-aware In-context Learning Correction for Entity-level
  Financial Sentiment Analysis
SILC-EFSA: Self-aware In-context Learning Correction for Entity-level Financial Sentiment Analysis
Senbin Zhu
Chenyuan He
Hongde Liu
Pengcheng Dong
Hanjie Zhao
Yuchen Yan
Yuxiang Jia
Hongying Zan
Min Peng
36
0
0
26 Dec 2024
Bridging Interpretability and Robustness Using LIME-Guided Model
  Refinement
Bridging Interpretability and Robustness Using LIME-Guided Model Refinement
Navid Nayyem
Abdullah Rakin
Longwei Wang
AAMLFAtt
110
2
0
25 Dec 2024
Large Language Model guided Deep Reinforcement Learning for Decision
  Making in Autonomous Driving
Large Language Model guided Deep Reinforcement Learning for Decision Making in Autonomous Driving
Hao Pang
Zhenpo Wang
Guoqiang Li
98
4
0
24 Dec 2024
Segment-Based Attention Masking for GPTs
Segment-Based Attention Masking for GPTs
Shahar Katz
Liran Ringel
Yaniv Romano
Lior Wolf
CLL
71
1
0
24 Dec 2024
AIGT: AI Generative Table Based on Prompt
AIGT: AI Generative Table Based on Prompt
Mingming Zhang
Zhiqing Xiao
Guoshan Lu
Sai Wu
Weiqiang Wang
Xing Fu
Can Yi
Junbo Zhao
LMTDVLM
81
2
0
24 Dec 2024
SlimGPT: Layer-wise Structured Pruning for Large Language Models
SlimGPT: Layer-wise Structured Pruning for Large Language Models
Gui Ling
Ziyang Wang
Yuliang Yan
Qingwen Liu
99
10
0
24 Dec 2024
Generating Completions for Fragmented Broca's Aphasic Sentences Using
  Large Language Models
Generating Completions for Fragmented Broca's Aphasic Sentences Using Large Language Models
Sijbren van Vaals
Yevgen Matusevych
Frank Tsiwah
60
0
0
23 Dec 2024
SBS Figures: Pre-training Figure QA from Stage-by-Stage Synthesized
  Images
SBS Figures: Pre-training Figure QA from Stage-by-Stage Synthesized Images
Risa Shinoda
Kuniaki Saito
Shohei Tanaka
Tosho Hirasawa
Yoshitaka Ushiku
59
1
0
23 Dec 2024
ERUPD -- English to Roman Urdu Parallel Dataset
ERUPD -- English to Roman Urdu Parallel Dataset
Mohammed Furqan
Raahid Bin Khaja
Rayyan Habeeb
73
0
0
23 Dec 2024
A Toolkit for Virtual Reality Data Collection
A Toolkit for Virtual Reality Data Collection
Tim Rolff
Niklas Hypki
Markus Lappe
Frank Steinicke
36
0
0
23 Dec 2024
Multi-Modal Grounded Planning and Efficient Replanning For Learning
  Embodied Agents with A Few Examples
Multi-Modal Grounded Planning and Efficient Replanning For Learning Embodied Agents with A Few Examples
Taewoong Kim
Byeonghwi Kim
Jonghyun Choi
LLMAGLM&Ro
95
1
0
23 Dec 2024
CharGen: High Accurate Character-Level Visual Text Generation Model with
  MultiModal Encoder
CharGen: High Accurate Character-Level Visual Text Generation Model with MultiModal Encoder
Lichen Ma
Tiezhu Yue
Pei Fu
Yujie Zhong
Kai Zhou
Xiaoming Wei
Jie Hu
DiffM
126
2
0
23 Dec 2024
Investigating Length Issues in Document-level Machine Translation
Investigating Length Issues in Document-level Machine Translation
Ziqian Peng
Rachel Bawden
François Yvon
108
2
0
23 Dec 2024
GQSA: Group Quantization and Sparsity for Accelerating Large Language Model Inference
GQSA: Group Quantization and Sparsity for Accelerating Large Language Model Inference
Chao Zeng
Songwei Liu
Shu Yang
Fangmin Chen
Xing Mei
Lean Fu
MQ
133
0
0
23 Dec 2024
DreamOmni: Unified Image Generation and Editing
DreamOmni: Unified Image Generation and Editing
Bin Xia
Yuechen Zhang
Jingyao Li
Chengyao Wang
Yitong Wang
Xinglong Wu
Bei Yu
Jiaya Jia
SyDaMLLM
135
5
0
22 Dec 2024
Previous
123...242526...196197198
Next