ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.10683
  4. Cited By
Exploring the Limits of Transfer Learning with a Unified Text-to-Text
  Transformer
v1v2v3v4 (latest)

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
    AIMat
ArXiv (abs)PDFHTML

Papers citing "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

50 / 9,948 papers shown
Title
Unlock the Correlation between Supervised Fine-Tuning and Reinforcement
  Learning in Training Code Large Language Models
Unlock the Correlation between Supervised Fine-Tuning and Reinforcement Learning in Training Code Large Language Models
Jie Chen
Xintian Han
Yu Ma
Xun Zhou
Liang Xiang
ALMLRM
76
2
0
14 Jun 2024
A Survey on Large Language Models from General Purpose to Medical
  Applications: Datasets, Methodologies, and Evaluations
A Survey on Large Language Models from General Purpose to Medical Applications: Datasets, Methodologies, and Evaluations
Jinqiang Wang
Huansheng Ning
Yi Peng
Qikai Wei
Daniel Tesfai
Wenwei Mao
Tao Zhu
Runhe Huang
LM&MAAI4MHELM
165
8
0
14 Jun 2024
ChartMimic: Evaluating LMM's Cross-Modal Reasoning Capability via Chart-to-Code Generation
ChartMimic: Evaluating LMM's Cross-Modal Reasoning Capability via Chart-to-Code Generation
Cheng Yang
Chufan Shi
Yaxin Liu
Bo Shui
Junjie Wang
...
Yuxiang Zhang
Gongye Liu
Xiaomei Nie
Deng Cai
Yujiu Yang
MLLMLRM
115
26
0
14 Jun 2024
Multimodal Large Language Models with Fusion Low Rank Adaptation for
  Device Directed Speech Detection
Multimodal Large Language Models with Fusion Low Rank Adaptation for Device Directed Speech Detection
Shruti Palaskar
Oggi Rudovic
Sameer Dharur
Florian Pesce
G. Krishna
Aswin Sivaraman
Jack Berkowitz
Ahmed Hussen Abdelaziz
Saurabh N. Adya
Ahmed H. Tewfik
VLM
88
0
0
13 Jun 2024
Cross-Modality Program Representation Learning for Electronic Design
  Automation with High-Level Synthesis
Cross-Modality Program Representation Learning for Electronic Design Automation with High-Level Synthesis
Zongyue Qin
Yunsheng Bai
Atefeh Sohrabizadeh
Zijian Ding
Ziniu Hu
Yizhou Sun
Jason Cong
86
2
0
13 Jun 2024
4M-21: An Any-to-Any Vision Model for Tens of Tasks and Modalities
4M-21: An Any-to-Any Vision Model for Tens of Tasks and Modalities
Roman Bachmann
Oğuzhan Fatih Kar
David Mizrahi
Ali Garjani
Mingfei Gao
David Griffiths
Jiaming Hu
Afshin Dehghan
Amir Zamir
MoEVLMMLLM
116
18
0
13 Jun 2024
SimGen: Simulator-conditioned Driving Scene Generation
SimGen: Simulator-conditioned Driving Scene Generation
Yunsong Zhou
Michael Simon
Zhenghao Peng
Sicheng Mo
Hongzi Zhu
Minyi Guo
Bolei Zhou
VGen
95
17
0
13 Jun 2024
ProxyLM: Predicting Language Model Performance on Multilingual Tasks via
  Proxy Models
ProxyLM: Predicting Language Model Performance on Multilingual Tasks via Proxy Models
David Anugraha
Genta Indra Winata
Chenyue Li
Patrick Amadeus Irawan
En-Shiun Annie Lee
99
8
0
13 Jun 2024
Test of Time: A Benchmark for Evaluating LLMs on Temporal Reasoning
Test of Time: A Benchmark for Evaluating LLMs on Temporal Reasoning
Bahare Fatemi
Mehran Kazemi
Anton Tsitsulin
Karishma Malkan
Jinyeong Yim
John Palowitch
Sungyong Seo
Jonathan J. Halcrow
Bryan Perozzi
LRM
105
39
0
13 Jun 2024
EMMA: Your Text-to-Image Diffusion Model Can Secretly Accept Multi-Modal
  Prompts
EMMA: Your Text-to-Image Diffusion Model Can Secretly Accept Multi-Modal Prompts
Yucheng Han
Rui Wang
Chi Zhang
Juntao Hu
Pei Cheng
Bin-Bin Fu
Hanwang Zhang
118
6
0
13 Jun 2024
RH-SQL: Refined Schema and Hardness Prompt for Text-to-SQL
RH-SQL: Refined Schema and Hardness Prompt for Text-to-SQL
J. Yi
Guo Chen
Zixiang Shen
72
1
0
13 Jun 2024
Modeling Comparative Logical Relation with Contrastive Learning for Text
  Generation
Modeling Comparative Logical Relation with Contrastive Learning for Text Generation
Yuhao Dan
Junfeng Tian
Jie Zhou
Ming Yan
Ji Zhang
Qin Chen
Liang He
105
0
0
13 Jun 2024
Enhancing Cross-Modal Fine-Tuning with Gradually Intermediate Modality
  Generation
Enhancing Cross-Modal Fine-Tuning with Gradually Intermediate Modality Generation
Lincan Cai
Shuang Li
Wenxuan Ma
Jingxuan Kang
Binhui Xie
Zixun Sun
Chengwei Zhu
MoEMoMe
94
1
0
13 Jun 2024
No perspective, no perception!! Perspective-aware Healthcare Answer
  Summarization
No perspective, no perception!! Perspective-aware Healthcare Answer Summarization
Gauri Naik
Sharad Chandakacherla
S. Yadav
Md. Shad Akhtar
93
11
0
13 Jun 2024
Plan, Generate and Complicate: Improving Low-resource Dialogue State
  Tracking via Easy-to-Difficult Zero-shot Data Augmentation
Plan, Generate and Complicate: Improving Low-resource Dialogue State Tracking via Easy-to-Difficult Zero-shot Data Augmentation
Ming Gu
Yan Yang
87
1
0
13 Jun 2024
Mixture-of-Skills: Learning to Optimize Data Usage for Fine-Tuning Large
  Language Models
Mixture-of-Skills: Learning to Optimize Data Usage for Fine-Tuning Large Language Models
Minghao Wu
Thuy-Trang Vu
Zhuang Li
Gholamreza Haffari
77
6
0
13 Jun 2024
Can't Hide Behind the API: Stealing Black-Box Commercial Embedding Models
Can't Hide Behind the API: Stealing Black-Box Commercial Embedding Models
Manveer Singh Tamber
Jasper Xian
Jimmy Lin
MLAUSILM
337
2
0
13 Jun 2024
Language Models are Crossword Solvers
Language Models are Crossword Solvers
Soumadeep Saha
Sutanoya Chakraborty
Saptarshi Saha
Utpal Garain
LRMReLM
125
3
0
13 Jun 2024
mOSCAR: A Large-scale Multilingual and Multimodal Document-level Corpus
mOSCAR: A Large-scale Multilingual and Multimodal Document-level Corpus
Matthieu Futeral
A. Zebaze
Pedro Ortiz Suarez
Julien Abadji
Rémi Lacroix
Cordelia Schmid
Rachel Bawden
Benoît Sagot
171
3
0
13 Jun 2024
Reversing the Forget-Retain Objectives: An Efficient LLM Unlearning
  Framework from Logit Difference
Reversing the Forget-Retain Objectives: An Efficient LLM Unlearning Framework from Logit Difference
Jiabao Ji
Yujian Liu
Yang Zhang
Gaowen Liu
Ramana Rao Kompella
Sijia Liu
Shiyu Chang
KELMMU
139
37
0
12 Jun 2024
Words Worth a Thousand Pictures: Measuring and Understanding Perceptual
  Variability in Text-to-Image Generation
Words Worth a Thousand Pictures: Measuring and Understanding Perceptual Variability in Text-to-Image Generation
Raphael Tang
Xinyu Crystina Zhang
Lixinyu Xu
Yao Lu
Wenyan Li
Pontus Stenetorp
Jimmy Lin
Ferhan Ture
99
0
0
12 Jun 2024
State Soup: In-Context Skill Learning, Retrieval and Mixing
State Soup: In-Context Skill Learning, Retrieval and Mixing
Maciej Pióro
Maciej Wołczyk
Razvan Pascanu
J. Oswald
João Sacramento
62
1
0
12 Jun 2024
OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images
  Interleaved with Text
OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text
Qingyun Li
Zhe Chen
Weiyun Wang
Wenhai Wang
Shenglong Ye
...
Dahua Lin
Yu Qiao
Botian Shi
Conghui He
Jifeng Dai
VLMOffRL
122
27
0
12 Jun 2024
GraphFM: A Comprehensive Benchmark for Graph Foundation Model
GraphFM: A Comprehensive Benchmark for Graph Foundation Model
Yuhao Xu
Xinqi Liu
Keyu Duan
Yi Fang
Yu-Neng Chuang
Daochen Zha
Qiaoyu Tan
AI4CE
59
1
0
12 Jun 2024
GPT4Rec: Graph Prompt Tuning for Streaming Recommendation
GPT4Rec: Graph Prompt Tuning for Streaming Recommendation
Peiyan Zhang
Yuchen Yan
Xi Zhang
Liying Kang
Chaozhuo Li
Feiran Huang
Senzhang Wang
Sunghun Kim
107
9
0
12 Jun 2024
BookSQL: A Large Scale Text-to-SQL Dataset for Accounting Domain
BookSQL: A Large Scale Text-to-SQL Dataset for Accounting Domain
Rahul Kumar
Amar Raja Dibbu
Shrutendra Harsola
Vignesh T. Subrahmaniam
Ashutosh Modi
87
8
0
12 Jun 2024
ALPS: Improved Optimization for Highly Sparse One-Shot Pruning for Large
  Language Models
ALPS: Improved Optimization for Highly Sparse One-Shot Pruning for Large Language Models
Xiang Meng
Kayhan Behdin
Haoyue Wang
Rahul Mazumder
83
6
0
12 Jun 2024
PRoDeliberation: Parallel Robust Deliberation for End-to-End Spoken
  Language Understanding
PRoDeliberation: Parallel Robust Deliberation for End-to-End Spoken Language Understanding
Trang Le
Daniel Lazar
Suyoun Kim
Shan Jiang
Duc Le
Adithya Sagar
Aleksandr Livshits
Ahmed Aly
Akshat Shrivastava
76
0
0
12 Jun 2024
Tell Me What's Next: Textual Foresight for Generic UI Representations
Tell Me What's Next: Textual Foresight for Generic UI Representations
Andrea Burns
Kate Saenko
Bryan A. Plummer
LM&RoAI4TS
92
5
0
12 Jun 2024
Prompt-Based Length Controlled Generation with Multiple Control Types
Prompt-Based Length Controlled Generation with Multiple Control Types
Renlong Jie
Xiaojun Meng
Lifeng Shang
Xin Jiang
Qun Liu
91
8
0
12 Jun 2024
GenDistiller: Distilling Pre-trained Language Models based on an
  Autoregressive Generative Model
GenDistiller: Distilling Pre-trained Language Models based on an Autoregressive Generative Model
Yingying Gao
Shilei Zhang
Chao Deng
Junlan Feng
78
0
0
12 Jun 2024
Making Task-Oriented Dialogue Datasets More Natural by Synthetically
  Generating Indirect User Requests
Making Task-Oriented Dialogue Datasets More Natural by Synthetically Generating Indirect User Requests
Amogh Mannekote
Jinseok Nam
Ziming Li
Jian Gao
K. Boyer
Bonnie J. Dorr
96
1
0
12 Jun 2024
Hierarchical Patch Diffusion Models for High-Resolution Video Generation
Hierarchical Patch Diffusion Models for High-Resolution Video Generation
Ivan Skorokhodov
Willi Menapace
Aliaksandr Siarohin
Sergey Tulyakov
VGen
79
10
0
12 Jun 2024
Next-Generation Database Interfaces: A Survey of LLM-based Text-to-SQL
Next-Generation Database Interfaces: A Survey of LLM-based Text-to-SQL
Zijin Hong
Zheng Yuan
Qinggang Zhang
Hao Chen
Junnan Dong
Feiran Huang
Xiao Huang
203
74
0
12 Jun 2024
MultiPragEval: Multilingual Pragmatic Evaluation of Large Language
  Models
MultiPragEval: Multilingual Pragmatic Evaluation of Large Language Models
Dojun Park
Jiwoo Lee
Seohyun Park
Hyeyun Jeong
Youngeun Koo
Soonha Hwang
Seonwoo Park
Sungeun Lee
ELM
63
2
0
11 Jun 2024
Simple and Effective Masked Diffusion Language Models
Simple and Effective Masked Diffusion Language Models
Subham Sekhar Sahoo
Marianne Arriola
Yair Schiff
Aaron Gokaslan
Edgar Marroquin
Justin T Chiu
Alexander M. Rush
Volodymyr Kuleshov
DiffM
118
123
0
11 Jun 2024
Paraphrasing in Affirmative Terms Improves Negation Understanding
Paraphrasing in Affirmative Terms Improves Negation Understanding
MohammadHossein Rezaei
Eduardo Blanco
79
2
0
11 Jun 2024
When Linear Attention Meets Autoregressive Decoding: Towards More
  Effective and Efficient Linearized Large Language Models
When Linear Attention Meets Autoregressive Decoding: Towards More Effective and Efficient Linearized Large Language Models
Haoran You
Yichao Fu
Zheng Wang
Amir Yazdanbakhsh
Yingyan Celine Lin
133
5
0
11 Jun 2024
BvSP: Broad-view Soft Prompting for Few-Shot Aspect Sentiment Quad
  Prediction
BvSP: Broad-view Soft Prompting for Few-Shot Aspect Sentiment Quad Prediction
Yinhao Bai
Yalan Xie
Xiaoyi Liu
Yuhua Zhao
Zhixin Han
Mengting Hu
Hang Gao
Renhong Cheng
76
4
0
11 Jun 2024
Effectively Compress KV Heads for LLM
Effectively Compress KV Heads for LLM
Hao Yu
Zelan Yang
Shen Li
Shen Li
Jianxin Wu
MQVLM
66
16
0
11 Jun 2024
CAAP: Context-Aware Action Planning Prompting to Solve Computer Tasks
  with Front-End UI Only
CAAP: Context-Aware Action Planning Prompting to Solve Computer Tasks with Front-End UI Only
Junhee Cho
Jihoon Kim
Daseul Bae
Jinho Choo
Youngjune Gwon
Yeong-Dae Kwon
LLMAG
59
1
0
11 Jun 2024
Post-Hoc Answer Attribution for Grounded and Trustworthy Long Document
  Comprehension: Task, Insights, and Challenges
Post-Hoc Answer Attribution for Grounded and Trustworthy Long Document Comprehension: Task, Insights, and Challenges
Abhilasha Sancheti
Koustava Goswami
Balaji Vasan Srinivasan
RALM
97
1
0
11 Jun 2024
SignMusketeers: An Efficient Multi-Stream Approach for Sign Language Translation at Scale
SignMusketeers: An Efficient Multi-Stream Approach for Sign Language Translation at Scale
Shester Gueuwou
Xiaodan Du
G. Shakhnarovich
Karen Livescu
SLR
107
5
0
11 Jun 2024
EAVE: Efficient Product Attribute Value Extraction via Lightweight
  Sparse-layer Interaction
EAVE: Efficient Product Attribute Value Extraction via Lightweight Sparse-layer Interaction
Li Yang
Qifan Wang
Jianfeng Chi
Jiahao Liu
Jingang Wang
Fuli Feng
Zenglin Xu
Yi Fang
Lifu Huang
Dongfang Liu
80
1
0
10 Jun 2024
An LLM-Assisted Easy-to-Trigger Backdoor Attack on Code Completion
  Models: Injecting Disguised Vulnerabilities against Strong Detection
An LLM-Assisted Easy-to-Trigger Backdoor Attack on Code Completion Models: Injecting Disguised Vulnerabilities against Strong Detection
Shenao Yan
Shen Wang
Yue Duan
Hanbin Hong
Kiho Lee
Doowon Kim
Yuan Hong
AAMLSILM
76
26
0
10 Jun 2024
TRINS: Towards Multimodal Language Models that Can Read
TRINS: Towards Multimodal Language Models that Can Read
Ruiyi Zhang
Yanzhe Zhang
Jian Chen
Yufan Zhou
Jiuxiang Gu
Changyou Chen
Tong Sun
VLM
82
6
0
10 Jun 2024
Autoregressive Model Beats Diffusion: Llama for Scalable Image
  Generation
Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation
Peize Sun
Yi Jiang
Shoufa Chen
Shilong Zhang
Bingyue Peng
Ping Luo
Zehuan Yuan
VLM
144
301
0
10 Jun 2024
Survey for Landing Generative AI in Social and E-commerce Recsys -- the
  Industry Perspectives
Survey for Landing Generative AI in Social and E-commerce Recsys -- the Industry Perspectives
Da Xu
Danqing Zhang
Guangyu Yang
Bo Yang
Shuyuan Xu
Lingling Zheng
Cindy Liang
45
3
0
10 Jun 2024
Transforming Wearable Data into Health Insights using Large Language
  Model Agents
Transforming Wearable Data into Health Insights using Large Language Model Agents
Mike A. Merrill
Akshay Paruchuri
Naghmeh Rezaei
Geza Kovacs
Javier Perez
...
Shwetak Patel
Jiening Zhan
Tim Althoff
Daniel J. McDuff
Xin Liu
LM&MALLMAGAI4CE
127
12
0
10 Jun 2024
An Improved Empirical Fisher Approximation for Natural Gradient Descent
An Improved Empirical Fisher Approximation for Natural Gradient Descent
Xiaodong Wu
Wenyi Yu
Chao Zhang
Philip Woodland
88
5
0
10 Jun 2024
Previous
123...535455...197198199
Next