ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.10683
  4. Cited By
Exploring the Limits of Transfer Learning with a Unified Text-to-Text
  Transformer
v1v2v3v4 (latest)

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
    AIMat
ArXiv (abs)PDFHTML

Papers citing "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

50 / 9,937 papers shown
Title
Factorized-Dreamer: Training A High-Quality Video Generator with Limited
  and Low-Quality Data
Factorized-Dreamer: Training A High-Quality Video Generator with Limited and Low-Quality Data
Tao Yang
Yangming Shi
Yunwen Huang
Feng Chen
Yin Zheng
Lei Zhang
DiffMVGen
87
0
0
19 Aug 2024
GLIMMER: Incorporating Graph and Lexical Features in Unsupervised
  Multi-Document Summarization
GLIMMER: Incorporating Graph and Lexical Features in Unsupervised Multi-Document Summarization
Ran Liu
Ming Liu
Min Yu
Jianguo Jiang
Gang Li
Dan Zhang
Jingyuan Li
Xiang Meng
Weiqing Huang
48
0
0
19 Aug 2024
Convert and Speak: Zero-shot Accent Conversion with Minimum Supervision
Convert and Speak: Zero-shot Accent Conversion with Minimum Supervision
Zhijun Jia
Huaying Xue
Xiulian Peng
Yan Lu
152
3
0
19 Aug 2024
FFAA: Multimodal Large Language Model based Explainable Open-World Face
  Forgery Analysis Assistant
FFAA: Multimodal Large Language Model based Explainable Open-World Face Forgery Analysis Assistant
Zhengchao Huang
Bin Xia
Zicheng Lin
Zhun Mou
Wenming Yang
CVBM
103
21
0
19 Aug 2024
Instruction-Based Molecular Graph Generation with Unified Text-Graph
  Diffusion Model
Instruction-Based Molecular Graph Generation with Unified Text-Graph Diffusion Model
Yuran Xiang
Haiteng Zhao
Chang Ma
Zhi-Hong Deng
80
1
0
19 Aug 2024
TaSL: Continual Dialog State Tracking via Task Skill Localization and
  Consolidation
TaSL: Continual Dialog State Tracking via Task Skill Localization and Consolidation
Yujie Feng
Xu Chu
Yongxin Xu
Guangyuan Shi
Bo Liu
Xiao-Ming Wu
MoMeCLL
83
9
0
19 Aug 2024
Summarizing long regulatory documents with a multi-step pipeline
Summarizing long regulatory documents with a multi-step pipeline
Mika Sie
Ruby Beek
Michiel Bots
S. Brinkkemper
Albert Gatt
AILawELM
67
3
0
19 Aug 2024
Transformers to SSMs: Distilling Quadratic Knowledge to Subquadratic Models
Transformers to SSMs: Distilling Quadratic Knowledge to Subquadratic Models
Aviv Bick
Kevin Y. Li
Eric P. Xing
J. Zico Kolter
Albert Gu
Mamba
154
32
0
19 Aug 2024
MAPLE: Enhancing Review Generation with Multi-Aspect Prompt LEarning in Explainable Recommendation
MAPLE: Enhancing Review Generation with Multi-Aspect Prompt LEarning in Explainable Recommendation
Ching-Wen Yang
Zhi-Quan Feng
Ying-Jia Lin
Che-Wei Chen
Kun-da Wu
Hao Xu
Jui-Feng Yao
Hung-Yu Kao
LRMVLM
110
0
0
19 Aug 2024
Grammatical Error Feedback: An Implicit Evaluation Approach
Grammatical Error Feedback: An Implicit Evaluation Approach
Stefano Bannò
Kate Knill
Mark Gales
60
0
0
18 Aug 2024
Identifying Speakers and Addressees of Quotations in Novels with Prompt
  Learning
Identifying Speakers and Addressees of Quotations in Novels with Prompt Learning
Yuchen Yan
Hanjie Zhao
Senbin Zhu
Hongde Liu
Zhihong Zhang
Yuxiang Jia
39
0
0
18 Aug 2024
Crossing New Frontiers: Knowledge-Augmented Large Language Model
  Prompting for Zero-Shot Text-Based De Novo Molecule Design
Crossing New Frontiers: Knowledge-Augmented Large Language Model Prompting for Zero-Shot Text-Based De Novo Molecule Design
Sakhinana Sagar Srinivas
Venkataramana Runkana
84
2
0
18 Aug 2024
MergeRepair: An Exploratory Study on Merging Task-Specific Adapters in Code LLMs for Automated Program Repair
MergeRepair: An Exploratory Study on Merging Task-Specific Adapters in Code LLMs for Automated Program Repair
Meghdad Dehghan
Jie JW Wu
Fatemeh H. Fard
Ali Ouni
MoMe
99
2
0
18 Aug 2024
Threshold Filtering Packing for Supervised Fine-Tuning: Training Related Samples within Packs
Threshold Filtering Packing for Supervised Fine-Tuning: Training Related Samples within Packs
Jiancheng Dong
Lei Jiang
Wei Jin
Lu Cheng
110
1
0
18 Aug 2024
CyberPal.AI: Empowering LLMs with Expert-Driven Cybersecurity
  Instructions
CyberPal.AI: Empowering LLMs with Expert-Driven Cybersecurity Instructions
Matan Levi
Yair Alluouche
Daniel Ohayon
Anton Puzanov
83
6
0
17 Aug 2024
ConVerSum: A Contrastive Learning based Approach for Data-Scarce
  Solution of Cross-Lingual Summarization Beyond Direct Equivalents
ConVerSum: A Contrastive Learning based Approach for Data-Scarce Solution of Cross-Lingual Summarization Beyond Direct Equivalents
Sanzana Karim Lora
Rifat Shahriyar
77
0
0
17 Aug 2024
mRNA2vec: mRNA Embedding with Language Model in the 5ÚTR-CDS for mRNA
  Design
mRNA2vec: mRNA Embedding with Language Model in the 5ÚTR-CDS for mRNA Design
Honggen Zhang
Xiangrui Gao
Igor Molybog
Lipeng Lai
50
1
0
16 Aug 2024
PEDAL: Enhancing Greedy Decoding with Large Language Models using
  Diverse Exemplars
PEDAL: Enhancing Greedy Decoding with Large Language Models using Diverse Exemplars
Sumanth Prabhu
89
1
0
16 Aug 2024
SC-Rec: Enhancing Generative Retrieval with Self-Consistent Reranking
  for Sequential Recommendation
SC-Rec: Enhancing Generative Retrieval with Self-Consistent Reranking for Sequential Recommendation
Tongyoung Kim
Soojin Yoon
SeongKu Kang
Jinyoung Yeo
Dongha Lee
RALM
83
4
0
16 Aug 2024
Rubick: Exploiting Job Reconfigurability for Deep Learning Cluster
  Scheduling
Rubick: Exploiting Job Reconfigurability for Deep Learning Cluster Scheduling
Xinyi Zhang
Hanyu Zhao
Wencong Xiao
Xianyan Jia
Fei Xu
Yong Li
Wei Lin
Fangming Liu
66
2
0
16 Aug 2024
Overview of the BioLaySumm 2024 Shared Task on the Lay Summarization of
  Biomedical Research Articles
Overview of the BioLaySumm 2024 Shared Task on the Lay Summarization of Biomedical Research Articles
Tomas Goldsack
Carolina Scarton
Matthew Shardlow
Chenghua Lin
61
36
0
16 Aug 2024
ABQ-LLM: Arbitrary-Bit Quantized Inference Acceleration for Large
  Language Models
ABQ-LLM: Arbitrary-Bit Quantized Inference Acceleration for Large Language Models
Chao Zeng
Songwei Liu
Yusheng Xie
Hong Liu
Xiaojian Wang
Miao Wei
Shu Yang
Fangmin Chen
Xing Mei
MQ
99
8
0
16 Aug 2024
Context-Aware Assistant Selection for Improved Inference Acceleration
  with Large Language Models
Context-Aware Assistant Selection for Improved Inference Acceleration with Large Language Models
Jerry Huang
Prasanna Parthasarathi
Mehdi Rezagholizadeh
Sarath Chandar
108
2
0
16 Aug 2024
Towards Realistic Synthetic User-Generated Content: A Scaffolding
  Approach to Generating Online Discussions
Towards Realistic Synthetic User-Generated Content: A Scaffolding Approach to Generating Online Discussions
K. Balog
John Palowitch
Barbara Ikica
Filip Radlinski
Hamidreza Alvari
Mehdi Manshadi
SyDa
76
2
0
15 Aug 2024
FactorLLM: Factorizing Knowledge via Mixture of Experts for Large
  Language Models
FactorLLM: Factorizing Knowledge via Mixture of Experts for Large Language Models
Zhongyu Zhao
Menghang Dong
Rongyu Zhang
Wenzhao Zheng
Yunpeng Zhang
Huanrui Yang
Dalong Du
Kurt Keutzer
Shanghang Zhang
104
0
0
15 Aug 2024
Graph Retrieval-Augmented Generation: A Survey
Graph Retrieval-Augmented Generation: A Survey
Boci Peng
Yun Zhu
Yongchao Liu
Xiaohe Bo
Haizhou Shi
Chuntao Hong
Yan Zhang
Siliang Tang
3DV
110
113
0
15 Aug 2024
DataVisT5: A Pre-trained Language Model for Jointly Understanding Text
  and Data Visualization
DataVisT5: A Pre-trained Language Model for Jointly Understanding Text and Data Visualization
Zhuoyue Wan
Yuanfeng Song
Shuaimin Li
Chen Jason Zhang
Raymond Chi-Wing Wong
VLM
83
1
0
14 Aug 2024
Drug Discovery SMILES-to-Pharmacokinetics Diffusion Models with Deep Molecular Understanding
Drug Discovery SMILES-to-Pharmacokinetics Diffusion Models with Deep Molecular Understanding
Bing Hu
Anita Layton
Helen Chen
MedIm
83
2
0
14 Aug 2024
BERT's Conceptual Cartography: Mapping the Landscapes of Meaning
BERT's Conceptual Cartography: Mapping the Landscapes of Meaning
Nina Haket
Ryan Daniels
57
0
0
13 Aug 2024
A Survey on Model MoErging: Recycling and Routing Among Specialized
  Experts for Collaborative Learning
A Survey on Model MoErging: Recycling and Routing Among Specialized Experts for Collaborative Learning
Prateek Yadav
Colin Raffel
Mohammed Muqeeth
Lucas Caccia
Haokun Liu
Tianlong Chen
Joey Tianyi Zhou
Leshem Choshen
Alessandro Sordoni
MoMe
120
25
0
13 Aug 2024
Large language models can consistently generate high-quality content for
  election disinformation operations
Large language models can consistently generate high-quality content for election disinformation operations
Angus R. Williams
Liam Burke-Moore
Ryan Sze-Yin Chan
Florence E. Enock
Federico Nanni
Tvesha Sippy
Yi-Ling Chung
Evelina Gabasova
Kobi Hackenburg
Jonathan Bright
69
5
0
13 Aug 2024
CROME: Cross-Modal Adapters for Efficient Multimodal LLM
CROME: Cross-Modal Adapters for Efficient Multimodal LLM
Sayna Ebrahimi
Sercan O. Arik
Tejas Nama
Tomas Pfister
81
1
0
13 Aug 2024
Fast-and-Frugal Text-Graph Transformers are Effective Link Predictors
Fast-and-Frugal Text-Graph Transformers are Effective Link Predictors
Andrei Catalin Coman
Christos Theodoropoulos
Marie-Francine Moens
James Henderson
96
0
0
13 Aug 2024
CTISum: A New Benchmark Dataset For Cyber Threat Intelligence Summarization
CTISum: A New Benchmark Dataset For Cyber Threat Intelligence Summarization
Wei Peng
Junmei Ding
Wei Wang
Lei Cui
Wei Cai
Zhiyu Hao
Xiaochun Yun
75
2
0
13 Aug 2024
FastFiD: Improve Inference Efficiency of Open Domain Question Answering
  via Sentence Selection
FastFiD: Improve Inference Efficiency of Open Domain Question Answering via Sentence Selection
Yufei Huang
Xu Han
Maosong Sun
65
0
0
12 Aug 2024
FuxiTranyu: A Multilingual Large Language Model Trained with Balanced
  Data
FuxiTranyu: A Multilingual Large Language Model Trained with Balanced Data
Haoran Sun
Renren Jin
Shaoyang Xu
Leiyu Pan
Supryadi
...
Lei Yang
Ling Shi
Juesi Xiao
Shaolin Zhu
Deyi Xiong
98
4
0
12 Aug 2024
Utilize Transformers for translating Wikipedia category names
Utilize Transformers for translating Wikipedia category names
Hoang-Thang Ta
Quoc Thang La
56
0
0
12 Aug 2024
Multi-scale Contrastive Adaptor Learning for Segmenting Anything in
  Underperformed Scenes
Multi-scale Contrastive Adaptor Learning for Segmenting Anything in Underperformed Scenes
Ke Zhou
Zhongwei Qiu
Dongmei Fu
VLM
72
3
0
12 Aug 2024
CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer
CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer
Zhuoyi Yang
Jiayan Teng
Wendi Zheng
Ming Ding
Shiyu Huang
...
Weihan Wang
Yean Cheng
Xiaotao Gu
Yuxiao Dong
Jie Tang
DiffMVGen
333
565
0
12 Aug 2024
SAGA: A Participant-specific Examination of Story Alternatives and Goal
  Applicability for a Deeper Understanding of Complex Events
SAGA: A Participant-specific Examination of Story Alternatives and Goal Applicability for a Deeper Understanding of Complex Events
Sai Vallurupalli
Katrin Erk
Francis Ferraro
60
2
0
11 Aug 2024
Top Pass: Improve Code Generation by Pass@k-Maximized Code Ranking
Top Pass: Improve Code Generation by Pass@k-Maximized Code Ranking
Zhi-Cun Lyu
Xin-Ye Li
Zheng Xie
Ming Li
70
8
0
11 Aug 2024
Efficient Diffusion Transformer with Step-wise Dynamic Attention
  Mediators
Efficient Diffusion Transformer with Step-wise Dynamic Attention Mediators
Yifan Pu
Zhuofan Xia
Jiayi Guo
Dongchen Han
Qixiu Li
...
Ji Li
Yizeng Han
Shiji Song
Gao Huang
Xiu Li
110
12
0
11 Aug 2024
A Psychology-based Unified Dynamic Framework for Curriculum Learning
A Psychology-based Unified Dynamic Framework for Curriculum Learning
Guangyu Meng
Qingkai Zeng
John P. Lalor
Hong-ye Yu
76
0
0
09 Aug 2024
Node Level Graph Autoencoder: Unified Pretraining for Textual Graph
  Learning
Node Level Graph Autoencoder: Unified Pretraining for Textual Graph Learning
Wenbin Hu
Huihao Jing
Qi Hu
Haoran Li
Yangqiu Song
SSLAI4CE
91
0
0
09 Aug 2024
Investigating a Benchmark for Training-set free Evaluation of Linguistic
  Capabilities in Machine Reading Comprehension
Investigating a Benchmark for Training-set free Evaluation of Linguistic Capabilities in Machine Reading Comprehension
Viktor Schlegel
Goran Nenadic
Riza Batista-Navarro
ELM
56
0
0
09 Aug 2024
Towards a Generative Approach for Emotion Detection and Reasoning
Towards a Generative Approach for Emotion Detection and Reasoning
Ankita Bhaumik
T. Strzalkowski
ReLMLRM
84
3
0
09 Aug 2024
MSG-Chart: Multimodal Scene Graph for ChartQA
MSG-Chart: Multimodal Scene Graph for ChartQA
Yue Dai
Soyeon Caren Han
Wei Liu
39
1
0
09 Aug 2024
Arctic-TILT. Business Document Understanding at Sub-Billion Scale
Arctic-TILT. Business Document Understanding at Sub-Billion Scale
Łukasz Borchmann
Michał Pietruszka
Wojciech Ja'skowski
Dawid Jurkiewicz
Piotr Halama
...
Gabriela Nowakowska
Artur Zawłocki
Łukasz Duhr
Paweł Dyda
Michał Turski
VLM
91
1
0
08 Aug 2024
LogogramNLP: Comparing Visual and Textual Representations of Ancient
  Logographic Writing Systems for NLP
LogogramNLP: Comparing Visual and Textual Representations of Ancient Logographic Writing Systems for NLP
Danlu Chen
Freda Shi
Aditi Agarwal
Jacobo Myerston
Taylor Berg-Kirkpatrick
77
2
0
08 Aug 2024
Open-domain Implicit Format Control for Large Language Model Generation
Open-domain Implicit Format Control for Large Language Model Generation
Yiqun Yao
Wenjia Ma
Xuezhi Fang
Xin Jiang
Xiang Li
Xuying Meng
Peng Han
Jing Li
Aixin Sun
Yequan Wang
75
2
0
08 Aug 2024
Previous
123...424344...197198199
Next