ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.10683
  4. Cited By
Exploring the Limits of Transfer Learning with a Unified Text-to-Text
  Transformer
v1v2v3v4 (latest)

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
    AIMat
ArXiv (abs)PDFHTML

Papers citing "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

50 / 9,935 papers shown
Title
μgat: Improving Single-Page Document Parsing by Providing Multi-Page
  Context
μgat: Improving Single-Page Document Parsing by Providing Multi-Page Context
Fabio Quattrini
Carmine Zaccagnino
Silvia Cascianelli
Laura Righi
Rita Cucchiara
68
1
0
28 Aug 2024
An Investigation of Warning Erroneous Chat Translations in Cross-lingual
  Communication
An Investigation of Warning Erroneous Chat Translations in Cross-lingual Communication
Yunmeng Li
Jun Suzuki
Makoto Morishita
Kaori Abe
Kentaro Inui
123
1
0
28 Aug 2024
FedMCP: Parameter-Efficient Federated Learning with Model-Contrastive
  Personalization
FedMCP: Parameter-Efficient Federated Learning with Model-Contrastive Personalization
Qianyi Zhao
Chen Qu
Cen Chen
Mingyuan Fan
Yanhao Wang
118
1
0
28 Aug 2024
Meta-Learn Unimodal Signals with Weak Supervision for Multimodal
  Sentiment Analysis
Meta-Learn Unimodal Signals with Weak Supervision for Multimodal Sentiment Analysis
Sijie Mai
Yu Zhao
Ying Zeng
Jianhua Yao
Haifeng Hu
142
2
0
28 Aug 2024
A Statistical Framework for Data-dependent Retrieval-Augmented Models
A Statistical Framework for Data-dependent Retrieval-Augmented Models
Soumya Basu
A. S. Rawat
Manzil Zaheer
RALM
88
0
0
27 Aug 2024
A Survey of Large Language Models for European Languages
A Survey of Large Language Models for European Languages
Wazir Ali
S. Pyysalo
159
3
0
27 Aug 2024
Evidence-Enhanced Triplet Generation Framework for Hallucination
  Alleviation in Generative Question Answering
Evidence-Enhanced Triplet Generation Framework for Hallucination Alleviation in Generative Question Answering
Haowei Du
Huishuai Zhang
Dongyan Zhao
HILM
60
0
0
27 Aug 2024
Alfie: Democratising RGBA Image Generation With No $$$
Alfie: Democratising RGBA Image Generation With No
Fabio Quattrini
Vittorio Pippi
Silvia Cascianelli
Rita Cucchiara
DiffM
93
6
0
27 Aug 2024
Relationships are Complicated! An Analysis of Relationships Between
  Datasets on the Web
Relationships are Complicated! An Analysis of Relationships Between Datasets on the Web
Kate Lin
Tarfah Alrashed
Natasha Noy
54
0
0
26 Aug 2024
GR-MG: Leveraging Partially Annotated Data via Multi-Modal Goal
  Conditioned Policy
GR-MG: Leveraging Partially Annotated Data via Multi-Modal Goal Conditioned Policy
Peiyan Li
Hongtao Wu
Yan Huang
Chilam Cheang
Liang Wang
Tao Kong
VGen
95
13
0
26 Aug 2024
An Evaluation of Explanation Methods for Black-Box Detectors of
  Machine-Generated Text
An Evaluation of Explanation Methods for Black-Box Detectors of Machine-Generated Text
Loris Schoenegger
Yuxi Xia
Benjamin Roth
FAtt
68
0
0
26 Aug 2024
SurGen: Text-Guided Diffusion Model for Surgical Video Generation
SurGen: Text-Guided Diffusion Model for Surgical Video Generation
Joseph Cho
Samuel Schmidgall
C. Zakka
Mrudang Mathur
Dhamanpreet Kaur
R. Shad
W. Hiesinger
VGenMedIm
121
8
0
26 Aug 2024
Step-by-Step Unmasking for Parameter-Efficient Fine-tuning of Large Language Models
Step-by-Step Unmasking for Parameter-Efficient Fine-tuning of Large Language Models
Aradhye Agarwal
Suhas Kamasetty Ramesh
Ayan Sengupta
Tanmoy Chakraborty
81
1
0
26 Aug 2024
Prompt-Softbox-Prompt: A free-text Embedding Control for Image Editing
Prompt-Softbox-Prompt: A free-text Embedding Control for Image Editing
Yitong Yang
Yinglin Wang
Jing Wang
Tian Zhang
DiffM
88
1
0
24 Aug 2024
FLEURS-ASL: Including American Sign Language in Massively Multilingual
  Multitask Evaluation
FLEURS-ASL: Including American Sign Language in Massively Multilingual Multitask Evaluation
Garrett Tanzer
SLRVLM
80
3
0
24 Aug 2024
Utilizing Large Language Models for Named Entity Recognition in
  Traditional Chinese Medicine against COVID-19 Literature: Comparative Study
Utilizing Large Language Models for Named Entity Recognition in Traditional Chinese Medicine against COVID-19 Literature: Comparative Study
Xu Tong
N. Smirnova
Sharmila Upadhyaya
Ran Yu
Jack H. Culbert
Chao Sun
Wolfgang Otto
Philipp Mayr
AI4MH
62
1
0
24 Aug 2024
A Law of Next-Token Prediction in Large Language Models
A Law of Next-Token Prediction in Large Language Models
Hangfeng He
Weijie J. Su
82
7
0
24 Aug 2024
Integrating Multi-Head Convolutional Encoders with Cross-Attention for
  Improved SPARQL Query Translation
Integrating Multi-Head Convolutional Encoders with Cross-Attention for Improved SPARQL Query Translation
Yi-Hui Chen
Eric Jui-Lin Lu
Kwan-Ho Cheng
76
1
0
24 Aug 2024
Understanding Defects in Generated Codes by Language Models
Understanding Defects in Generated Codes by Language Models
Ali Mohammadi Esfahani
N. Kahani
S. Ajila
94
1
0
23 Aug 2024
Enhancing Few-Shot Transfer Learning with Optimized Multi-Task Prompt
  Tuning through Modular Prompt Composition
Enhancing Few-Shot Transfer Learning with Optimized Multi-Task Prompt Tuning through Modular Prompt Composition
Ahmad Pouramini
H. Faili
VLM
39
0
0
23 Aug 2024
SpeechPrompt: Prompting Speech Language Models for Speech Processing
  Tasks
SpeechPrompt: Prompting Speech Language Models for Speech Processing Tasks
Kai-Wei Chang
Haibin Wu
Yu-Kai Wang
Yuan-Kuei Wu
Hua Shen
Wei-Cheng Tseng
Iu-thing Kang
Shang-Wen Li
Hung-yi Lee
93
3
0
23 Aug 2024
VFM-Det: Towards High-Performance Vehicle Detection via Large Foundation
  Models
VFM-Det: Towards High-Performance Vehicle Detection via Large Foundation Models
Wentao Wu
Fanghua Hong
Xiao Wang
Chenglong Li
Jin Tang
VLM
93
1
0
23 Aug 2024
Internal and External Knowledge Interactive Refinement Framework for
  Knowledge-Intensive Question Answering
Internal and External Knowledge Interactive Refinement Framework for Knowledge-Intensive Question Answering
Haowei Du
Dongyan Zhao
KELM
49
0
0
23 Aug 2024
Memory-Efficient LLM Training with Online Subspace Descent
Memory-Efficient LLM Training with Online Subspace Descent
Kaizhao Liang
Bo Liu
Lizhang Chen
Qiang Liu
75
15
0
23 Aug 2024
Investigating LLM Applications in E-Commerce
Investigating LLM Applications in E-Commerce
Chester Palen-Michel
Ruixiang Wang
Yipeng Zhang
David Yu
Canran Xu
Zhe Wu
73
5
0
23 Aug 2024
xGen-VideoSyn-1: High-fidelity Text-to-Video Synthesis with Compressed
  Representations
xGen-VideoSyn-1: High-fidelity Text-to-Video Synthesis with Compressed Representations
Can Qin
Congying Xia
Krithika Ramakrishnan
Michael S Ryoo
Lifu Tu
...
Silvio Savarese
Juan Carlos Niebles
Zeyuan Chen
Ran Xu
Caiming Xiong
VGenDiffM
145
3
0
22 Aug 2024
Show-o: One Single Transformer to Unify Multimodal Understanding and
  Generation
Show-o: One Single Transformer to Unify Multimodal Understanding and Generation
Jinheng Xie
Weijia Mao
Zechen Bai
David Junhao Zhang
Weihao Wang
Kevin Qinghong Lin
Yuchao Gu
Zhijie Chen
Zhenheng Yang
Mike Zheng Shou
140
228
0
22 Aug 2024
A Comparative Analysis of Faithfulness Metrics and Humans in Citation
  Evaluation
A Comparative Analysis of Faithfulness Metrics and Humans in Citation Evaluation
Weijia Zhang
Mohammad Aliannejadi
Jiahuan Pei
Yifei Yuan
Jia-Hong Huang
Evangelos Kanoulas
HILM
80
4
0
22 Aug 2024
Enhanced Fine-Tuning of Lightweight Domain-Specific Q&A Model Based on
  Large Language Models
Enhanced Fine-Tuning of Lightweight Domain-Specific Q&A Model Based on Large Language Models
Shenglin Zhang
Pengtian Zhu
Minghua Ma
Jiagang Wang
Yongqian Sun
...
Jingyu Wang
Qianying Guo
Xiaolei Hua
Lin Zhu
Dan Pei
AI4TS
42
0
0
22 Aug 2024
Large Language Models as Foundations for Next-Gen Dense Retrieval: A
  Comprehensive Empirical Assessment
Large Language Models as Foundations for Next-Gen Dense Retrieval: A Comprehensive Empirical Assessment
Kun Luo
Minghao Qin
Zheng Liu
Shitao Xiao
Jun Zhao
Kang Liu
76
13
0
22 Aug 2024
Time Series Foundation Models and Deep Learning Architectures for
  Earthquake Temporal and Spatial Nowcasting
Time Series Foundation Models and Deep Learning Architectures for Earthquake Temporal and Spatial Nowcasting
Alireza Jafari
Geoffrey Fox
John B. Rundle
A. Donnellan
L. G. Ludwig
AI4TSAI4CE
68
3
0
21 Aug 2024
Against All Odds: Overcoming Typology, Script, and Language Confusion in
  Multilingual Embedding Inversion Attacks
Against All Odds: Overcoming Typology, Script, and Language Confusion in Multilingual Embedding Inversion Attacks
Yiyi Chen
Russa Biswas
Heather Lent
Johannes Bjerva
AAML
92
5
0
21 Aug 2024
Mixed Sparsity Training: Achieving 4$\times$ FLOP Reduction for
  Transformer Pretraining
Mixed Sparsity Training: Achieving 4×\times× FLOP Reduction for Transformer Pretraining
Pihe Hu
Shaolong Li
Longbo Huang
62
0
0
21 Aug 2024
Efficient Detection of Toxic Prompts in Large Language Models
Efficient Detection of Toxic Prompts in Large Language Models
Yi Liu
Junzhe Yu
Huijia Sun
Ling Shi
Gelei Deng
Yuqi Chen
Yang Liu
100
6
0
21 Aug 2024
Differentiating Choices via Commonality for Multiple-Choice Question
  Answering
Differentiating Choices via Commonality for Multiple-Choice Question Answering
Wenqing Deng
Zhe Wang
Kewen Wang
Shirui Pan
Xiaowang Zhang
Zhiyong Feng
67
0
0
21 Aug 2024
LARR: Large Language Model Aided Real-time Scene Recommendation with
  Semantic Understanding
LARR: Large Language Model Aided Real-time Scene Recommendation with Semantic Understanding
Zhizhong Wan
Bin Yin
Junjie Xie
Fei Jiang
Xiang Li
Wei Lin
3DV
74
5
0
21 Aug 2024
DocTabQA: Answering Questions from Long Documents Using Tables
DocTabQA: Answering Questions from Long Documents Using Tables
Haochen Wang
Kai Hu
Haoyu Dong
Liangcai Gao
RALMLMTD
67
3
0
21 Aug 2024
Applying and Evaluating Large Language Models in Mental Health Care: A
  Scoping Review of Human-Assessed Generative Tasks
Applying and Evaluating Large Language Models in Mental Health Care: A Scoping Review of Human-Assessed Generative Tasks
Yining Hua
Hongbin Na
Zehan Li
Fenglin Liu
Xiao Fang
David Clifton
John Torous
ELMLM&MAAI4MH
69
4
0
21 Aug 2024
Transfusion: Predict the Next Token and Diffuse Images with One
  Multi-Modal Model
Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model
Chunting Zhou
Lili Yu
Arun Babu
Kushal Tirumala
Michihiro Yasunaga
Leonid Shamis
Jacob Kahn
Xuezhe Ma
Luke Zettlemoyer
Omer Levy
DiffM
130
190
0
20 Aug 2024
MegaFusion: Extend Diffusion Models towards Higher-resolution Image
  Generation without Further Tuning
MegaFusion: Extend Diffusion Models towards Higher-resolution Image Generation without Further Tuning
Haoning Wu
Shaocheng Shen
Qiang Hu
Xiaoyun Zhang
Ya Zhang
Yanfeng Wang
114
11
0
20 Aug 2024
To Code, or Not To Code? Exploring Impact of Code in Pre-training
To Code, or Not To Code? Exploring Impact of Code in Pre-training
Viraat Aryabumi
Yixuan Su
Raymond Ma
Adrien Morisot
Ivan Zhang
Acyr Locatelli
Marzieh Fadaee
Ahmet Üstün
Sara Hooker
SyDaAI4CE
101
26
0
20 Aug 2024
Soda-Eval: Open-Domain Dialogue Evaluation in the age of LLMs
Soda-Eval: Open-Domain Dialogue Evaluation in the age of LLMs
John Mendonça
Isabel Trancoso
A. Lavie
ALM
81
3
0
20 Aug 2024
Pluto and Charon: A Time and Memory Efficient Collaborative Edge AI
  Framework for Personal LLMs Fine-Tuning
Pluto and Charon: A Time and Memory Efficient Collaborative Edge AI Framework for Personal LLMs Fine-Tuning
Bei Ouyang
Shengyuan Ye
Liekang Zeng
Tianyi Qian
Jingyi Li
Xu Chen
115
4
0
20 Aug 2024
MEGen: Generative Backdoor in Large Language Models via Model Editing
MEGen: Generative Backdoor in Large Language Models via Model Editing
Jiyang Qiu
Xinbei Ma
Zhuosheng Zhang
Hai Zhao
AAMLKELMSILM
84
5
0
20 Aug 2024
HMoE: Heterogeneous Mixture of Experts for Language Modeling
HMoE: Heterogeneous Mixture of Experts for Language Modeling
An Wang
Xingwu Sun
Ruobing Xie
Shuaipeng Li
Jiaqi Zhu
...
J. N. Han
Zhanhui Kang
Di Wang
Naoaki Okazaki
Cheng-zhong Xu
MoE
127
18
0
20 Aug 2024
REInstruct: Building Instruction Data from Unlabeled Corpus
REInstruct: Building Instruction Data from Unlabeled Corpus
Shu Chen
Xinyan Guan
Yaojie Lu
Hongyu Lin
Xianpei Han
Le Sun
ALMSyDa
54
3
0
20 Aug 2024
LLM-Barber: Block-Aware Rebuilder for Sparsity Mask in One-Shot for
  Large Language Models
LLM-Barber: Block-Aware Rebuilder for Sparsity Mask in One-Shot for Large Language Models
Yupeng Su
Ziyi Guan
Xiaoqun Liu
Tianlai Jin
Dongkuan Wu
G. Chesi
Ngai Wong
Hao Yu
67
2
0
20 Aug 2024
LeCov: Multi-level Testing Criteria for Large Language Models
LeCov: Multi-level Testing Criteria for Large Language Models
Xuan Xie
Jiayang Song
Yuheng Huang
Da Song
Fuyuan Zhang
Felix Juefei-Xu
Lei Ma
ELM
94
0
0
20 Aug 2024
Enhancing One-shot Pruned Pre-trained Language Models through
  Sparse-Dense-Sparse Mechanism
Enhancing One-shot Pruned Pre-trained Language Models through Sparse-Dense-Sparse Mechanism
Guanchen Li
Xiandong Zhao
Lian Liu
Zeping Li
Dong Li
Lu Tian
Jie He
Ashish Sirasao
E. Barsoum
VLM
54
1
0
20 Aug 2024
Goldfish: Monolingual Language Models for 350 Languages
Goldfish: Monolingual Language Models for 350 Languages
Tyler A. Chang
Catherine Arnett
Zhuowen Tu
Benjamin Bergen
LRM
132
10
0
19 Aug 2024
Previous
123...414243...197198199
Next