ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.10683
  4. Cited By
Exploring the Limits of Transfer Learning with a Unified Text-to-Text
  Transformer
v1v2v3v4 (latest)

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
    AIMat
ArXiv (abs)PDFHTML

Papers citing "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

50 / 9,938 papers shown
Title
MLLM Is a Strong Reranker: Advancing Multimodal Retrieval-augmented
  Generation via Knowledge-enhanced Reranking and Noise-injected Training
MLLM Is a Strong Reranker: Advancing Multimodal Retrieval-augmented Generation via Knowledge-enhanced Reranking and Noise-injected Training
Rivik Setty
Chengjin Xu
Vinay Setty
Jian Guo
87
13
0
31 Jul 2024
Enabling Contextual Soft Moderation on Social Media through Contrastive
  Textual Deviation
Enabling Contextual Soft Moderation on Social Media through Contrastive Textual Deviation
Pujan Paudel
Mohammad Hammas Saeed
Rebecca Auger
Chris Wells
Gianluca Stringhini
118
2
0
30 Jul 2024
Pruning Large Language Models with Semi-Structural Adaptive Sparse
  Training
Pruning Large Language Models with Semi-Structural Adaptive Sparse Training
Weiyu Huang
Yuezhou Hu
Guohao Jian
Jun Zhu
Jianfei Chen
107
8
0
30 Jul 2024
Machine Unlearning in Generative AI: A Survey
Machine Unlearning in Generative AI: A Survey
Zheyuan Liu
Guangyao Dou
Zhaoxuan Tan
Yijun Tian
Meng Jiang
MU
109
19
0
30 Jul 2024
Harvesting Textual and Structured Data from the HAL Publication Repository
Harvesting Textual and Structured Data from the HAL Publication Repository
Francis Kulumba
Wissam Antoun
Guillaume Vimont
Laurent Romary
113
2
0
30 Jul 2024
Mixture of Nested Experts: Adaptive Processing of Visual Tokens
Mixture of Nested Experts: Adaptive Processing of Visual Tokens
Gagan Jain
Nidhi Hegde
Aditya Kusupati
Arsha Nagrani
Shyamal Buch
Prateek Jain
Anurag Arnab
Sujoy Paul
MoE
111
8
0
29 Jul 2024
Sentiment Analysis of Lithuanian Online Reviews Using Large Language
  Models
Sentiment Analysis of Lithuanian Online Reviews Using Large Language Models
Brigita Vileikyt.e
M. Lukoševičius
Lukas Stankevicius
89
1
0
29 Jul 2024
Practical and Reproducible Symbolic Music Generation by Large Language
  Models with Structural Embeddings
Practical and Reproducible Symbolic Music Generation by Large Language Models with Structural Embeddings
Seungyeon Rhyu
Kichang Yang
Sungjun Cho
Jaehyeon Kim
Kyogu Lee
Moontae Lee
109
0
0
29 Jul 2024
Beyond Metrics: A Critical Analysis of the Variability in Large Language
  Model Evaluation Frameworks
Beyond Metrics: A Critical Analysis of the Variability in Large Language Model Evaluation Frameworks
Marco AF Pimentel
Clément Christophe
Tathagata Raha
Prateek Munjal
Praveen K Kanithi
Shadab Khan
ELM
80
3
0
29 Jul 2024
mGTE: Generalized Long-Context Text Representation and Reranking Models
  for Multilingual Text Retrieval
mGTE: Generalized Long-Context Text Representation and Reranking Models for Multilingual Text Retrieval
Xin Zhang
Yanzhao Zhang
Dingkun Long
Wen Xie
Ziqi Dai
...
Pengjun Xie
Fei Huang
Meishan Zhang
Wenjie Li
Min Zhang
141
109
0
29 Jul 2024
QAEA-DR: A Unified Text Augmentation Framework for Dense Retrieval
QAEA-DR: A Unified Text Augmentation Framework for Dense Retrieval
Hongming Tan
Shaoxiong Zhan
Hai Lin
Hai-Tao Zheng
Wai Kin Chan
RALM
116
2
0
29 Jul 2024
Why Misinformation is Created? Detecting them by Integrating Intent
  Features
Why Misinformation is Created? Detecting them by Integrating Intent Features
Bing Wang
Ximing Li
C. Li
Bo Fu
Songwen Pei
Shengsheng Wang
81
3
0
27 Jul 2024
Optimizing Numerical Estimation and Operational Efficiency in the Legal
  Domain through Large Language Models
Optimizing Numerical Estimation and Operational Efficiency in the Legal Domain through Large Language Models
Jia-Hong Huang
Chao-Chun Yang
Yixian Shen
A. M. Pacces
Evangelos Kanoulas
ELMAILaw
100
6
0
26 Jul 2024
Granularity is crucial when applying differential privacy to text: An
  investigation for neural machine translation
Granularity is crucial when applying differential privacy to text: An investigation for neural machine translation
Doan Nam Long Vu
Timour Igamberdiev
Ivan Habernal
79
0
0
26 Jul 2024
Knowledge Graph Structure as Prompt: Improving Small Language Models
  Capabilities for Knowledge-based Causal Discovery
Knowledge Graph Structure as Prompt: Improving Small Language Models Capabilities for Knowledge-based Causal Discovery
Yuni Susanti
Michael Färber
83
3
0
26 Jul 2024
AutoRDF2GML: Facilitating RDF Integration in Graph Machine Learning
AutoRDF2GML: Facilitating RDF Integration in Graph Machine Learning
Michael Färber
David Lamprecht
Yuni Susanti
AI4CE
87
1
0
26 Jul 2024
Using Large Language Models for the Interpretation of Building
  Regulations
Using Large Language Models for the Interpretation of Building Regulations
Stefan Fuchs
Michael Witbrock
J. Dimyadi
Robert Amor
AI4CEAILaw
56
0
0
26 Jul 2024
Fairness Definitions in Language Models Explained
Fairness Definitions in Language Models Explained
Thang Viet Doan
Zhibo Chu
Zichong Wang
Wenbin Zhang
ALM
113
10
0
26 Jul 2024
Exploring Bengali Religious Dialect Biases in Large Language Models with
  Evaluation Perspectives
Exploring Bengali Religious Dialect Biases in Large Language Models with Evaluation Perspectives
Azmine Toushik Wasi
Raima Islam
Mst Rafia Islam
Taki Hasan Rafi
Dong-Kyu Chae
131
5
0
25 Jul 2024
Self-Training with Direct Preference Optimization Improves
  Chain-of-Thought Reasoning
Self-Training with Direct Preference Optimization Improves Chain-of-Thought Reasoning
Tianduo Wang
Shichen Li
Wei Lu
LRMAI4CE
88
20
1
25 Jul 2024
Dallah: A Dialect-Aware Multimodal Large Language Model for Arabic
Dallah: A Dialect-Aware Multimodal Large Language Model for Arabic
Fakhraddin Alwajih
Gagan Bhatia
Muhammad Abdul-Mageed
63
7
0
25 Jul 2024
Fine-Tuning Large Language Models for Stock Return Prediction Using
  Newsflow
Fine-Tuning Large Language Models for Stock Return Prediction Using Newsflow
Tian Guo
E. Hauptmann
AIFin
87
5
0
25 Jul 2024
Positive Text Reframing under Multi-strategy Optimization
Positive Text Reframing under Multi-strategy Optimization
Shutong Jia
Biwei Cao
Qingqing Gao
Jiuxin Cao
Bo Liu
58
1
0
25 Jul 2024
Exploring Description-Augmented Dataless Intent Classification
Exploring Description-Augmented Dataless Intent Classification
Ruoyu Hu
Foaad Khosmood
Abbas Edalat
AI4TS
100
0
0
25 Jul 2024
An Efficient Inference Framework for Early-exit Large Language Models
An Efficient Inference Framework for Early-exit Large Language Models
Ruijie Miao
Yihan Yan
Xinshuo Yao
Tong Yang
71
0
0
25 Jul 2024
LoRA-Pro: Are Low-Rank Adapters Properly Optimized?
LoRA-Pro: Are Low-Rank Adapters Properly Optimized?
Zhengbo Wang
Jian Liang
Ran He
Zilei Wang
Tieniu Tan
186
29
0
25 Jul 2024
Learn while Unlearn: An Iterative Unlearning Framework for Generative Language Models
Learn while Unlearn: An Iterative Unlearning Framework for Generative Language Models
Haoyu Tang
Ye Liu
Xukai Liu
Xukai Liu
Yanghai Zhang
Kai Zhang
Xiaofang Zhou
Enhong Chen
MU
160
3
0
25 Jul 2024
Time Matters: Examine Temporal Effects on Biomedical Language Models
Time Matters: Examine Temporal Effects on Biomedical Language Models
Weisi Liu
Zhe He
Xiaolei Huang
69
6
0
24 Jul 2024
Reporting and Analysing the Environmental Impact of Language Models on
  the Example of Commonsense Question Answering with External Knowledge
Reporting and Analysing the Environmental Impact of Language Models on the Example of Commonsense Question Answering with External Knowledge
Aida Usmanova
Junbo Huang
Debayan Banerjee
Ricardo Usbeck
66
1
0
24 Jul 2024
Revolutionizing Text-to-Image Retrieval as Autoregressive Token-to-Voken
  Generation
Revolutionizing Text-to-Image Retrieval as Autoregressive Token-to-Voken Generation
Chak Tou Leong
Hongru Cai
Wenjie Wang
Leigang Qu
Yinwei Wei
Wenjie Li
Liqiang Nie
Tat-Seng Chua
DiffM
69
1
0
24 Jul 2024
Zero-Shot vs. Few-Shot Multi-Speaker TTS Using Pre-trained Czech
  SpeechT5 Model
Zero-Shot vs. Few-Shot Multi-Speaker TTS Using Pre-trained Czech SpeechT5 Model
Jan Lehecka
Z. Hanzlícek
J. Matousek
Daniel Tihelka
66
0
0
24 Jul 2024
PatchFinder: A Two-Phase Approach to Security Patch Tracing for
  Disclosed Vulnerabilities in Open-Source Software
PatchFinder: A Two-Phase Approach to Security Patch Tracing for Disclosed Vulnerabilities in Open-Source Software
Kai-Jing Li
Jian Zhang
Sen Chen
Han Liu
Yang Liu
Yixiang Chen
57
4
0
24 Jul 2024
Train-Attention: Meta-Learning Where to Focus in Continual Knowledge Learning
Train-Attention: Meta-Learning Where to Focus in Continual Knowledge Learning
Yeongbin Seo
Dongha Lee
Jinyoung Yeo
CLLKELM
163
2
0
24 Jul 2024
Sentiment Reasoning for Healthcare
Sentiment Reasoning for Healthcare
Khai-Nguyen Nguyen
Khai Le-Duc
Bach Phan Tat
Duy Le
Jerry Ngo
Long Vo-Dang
LRM
116
0
0
24 Jul 2024
PartGLEE: A Foundation Model for Recognizing and Parsing Any Objects
PartGLEE: A Foundation Model for Recognizing and Parsing Any Objects
Junyi Li
Junfeng Wu
Weizhi Zhao
Song Bai
Xiang Bai
88
3
0
23 Jul 2024
Exploring the Effectiveness and Consistency of Task Selection in
  Intermediate-Task Transfer Learning
Exploring the Effectiveness and Consistency of Task Selection in Intermediate-Task Transfer Learning
Pin-Jie Lin
Miaoran Zhang
Marius Mosbach
Dietrich Klakow
44
0
0
23 Jul 2024
A Comprehensive Survey of LLM Alignment Techniques: RLHF, RLAIF, PPO,
  DPO and More
A Comprehensive Survey of LLM Alignment Techniques: RLHF, RLAIF, PPO, DPO and More
Zhichao Wang
Bin Bi
Shiva K. Pentyala
Kiran Ramnath
Sougata Chaudhuri
...
Z. Zhu
Xiang-Bo Mao
S. Asur
Na
Na Cheng
OffRL
97
58
0
23 Jul 2024
UniMEL: A Unified Framework for Multimodal Entity Linking with Large
  Language Models
UniMEL: A Unified Framework for Multimodal Entity Linking with Large Language Models
Liu Qi
Yongyi He
Lian Defu
Zhi Zheng
Tong Xu
Liu Che
Chen Enhong
MLLM
82
2
0
23 Jul 2024
Finetuning Generative Large Language Models with Discrimination
  Instructions for Knowledge Graph Completion
Finetuning Generative Large Language Models with Discrimination Instructions for Knowledge Graph Completion
Yang Liu
Xiaobin Tian
Zequn Sun
Wei Hu
95
4
0
23 Jul 2024
Promises and Pitfalls of Generative Masked Language Modeling:
  Theoretical Framework and Practical Guidelines
Promises and Pitfalls of Generative Masked Language Modeling: Theoretical Framework and Practical Guidelines
Yuchen Li
Alexandre Kirchmeyer
Aashay Mehta
Yilong Qin
Boris Dadachev
Kishore Papineni
Sanjiv Kumar
Andrej Risteski
126
2
0
22 Jul 2024
Accelerating Pre-training of Multimodal LLMs via Chain-of-Sight
Accelerating Pre-training of Multimodal LLMs via Chain-of-Sight
Ziyuan Huang
Kaixiang Ji
Biao Gong
Zhiwu Qing
Qinglong Zhang
Kecheng Zheng
Jian Wang
Jingdong Chen
Ming Yang
LRM
75
2
0
22 Jul 2024
Stretching Each Dollar: Diffusion Training from Scratch on a
  Micro-Budget
Stretching Each Dollar: Diffusion Training from Scratch on a Micro-Budget
Vikash Sehwag
Xianghao Kong
Jingtao Li
Michael Spranger
Lingjuan Lyu
DiffM
90
11
0
22 Jul 2024
FSboard: Over 3 million characters of ASL fingerspelling collected via
  smartphones
FSboard: Over 3 million characters of ASL fingerspelling collected via smartphones
Manfred Georg
Garrett Tanzer
Saad Hassan
Max Shengelia
Esha Uboweja
Sam S. Sepah
Sean Forbes
Thad Starner
57
0
0
22 Jul 2024
MoRSE: Bridging the Gap in Cybersecurity Expertise with Retrieval
  Augmented Generation
MoRSE: Bridging the Gap in Cybersecurity Expertise with Retrieval Augmented Generation
Marco Simoni
Andrea Saracino
Vinod Puthuvath
Maurco Conti
119
4
0
22 Jul 2024
SETTP: Style Extraction and Tunable Inference via Dual-level
  Transferable Prompt Learning
SETTP: Style Extraction and Tunable Inference via Dual-level Transferable Prompt Learning
Chunzhen Jin
Yongfeng Huang
Yaqi Wang
Peng Cao
Osmar Zaiane
VLM
94
1
0
22 Jul 2024
On the Automated Processing of User Feedback
On the Automated Processing of User Feedback
Walid Maalej
V. Biryuk
Jialiang Wei
Fabian Panse
68
6
0
22 Jul 2024
Local All-Pair Correspondence for Point Tracking
Local All-Pair Correspondence for Point Tracking
Seokju Cho
Jiahui Huang
Jisu Nam
Honggyu An
Seungryong Kim
Joon-Young Lee
107
28
0
22 Jul 2024
Chronologically Accurate Retrieval for Temporal Grounding of
  Motion-Language Models
Chronologically Accurate Retrieval for Temporal Grounding of Motion-Language Models
Kent Fujiwara
Mikihiro Tanaka
Qing Yu
92
2
0
22 Jul 2024
ALLaM: Large Language Models for Arabic and English
ALLaM: Large Language Models for Arabic and English
M Saiful Bari
Yazeed Alnumay
Norah A. Alzahrani
Nouf M. Alotaibi
H. A. Alyahya
...
Jeril Kuriakose
Abdalghani Abujabal
Nora Al-Twairesh
Areeb Alowisheq
Haidar Khan
77
17
0
22 Jul 2024
Towards Robust Vision Transformer via Masked Adaptive Ensemble
Towards Robust Vision Transformer via Masked Adaptive Ensemble
Fudong Lin
Jiadong Lou
Xu Yuan
Nianfeng Tzeng
ViTAAML
100
2
0
22 Jul 2024
Previous
123...444546...197198199
Next