ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.10683
  4. Cited By
Exploring the Limits of Transfer Learning with a Unified Text-to-Text
  Transformer
v1v2v3v4 (latest)

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
    AIMat
ArXiv (abs)PDFHTML

Papers citing "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

50 / 9,925 papers shown
Title
RACER: Rich Language-Guided Failure Recovery Policies for Imitation
  Learning
RACER: Rich Language-Guided Failure Recovery Policies for Imitation Learning
Yinpei Dai
Jayjun Lee
Nima Fazeli
Joyce Chai
71
13
0
23 Sep 2024
Speechworthy Instruction-tuned Language Models
Speechworthy Instruction-tuned Language Models
Hyundong Justin Cho
Nicolaas Jedema
Leonardo F. R. Ribeiro
Karishma Sharma
Pedro Szekely
Alessandro Moschitti
Ruben Janssen
Jonathan May
ALM
85
1
0
23 Sep 2024
Multi-modal Generative AI: Multi-modal LLMs, Diffusions and the Unification
Multi-modal Generative AI: Multi-modal LLMs, Diffusions and the Unification
X. Wang
Yuwei Zhou
Bin Huang
Hong Chen
Wenwu Zhu
DiffM
158
9
0
23 Sep 2024
Pretraining Data Detection for Large Language Models: A Divergence-based Calibration Method
Pretraining Data Detection for Large Language Models: A Divergence-based Calibration Method
Weichao Zhang
Ruqing Zhang
Jiafeng Guo
Maarten de Rijke
Yixing Fan
Xueqi Cheng
156
16
0
23 Sep 2024
Can pre-trained language models generate titles for research papers?
Can pre-trained language models generate titles for research papers?
Tohida Rehman
Debarshi Kumar Sanyal
S. Chattopadhyay
99
3
0
22 Sep 2024
Learning to Localize Actions in Instructional Videos with LLM-Based
  Multi-Pathway Text-Video Alignment
Learning to Localize Actions in Instructional Videos with LLM-Based Multi-Pathway Text-Video Alignment
Yuxiao Chen
Keqin Li
Wentao Bao
Deep Patel
Yu Kong
Martin Renqiang Min
Dimitris N. Metaxas
DiffM
93
1
0
22 Sep 2024
Work Smarter Not Harder: Simple Imitation Learning with CS-PIBT
  Outperforms Large Scale Imitation Learning for MAPF
Work Smarter Not Harder: Simple Imitation Learning with CS-PIBT Outperforms Large Scale Imitation Learning for MAPF
Rishi Veerapaneni
Arthur Jakobsson
Kevin Ren
Samuel Kim
Jiaoyang Li
Maxim Likhachev
76
1
0
22 Sep 2024
Effectively Enhancing Vision Language Large Models by Prompt
  Augmentation and Caption Utilization
Effectively Enhancing Vision Language Large Models by Prompt Augmentation and Caption Utilization
Minyi Zhao
Jie Wang
Zerui Li
Jiyuan Zhang
Zhenbang Sun
Shuigeng Zhou
MLLMVLM
138
0
0
22 Sep 2024
SAC-KG: Exploiting Large Language Models as Skilled Automatic
  Constructors for Domain Knowledge Graphs
SAC-KG: Exploiting Large Language Models as Skilled Automatic Constructors for Domain Knowledge Graphs
Hanzhu Chen
Xu Shen
Qitan Lv
Jie Wang
Xiaoqi Ni
Jieping Ye
79
10
0
22 Sep 2024
Unveiling Narrative Reasoning Limits of Large Language Models with Trope
  in Movie Synopses
Unveiling Narrative Reasoning Limits of Large Language Models with Trope in Movie Synopses
Hung-Ting Su
Ya-Ching Hsu
Xudong Lin
Xiang Qian Shi
Yulei Niu
Han-Yuan Hsu
Hung-yi Lee
Winston H. Hsu
LRM
55
1
0
22 Sep 2024
Generalization in birdsong classification: impact of transfer learning
  methods and dataset characteristics
Generalization in birdsong classification: impact of transfer learning methods and dataset characteristics
Burooj Ghani
Vincent J. Kalkman
Bob Planqué
Willem-Pier Vellinga
L. Gill
Dan Stowell
VLM
69
6
0
21 Sep 2024
AMT-APC: Automatic Piano Cover by Fine-Tuning an Automatic Music
  Transcription Model
AMT-APC: Automatic Piano Cover by Fine-Tuning an Automatic Music Transcription Model
Kazuma Komiya
Yoshihisa Fukuhara
60
0
0
21 Sep 2024
FAMOUS: Flexible Accelerator for the Attention Mechanism of Transformer on UltraScale+ FPGAs
FAMOUS: Flexible Accelerator for the Attention Mechanism of Transformer on UltraScale+ FPGAs
Ehsan Kabir
Md. Arafat Kabir
Austin R. J. Downey
Jason D. Bakos
David Andrews
Miaoqing Huang
GNN
66
0
0
21 Sep 2024
One Model is All You Need: ByT5-Sanskrit, a Unified Model for Sanskrit
  NLP Tasks
One Model is All You Need: ByT5-Sanskrit, a Unified Model for Sanskrit NLP Tasks
Sebastian Nehrdich
Oliver Hellwig
Kurt Keutzer
62
5
0
20 Sep 2024
Beyond Accuracy Optimization: Computer Vision Losses for Large Language
  Model Fine-Tuning
Beyond Accuracy Optimization: Computer Vision Losses for Large Language Model Fine-Tuning
Daniele Rege Cambrin
Giuseppe Gallipoli
Irene Benedetto
Luca Cagliero
Paolo Garza
55
0
0
20 Sep 2024
ShizishanGPT: An Agricultural Large Language Model Integrating Tools and
  Resources
ShizishanGPT: An Agricultural Large Language Model Integrating Tools and Resources
Shuting Yang
Zehui Liu
Wolfgang Mayer
RALM
56
3
0
20 Sep 2024
Towards Long-Context Time Series Foundation Models
Towards Long-Context Time Series Foundation Models
Nina Żukowska
Mononito Goswami
Michał Wiliński
Willa Potosnak
Artur Dubrawski
AI4TS
61
3
0
20 Sep 2024
EMMeTT: Efficient Multimodal Machine Translation Training
EMMeTT: Efficient Multimodal Machine Translation Training
Piotr Żelasko
Zhehuai Chen
Mengru Wang
Daniel Galvez
Oleksii Hrinchuk
Shuoyang Ding
Ke Hu
Jagadeesh Balam
Vitaly Lavrukhin
Boris Ginsburg
85
1
0
20 Sep 2024
Imagine yourself: Tuning-Free Personalized Image Generation
Imagine yourself: Tuning-Free Personalized Image Generation
Zecheng He
Bo Sun
Felix Juefei-Xu
Haoyu Ma
Ankit Ramchandani
...
Ning Zhang
Peizhao Zhang
Roshan Sumbaly
Peter Vajda
Animesh Sinha
DiffM
102
19
0
20 Sep 2024
Towards LifeSpan Cognitive Systems
Towards LifeSpan Cognitive Systems
Yu Wang
Chi Han
Tongtong Wu
Xiaoxin He
Wangchunshu Zhou
...
Zexue He
Wei Wang
Gholamreza Haffari
Heng Ji
Julian McAuley
KELMCLL
486
2
0
20 Sep 2024
Exploring Scaling Laws for Local SGD in Large Language Model Training
Exploring Scaling Laws for Local SGD in Large Language Model Training
Qiaozhi He
Xiaomin Zhuang
Zhihua Wu
92
4
0
20 Sep 2024
OATS: Outlier-Aware Pruning Through Sparse and Low Rank Decomposition
OATS: Outlier-Aware Pruning Through Sparse and Low Rank Decomposition
Stephen Zhang
Vardan Papyan
VLM
166
3
0
20 Sep 2024
Cross-Domain Content Generation with Domain-Specific Small Language
  Models
Cross-Domain Content Generation with Domain-Specific Small Language Models
Ankit Maloo
Abhinav Garg
CLL
47
0
0
19 Sep 2024
Exploring Large Language Models for Product Attribute Value
  Identification
Exploring Large Language Models for Product Attribute Value Identification
Kassem Sabeh
Mouna Kacimi
Johann Gamper
Robert Litschko
Barbara Plank
75
2
0
19 Sep 2024
Text2Traj2Text: Learning-by-Synthesis Framework for Contextual
  Captioning of Human Movement Trajectories
Text2Traj2Text: Learning-by-Synthesis Framework for Contextual Captioning of Human Movement Trajectories
Hikaru Asano
Ryo Yonetani
Taiki Sekii
Hiroki Ouchi
105
0
0
19 Sep 2024
Enhancing SLM via ChatGPT and Dataset Augmentation
Enhancing SLM via ChatGPT and Dataset Augmentation
Tom Pieper
Mohamad Ballout
U. Krumnack
Gunther Heidemann
Kai-Uwe Kühnberger
97
0
0
19 Sep 2024
Efficient Knowledge Distillation: Empowering Small Language Models with
  Teacher Model Insights
Efficient Knowledge Distillation: Empowering Small Language Models with Teacher Model Insights
Mohamad Ballout
U. Krumnack
Gunther Heidemann
Kai-Uwe Kühnberger
95
3
0
19 Sep 2024
InfiMM-WebMath-40B: Advancing Multimodal Pre-Training for Enhanced
  Mathematical Reasoning
InfiMM-WebMath-40B: Advancing Multimodal Pre-Training for Enhanced Mathematical Reasoning
Xiaotian Han
Yiren Jian
Xuefeng Hu
Haogeng Liu
Yiqi Wang
...
Yuang Ai
Huaibo Huang
Ran He
Zhenheng Yang
Quanzeng You
LRMAI4CE
62
22
0
19 Sep 2024
LLMR: Knowledge Distillation with a Large Language Model-Induced Reward
LLMR: Knowledge Distillation with a Large Language Model-Induced Reward
Dongheng Li
Yongchang Hao
Lili Mou
114
2
0
19 Sep 2024
From Linguistic Giants to Sensory Maestros: A Survey on Cross-Modal
  Reasoning with Large Language Models
From Linguistic Giants to Sensory Maestros: A Survey on Cross-Modal Reasoning with Large Language Models
Shengsheng Qian
Zuyi Zhou
Dizhan Xue
Bing Wang
Changsheng Xu
LRM
154
2
0
19 Sep 2024
Small Language Models are Equation Reasoners
Small Language Models are Equation Reasoners
Bumjun Kim
Kunha Lee
Juyeon Kim
Sangam Lee
ReLMLRM
45
3
0
19 Sep 2024
AudioComposer: Towards Fine-grained Audio Generation with Natural Language Descriptions
AudioComposer: Towards Fine-grained Audio Generation with Natural Language Descriptions
Yun Wang
Hangting Chen
Dongchao Yang
Zhiyong Wu
Xixin Wu
DiffM
97
2
0
19 Sep 2024
Tokenization for Molecular Foundation Models
Tokenization for Molecular Foundation Models
Alexius Wadell
Anoushka Bhutani
Venkatasubramanian Viswanathan
476
1
0
19 Sep 2024
Ethical software requirements from user reviews: A systematic literature
  review
Ethical software requirements from user reviews: A systematic literature review
Aakash Sorathiya
Gouri Ginde
40
2
0
18 Sep 2024
Fine-Tuning a Time Series Foundation Model with Wasserstein Loss
Fine-Tuning a Time Series Foundation Model with Wasserstein Loss
Andrei Chernov
AI4TS
38
0
0
18 Sep 2024
Computational Imaging for Long-Term Prediction of Solar Irradiance
Computational Imaging for Long-Term Prediction of Solar Irradiance
Leron Julian
Haejoon Lee
S. Kar
Aswin C. Sankaranarayanan
74
0
0
18 Sep 2024
FLARE: Fusing Language Models and Collaborative Architectures for Recommender Enhancement
FLARE: Fusing Language Models and Collaborative Architectures for Recommender Enhancement
Liam Hebert
Marialena Kyriakidi
Hubert Pham
Krishna Sayana
James Pine
Sukhdeep S. Sodhi
Ambarish Jash
VLM
101
4
0
18 Sep 2024
Augment, Drop & Swap: Improving Diversity in LLM Captions for Efficient
  Music-Text Representation Learning
Augment, Drop & Swap: Improving Diversity in LLM Captions for Efficient Music-Text Representation Learning
Ilaria Manco
Justin Salamon
Oriol Nieto
62
2
0
17 Sep 2024
Enriching Datasets with Demographics through Large Language Models:
  What's in a Name?
Enriching Datasets with Demographics through Large Language Models: What's in a Name?
Khaled AlNuaimi
Gautier Marti
Mathieu Ravaut
Abdulla Alketbi
Andreas Henschel
Raed Jaradat
68
1
0
17 Sep 2024
Diversify and Conquer: Diversity-Centric Data Selection with Iterative
  Refinement
Diversify and Conquer: Diversity-Centric Data Selection with Iterative Refinement
Simon Yu
Liangyu Chen
Sara Ahmadian
Marzieh Fadaee
80
7
0
17 Sep 2024
SOAP: Improving and Stabilizing Shampoo using Adam
SOAP: Improving and Stabilizing Shampoo using Adam
Nikhil Vyas
Depen Morwani
Rosie Zhao
Itai Shapira
David Brandfonbrener
Lucas Janson
Sham Kakade
Sham Kakade
169
38
0
17 Sep 2024
Beyond LoRA: Exploring Efficient Fine-Tuning Techniques for Time Series
  Foundational Models
Beyond LoRA: Exploring Efficient Fine-Tuning Techniques for Time Series Foundational Models
Divij Gupta
Anubhav Bhatti
Surajsinh Parmar
AI4TS
87
2
0
17 Sep 2024
Leveraging Distillation Techniques for Document Understanding: A Case
  Study with FLAN-T5
Leveraging Distillation Techniques for Document Understanding: A Case Study with FLAN-T5
Marcel Lamott
Muhammad Armaghan Shakir
70
0
0
17 Sep 2024
Evaluating the Impact of Compression Techniques on Task-Specific
  Performance of Large Language Models
Evaluating the Impact of Compression Techniques on Task-Specific Performance of Large Language Models
Bishwash Khanal
Jeffery M. Capone
94
1
0
17 Sep 2024
Attention-Seeker: Dynamic Self-Attention Scoring for Unsupervised
  Keyphrase Extraction
Attention-Seeker: Dynamic Self-Attention Scoring for Unsupervised Keyphrase Extraction
Erwin D. López Z.
Cheng Tang
Atsushi Shimada
48
1
0
17 Sep 2024
Chain-of-Thought Prompting for Speech Translation
Chain-of-Thought Prompting for Speech Translation
Ke Hu
Zhehuai Chen
Chao-Han Huck Yang
Piotr Żelasko
Oleksii Hrinchuk
Vitaly Lavrukhin
Jagadeesh Balam
Boris Ginsburg
LRM
173
9
0
17 Sep 2024
Measuring and Enhancing Trustworthiness of LLMs in RAG through Grounded Attributions and Learning to Refuse
Measuring and Enhancing Trustworthiness of LLMs in RAG through Grounded Attributions and Learning to Refuse
Maojia Song
Shang Hong Sim
Rishabh Bhardwaj
Hai Leong Chieu
Navonil Majumder
Soujanya Poria
132
12
0
17 Sep 2024
Playground v3: Improving Text-to-Image Alignment with Deep-Fusion Large
  Language Models
Playground v3: Improving Text-to-Image Alignment with Deep-Fusion Large Language Models
Bingchen Liu
Ehsan Akhgari
Alexander Visheratin
Aleks Kamko
Linmiao Xu
Shivam Shrirao
Joao Souza
Suhail Doshi
Daiqing Li
Daiqing Li
DiffMMLLM
111
60
0
16 Sep 2024
FakeMusicCaps: a Dataset for Detection and Attribution of Synthetic
  Music Generated via Text-to-Music Models
FakeMusicCaps: a Dataset for Detection and Attribution of Synthetic Music Generated via Text-to-Music Models
Luca Comanducci
Paolo Bestagini
Stefano Tubaro
69
7
0
16 Sep 2024
Exploring Fine-tuned Generative Models for Keyphrase Selection: A Case
  Study for Russian
Exploring Fine-tuned Generative Models for Keyphrase Selection: A Case Study for Russian
Anna Glazkova
Dmitry A. Morozov
59
1
0
16 Sep 2024
Previous
123...383940...197198199
Next