ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.10683
  4. Cited By
Exploring the Limits of Transfer Learning with a Unified Text-to-Text
  Transformer
v1v2v3v4 (latest)

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
    AIMat
ArXiv (abs)PDFHTML

Papers citing "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

50 / 9,925 papers shown
Title
A Knowledge-Enhanced Disease Diagnosis Method Based on Prompt Learning
  and BERT Integration
A Knowledge-Enhanced Disease Diagnosis Method Based on Prompt Learning and BERT Integration
Zhang Zheng
37
0
0
16 Sep 2024
MGSA: Multi-Granularity Graph Structure Attention for Knowledge
  Graph-to-Text Generation
MGSA: Multi-Granularity Graph Structure Attention for Knowledge Graph-to-Text Generation
Shanshan Wang
C. Zhang
Ning Zhang
68
0
0
16 Sep 2024
Trustworthiness in Retrieval-Augmented Generation Systems: A Survey
Trustworthiness in Retrieval-Augmented Generation Systems: A Survey
Yujia Zhou
Yan Liu
Xiaoxi Li
Jiajie Jin
Hongjin Qian
Zheng Liu
Chaozhuo Li
Zhicheng Dou
Tsung-Yi Ho
Philip S. Yu
3DVRALM
116
39
0
16 Sep 2024
Latent Diffusion Models for Controllable RNA Sequence Generation
Latent Diffusion Models for Controllable RNA Sequence Generation
Kaixuan Huang
Yukang Yang
Kaidi Fu
Yanyi Chu
Le Cong
Mengdi Wang
89
2
0
15 Sep 2024
GP-GPT: Large Language Model for Gene-Phenotype Mapping
GP-GPT: Large Language Model for Gene-Phenotype Mapping
Yanjun Lyu
Zihao Wu
Lu Zhang
Jing Zhang
Yiwei Li
...
Rongjie Liu
Chao Huang
Wentao Li
Tianming Liu
Dajiang Zhu
LM&MA
52
4
0
15 Sep 2024
Generalizing Alignment Paradigm of Text-to-Image Generation with
  Preferences through $f$-divergence Minimization
Generalizing Alignment Paradigm of Text-to-Image Generation with Preferences through fff-divergence Minimization
Haoyuan Sun
Bo Xia
Yongzhe Chang
Xueqian Wang
EGVM
67
6
0
15 Sep 2024
AlpaPICO: Extraction of PICO Frames from Clinical Trial Documents Using
  LLMs
AlpaPICO: Extraction of PICO Frames from Clinical Trial Documents Using LLMs
Madhusudan Ghosh
Shrimon Mukherjee
Asmit Ganguly
Partha Basuchowdhuri
S. Naskar
Debasis Ganguly
99
8
0
15 Sep 2024
ESPnet-EZ: Python-only ESPnet for Easy Fine-tuning and Integration
ESPnet-EZ: Python-only ESPnet for Easy Fine-tuning and Integration
Masao Someki
Kwanghee Choi
Siddhant Arora
William Chen
Samuele Cornell
Jionghao Han
Yifan Peng
Jiatong Shi
Vaibhav Srivastav
Shinji Watanabe
VLM
109
0
0
14 Sep 2024
Synthetic4Health: Generating Annotated Synthetic Clinical Letters
Synthetic4Health: Generating Annotated Synthetic Clinical Letters
Libo Ren
Samuel Belkadi
Lifeng Han
Warren Del-Pinto
Goran Nenadic
SyDa
59
2
0
14 Sep 2024
Text Prompt is Not Enough: Sound Event Enhanced Prompt Adapter for
  Target Style Audio Generation
Text Prompt is Not Enough: Sound Event Enhanced Prompt Adapter for Target Style Audio Generation
Chenxu Xiong
Ruibo Fu
Shuchen Shi
Zhengqi Wen
Jianhua Tao
...
Chunyu Qiang
Yuankun Xie
Xin Qi
Guanjun Li
Zizheng Yang
DiffM
82
0
0
14 Sep 2024
Prevailing Research Areas for Music AI in the Era of Foundation Models
Prevailing Research Areas for Music AI in the Era of Foundation Models
Megan Wei
M. Modrzejewski
Aswin Sivaraman
Dorien Herremans
MedIm
94
2
0
14 Sep 2024
A Compressive Memory-based Retrieval Approach for Event Argument
  Extraction
A Compressive Memory-based Retrieval Approach for Event Argument Extraction
Wanlong Liu
Enqi Zhang
Li Zhou
DingYi Zeng
Shaohuan Cheng
Chen Zhang
Malu Zhang
Wenyu Chen
RALM
90
0
0
14 Sep 2024
Unleash LLMs Potential for Recommendation by Coordinating Twin-Tower
  Dynamic Semantic Token Generator
Unleash LLMs Potential for Recommendation by Coordinating Twin-Tower Dynamic Semantic Token Generator
Jun Yin
Zhengxin Zeng
Mingzheng Li
Hao Yan
Chaozhuo Li
...
Denvy Deng
Feng Sun
Qi Zhang
Shirui Pan
Senzhang Wang
55
3
0
14 Sep 2024
ReCLAP: Improving Zero Shot Audio Classification by Describing Sounds
ReCLAP: Improving Zero Shot Audio Classification by Describing Sounds
Sreyan Ghosh
Sonal Kumar
Chandra Kiran Reddy Evuru
Oriol Nieto
R. Duraiswami
Dinesh Manocha
VLM
119
6
0
13 Sep 2024
DomURLs_BERT: Pre-trained BERT-based Model for Malicious Domains and
  URLs Detection and Classification
DomURLs_BERT: Pre-trained BERT-based Model for Malicious Domains and URLs Detection and Classification
Abdelkader El Mahdaouy
Salima Lamsiyah
Meryem Janati Idrissi
H. Alami
Zakaria Yartaoui
Ismail Berrada
53
3
0
13 Sep 2024
Knowledge Tagging with Large Language Model based Multi-Agent System
Knowledge Tagging with Large Language Model based Multi-Agent System
Hang Li
Tianlong Xu
Ethan Chang
Qingsong Wen
LLMAG
50
0
0
12 Sep 2024
Dynamic Prompting of Frozen Text-to-Image Diffusion Models for Panoptic
  Narrative Grounding
Dynamic Prompting of Frozen Text-to-Image Diffusion Models for Panoptic Narrative Grounding
Hongyu Li
Tianrui Hui
Zihan Ding
Jing Zhang
Bin Ma
Xiaoming Wei
Jizhong Han
Si Liu
DiffM
72
2
0
12 Sep 2024
TheraGen: Therapy for Every Generation
TheraGen: Therapy for Every Generation
Kartikey Doshi
Jimit Shah
Narendra Shekokar
AI4MH
53
0
0
12 Sep 2024
Towards a graph-based foundation model for network traffic analysis
Towards a graph-based foundation model for network traffic analysis
Louis Van Langendonck
Ismael Castell-Uroz
Pere Barlet-Ros
74
1
0
12 Sep 2024
The CLC-UKET Dataset: Benchmarking Case Outcome Prediction for the UK
  Employment Tribunal
The CLC-UKET Dataset: Benchmarking Case Outcome Prediction for the UK Employment Tribunal
Huiyuan Xie
Felix Steffek
Joana Ribeiro de Faria
Christine Carter
Jonathan Rutherford
AILaw
65
3
0
12 Sep 2024
Diffusion-Based Image-to-Image Translation by Noise Correction via
  Prompt Interpolation
Diffusion-Based Image-to-Image Translation by Noise Correction via Prompt Interpolation
Junsung Lee
Minsoo Kang
Bohyung Han
DiffMVLM
45
3
0
12 Sep 2024
TSELM: Target Speaker Extraction using Discrete Tokens and Language
  Models
TSELM: Target Speaker Extraction using Discrete Tokens and Language Models
Beilong Tang
Bang Zeng
Ming Li
89
4
0
12 Sep 2024
Controllable Synthetic Clinical Note Generation with Privacy Guarantees
Controllable Synthetic Clinical Note Generation with Privacy Guarantees
Tal Baumel
Andre Manoel
Daniel Jones
Shize Su
Huseyin A. Inan
Aaron
Bornstein
Robert Sim
30
1
0
12 Sep 2024
On the Role of Context in Reading Time Prediction
On the Role of Context in Reading Time Prediction
Andreas Opedal
Eleanor Chodroff
Ryan Cotterell
Ethan Gotlieb Wilcox
107
8
0
12 Sep 2024
Retro-li: Small-Scale Retrieval Augmented Generation Supporting Noisy Similarity Searches and Domain Shift Generalization
Retro-li: Small-Scale Retrieval Augmented Generation Supporting Noisy Similarity Searches and Domain Shift Generalization
Gentiana Rashiti
G. Karunaratne
Mrinmaya Sachan
Abu Sebastian
Abbas Rahimi
RALM
237
0
0
12 Sep 2024
Recent Trends of Multimodal Affective Computing: A Survey from NLP
  Perspective
Recent Trends of Multimodal Affective Computing: A Survey from NLP Perspective
Guimin Hu
Yi Xin
Weimin Lyu
Haojian Huang
Chang Sun
Zehan Zhu
Lin Gui
Ruichu Cai
Erik Cambria
Hasti Seifi
105
6
0
11 Sep 2024
PiTe: Pixel-Temporal Alignment for Large Video-Language Model
PiTe: Pixel-Temporal Alignment for Large Video-Language Model
Yang Liu
Pengxiang Ding
Siteng Huang
Min Zhang
Han Zhao
Donglin Wang
91
7
0
11 Sep 2024
Ontology-Free General-Domain Knowledge Graph-to-Text Generation Dataset
  Synthesis using Large Language Model
Ontology-Free General-Domain Knowledge Graph-to-Text Generation Dataset Synthesis using Large Language Model
Daehee Kim
Deokhyung Kang
Sangwon Ryu
Gary Geunbae Lee
64
1
0
11 Sep 2024
Understanding Knowledge Drift in LLMs through Misinformation
Understanding Knowledge Drift in LLMs through Misinformation
Alina Fastowski
Gjergji Kasneci
KELM
64
2
0
11 Sep 2024
Unveiling Markov Heads in Pretrained Language Models for Offline Reinforcement Learning
Unveiling Markov Heads in Pretrained Language Models for Offline Reinforcement Learning
Wenhao Zhao
Qiushui Xu
Linjie Xu
Lei Song
Jinyu Wang
Chunlai Zhou
Jiang Bian
87
0
0
11 Sep 2024
DA-MoE: Towards Dynamic Expert Allocation for Mixture-of-Experts Models
DA-MoE: Towards Dynamic Expert Allocation for Mixture-of-Experts Models
Maryam Akhavan Aghdam
Hongpeng Jin
Yanzhao Wu
MoE
65
3
0
10 Sep 2024
Fine-tuning and Prompt Engineering with Cognitive Knowledge Graphs for
  Scholarly Knowledge Organization
Fine-tuning and Prompt Engineering with Cognitive Knowledge Graphs for Scholarly Knowledge Organization
Gollam Rabby
Sören Auer
Jennifer D'Souza
A. Oelen
328
2
0
10 Sep 2024
Extracting Paragraphs from LLM Token Activations
Extracting Paragraphs from LLM Token Activations
Nicholas Pochinkov
Angelo Benoit
Lovkush Agarwal
Zainab Ali Majid
Lucile Ter-Minassian
79
2
0
10 Sep 2024
Keyword-Aware ASR Error Augmentation for Robust Dialogue State Tracking
Keyword-Aware ASR Error Augmentation for Robust Dialogue State Tracking
Jihyun Lee
Solee Im
Wonjun Lee
Gary Geunbae Lee
75
0
0
10 Sep 2024
RNR: Teaching Large Language Models to Follow Roles and Rules
RNR: Teaching Large Language Models to Follow Roles and Rules
Kuan-Chieh Wang
Alexander Bukharin
Haoming Jiang
Qingyu Yin
Zhengyang Wang
...
Chao Zhang
Bing Yin
Xian Li
Jianshu Chen
Shiyang Li
ALM
84
2
0
10 Sep 2024
STUN: Structured-Then-Unstructured Pruning for Scalable MoE Pruning
STUN: Structured-Then-Unstructured Pruning for Scalable MoE Pruning
Jaeseong Lee
seung-won hwang
Aurick Qiao
Daniel F Campos
Z. Yao
Yuxiong He
65
3
0
10 Sep 2024
What is the Role of Small Models in the LLM Era: A Survey
What is the Role of Small Models in the LLM Era: A Survey
Lihu Chen
Gaël Varoquaux
ALM
257
32
0
10 Sep 2024
DetoxBench: Benchmarking Large Language Models for Multitask Fraud &
  Abuse Detection
DetoxBench: Benchmarking Large Language Models for Multitask Fraud & Abuse Detection
Joymallya Chakraborty
Wei Xia
Anirban Majumder
Dan Ma
Walid Chaabene
Naveed Janvekar
49
3
0
09 Sep 2024
Real-Time Human Action Recognition on Embedded Platforms
Real-Time Human Action Recognition on Embedded Platforms
Ruiqi Wang
Zichen Wang
Peiqi Gao
Mingzhen Li
Jaehwan Jeong
Yihang Xu
Yejin Lee
Carolyn M. Baum
Lisa Connor
Chenyang Lu
98
3
0
09 Sep 2024
Shaking Up VLMs: Comparing Transformers and Structured State Space
  Models for Vision & Language Modeling
Shaking Up VLMs: Comparing Transformers and Structured State Space Models for Vision & Language Modeling
Georgios Pantazopoulos
Malvina Nikandrou
Alessandro Suglia
Oliver Lemon
Arash Eshghi
Mamba
84
2
0
09 Sep 2024
Improving Pretraining Data Using Perplexity Correlations
Improving Pretraining Data Using Perplexity Correlations
Tristan Thrush
Christopher Potts
Tatsunori Hashimoto
112
22
0
09 Sep 2024
Generative Recommender with End-to-End Learnable Item Tokenization
Generative Recommender with End-to-End Learnable Item Tokenization
Enze Liu
Bowen Zheng
Cheng Ling
Lantao Hu
Han Li
Wayne Xin Zhao
91
3
0
09 Sep 2024
Expanding Expressivity in Transformer Models with MöbiusAttention
Expanding Expressivity in Transformer Models with MöbiusAttention
Anna-Maria Halacheva
M. Nayyeri
Steffen Staab
84
1
0
08 Sep 2024
ELMS: Elasticized Large Language Models On Mobile Devices
ELMS: Elasticized Large Language Models On Mobile Devices
Wangsong Yin
Rongjie Yi
Daliang Xu
Gang Huang
Mengwei Xu
Xuanzhe Liu
80
6
0
08 Sep 2024
Maximizing Relation Extraction Potential: A Data-Centric Study to Unveil
  Challenges and Opportunities
Maximizing Relation Extraction Potential: A Data-Centric Study to Unveil Challenges and Opportunities
Anushka Swarup
Avanti Bhandarkar
Olivia P. Dizon-Paradis
Ronald Wilson
D. Woodard
111
1
0
07 Sep 2024
AnyMatch -- Efficient Zero-Shot Entity Matching with a Small Language
  Model
AnyMatch -- Efficient Zero-Shot Entity Matching with a Small Language Model
Zeyu Zhang
Paul Groth
Iacer Calixto
Sebastian Schelter
94
4
0
06 Sep 2024
Large Language Models in Drug Discovery and Development: From Disease
  Mechanisms to Clinical Trials
Large Language Models in Drug Discovery and Development: From Disease Mechanisms to Clinical Trials
Yizhen Zheng
Huan Yee Koh
M. Yang
Li Li
Lauren T. May
Geoffrey I. Webb
Shirui Pan
George Church
LM&MA
100
13
0
06 Sep 2024
Generating Faithful and Salient Text from Multimodal Data
Generating Faithful and Salient Text from Multimodal Data
Tahsina Hashem
Weiqing Wang
Derry Tanti Wijaya
Mohammed Eunus Ali
Yuan-Fang Li
103
0
0
06 Sep 2024
An overview of domain-specific foundation model: key technologies, applications and challenges
An overview of domain-specific foundation model: key technologies, applications and challenges
Haolong Chen
Hanzhi Chen
Zijian Zhao
Kaifeng Han
Guangxu Zhu
Yichen Zhao
Ying Du
Wei Xu
Qingjiang Shi
ALMVLM
131
5
0
06 Sep 2024
How Does Code Pretraining Affect Language Model Task Performance?
How Does Code Pretraining Affect Language Model Task Performance?
Jackson Petty
Sjoerd van Steenkiste
Tal Linzen
133
13
0
06 Sep 2024
Previous
123...394041...197198199
Next