ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.10683
  4. Cited By
Exploring the Limits of Transfer Learning with a Unified Text-to-Text
  Transformer
v1v2v3v4 (latest)

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
    AIMat
ArXiv (abs)PDFHTML

Papers citing "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

50 / 9,973 papers shown
Title
Encoder-Decoder Framework for Interactive Free Verses with Generation
  with Controllable High-Quality Rhyming
Encoder-Decoder Framework for Interactive Free Verses with Generation with Controllable High-Quality Rhyming
Tommaso Pasini
Alejo López-Ávila
Husam Quteineh
Gerasimos Lampouras
Jinhua Du
Yubing Wang
Ze Li
Yusen Sun
70
0
0
08 May 2024
Critical Infrastructure Protection: Generative AI, Challenges, and
  Opportunities
Critical Infrastructure Protection: Generative AI, Challenges, and Opportunities
Yagmur Yigit
M. Ferrag
Iqbal H. Sarker
Leandros A. Maglaras
Christos Chrysoulas
Naghmeh Moradpoor
Helge Janicke
65
8
0
08 May 2024
APrompt4EM: Augmented Prompt Tuning for Generalized Entity Matching
APrompt4EM: Augmented Prompt Tuning for Generalized Entity Matching
Yikuan Xia
Jiazun Chen
Xinchi Li
Jun Gao
VLM
120
3
0
08 May 2024
THRONE: An Object-based Hallucination Benchmark for the Free-form Generations of Large Vision-Language Models
THRONE: An Object-based Hallucination Benchmark for the Free-form Generations of Large Vision-Language Models
Prannay Kaul
Zhizhong Li
Hao Yang
Yonatan Dukler
Ashwin Swaminathan
C. Taylor
Stefano Soatto
HILM
174
18
0
08 May 2024
Large Language Models for Cyber Security: A Systematic Literature Review
Large Language Models for Cyber Security: A Systematic Literature Review
HanXiang Xu
Shenao Wang
Ningke Li
Kaidi Wang
Yanjie Zhao
Kai Chen
Ting Yu
Yang Liu
Haoyu Wang
139
43
0
08 May 2024
Bridging the Bosphorus: Advancing Turkish Large Language Models through
  Strategies for Low-Resource Language Adaptation and Benchmarking
Bridging the Bosphorus: Advancing Turkish Large Language Models through Strategies for Low-Resource Language Adaptation and Benchmarking
Emre Can Acikgoz
Mete Erdogan
Deniz Yuret
86
8
0
07 May 2024
Understanding the Capabilities and Limitations of Large Language Models
  for Cultural Commonsense
Understanding the Capabilities and Limitations of Large Language Models for Cultural Commonsense
Siqi Shen
Lajanugen Logeswaran
Moontae Lee
Honglak Lee
Soujanya Poria
Rada Mihalcea
AI4MHLRMELM
115
33
0
07 May 2024
Switchable Decision: Dynamic Neural Generation Networks
Switchable Decision: Dynamic Neural Generation Networks
Shujian Zhang
Korawat Tanwisuth
Chengyue Gong
Pengcheng He
Mi Zhou
BDL
77
0
0
07 May 2024
Learning To See But Forgetting To Follow: Visual Instruction Tuning
  Makes LLMs More Prone To Jailbreak Attacks
Learning To See But Forgetting To Follow: Visual Instruction Tuning Makes LLMs More Prone To Jailbreak Attacks
Georgios Pantazopoulos
Amit Parekh
Malvina Nikandrou
Alessandro Suglia
115
5
0
07 May 2024
Mitigating Clickbait: An Approach to Spoiler Generation Using Multitask
  Learning
Mitigating Clickbait: An Approach to Spoiler Generation Using Multitask Learning
Sayantan Pal
Souvik Das
Rohini Srihari
63
1
0
07 May 2024
Sign2GPT: Leveraging Large Language Models for Gloss-Free Sign Language
  Translation
Sign2GPT: Leveraging Large Language Models for Gloss-Free Sign Language Translation
Ryan Wong
Necati Cihan Camgöz
Richard Bowden
SLR
104
26
0
07 May 2024
Evaluating Text Summaries Generated by Large Language Models Using
  OpenAI's GPT
Evaluating Text Summaries Generated by Large Language Models Using OpenAI's GPT
Hassan Shakil
Atqiya Munawara Mahi
Phuoc Nguyen
Zeydy Ortiz
M. Mardini
ELM
62
6
0
07 May 2024
Utilizing GPT to Enhance Text Summarization: A Strategy to Minimize
  Hallucinations
Utilizing GPT to Enhance Text Summarization: A Strategy to Minimize Hallucinations
Hassan Shakil
Zeydy Ortiz
Grant C. Forbes
101
3
0
07 May 2024
Long Context Alignment with Short Instructions and Synthesized Positions
Long Context Alignment with Short Instructions and Synthesized Positions
Wenhao Wu
Yizhong Wang
Yao Fu
Xiang Yue
Dawei Zhu
Sujian Li
SyDa
86
19
0
07 May 2024
KV Cache is 1 Bit Per Channel: Efficient Large Language Model Inference
  with Coupled Quantization
KV Cache is 1 Bit Per Channel: Efficient Large Language Model Inference with Coupled Quantization
Tianyi Zhang
Jonah Yi
Zhaozhuo Xu
Anshumali Shrivastava
MQ
68
32
0
07 May 2024
FlashBack:Efficient Retrieval-Augmented Language Modeling for Long Context Inference
FlashBack:Efficient Retrieval-Augmented Language Modeling for Long Context Inference
Runheng Liu
Xingchen Xiao
Heyan Huang
Zewen Chi
Zhijing Wu
RALMKELM
89
0
0
07 May 2024
Who Wrote This? The Key to Zero-Shot LLM-Generated Text Detection Is GECScore
Who Wrote This? The Key to Zero-Shot LLM-Generated Text Detection Is GECScore
Junchao Wu
Runzhe Zhan
Derek F. Wong
Shu Yang
Xuebo Liu
Lidia S. Chao
Min Zhang
DeLMO
125
5
0
07 May 2024
Self-Improving Customer Review Response Generation Based on LLMs
Self-Improving Customer Review Response Generation Based on LLMs
Guy Azov
Tatiana Pelc
Adi Fledel Alon
Gila Kamhi
76
2
0
06 May 2024
Position: Leverage Foundational Models for Black-Box Optimization
Position: Leverage Foundational Models for Black-Box Optimization
Xingyou Song
Yingtao Tian
Robert Tjarko Lange
Chansoo Lee
Yujin Tang
Yutian Chen
99
9
0
06 May 2024
Is Sora a World Simulator? A Comprehensive Survey on General World
  Models and Beyond
Is Sora a World Simulator? A Comprehensive Survey on General World Models and Beyond
Zheng Zhu
Xiaofeng Wang
Wangbo Zhao
Chen Min
Nianchen Deng
...
Dawei Zhao
Liang Xiao
Jian-jun Zhao
Jiwen Lu
Guan Huang
VGenLM&Ro
187
48
0
06 May 2024
Adapting Dual-encoder Vision-language Models for Paraphrased Retrieval
Adapting Dual-encoder Vision-language Models for Paraphrased Retrieval
Jiacheng Cheng
Hijung Valentina Shin
Nuno Vasconcelos
Bryan C. Russell
Fabian Caba Heilbron
VLM
70
1
0
06 May 2024
Lory: Fully Differentiable Mixture-of-Experts for Autoregressive
  Language Model Pre-training
Lory: Fully Differentiable Mixture-of-Experts for Autoregressive Language Model Pre-training
Zexuan Zhong
Mengzhou Xia
Danqi Chen
Mike Lewis
MoE
110
19
0
06 May 2024
Parameter-Efficient Fine-Tuning with Discrete Fourier Transform
Parameter-Efficient Fine-Tuning with Discrete Fourier Transform
Ziqi Gao
Qichao Wang
Aochuan Chen
Zijing Liu
Bingzhe Wu
Liang Chen
Jia Li
105
35
0
05 May 2024
SkelCap: Automated Generation of Descriptive Text from Skeleton Keypoint
  Sequences
SkelCap: Automated Generation of Descriptive Text from Skeleton Keypoint Sequences
Ali Emre Keskin
H. Keles
SLR
63
0
0
05 May 2024
Enabling Patient-side Disease Prediction via the Integration of Patient
  Narratives
Enabling Patient-side Disease Prediction via the Integration of Patient Narratives
Zhixiang Su
Yinan Zhang
Jiazheng Jing
Jie Xiao
Zhiqi Shen
42
0
0
05 May 2024
Data-Efficient Molecular Generation with Hierarchical Textual Inversion
Data-Efficient Molecular Generation with Hierarchical Textual Inversion
Seojin Kim
Jaehyun Nam
Sihyun Yu
Younghoon Shin
Jinwoo Shin
137
3
0
05 May 2024
Stochastic RAG: End-to-End Retrieval-Augmented Generation through
  Expected Utility Maximization
Stochastic RAG: End-to-End Retrieval-Augmented Generation through Expected Utility Maximization
Hamed Zamani
Michael Bendersky
120
29
0
05 May 2024
Assessing Adversarial Robustness of Large Language Models: An Empirical
  Study
Assessing Adversarial Robustness of Large Language Models: An Empirical Study
Zeyu Yang
Zhao Meng
Xiaochen Zheng
Roger Wattenhofer
ELMAAML
93
10
0
04 May 2024
Large Language Models estimate fine-grained human color-concept
  associations
Large Language Models estimate fine-grained human color-concept associations
Kushin Mukherjee
Timothy T. Rogers
Karen B. Schloss
VLM
106
4
0
04 May 2024
Overview of the EHRSQL 2024 Shared Task on Reliable Text-to-SQL Modeling
  on Electronic Health Records
Overview of the EHRSQL 2024 Shared Task on Reliable Text-to-SQL Modeling on Electronic Health Records
Gyubok Lee
Sunjun Kweon
Seongsu Bae
Edward Choi
66
2
0
04 May 2024
CALRec: Contrastive Alignment of Generative LLMs For Sequential
  Recommendation
CALRec: Contrastive Alignment of Generative LLMs For Sequential Recommendation
Yaoyiran Li
Xiang Zhai
M. Alzantot
Keyi Yu
Ivan Vulić
Anna Korhonen
Mohamed Hammad
90
16
0
03 May 2024
Vibe-Eval: A hard evaluation suite for measuring progress of multimodal
  language models
Vibe-Eval: A hard evaluation suite for measuring progress of multimodal language models
Piotr Padlewski
Max Bain
Matthew Henderson
Zhongkai Zhu
Nishant Relan
...
Che Zheng
Cyprien de Masson dÁutume
Dani Yogatama
Mikel Artetxe
Yi Tay
VLM
152
27
0
03 May 2024
Parameter-Efficient Instruction Tuning of Large Language Models For
  Extreme Financial Numeral Labelling
Parameter-Efficient Instruction Tuning of Large Language Models For Extreme Financial Numeral Labelling
Subhendu Khatuya
Rajdeep Mukherjee
Akash Ghosh
Manjunath Hegde
Koustuv Dasgupta
Niloy Ganguly
Saptarshi Ghosh
Pawan Goyal
74
3
0
03 May 2024
Instruction-Guided Bullet Point Summarization of Long Financial Earnings
  Call Transcripts
Instruction-Guided Bullet Point Summarization of Long Financial Earnings Call Transcripts
Subhendu Khatuya
Koushiki Sinha
Niloy Ganguly
Saptarshi Ghosh
Pawan Goyal
63
4
0
03 May 2024
Hoaxpedia: A Unified Wikipedia Hoax Articles Dataset
Hoaxpedia: A Unified Wikipedia Hoax Articles Dataset
Hsuvas Borkakoty
Luis Espinosa-Anke
91
1
0
03 May 2024
SUKHSANDESH: An Avatar Therapeutic Question Answering Platform for
  Sexual Education in Rural India
SUKHSANDESH: An Avatar Therapeutic Question Answering Platform for Sexual Education in Rural India
Salam Michael Singh
Shubhmoy Kumar Garg
Amitesh Misra
Aaditeshwar Seth
Tanmoy Chakraborty
73
0
0
03 May 2024
A Survey of Time Series Foundation Models: Generalizing Time Series
  Representation with Large Language Model
A Survey of Time Series Foundation Models: Generalizing Time Series Representation with Large Language Model
Weiqi Zhang
Jiexia Ye
Ke Yi
Yongzi Yu
Ziyue Li
Jia Li
Fugee Tsung
AI4TSAI4CE
103
29
0
03 May 2024
Understanding Position Bias Effects on Fairness in Social Multi-Document
  Summarization
Understanding Position Bias Effects on Fairness in Social Multi-Document Summarization
Olubusayo Olabisi
Ameeta Agrawal
92
2
0
03 May 2024
Large Language Models for UAVs: Current State and Pathways to the Future
Large Language Models for UAVs: Current State and Pathways to the Future
Shumaila Javaid
Nasir Saeed
Bin He
102
26
0
02 May 2024
COPAL: Continual Pruning in Large Language Generative Models
COPAL: Continual Pruning in Large Language Generative Models
Srikanth Malla
Joon Hee Choi
Chiho Choi
VLMCLL
92
2
0
02 May 2024
Improving Subject-Driven Image Synthesis with Subject-Agnostic Guidance
Improving Subject-Driven Image Synthesis with Subject-Agnostic Guidance
Kelvin C. K. Chan
Yang Zhao
Xuhui Jia
Ming-Hsuan Yang
Huisheng Wang
123
3
0
02 May 2024
DiffusionPipe: Training Large Diffusion Models with Efficient Pipelines
DiffusionPipe: Training Large Diffusion Models with Efficient Pipelines
Ye Tian
Zhen Jia
Ziyue Luo
Yida Wang
Chuan Wu
AI4CE
61
4
0
02 May 2024
Efficient Data Generation for Source-grounded Information-seeking
  Dialogs: A Use Case for Meeting Transcripts
Efficient Data Generation for Source-grounded Information-seeking Dialogs: A Use Case for Meeting Transcripts
Lotem Golany
Filippo Galgani
Maya Mamo
Nimrod Parasol
Omer Vandsburger
Nadav Bar
Ido Dagan
90
2
0
02 May 2024
Modeling Empathetic Alignment in Conversation
Modeling Empathetic Alignment in Conversation
Jiamin Yang
David Jurgens
72
0
0
02 May 2024
SonicDiffusion: Audio-Driven Image Generation and Editing with
  Pretrained Diffusion Models
SonicDiffusion: Audio-Driven Image Generation and Editing with Pretrained Diffusion Models
Burak Can Biner
Farrin Marouf Sofian
Umur Berkay Karakacs
Duygu Ceylan
Erkut Erdem
Aykut Erdem
77
9
0
01 May 2024
Uncovering Agendas: A Novel French & English Dataset for Agenda
  Detection on Social Media
Uncovering Agendas: A Novel French & English Dataset for Agenda Detection on Social Media
Gregorios A. Katsios
Ning Sa
Ankita Bhaumik
T. Strzalkowski
78
0
0
01 May 2024
When Quantization Affects Confidence of Large Language Models?
When Quantization Affects Confidence of Large Language Models?
Irina Proskurina
Luc Brun
Guillaume Metzler
Julien Velcin
MQ
131
2
0
01 May 2024
Investigating Automatic Scoring and Feedback using Large Language Models
Investigating Automatic Scoring and Feedback using Large Language Models
G. Katuka
Alexander Gain
Yen-Yun Yu
AI4EdALM
66
3
0
01 May 2024
CookingSense: A Culinary Knowledgebase with Multidisciplinary Assertions
CookingSense: A Culinary Knowledgebase with Multidisciplinary Assertions
Donghee Choi
Mogan Gim
Donghyeon Park
Mujeen Sung
Hyunjae Kim
Jaewoo Kang
Jihun Choi
77
1
0
01 May 2024
Navigating WebAI: Training Agents to Complete Web Tasks with Large
  Language Models and Reinforcement Learning
Navigating WebAI: Training Agents to Complete Web Tasks with Large Language Models and Reinforcement Learning
Lucas-Andrei Thil
Mirela Popa
Gerasimos Spanakis
LLMAG
49
2
0
01 May 2024
Previous
123...616263...198199200
Next