ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.10683
  4. Cited By
Exploring the Limits of Transfer Learning with a Unified Text-to-Text
  Transformer
v1v2v3v4 (latest)

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
    AIMat
ArXiv (abs)PDFHTML

Papers citing "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

50 / 9,952 papers shown
Title
On the Opportunities and Challenges of Foundation Models for Geospatial
  Artificial Intelligence
On the Opportunities and Challenges of Foundation Models for Geospatial Artificial Intelligence
Gengchen Mai
Weiming Huang
Jin Sun
Suhang Song
Deepak Mishra
...
Yingjie Hu
Chris Cundy
Ziyuan Li
Rui Zhu
Ni Lao
AI4CE
122
134
0
13 Apr 2023
Automated Mapping of CVE Vulnerability Records to MITRE CWE Weaknesses
Automated Mapping of CVE Vulnerability Records to MITRE CWE Weaknesses
Ashraf Haddad
N. Aaraj
Preslav Nakov
Septimiu Fabian Mare
50
6
0
13 Apr 2023
Are LLMs All You Need for Task-Oriented Dialogue?
Are LLMs All You Need for Task-Oriented Dialogue?
Vojtvech Hudevcek
Ondrej Dusek
94
62
0
13 Apr 2023
LeafAI: query generator for clinical cohort discovery rivaling a human
  programmer
LeafAI: query generator for clinical cohort discovery rivaling a human programmer
Nicholas J. Dobbins
Bin Han
Weipeng Zhou
Kristine Lan
H. N. Kim
R. Harrington
Özlem Uzuner
Meliha Yetisgen-Yildiz
90
9
0
13 Apr 2023
Global Prompt Cell: A Portable Control Module for Effective Prompt
  Tuning
Global Prompt Cell: A Portable Control Module for Effective Prompt Tuning
Chi-Liang Liu
Hao Wang
Nuwa Xi
Sendong Zhao
Bing Qin
VLM
79
1
0
12 Apr 2023
Exploring the Use of Foundation Models for Named Entity Recognition and
  Lemmatization Tasks in Slavic Languages
Exploring the Use of Foundation Models for Named Entity Recognition and Lemmatization Tasks in Slavic Languages
Gabriela Pałka
Artur Nowakowski
54
2
0
11 Apr 2023
DISTO: Evaluating Textual Distractors for Multi-Choice Questions using
  Negative Sampling based Approach
DISTO: Evaluating Textual Distractors for Multi-Choice Questions using Negative Sampling based Approach
Bilal Ghanem
Alona Fyshe
64
4
0
10 Apr 2023
FlexMoE: Scaling Large-scale Sparse Pre-trained Model Training via
  Dynamic Device Placement
FlexMoE: Scaling Large-scale Sparse Pre-trained Model Training via Dynamic Device Placement
Xiaonan Nie
Xupeng Miao
Zilong Wang
Zichao Yang
Jilong Xue
Lingxiao Ma
Gang-Ming Cao
Tengjiao Wang
MoE
95
50
0
08 Apr 2023
Similarity search in the blink of an eye with compressed indices
Similarity search in the blink of an eye with compressed indices
Cecilia Aguerrebere
Ishwar Bhati
Mark Hildebrand
Mariano Tepper
Ted Willke
83
30
0
07 Apr 2023
Language Models are Causal Knowledge Extractors for Zero-shot Video
  Question Answering
Language Models are Causal Knowledge Extractors for Zero-shot Video Question Answering
Hung-Ting Su
Yulei Niu
Xudong Lin
Winston H. Hsu
Shih-Fu Chang
VGenELM
112
6
0
07 Apr 2023
Should ChatGPT be Biased? Challenges and Risks of Bias in Large Language
  Models
Should ChatGPT be Biased? Challenges and Risks of Bias in Large Language Models
Emilio Ferrara
SILM
134
264
0
07 Apr 2023
Does Prompt-Tuning Language Model Ensure Privacy?
Does Prompt-Tuning Language Model Ensure Privacy?
Shangyu Xie
Wei Dai
Esha Ghosh
Sambuddha Roy
Dan Schwartz
Kim Laine
SILM
97
4
0
07 Apr 2023
Bridging the Language Gap: Knowledge Injected Multilingual Question
  Answering
Bridging the Language Gap: Knowledge Injected Multilingual Question Answering
Zhichao Duan
Xiuxing Li
Zhengyan Zhang
Zhenyu Li
Ning Liu
Jianyong Wang
72
8
0
06 Apr 2023
ERRA: An Embodied Representation and Reasoning Architecture for
  Long-horizon Language-conditioned Manipulation Tasks
ERRA: An Embodied Representation and Reasoning Architecture for Long-horizon Language-conditioned Manipulation Tasks
Chao Zhao
Shuai Yuan
Chunli Jiang
Junhao Cai
Hongyu Yu
M. Y. Wang
Qifeng Chen
LM&Ro
86
15
0
05 Apr 2023
AToMiC: An Image/Text Retrieval Test Collection to Support Multimedia
  Content Creation
AToMiC: An Image/Text Retrieval Test Collection to Support Multimedia Content Creation
Jheng-Hong Yang
Carlos Lassance
Rafael Sampaio de Rezende
Krishna Srinivasan
Miriam Redi
Stéphane Clinchant
Jimmy J. Lin
91
12
0
04 Apr 2023
LLM-Adapters: An Adapter Family for Parameter-Efficient Fine-Tuning of
  Large Language Models
LLM-Adapters: An Adapter Family for Parameter-Efficient Fine-Tuning of Large Language Models
Zhiqiang Hu
Lei Wang
Yihuai Lan
Wanyu Xu
Ee-Peng Lim
Lidong Bing
Xing Xu
Soujanya Poria
Roy Ka-wei Lee
ALM
181
275
0
04 Apr 2023
Mastering Symbolic Operations: Augmenting Language Models with Compiled
  Neural Networks
Mastering Symbolic Operations: Augmenting Language Models with Compiled Neural Networks
Yixuan Weng
Minjun Zhu
Fei Xia
Bin Li
Shizhu He
Kang Liu
Jun Zhao
103
6
0
04 Apr 2023
Unsupervised Improvement of Factual Knowledge in Language Models
Unsupervised Improvement of Factual Knowledge in Language Models
Nafis Sadeq
Byungkyu Kang
Prarit Lamba
Julian McAuley
KELM
85
7
0
04 Apr 2023
One Small Step for Generative AI, One Giant Leap for AGI: A Complete
  Survey on ChatGPT in AIGC Era
One Small Step for Generative AI, One Giant Leap for AGI: A Complete Survey on ChatGPT in AIGC Era
Chaoning Zhang
Chenshuang Zhang
Chenghao Li
Yu Qiao
Sheng Zheng
...
Sung-Ho Bae
Lik-Hang Lee
Pan Hui
In So Kweon
Choong Seon Hong
LM&MAAI4MHLRMELM
106
138
0
04 Apr 2023
Safety Analysis in the Era of Large Language Models: A Case Study of
  STPA using ChatGPT
Safety Analysis in the Era of Large Language Models: A Case Study of STPA using ChatGPT
Yi Qi
Xingyu Zhao
Siddartha Khastgir
Xiaowei Huang
82
17
0
03 Apr 2023
Self-Supervised learning for Neural Architecture Search (NAS)
Self-Supervised learning for Neural Architecture Search (NAS)
Samuel Ducros
SSL
67
1
0
03 Apr 2023
Dialog-to-Actions: Building Task-Oriented Dialogue System via
  Action-Level Generation
Dialog-to-Actions: Building Task-Oriented Dialogue System via Action-Level Generation
Yuncheng Hua
Xiangyu Xi
Zhengsong Jiang
Guanwei Zhang
Chaobo Sun
Guanglu Wan
Wei Ye
LLMAG
53
1
0
03 Apr 2023
Multi-modal Fake News Detection on Social Media via Multi-grained
  Information Fusion
Multi-modal Fake News Detection on Social Media via Multi-grained Information Fusion
Yangming Zhou
Yuzhou Yang
Qichao Ying
Zhenxing Qian
Xinpeng Zhang
69
45
0
03 Apr 2023
Evaluating Large Language Models on a Highly-specialized Topic,
  Radiation Oncology Physics
Evaluating Large Language Models on a Highly-specialized Topic, Radiation Oncology Physics
J. Holmes
Zheng Liu
Hua Zhou
Yuzhen Ding
Terence T. Sio
...
Jonathan B. Ashman
Xiang Li
Tianming Liu
Jiajian Shen
Wen Liu
LM&MAAI4CEELM
94
124
0
01 Apr 2023
CQSumDP: A ChatGPT-Annotated Resource for Query-Focused Abstractive
  Summarization Based on Debatepedia
CQSumDP: A ChatGPT-Annotated Resource for Query-Focused Abstractive Summarization Based on Debatepedia
Md Tahmid Rahman Laskar
Mizanur Rahman
Israt Jahan
Enamul Hoque
J. Huang
86
9
0
31 Mar 2023
Social Honeypot for Humans: Luring People through Self-managed Instagram
  Pages
Social Honeypot for Humans: Luring People through Self-managed Instagram Pages
Sara Bardi
Mauro Conti
Luca Pajola
Pier Paolo Tricomi
58
1
0
31 Mar 2023
Elastic Weight Removal for Faithful and Abstractive Dialogue Generation
Elastic Weight Removal for Faithful and Abstractive Dialogue Generation
Nico Daheim
Nouha Dziri
Mrinmaya Sachan
Iryna Gurevych
Edoardo Ponti
MoMe
108
31
0
30 Mar 2023
WavCaps: A ChatGPT-Assisted Weakly-Labelled Audio Captioning Dataset for
  Audio-Language Multimodal Research
WavCaps: A ChatGPT-Assisted Weakly-Labelled Audio Captioning Dataset for Audio-Language Multimodal Research
Xinhao Mei
Chutong Meng
Haohe Liu
Qiuqiang Kong
Tom Ko
Chengqi Zhao
Mark D. Plumbley
Yuexian Zou
Wenwu Wang
184
220
0
30 Mar 2023
QUADRo: Dataset and Models for QUestion-Answer Database Retrieval
QUADRo: Dataset and Models for QUestion-Answer Database Retrieval
S. Campese
Ivano Lauriola
Alessandro Moschitti
60
5
0
30 Mar 2023
InceptionNeXt: When Inception Meets ConvNeXt
InceptionNeXt: When Inception Meets ConvNeXt
Weihao Yu
Pan Zhou
Shuicheng Yan
Xinchao Wang
193
142
0
29 Mar 2023
Zero-Shot Generalizable End-to-End Task-Oriented Dialog System using
  Context Summarization and Domain Schema
Zero-Shot Generalizable End-to-End Task-Oriented Dialog System using Context Summarization and Domain Schema
A. Mosharrof
M. H. Maqbool
A.B. Siddique
VLM
76
5
0
28 Mar 2023
Exploring Natural Language Processing Methods for Interactive Behaviour
  Modelling
Exploring Natural Language Processing Methods for Interactive Behaviour Modelling
Guanhua Zhang
Matteo Bortoletto
Zhiming Hu
Lei Shi
Mihai Bâce
Andreas Bulling
59
3
0
28 Mar 2023
Explicit Planning Helps Language Models in Logical Reasoning
Explicit Planning Helps Language Models in Logical Reasoning
Hongyu Zhao
Kangrui Wang
Mo Yu
Hongyuan Mei
LRMReLM
134
17
0
28 Mar 2023
Anti-DreamBooth: Protecting users from personalized text-to-image
  synthesis
Anti-DreamBooth: Protecting users from personalized text-to-image synthesis
T. Le
Hao Phung
Thuan Hoang Nguyen
Quan Dao
Ngoc N. Tran
Anh Tran
111
100
0
27 Mar 2023
Sigmoid Loss for Language Image Pre-Training
Sigmoid Loss for Language Image Pre-Training
Xiaohua Zhai
Basil Mustafa
Alexander Kolesnikov
Lucas Beyer
CLIPVLM
328
1,208
0
27 Mar 2023
Scaling Pre-trained Language Models to Deeper via Parameter-efficient
  Architecture
Scaling Pre-trained Language Models to Deeper via Parameter-efficient Architecture
Peiyu Liu
Ze-Feng Gao
Yushuo Chen
Wayne Xin Zhao
Ji-Rong Wen
MoE
74
0
0
27 Mar 2023
On the Creativity of Large Language Models
On the Creativity of Large Language Models
Giorgio Franceschelli
Mirco Musolesi
224
60
0
27 Mar 2023
Natural Language Reasoning, A Survey
Natural Language Reasoning, A Survey
Fei Yu
Hongbo Zhang
Prayag Tiwari
Benyou Wang
ReLMLRM
173
64
0
26 Mar 2023
Fine-Tashkeel: Finetuning Byte-Level Models for Accurate Arabic Text
  Diacritization
Fine-Tashkeel: Finetuning Byte-Level Models for Accurate Arabic Text Diacritization
Bashar Al-Rfooh
Gheith A. Abandah
Rami Al-Rfou
54
7
0
25 Mar 2023
Automatic Generation of Multiple-Choice Questions
Automatic Generation of Multiple-Choice Questions
Cheng Zhang
58
7
0
25 Mar 2023
COFFEE: A Contrastive Oracle-Free Framework for Event Extraction
COFFEE: A Contrastive Oracle-Free Framework for Event Extraction
Meiru Zhang
Yixuan Su
Zaiqiao Meng
Z. Fu
Nigel Collier
75
4
0
25 Mar 2023
SPEC: Summary Preference Decomposition for Low-Resource Abstractive
  Summarization
SPEC: Summary Preference Decomposition for Low-Resource Abstractive Summarization
Yi-Syuan Chen
Yun-Zhu Song
Hong-Han Shuai
66
6
0
24 Mar 2023
DreamBooth3D: Subject-Driven Text-to-3D Generation
DreamBooth3D: Subject-Driven Text-to-3D Generation
Amit Raj
S. Kaza
Ben Poole
Michael Niemeyer
Nataniel Ruiz
...
Kfir Aberman
Michael Rubinstein
Jonathan T. Barron
Yuanzhen Li
Varun Jampani
DiffM
148
229
0
23 Mar 2023
Neuro-Symbolic Execution of Generic Source Code
Neuro-Symbolic Execution of Generic Source Code
Yaojie Hu
Jin Tian
NAI
92
0
0
23 Mar 2023
CoBIT: A Contrastive Bi-directional Image-Text Generation Model
CoBIT: A Contrastive Bi-directional Image-Text Generation Model
Haoxuan You
Mandy Guo
Zhecan Wang
Kai-Wei Chang
Jason Baldridge
Jiahui Yu
DiffM
94
13
0
23 Mar 2023
Paraphrasing evades detectors of AI-generated text, but retrieval is an
  effective defense
Paraphrasing evades detectors of AI-generated text, but retrieval is an effective defense
Kalpesh Krishna
Yixiao Song
Marzena Karpinska
John Wieting
Mohit Iyyer
DeLMO
121
325
0
23 Mar 2023
GETT-QA: Graph Embedding based T2T Transformer for Knowledge Graph
  Question Answering
GETT-QA: Graph Embedding based T2T Transformer for Knowledge Graph Question Answering
Debayan Banerjee
Pranav Ajit Nair
Ricardo Usbeck
Chris Biemann
94
9
0
23 Mar 2023
Beyond Universal Transformer: block reusing with adaptor in Transformer
  for automatic speech recognition
Beyond Universal Transformer: block reusing with adaptor in Transformer for automatic speech recognition
Haoyu Tang
Zhaoyi Liu
Chang Zeng
Xinfeng Li
67
1
0
23 Mar 2023
Open-Vocabulary Object Detection using Pseudo Caption Labels
Open-Vocabulary Object Detection using Pseudo Caption Labels
Han-Cheol Cho
Won Young Jhoo
Woohyun Kang
Byungseok Roh
VLMObjD
74
20
0
23 Mar 2023
SPeC: A Soft Prompt-Based Calibration on Performance Variability of
  Large Language Model in Clinical Notes Summarization
SPeC: A Soft Prompt-Based Calibration on Performance Variability of Large Language Model in Clinical Notes Summarization
Yu-Neng Chuang
Ruixiang Tang
Xiaoqian Jiang
Helen Zhou
LM&MA
74
21
0
23 Mar 2023
Previous
123...134135136...198199200
Next