ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.10683
  4. Cited By
Exploring the Limits of Transfer Learning with a Unified Text-to-Text
  Transformer
v1v2v3v4 (latest)

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
    AIMat
ArXiv (abs)PDFHTML

Papers citing "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

50 / 9,955 papers shown
Title
Pre-training Language Model as a Multi-perspective Course Learner
Pre-training Language Model as a Multi-perspective Course Learner
Beiduo Chen
Shaohan Huang
Zi-qiang Zhang
Wu Guo
Zhen-Hua Ling
Haizhen Huang
Furu Wei
Weiwei Deng
Qi Zhang
63
0
0
06 May 2023
TASTY: A Transformer based Approach to Space and Time complexity
TASTY: A Transformer based Approach to Space and Time complexity
K. Moudgalya
Ankit Ramakrishnan
Vamsikrishna Chemudupati
Xinghai Lu
62
3
0
06 May 2023
NorBench -- A Benchmark for Norwegian Language Models
NorBench -- A Benchmark for Norwegian Language Models
David Samuel
Andrey Kutuzov
Samia Touileb
Erik Velldal
Lilja Ovrelid
Egil Rønningstad
Elina Sigdel
Anna Palatkina
93
25
0
06 May 2023
Stylized Data-to-Text Generation: A Case Study in the E-Commerce Domain
Stylized Data-to-Text Generation: A Case Study in the E-Commerce Domain
Liqiang Jing
Xuemeng Song
Xuming Lin
Zhongzhou Zhao
Wei Zhou
Liqiang Nie
93
17
0
05 May 2023
VicunaNER: Zero/Few-shot Named Entity Recognition using Vicuna
VicunaNER: Zero/Few-shot Named Entity Recognition using Vicuna
Shezheng Song
69
13
0
05 May 2023
VideoOFA: Two-Stage Pre-Training for Video-to-Text Generation
VideoOFA: Two-Stage Pre-Training for Video-to-Text Generation
Xilun Chen
L. Yu
Wenhan Xiong
Barlas Ouguz
Yashar Mehdad
Wen-tau Yih
VGen
58
3
0
04 May 2023
Can LLM Already Serve as A Database Interface? A BIg Bench for
  Large-Scale Database Grounded Text-to-SQLs
Can LLM Already Serve as A Database Interface? A BIg Bench for Large-Scale Database Grounded Text-to-SQLs
Jinyang Li
Binyuan Hui
Ge Qu
Jiaxi Yang
Binhua Li
...
Guoliang Li
Kevin C. C. Chang
Fei Huang
Reynold Cheng
Yongbin Li
LMTD
184
422
0
04 May 2023
Modeling What-to-ask and How-to-ask for Answer-unaware Conversational
  Question Generation
Modeling What-to-ask and How-to-ask for Answer-unaware Conversational Question Generation
Do Xuan Long
Bowei Zou
Shafiq Joty
Anh Tai Tran
Liangming Pan
Nancy F. Chen
Ai Ti Aw
74
8
0
04 May 2023
Towards Weakly-Supervised Hate Speech Classification Across Datasets
Towards Weakly-Supervised Hate Speech Classification Across Datasets
Yiping Jin
Leo Wanner
Vishakha Kadam
A. Shvets
79
5
0
04 May 2023
Black-box Prompt Tuning with Subspace Learning
Black-box Prompt Tuning with Subspace Learning
Yuanhang Zheng
Zhixing Tan
Peng Li
Yang Liu
VLM
153
11
0
04 May 2023
ChatGraph: Interpretable Text Classification by Converting ChatGPT
  Knowledge to Graphs
ChatGraph: Interpretable Text Classification by Converting ChatGPT Knowledge to Graphs
Yucheng Shi
Hehuan Ma
Wenliang Zhong
Qiaoyu Tan
Gengchen Mai
Xiang Li
Tianming Liu
Junzhou Huang
AI4MH
75
35
0
03 May 2023
Entity Tracking in Language Models
Entity Tracking in Language Models
Najoung Kim
Sebastian Schuster
151
22
0
03 May 2023
Distilling Step-by-Step! Outperforming Larger Language Models with Less
  Training Data and Smaller Model Sizes
Distilling Step-by-Step! Outperforming Larger Language Models with Less Training Data and Smaller Model Sizes
Lokesh Nagalapatti
Chun-Liang Li
Chih-Kuan Yeh
Hootan Nakhost
Yasuhisa Fujii
Alexander Ratner
Ranjay Krishna
Chen-Yu Lee
Tomas Pfister
ALM
371
563
0
03 May 2023
The Benefits of Label-Description Training for Zero-Shot Text
  Classification
The Benefits of Label-Description Training for Zero-Shot Text Classification
Lingyu Gao
Debanjan Ghosh
Kevin Gimpel
VLM
200
11
0
03 May 2023
GPT-RE: In-context Learning for Relation Extraction using Large Language
  Models
GPT-RE: In-context Learning for Relation Extraction using Large Language Models
Michele Focchi
Fei Cheng
Zhuoyuan Mao
Qianying Liu
Haiyue Song
Jiwei Li
Sadao Kurohashi
LRM
117
94
0
03 May 2023
Can Large Language Models Be an Alternative to Human Evaluations?
Can Large Language Models Be an Alternative to Human Evaluations?
Cheng-Han Chiang
Hung-yi Lee
ALMLM&MA
320
634
0
03 May 2023
Generative Meta-Learning for Zero-Shot Relation Triplet Extraction
Generative Meta-Learning for Zero-Shot Relation Triplet Extraction
Wanli Li
T. Qian
Yi Song
Zeyu Zhang
Jiawei Li
Zhuang Chen
Lixin Zou
160
1
0
03 May 2023
Distill or Annotate? Cost-Efficient Fine-Tuning of Compact Models
Distill or Annotate? Cost-Efficient Fine-Tuning of Compact Models
Junmo Kang
Wei Xu
Alan Ritter
128
15
0
02 May 2023
Unlimiformer: Long-Range Transformers with Unlimited Length Input
Unlimiformer: Long-Range Transformers with Unlimited Length Input
Amanda Bertsch
Uri Alon
Graham Neubig
Matthew R. Gormley
RALM
211
130
0
02 May 2023
FreeLM: Fine-Tuning-Free Language Model
FreeLM: Fine-Tuning-Free Language Model
Xiang Li
Xin Jiang
Xuying Meng
Aixin Sun
Yequan Wang
84
3
0
02 May 2023
Text-Blueprint: An Interactive Platform for Plan-based Conditional
  Generation
Text-Blueprint: An Interactive Platform for Plan-based Conditional Generation
Fantine Huot
Joshua Maynez
Shashi Narayan
Reinald Kim Amplayo
Kuzman Ganchev
Annie Louis
Anders Sandholm
Dipanjan Das
Mirella Lapata
108
8
0
28 Apr 2023
HQP: A Human-Annotated Dataset for Detecting Online Propaganda
HQP: A Human-Annotated Dataset for Detecting Online Propaganda
Abdurahman Maarouf
Dominik Bär
Dominique Geissler
Stefan Feuerriegel
80
10
0
28 Apr 2023
Prompt Engineering for Healthcare: Methodologies and Applications
Prompt Engineering for Healthcare: Methodologies and Applications
Jiaqi Wang
Enze Shi
Sigang Yu
Zihao Wu
Chong Ma
...
Dajiang Zhu
Yixuan Yuan
Dinggang Shen
Tianming Liu
Shu Zhang
LM&MA
136
116
0
28 Apr 2023
IconShop: Text-Guided Vector Icon Synthesis with Autoregressive
  Transformers
IconShop: Text-Guided Vector Icon Synthesis with Autoregressive Transformers
Rong Wu
Wanchao Su
Kede Ma
Jing Liao
119
41
0
27 Apr 2023
$π$-Tuning: Transferring Multimodal Foundation Models with Optimal
  Multi-task Interpolation
πππ-Tuning: Transferring Multimodal Foundation Models with Optimal Multi-task Interpolation
Chengyue Wu
Teng Wang
Yixiao Ge
Zeyu Lu
Rui-Zhi Zhou
Ying Shan
Ping Luo
MoMe
145
37
0
27 Apr 2023
mPLUG-Owl: Modularization Empowers Large Language Models with
  Multimodality
mPLUG-Owl: Modularization Empowers Large Language Models with Multimodality
Qinghao Ye
Haiyang Xu
Guohai Xu
Jiabo Ye
Ming Yan
...
Junfeng Tian
Qiang Qi
Ji Zhang
Feiyan Huang
Jingren Zhou
VLMMLLM
315
957
0
27 Apr 2023
ChatGPT vs State-of-the-Art Models: A Benchmarking Study in Keyphrase
  Generation Task
ChatGPT vs State-of-the-Art Models: A Benchmarking Study in Keyphrase Generation Task
Roberto Martínez-Cruz
Alvaro J. López-López
J. Portela
106
23
0
27 Apr 2023
PVP: Pre-trained Visual Parameter-Efficient Tuning
PVP: Pre-trained Visual Parameter-Efficient Tuning
Zhao Song
Ke Yang
Naiyang Guan
Junjie Zhu
Peng Qiao
Qingyong Hu
VPVLMVLM
71
3
0
26 Apr 2023
Neuro-symbolic Zero-Shot Code Cloning with Cross-Language Intermediate
  Representation
Neuro-symbolic Zero-Shot Code Cloning with Cross-Language Intermediate Representation
Krishnam Hasija
Shrishti Pradhan
Manasi Patwardhan
Raveendra Kumar Medicherla
Lovekesh Vig
Ravindra Naik
59
2
0
26 Apr 2023
SCM: Enhancing Large Language Model with Self-Controlled Memory Framework
SCM: Enhancing Large Language Model with Self-Controlled Memory Framework
Bin Wang
Xinnian Liang
Jian Yang
Huijia Huang
Shuangzhi Wu
Peihao Wu
Lu Lu
Zejun Ma
Zhoujun Li
LLMAGKELMRALM
150
29
0
26 Apr 2023
TABLET: Learning From Instructions For Tabular Data
TABLET: Learning From Instructions For Tabular Data
Dylan Slack
Sameer Singh
LMTDALMRALM
82
19
0
25 Apr 2023
Hypernymization of named entity-rich captions for grounding-based
  multi-modal pretraining
Hypernymization of named entity-rich captions for grounding-based multi-modal pretraining
Giacomo Nebbia
Adriana Kovashka
110
0
0
25 Apr 2023
Evaluating Inter-Bilingual Semantic Parsing for Indian Languages
Evaluating Inter-Bilingual Semantic Parsing for Indian Languages
Divyanshu Aggarwal
V. Gupta
Anoop Kunchukuttan
96
3
0
25 Apr 2023
Test-Time Adaptation with Perturbation Consistency Learning
Test-Time Adaptation with Perturbation Consistency Learning
Yi Su
Yixin Ji
Juntao Li
Hai Ye
Hao Fei
VLM
77
2
0
25 Apr 2023
Hint-Aug: Drawing Hints from Foundation Vision Transformers Towards
  Boosted Few-Shot Parameter-Efficient Tuning
Hint-Aug: Drawing Hints from Foundation Vision Transformers Towards Boosted Few-Shot Parameter-Efficient Tuning
Zhongzhi Yu
Shang Wu
Y. Fu
Shunyao Zhang
Yingyan Lin
86
6
0
25 Apr 2023
TIGTEC : Token Importance Guided TExt Counterfactuals
TIGTEC : Token Importance Guided TExt Counterfactuals
Milan Bhan
Jean-Noel Vittaut
Nicolas Chesneau
Marie-Jeanne Lesot
102
9
0
24 Apr 2023
Benchmarking ChatGPT-4 on ACR Radiation Oncology In-Training (TXIT) Exam
  and Red Journal Gray Zone Cases: Potentials and Challenges for AI-Assisted
  Medical Education and Decision Making in Radiation Oncology
Benchmarking ChatGPT-4 on ACR Radiation Oncology In-Training (TXIT) Exam and Red Journal Gray Zone Cases: Potentials and Challenges for AI-Assisted Medical Education and Decision Making in Radiation Oncology
Yixing Huang
A. Gomaa
S. Semrau
M. Haderlein
S. Lettmaier
...
L. Distel
Andreas Maier
R. Fietkau
Christoph Bert
F. Putz
ELMLM&MAAI4MH
73
9
0
24 Apr 2023
Differentiate ChatGPT-generated and Human-written Medical Texts
Differentiate ChatGPT-generated and Human-written Medical Texts
Wenxiong Liao
Zheng Liu
Haixing Dai
Shaochen Xu
Zihao Wu
...
Xiaoke Huang
Dajiang Zhu
Hongmin Cai
Tianming Liu
Xiang Li
LM&MADeLMOMedImAI4MH
65
60
0
23 Apr 2023
Divide and Prompt: Chain of Thought Prompting for Text-to-SQL
Divide and Prompt: Chain of Thought Prompting for Text-to-SQL
X. Liu
Zhao Tan
ReLMLRM
103
17
0
23 Apr 2023
ChatABL: Abductive Learning via Natural Language Interaction with
  ChatGPT
ChatABL: Abductive Learning via Natural Language Interaction with ChatGPT
Tianyang Zhong
Yaonai Wei
Li Yang
Zihao Wu
Zheng Liu
...
Xi Jiang
Jun-Feng Han
Dinggang Shen
Tianming Liu
Tuo Zhang
LRM
81
29
0
21 Apr 2023
Evaluating Transformer Language Models on Arithmetic Operations Using
  Number Decomposition
Evaluating Transformer Language Models on Arithmetic Operations Using Number Decomposition
Matteo Muffo
A. Cocco
Enrico Bertino
ReLM
75
25
0
21 Apr 2023
MPMQA: Multimodal Question Answering on Product Manuals
MPMQA: Multimodal Question Answering on Product Manuals
Liangfu Zhang
Anwen Hu
Jing Zhang
Shuo Hu
Qin Jin
84
10
0
19 Apr 2023
CodeKGC: Code Language Model for Generative Knowledge Graph Construction
CodeKGC: Code Language Model for Generative Knowledge Graph Construction
Zhen Bi
Jing Chen
Yinuo Jiang
Feiyu Xiong
Wei Guo
Huajun Chen
Ningyu Zhang
68
42
0
18 Apr 2023
On Uncertainty Calibration and Selective Generation in Probabilistic
  Neural Summarization: A Benchmark Study
On Uncertainty Calibration and Selective Generation in Probabilistic Neural Summarization: A Benchmark Study
Polina Zablotskaia
Du Phan
Joshua Maynez
Shashi Narayan
Jie Jessie Ren
J. Liu
UQLMUQCV
76
20
0
17 Apr 2023
Learning to Compress Prompts with Gist Tokens
Learning to Compress Prompts with Gist Tokens
Jesse Mu
Xiang Lisa Li
Noah D. Goodman
VLM
154
228
0
17 Apr 2023
Tool Learning with Foundation Models
Tool Learning with Foundation Models
Yujia Qin
Shengding Hu
Yankai Lin
Weize Chen
Ning Ding
...
Cheng Yang
Tongshuang Wu
Heng Ji
Zhiyuan Liu
Maosong Sun
150
222
0
17 Apr 2023
Prediction-Oriented Bayesian Active Learning
Prediction-Oriented Bayesian Active Learning
Freddie Bickford-Smith
Andreas Kirsch
Sebastian Farquhar
Y. Gal
Adam Foster
Tom Rainforth
95
36
0
17 Apr 2023
Towards Better Instruction Following Language Models for Chinese:
  Investigating the Impact of Training Data and Evaluation
Towards Better Instruction Following Language Models for Chinese: Investigating the Impact of Training Data and Evaluation
Yunjie Ji
Yan Gong
Yong Deng
Yiping Peng
Qiang Niu
Baochang Ma
Xiangang Li
ALMELM
107
25
0
16 Apr 2023
ArguGPT: evaluating, understanding and identifying argumentative essays
  generated by GPT models
ArguGPT: evaluating, understanding and identifying argumentative essays generated by GPT models
Yikang Liu
Ziyin Zhang
Wanyang Zhang
Shisen Yue
Xiaojing Zhao
Xinyuan Cheng
Yiwen Zhang
Hai Hu
DeLMO
103
55
0
16 Apr 2023
Vax-Culture: A Dataset for Studying Vaccine Discourse on Twitter
Vax-Culture: A Dataset for Studying Vaccine Discourse on Twitter
M. Zarei
M. Christensen
S. Everts
Majid Komeili
39
1
0
13 Apr 2023
Previous
123...133134135...198199200
Next