ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.10683
  4. Cited By
Exploring the Limits of Transfer Learning with a Unified Text-to-Text
  Transformer
v1v2v3v4 (latest)

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
    AIMat
ArXiv (abs)PDFHTML

Papers citing "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

50 / 9,870 papers shown
Title
Thinking about GPT-3 In-Context Learning for Biomedical IE? Think Again
Thinking about GPT-3 In-Context Learning for Biomedical IE? Think Again
Bernal Jiménez Gutiérrez
Nikolas McNeal
Clay Washington
You Chen
Lang Li
Huan Sun
Yu-Chuan Su
110
158
0
16 Mar 2022
Multilingual Generative Language Models for Zero-Shot Cross-Lingual
  Event Argument Extraction
Multilingual Generative Language Models for Zero-Shot Cross-Lingual Event Argument Extraction
Kuan-Hao Huang
I-Hung Hsu
Premkumar Natarajan
Kai-Wei Chang
Nanyun Peng
65
68
0
15 Mar 2022
Hyperdecoders: Instance-specific decoders for multi-task NLP
Hyperdecoders: Instance-specific decoders for multi-task NLP
Hamish Ivison
Matthew E. Peters
AI4CE
113
22
0
15 Mar 2022
Representation Learning for Resource-Constrained Keyphrase Generation
Representation Learning for Resource-Constrained Keyphrase Generation
Di Wu
Wasi Uddin Ahmad
Sunipa Dev
Kai-Wei Chang
83
18
0
15 Mar 2022
Things not Written in Text: Exploring Spatial Commonsense from Visual
  Signals
Things not Written in Text: Exploring Spatial Commonsense from Visual Signals
Xiao Liu
Da Yin
Yansong Feng
Dongyan Zhao
LRM
80
46
0
15 Mar 2022
Modular and Parameter-Efficient Multimodal Fusion with Prompting
Modular and Parameter-Efficient Multimodal Fusion with Prompting
Sheng Liang
Mengjie Zhao
Hinrich Schütze
90
45
0
15 Mar 2022
Graph Pre-training for AMR Parsing and Generation
Graph Pre-training for AMR Parsing and Generation
Xuefeng Bai
Yulong Chen
Yue Zhang
SSL
112
103
0
15 Mar 2022
UniSAr: A Unified Structure-Aware Autoregressive Language Model for
  Text-to-SQL
UniSAr: A Unified Structure-Aware Autoregressive Language Model for Text-to-SQL
Longxu Dou
Yan Gao
Mingyang Pan
Dingzirui Wang
Wanxiang Che
Dechen Zhan
Jian-Guang Lou
102
27
0
15 Mar 2022
ViWOZ: A Multi-Domain Task-Oriented Dialogue Systems Dataset For
  Low-resource Language
ViWOZ: A Multi-Domain Task-Oriented Dialogue Systems Dataset For Low-resource Language
Phi Nguyen Van
Tung Cao Hoang
Dũng Nguyễn Mạnh
Q. Minh
Long Tran Quoc
81
3
0
15 Mar 2022
ReACC: A Retrieval-Augmented Code Completion Framework
ReACC: A Retrieval-Augmented Code Completion Framework
Shuai Lu
Nan Duan
Hojae Han
Daya Guo
Seung-won Hwang
Alexey Svyatkovskiy
84
149
0
15 Mar 2022
Efficient Long Sequence Encoding via Synchronization
Efficient Long Sequence Encoding via Synchronization
Xiangyang Mou
Mo Yu
Bingsheng Yao
Lifu Huang
61
0
0
15 Mar 2022
Do Language Models Plagiarize?
Do Language Models Plagiarize?
Jooyoung Lee
Thai Le
Jinghui Chen
Dongwon Lee
103
76
0
15 Mar 2022
Long Document Summarization with Top-down and Bottom-up Inference
Long Document Summarization with Top-down and Bottom-up Inference
Bo Pang
Erik Nijkamp
Wojciech Kry'sciñski
Silvio Savarese
Yingbo Zhou
Caiming Xiong
RALMBDL
86
56
0
15 Mar 2022
ScienceWorld: Is your Agent Smarter than a 5th Grader?
ScienceWorld: Is your Agent Smarter than a 5th Grader?
Ruoyao Wang
Peter Alexander Jansen
Marc-Alexandre Côté
Prithviraj Ammanabrolu
LLMAGReLMLRM
132
129
0
14 Mar 2022
Choose Your QA Model Wisely: A Systematic Study of Generative and
  Extractive Readers for Question Answering
Choose Your QA Model Wisely: A Systematic Study of Generative and Extractive Readers for Question Answering
Man Luo
Kazuma Hashimoto
Semih Yavuz
Zhiwei Liu
Chitta Baral
Yingbo Zhou
74
22
0
14 Mar 2022
Uncertainty Estimation for Language Reward Models
Uncertainty Estimation for Language Reward Models
Adam Gleave
G. Irving
UQLM
84
34
0
14 Mar 2022
Switch Trajectory Transformer with Distributional Value Approximation
  for Multi-Task Reinforcement Learning
Switch Trajectory Transformer with Distributional Value Approximation for Multi-Task Reinforcement Learning
Qinjie Lin
Han Liu
B. Sengupta
OffRL
70
12
0
14 Mar 2022
CLIP Models are Few-shot Learners: Empirical Studies on VQA and Visual
  Entailment
CLIP Models are Few-shot Learners: Empirical Studies on VQA and Visual Entailment
Haoyu Song
Li Dong
Weinan Zhang
Ting Liu
Furu Wei
VLMCLIP
89
139
0
14 Mar 2022
Disentangled Representation Learning for Text-Video Retrieval
Disentangled Representation Learning for Text-Video Retrieval
Qiang Wang
Yanhao Zhang
Yun Zheng
Pan Pan
Xiansheng Hua
79
81
0
14 Mar 2022
HIE-SQL: History Information Enhanced Network for Context-Dependent
  Text-to-SQL Semantic Parsing
HIE-SQL: History Information Enhanced Network for Context-Dependent Text-to-SQL Semantic Parsing
Yanzhao Zheng
Haibin Wang
B. Dong
Xingjun Wang
Changshan Li
97
35
0
14 Mar 2022
Uncertainty-Aware Text-to-Program for Question Answering on Structured
  Electronic Health Records
Uncertainty-Aware Text-to-Program for Question Answering on Structured Electronic Health Records
Daeyoung Kim
Seongsu Bae
S. Kim
Edward Choi
68
6
0
14 Mar 2022
Delta Tuning: A Comprehensive Study of Parameter Efficient Methods for
  Pre-trained Language Models
Delta Tuning: A Comprehensive Study of Parameter Efficient Methods for Pre-trained Language Models
Ning Ding
Yujia Qin
Guang Yang
Fu Wei
Zonghan Yang
...
Jianfei Chen
Yang Liu
Jie Tang
Juan Li
Maosong Sun
108
205
0
14 Mar 2022
Efficient Language Modeling with Sparse all-MLP
Efficient Language Modeling with Sparse all-MLP
Ping Yu
Mikel Artetxe
Myle Ott
Sam Shleifer
Hongyu Gong
Ves Stoyanov
Xian Li
MoE
88
11
0
14 Mar 2022
SUPERB-SG: Enhanced Speech processing Universal PERformance Benchmark
  for Semantic and Generative Capabilities
SUPERB-SG: Enhanced Speech processing Universal PERformance Benchmark for Semantic and Generative Capabilities
Hsiang-Sheng Tsai
Heng-Jui Chang
Wen-Chin Huang
Zili Huang
Kushal Lakhotia
...
Hsuan-Jui Chen
Shang-Wen Li
Shinji Watanabe
Abdel-rahman Mohamed
Hung-yi Lee
91
110
0
14 Mar 2022
Can pre-trained Transformers be used in detecting complex sensitive
  sentences? -- A Monsanto case study
Can pre-trained Transformers be used in detecting complex sensitive sentences? -- A Monsanto case study
Roelien C. Timmer
David Liebowitz
Surya Nepal
S. Kanhere
50
8
0
14 Mar 2022
Summarizing a virtual robot's past actions in natural language
Summarizing a virtual robot's past actions in natural language
Chad DeChant
Daniel Bauer
LM&Ro
62
4
0
13 Mar 2022
Towards Personalized Intelligence at Scale
Towards Personalized Intelligence at Scale
Yiping Kang
Ashish Mahendra
Christopher Clarke
Lingjia Tang
Jason Mars
70
1
0
13 Mar 2022
Continual Prompt Tuning for Dialog State Tracking
Continual Prompt Tuning for Dialog State Tracking
Qi Zhu
Bing Li
Fei Mi
Xiaoyan Zhu
Minlie Huang
CLLKELM
90
60
0
13 Mar 2022
Masked Autoencoders for Point Cloud Self-supervised Learning
Masked Autoencoders for Point Cloud Self-supervised Learning
Yatian Pang
Wenxiao Wang
Francis E. H. Tay
Wen Liu
Yonghong Tian
Liuliang Yuan
3DPCViT
117
483
0
13 Mar 2022
SummaReranker: A Multi-Task Mixture-of-Experts Re-ranking Framework for
  Abstractive Summarization
SummaReranker: A Multi-Task Mixture-of-Experts Re-ranking Framework for Abstractive Summarization
Mathieu Ravaut
Shafiq Joty
Nancy F. Chen
MoE
51
96
0
13 Mar 2022
Chart-to-Text: A Large-Scale Benchmark for Chart Summarization
Chart-to-Text: A Large-Scale Benchmark for Chart Summarization
Shankar Kanthara
Rixie Tiffany Ko Leong
Xiang Lin
Ahmed Masry
Megh Thakkar
Enamul Hoque
Shafiq Joty
110
150
0
12 Mar 2022
Enabling Multimodal Generation on CLIP via Vision-Language Knowledge
  Distillation
Enabling Multimodal Generation on CLIP via Vision-Language Knowledge Distillation
Wenliang Dai
Lu Hou
Lifeng Shang
Xin Jiang
Qun Liu
Pascale Fung
VLM
92
94
0
12 Mar 2022
Block-Recurrent Transformers
Block-Recurrent Transformers
DeLesley S. Hutchins
Imanol Schlag
Yuhuai Wu
Ethan Dyer
Behnam Neyshabur
103
100
0
11 Mar 2022
Staged Training for Transformer Language Models
Staged Training for Transformer Language Models
Sheng Shen
Pete Walsh
Kurt Keutzer
Jesse Dodge
Matthew E. Peters
Iz Beltagy
66
37
0
11 Mar 2022
LaPraDoR: Unsupervised Pretrained Dense Retriever for Zero-Shot Text
  Retrieval
LaPraDoR: Unsupervised Pretrained Dense Retriever for Zero-Shot Text Retrieval
Canwen Xu
Daya Guo
Nan Duan
Julian McAuley
RALMVLM
81
48
0
11 Mar 2022
Model soups: averaging weights of multiple fine-tuned models improves
  accuracy without increasing inference time
Model soups: averaging weights of multiple fine-tuned models improves accuracy without increasing inference time
Mitchell Wortsman
Gabriel Ilharco
S. Gadre
Rebecca Roelofs
Raphael Gontijo-Lopes
...
Hongseok Namkoong
Ali Farhadi
Y. Carmon
Simon Kornblith
Ludwig Schmidt
MoMe
199
1,013
1
10 Mar 2022
IndicNLG Benchmark: Multilingual Datasets for Diverse NLG Tasks in Indic
  Languages
IndicNLG Benchmark: Multilingual Datasets for Diverse NLG Tasks in Indic Languages
Aman Kumar
Himani Shrotriya
P. Sahu
Raj Dabre
Ratish Puduppully
Anoop Kunchukuttan
Amogh Mishra
Mitesh M. Khapra
Pratyush Kumar
101
42
0
10 Mar 2022
Compilable Neural Code Generation with Compiler Feedback
Compilable Neural Code Generation with Compiler Feedback
Xin Wang
Yasheng Wang
Yao Wan
Fei Mi
Yitong Li
Pingyi Zhou
Jin Liu
Hao Wu
Xin Jiang
Qun Liu
78
69
0
10 Mar 2022
PACTran: PAC-Bayesian Metrics for Estimating the Transferability of
  Pretrained Models to Classification Tasks
PACTran: PAC-Bayesian Metrics for Estimating the Transferability of Pretrained Models to Classification Tasks
Nan Ding
Xi Chen
Tomer Levinboim
Soravit Changpinyo
Radu Soricut
79
29
0
10 Mar 2022
HealthPrompt: A Zero-shot Learning Paradigm for Clinical Natural
  Language Processing
HealthPrompt: A Zero-shot Learning Paradigm for Clinical Natural Language Processing
Sonish Sivarajkumar
Yanshan Wang
VLMLM&MA
103
58
0
09 Mar 2022
A Simple Multi-Modality Transfer Learning Baseline for Sign Language
  Translation
A Simple Multi-Modality Transfer Learning Baseline for Sign Language Translation
Yutong Chen
Fangyun Wei
Xiao Sun
Zhirong Wu
Stephen Lin
SLR
88
104
0
08 Mar 2022
InstructionNER: A Multi-Task Instruction-Based Generative Framework for
  Few-shot NER
InstructionNER: A Multi-Task Instruction-Based Generative Framework for Few-shot NER
Liwen Wang
Rumei Li
Yang Yan
Yuanmeng Yan
Sirui Wang
Wei Wu
Weiran Xu
77
55
0
08 Mar 2022
HyperPELT: Unified Parameter-Efficient Language Model Tuning for Both
  Language and Vision-and-Language Tasks
HyperPELT: Unified Parameter-Efficient Language Model Tuning for Both Language and Vision-and-Language Tasks
Zhengkun Zhang
Wenya Guo
Xiaojun Meng
Yasheng Wang
Yadao Wang
Xin Jiang
Qun Liu
Zhenglu Yang
80
17
0
08 Mar 2022
UniXcoder: Unified Cross-Modal Pre-training for Code Representation
UniXcoder: Unified Cross-Modal Pre-training for Code Representation
Daya Guo
Shuai Lu
Nan Duan
Yanlin Wang
Ming Zhou
Jian Yin
96
609
0
08 Mar 2022
IT5: Text-to-text Pretraining for Italian Language Understanding and
  Generation
IT5: Text-to-text Pretraining for Italian Language Understanding and Generation
Gabriele Sarti
Malvina Nissim
AILaw
101
42
0
07 Mar 2022
Tensor Programs V: Tuning Large Neural Networks via Zero-Shot
  Hyperparameter Transfer
Tensor Programs V: Tuning Large Neural Networks via Zero-Shot Hyperparameter Transfer
Greg Yang
J. E. Hu
Igor Babuschkin
Szymon Sidor
Xiaodong Liu
David Farhi
Nick Ryder
J. Pachocki
Weizhu Chen
Jianfeng Gao
130
168
0
07 Mar 2022
What Did You Say? Task-Oriented Dialog Datasets Are Not Conversational!?
What Did You Say? Task-Oriented Dialog Datasets Are Not Conversational!?
Alice Shoshana Jakobovits
Francesco Piccinno
Yasemin Altun
77
3
0
07 Mar 2022
Input-Tuning: Adapting Unfamiliar Inputs to Frozen Pretrained Models
Input-Tuning: Adapting Unfamiliar Inputs to Frozen Pretrained Models
Shengnan An
Yifei Li
Zeqi Lin
Qian Liu
Bei Chen
Qiang Fu
Weizhu Chen
Nanning Zheng
Jian-Guang Lou
VLMAAML
93
43
0
07 Mar 2022
ClarET: Pre-training a Correlation-Aware Context-To-Event Transformer
  for Event-Centric Generation and Classification
ClarET: Pre-training a Correlation-Aware Context-To-Event Transformer for Event-Centric Generation and Classification
Yucheng Zhou
Tao Shen
Xiubo Geng
Guodong Long
Daxin Jiang
112
60
0
04 Mar 2022
SimKGC: Simple Contrastive Knowledge Graph Completion with Pre-trained
  Language Models
SimKGC: Simple Contrastive Knowledge Graph Completion with Pre-trained Language Models
Liang Wang
Wei Zhao
Zhuoyu Wei
Jingming Liu
96
186
0
04 Mar 2022
Previous
123...165166167...196197198
Next