ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.10683
  4. Cited By
Exploring the Limits of Transfer Learning with a Unified Text-to-Text
  Transformer
v1v2v3v4 (latest)

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
    AIMat
ArXiv (abs)PDFHTML

Papers citing "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

50 / 9,870 papers shown
Title
Defending Against Backdoor Attacks in Natural Language Generation
Defending Against Backdoor Attacks in Natural Language Generation
Xiaofei Sun
Xiaoya Li
Yuxian Meng
Xiang Ao
Leilei Gan
Jiwei Li
Tianwei Zhang
AAMLSILM
103
52
0
03 Jun 2021
Reordering Examples Helps during Priming-based Few-Shot Learning
Reordering Examples Helps during Priming-based Few-Shot Learning
Sawan Kumar
Partha P. Talukdar
82
58
0
03 Jun 2021
Few-shot Knowledge Graph-to-Text Generation with Pretrained Language
  Models
Few-shot Knowledge Graph-to-Text Generation with Pretrained Language Models
Junyi Li
Tianyi Tang
Wayne Xin Zhao
Zhicheng Wei
N. Yuan
Ji-Rong Wen
79
49
0
03 Jun 2021
Can Generative Pre-trained Language Models Serve as Knowledge Bases for
  Closed-book QA?
Can Generative Pre-trained Language Models Serve as Knowledge Bases for Closed-book QA?
Cunxiang Wang
Pai Liu
Yue Zhang
RALM
104
84
0
03 Jun 2021
Question Answering Over Temporal Knowledge Graphs
Question Answering Over Temporal Knowledge Graphs
Apoorv Saxena
Soumen Chakrabarti
Partha P. Talukdar
AI4MH
118
139
0
03 Jun 2021
Ethical-Advice Taker: Do Language Models Understand Natural Language
  Interventions?
Ethical-Advice Taker: Do Language Models Understand Natural Language Interventions?
Jieyu Zhao
Daniel Khashabi
Tushar Khot
Ashish Sabharwal
Kai-Wei Chang
KELM
87
53
0
02 Jun 2021
multiPRover: Generating Multiple Proofs for Improved Interpretability in
  Rule Reasoning
multiPRover: Generating Multiple Proofs for Improved Interpretability in Rule Reasoning
Swarnadeep Saha
Prateek Yadav
Joey Tianyi Zhou
ReLMLRM
92
26
0
02 Jun 2021
A Unified Generative Framework for Various NER Subtasks
A Unified Generative Framework for Various NER Subtasks
Hang Yan
Tao Gui
Junqi Dai
Qipeng Guo
Zheng Zhang
Xipeng Qiu
92
298
0
02 Jun 2021
COM2SENSE: A Commonsense Reasoning Benchmark with Complementary
  Sentences
COM2SENSE: A Commonsense Reasoning Benchmark with Complementary Sentences
Shikhar Singh
Nuan Wen
Yu Hou
Pegah Alipoormolabashi
Te-Lin Wu
Xuezhe Ma
Nanyun Peng
LRM
98
59
0
02 Jun 2021
Answer Generation for Retrieval-based Question Answering Systems
Answer Generation for Retrieval-based Question Answering Systems
Chao-Chun Hsu
Eric Lind
Luca Soldaini
Alessandro Moschitti
68
26
0
02 Jun 2021
Claim Matching Beyond English to Scale Global Fact-Checking
Claim Matching Beyond English to Scale Global Fact-Checking
Ashkan Kazemi
Kiran Garimella
Devin Gaffney
Scott A. Hale
77
60
0
01 Jun 2021
ConvoSumm: Conversation Summarization Benchmark and Improved Abstractive
  Summarization with Argument Mining
ConvoSumm: Conversation Summarization Benchmark and Improved Abstractive Summarization with Argument Mining
Alexander R. Fabbri
Faiaz Rahman
Imad Rizvi
Borui Wang
Haoran Li
Yashar Mehdad
Dragomir R. Radev
96
65
0
01 Jun 2021
Implicit Representations of Meaning in Neural Language Models
Implicit Representations of Meaning in Neural Language Models
Belinda Z. Li
Maxwell Nye
Jacob Andreas
NAIMILM
67
177
0
01 Jun 2021
CIDER: Commonsense Inference for Dialogue Explanation and Reasoning
CIDER: Commonsense Inference for Dialogue Explanation and Reasoning
Deepanway Ghosal
Pengfei Hong
Siqi Shen
Navonil Majumder
Rada Mihalcea
Soujanya Poria
86
23
0
01 Jun 2021
PIGLeT: Language Grounding Through Neuro-Symbolic Interaction in a 3D
  World
PIGLeT: Language Grounding Through Neuro-Symbolic Interaction in a 3D World
Rowan Zellers
Ari Holtzman
Matthew E. Peters
Roozbeh Mottaghi
Aniruddha Kembhavi
Ali Farhadi
Yejin Choi
108
69
0
01 Jun 2021
HiddenCut: Simple Data Augmentation for Natural Language Understanding
  with Better Generalization
HiddenCut: Simple Data Augmentation for Natural Language Understanding with Better Generalization
Jiaao Chen
Dinghan Shen
Weizhu Chen
Diyi Yang
BDL
74
48
0
31 May 2021
MSG-Transformer: Exchanging Local Spatial Information by Manipulating
  Messenger Tokens
MSG-Transformer: Exchanging Local Spatial Information by Manipulating Messenger Tokens
Jiemin Fang
Lingxi Xie
Xinggang Wang
Xiaopeng Zhang
Wenyu Liu
Qi Tian
ViT
73
77
0
31 May 2021
M6-T: Exploring Sparse Expert Models and Beyond
M6-T: Exploring Sparse Expert Models and Beyond
An Yang
Junyang Lin
Rui Men
Chang Zhou
Le Jiang
...
Dingyang Zhang
Wei Lin
Lin Qu
Jingren Zhou
Hongxia Yang
MoE
122
24
0
31 May 2021
SemEval-2021 Task 4: Reading Comprehension of Abstract Meaning
SemEval-2021 Task 4: Reading Comprehension of Abstract Meaning
Boyuan Zheng
Xiaoyu Yang
Yu-Ping Ruan
Zhen-Hua Ling
Quan Liu
Si Wei
Xiao-Dan Zhu
ELM
44
13
0
31 May 2021
Transfer Learning for Sequence Generation: from Single-source to
  Multi-source
Transfer Learning for Sequence Generation: from Single-source to Multi-source
Xuancheng Huang
Jingfang Xu
Maosong Sun
Yang Liu
57
5
0
31 May 2021
On Compositional Generalization of Neural Machine Translation
On Compositional Generalization of Neural Machine Translation
Yafu Li
Yongjing Yin
Yulong Chen
Yue Zhang
233
46
0
31 May 2021
On the Interplay Between Fine-tuning and Composition in Transformers
On the Interplay Between Fine-tuning and Composition in Transformers
Lang-Chi Yu
Allyson Ettinger
77
14
0
31 May 2021
LEAP: Learnable Pruning for Transformer-based Models
LEAP: Learnable Pruning for Transformer-based Models
Z. Yao
Xiaoxia Wu
Linjian Ma
Sheng Shen
Kurt Keutzer
Michael W. Mahoney
Yuxiong He
60
7
0
30 May 2021
StyTr$^2$: Image Style Transfer with Transformers
StyTr2^22: Image Style Transfer with Transformers
Yingying Deng
Fan Tang
Weiming Dong
Chongyang Ma
Xingjia Pan
Lei Wang
Changsheng Xu
ViT
117
268
0
30 May 2021
Gaze Estimation using Transformer
Gaze Estimation using Transformer
Yihua Cheng
Feng Lu
ViT
80
94
0
30 May 2021
Towards mental time travel: a hierarchical memory for reinforcement
  learning agents
Towards mental time travel: a hierarchical memory for reinforcement learning agents
Andrew Kyle Lampinen
Stephanie C. Y. Chan
Andrea Banino
Felix Hill
92
47
0
28 May 2021
Knowledge Inheritance for Pre-trained Language Models
Knowledge Inheritance for Pre-trained Language Models
Yujia Qin
Yankai Lin
Jing Yi
Jiajie Zhang
Xu Han
...
Yusheng Su
Zhiyuan Liu
Peng Li
Maosong Sun
Jie Zhou
VLM
78
50
0
28 May 2021
Early Exiting with Ensemble Internal Classifiers
Early Exiting with Ensemble Internal Classifiers
Tianxiang Sun
Yunhua Zhou
Xiangyang Liu
Xinyu Zhang
Hao Jiang
Bo Zhao
Xuanjing Huang
Xipeng Qiu
70
31
0
28 May 2021
ByT5: Towards a token-free future with pre-trained byte-to-byte models
ByT5: Towards a token-free future with pre-trained byte-to-byte models
Linting Xue
Aditya Barua
Noah Constant
Rami Al-Rfou
Sharan Narang
Mihir Kale
Adam Roberts
Colin Raffel
141
508
0
28 May 2021
SciFive: a text-to-text transformer model for biomedical literature
SciFive: a text-to-text transformer model for biomedical literature
Long Phan
J. Anibal
H. Tran
Shaurya Chanana
Erol Bahadroglu
Alec Peltekian
G. Altan-Bonnet
MedIm
71
151
0
28 May 2021
Inspecting the concept knowledge graph encoded by modern language models
Inspecting the concept knowledge graph encoded by modern language models
Carlos Aspillaga
Marcelo Mendoza
Alvaro Soto
72
13
0
27 May 2021
Synthetic Data Generation for Grammatical Error Correction with Tagged
  Corruption Models
Synthetic Data Generation for Grammatical Error Correction with Tagged Corruption Models
Felix Stahlberg
Shankar Kumar
SyDa
137
97
0
27 May 2021
Contrastive Fine-tuning Improves Robustness for Neural Rankers
Contrastive Fine-tuning Improves Robustness for Neural Rankers
Xiaofei Ma
Cicero Nogueira dos Santos
Andrew O. Arnold
111
20
0
27 May 2021
Sequence Parallelism: Long Sequence Training from System Perspective
Sequence Parallelism: Long Sequence Training from System Perspective
Shenggui Li
Fuzhao Xue
Chaitanya Baranwal
Yongbin Li
Yang You
92
103
0
26 May 2021
TreeBERT: A Tree-Based Pre-Trained Model for Programming Language
TreeBERT: A Tree-Based Pre-Trained Model for Programming Language
Xue Jiang
Zhuoran Zheng
Chen Lyu
Liang Li
Lei Lyu
85
91
0
26 May 2021
LMMS Reloaded: Transformer-based Sense Embeddings for Disambiguation and
  Beyond
LMMS Reloaded: Transformer-based Sense Embeddings for Disambiguation and Beyond
Daniel Loureiro
A. Jorge
Jose Camacho-Collados
88
26
0
26 May 2021
Super Tickets in Pre-Trained Language Models: From Model Compression to
  Improving Generalization
Super Tickets in Pre-Trained Language Models: From Model Compression to Improving Generalization
Chen Liang
Simiao Zuo
Minshuo Chen
Haoming Jiang
Xiaodong Liu
Pengcheng He
T. Zhao
Weizhu Chen
59
69
0
25 May 2021
Focus Attention: Promoting Faithfulness and Diversity in Summarization
Focus Attention: Promoting Faithfulness and Diversity in Summarization
Rahul Aralikatte
Shashi Narayan
Joshua Maynez
S. Rothe
Ryan T. McDonald
112
46
0
25 May 2021
Estimating Redundancy in Clinical Text
Estimating Redundancy in Clinical Text
Thomas Searle
Zina M. Ibrahim
J. Teo
Richard J. B. Dobson
67
21
0
25 May 2021
PTR: Prompt Tuning with Rules for Text Classification
PTR: Prompt Tuning with Rules for Text Classification
Xu Han
Weilin Zhao
Ning Ding
Zhiyuan Liu
Maosong Sun
VLM
106
532
0
24 May 2021
Pretrained Language Models for Text Generation: A Survey
Pretrained Language Models for Text Generation: A Survey
Junyi Li
Tianyi Tang
Wayne Xin Zhao
Ji-Rong Wen
LM&MAVLMSyDa
106
191
0
21 May 2021
Dynaboard: An Evaluation-As-A-Service Platform for Holistic
  Next-Generation Benchmarking
Dynaboard: An Evaluation-As-A-Service Platform for Holistic Next-Generation Benchmarking
Zhiyi Ma
Kawin Ethayarajh
Tristan Thrush
Somya Jain
Ledell Yu Wu
Robin Jia
Christopher Potts
Adina Williams
Douwe Kiela
ELM
115
59
0
21 May 2021
KLUE: Korean Language Understanding Evaluation
KLUE: Korean Language Understanding Evaluation
Sungjoon Park
Jihyung Moon
Sungdong Kim
Won Ik Cho
Jiyoon Han
...
Seonghyun Kim
Lucy Park
Alice Oh
Jung-Woo Ha
Kyunghyun Cho
ELMVLM
121
198
0
20 May 2021
DeepDebug: Fixing Python Bugs Using Stack Traces, Backtranslation, and
  Code Skeletons
DeepDebug: Fixing Python Bugs Using Stack Traces, Backtranslation, and Code Skeletons
Dawn Drain
Colin B. Clement
Guillermo Serrato
Neel Sundaresan
71
31
0
19 May 2021
Exploring Text-to-Text Transformers for English to Hinglish Machine
  Translation with Synthetic Code-Mixing
Exploring Text-to-Text Transformers for English to Hinglish Machine Translation with Synthetic Code-Mixing
Ganesh Jawahar
El Moatez Billah Nagoudi
Muhammad Abdul-Mageed
L. Lakshmanan
95
30
0
18 May 2021
CoTexT: Multi-task Learning with Code-Text Transformer
CoTexT: Multi-task Learning with Code-Text Transformer
Long Phan
H. Tran
Daniel Le
Hieu Duy Nguyen
J. Anibal
Alec Peltekian
Yanfang Ye
92
136
0
18 May 2021
Relative Positional Encoding for Transformers with Linear Complexity
Relative Positional Encoding for Transformers with Linear Complexity
Antoine Liutkus
Ondřej Cífka
Shih-Lun Wu
Umut Simsekli
Yi-Hsuan Yang
Gaël Richard
84
48
0
18 May 2021
BookSum: A Collection of Datasets for Long-form Narrative Summarization
BookSum: A Collection of Datasets for Long-form Narrative Summarization
Wojciech Kry'sciñski
Nazneen Rajani
Divyansh Agarwal
Caiming Xiong
Dragomir R. Radev
RALM
116
154
0
18 May 2021
SHARE: a System for Hierarchical Assistive Recipe Editing
SHARE: a System for Hierarchical Assistive Recipe Editing
Shuyang Li
Yufei Li
Jianmo Ni
Julian McAuley
52
20
0
17 May 2021
Pay Attention to MLPs
Pay Attention to MLPs
Hanxiao Liu
Zihang Dai
David R. So
Quoc V. Le
AI4CE
159
669
0
17 May 2021
Previous
123...181182183...196197198
Next