ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.10683
  4. Cited By
Exploring the Limits of Transfer Learning with a Unified Text-to-Text
  Transformer
v1v2v3v4 (latest)

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
    AIMat
ArXiv (abs)PDFHTML

Papers citing "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

50 / 9,870 papers shown
Title
Controllable Semantic Parsing via Retrieval Augmentation
Controllable Semantic Parsing via Retrieval Augmentation
Panupong Pasupat
Yuan Zhang
Kelvin Guu
206
48
0
16 Oct 2021
Knowledge Enhanced Pretrained Language Models: A Compreshensive Survey
Knowledge Enhanced Pretrained Language Models: A Compreshensive Survey
Xiaokai Wei
Shen Wang
Dejiao Zhang
Parminder Bhatia
Andrew O. Arnold
KELM
93
46
0
16 Oct 2021
Open Domain Question Answering with A Unified Knowledge Interface
Open Domain Question Answering with A Unified Knowledge Interface
Kaixin Ma
Hao Cheng
Xiaodong Liu
Eric Nyberg
Jianfeng Gao
RALM
199
40
0
16 Oct 2021
Evaluating the Faithfulness of Importance Measures in NLP by Recursively
  Masking Allegedly Important Tokens and Retraining
Evaluating the Faithfulness of Importance Measures in NLP by Recursively Masking Allegedly Important Tokens and Retraining
Andreas Madsen
Nicholas Meade
Vaibhav Adlakha
Siva Reddy
154
37
0
15 Oct 2021
Generated Knowledge Prompting for Commonsense Reasoning
Generated Knowledge Prompting for Commonsense Reasoning
Jiacheng Liu
Alisa Liu
Ximing Lu
Sean Welleck
Peter West
Ronan Le Bras
Yejin Choi
Hannaneh Hajishirzi
KELMRALMReLMLLMAGLRM
147
326
0
15 Oct 2021
Detecting Gender Bias in Transformer-based Models: A Case Study on BERT
Detecting Gender Bias in Transformer-based Models: A Case Study on BERT
Bingbing Li
Hongwu Peng
Rajat Sainju
Junhuan Yang
Lei Yang
Yueying Liang
Weiwen Jiang
Binghui Wang
Hang Liu
Caiwen Ding
49
13
0
15 Oct 2021
Towards Transparent Interactive Semantic Parsing via Step-by-Step
  Correction
Towards Transparent Interactive Semantic Parsing via Step-by-Step Correction
Lingbo Mo
A. Lewis
Huan Sun
Michael White
KELM
82
14
0
15 Oct 2021
Control Prefixes for Parameter-Efficient Text Generation
Control Prefixes for Parameter-Efficient Text Generation
Jordan Clive
Kris Cao
Marek Rei
125
32
0
15 Oct 2021
On Learning the Transformer Kernel
On Learning the Transformer Kernel
Sankalan Pal Chowdhury
Adamos Solomou
Kumar Avinava Dubey
Mrinmaya Sachan
ViT
131
14
0
15 Oct 2021
DialFact: A Benchmark for Fact-Checking in Dialogue
DialFact: A Benchmark for Fact-Checking in Dialogue
Prakhar Gupta
Chien-Sheng Wu
Wenhao Liu
Caiming Xiong
HILM
65
63
0
15 Oct 2021
Why don't people use character-level machine translation?
Why don't people use character-level machine translation?
Jindrich Libovický
Helmut Schmid
Alexander Fraser
135
29
0
15 Oct 2021
MixQG: Neural Question Generation with Mixed Answer Types
MixQG: Neural Question Generation with Mixed Answer Types
Lidiya Murakhovs'ka
Chien-Sheng Wu
Philippe Laban
Tong Niu
Wenhao Liu
Caiming Xiong
86
48
0
15 Oct 2021
Rewire-then-Probe: A Contrastive Recipe for Probing Biomedical Knowledge
  of Pre-trained Language Models
Rewire-then-Probe: A Contrastive Recipe for Probing Biomedical Knowledge of Pre-trained Language Models
Zaiqiao Meng
Fangyu Liu
Ehsan Shareghi
Yixuan Su
Charlotte Collins
Nigel Collier
93
28
0
15 Oct 2021
DYLE: Dynamic Latent Extraction for Abstractive Long-Input Summarization
DYLE: Dynamic Latent Extraction for Abstractive Long-Input Summarization
Ziming Mao
Chen Henry Wu
Ansong Ni
Yusen Zhang
Rui Zhang
Tao Yu
Budhaditya Deb
Chenguang Zhu
Ahmed Hassan Awadallah
Dragomir R. Radev
99
57
0
15 Oct 2021
Few-Shot Bot: Prompt-Based Learning for Dialogue Systems
Few-Shot Bot: Prompt-Based Learning for Dialogue Systems
Andrea Madotto
Zhaojiang Lin
Genta Indra Winata
Pascale Fung
93
85
0
15 Oct 2021
SPoT: Better Frozen Model Adaptation through Soft Prompt Transfer
SPoT: Better Frozen Model Adaptation through Soft Prompt Transfer
Tu Vu
Brian Lester
Noah Constant
Rami Al-Rfou
Daniel Cer
VLMLRM
223
290
0
15 Oct 2021
Modeling Endorsement for Multi-Document Abstractive Summarization
Modeling Endorsement for Multi-Document Abstractive Summarization
Logan Lebanoff
Bingqing Wang
Z. Feng
Fei Liu
343
4
0
15 Oct 2021
P-Tuning v2: Prompt Tuning Can Be Comparable to Fine-tuning Universally
  Across Scales and Tasks
P-Tuning v2: Prompt Tuning Can Be Comparable to Fine-tuning Universally Across Scales and Tasks
Xiao Liu
Kaixuan Ji
Yicheng Fu
Weng Lam Tam
Zhengxiao Du
Zhilin Yang
Jie Tang
VLM
301
863
0
14 Oct 2021
Retrieval-guided Counterfactual Generation for QA
Retrieval-guided Counterfactual Generation for QA
Bhargavi Paranjape
Matthew Lamm
Ian Tenney
94
31
0
14 Oct 2021
Can Machines Learn Morality? The Delphi Experiment
Can Machines Learn Morality? The Delphi Experiment
Liwei Jiang
Jena D. Hwang
Chandra Bhagavatula
Ronan Le Bras
Jenny T Liang
...
Yulia Tsvetkov
Oren Etzioni
Maarten Sap
Regina A. Rini
Yejin Choi
FaML
211
122
0
14 Oct 2021
MReD: A Meta-Review Dataset for Structure-Controllable Text Generation
MReD: A Meta-Review Dataset for Structure-Controllable Text Generation
Chenhui Shen
Liying Cheng
Ran Zhou
Lidong Bing
Yang You
Luo Si
175
37
0
14 Oct 2021
Towards More Effective and Economic Sparsely-Activated Model
Towards More Effective and Economic Sparsely-Activated Model
Hao Jiang
Ke Zhan
Jianwei Qu
Yongkang Wu
Zhaoye Fei
...
Enrui Hu
Yinxia Zhang
Yantao Jia
Fan Yu
Bo Zhao
MoE
229
13
0
14 Oct 2021
Plug-Tagger: A Pluggable Sequence Labeling Framework Using Language
  Models
Plug-Tagger: A Pluggable Sequence Labeling Framework Using Language Models
Xin Zhou
Ruotian Ma
Tao Gui
Y. Tan
Qi Zhang
Xuanjing Huang
VLM
68
5
0
14 Oct 2021
Solving Aspect Category Sentiment Analysis as a Text Generation Task
Solving Aspect Category Sentiment Analysis as a Text Generation Task
Jian Liu
Zhiyang Teng
Leyang Cui
Hanmeng Liu
Yue Zhang
133
72
0
14 Oct 2021
LFPT5: A Unified Framework for Lifelong Few-shot Language Learning Based
  on Prompt Tuning of T5
LFPT5: A Unified Framework for Lifelong Few-shot Language Learning Based on Prompt Tuning of T5
Chengwei Qin
Shafiq Joty
CLL
216
104
0
14 Oct 2021
SpeechT5: Unified-Modal Encoder-Decoder Pre-Training for Spoken Language
  Processing
SpeechT5: Unified-Modal Encoder-Decoder Pre-Training for Spoken Language Processing
Junyi Ao
Rui Wang
Long Zhou
Chengyi Wang
Shuo Ren
...
Yu Zhang
Zhihua Wei
Yao Qian
Jinyu Li
Furu Wei
162
202
0
14 Oct 2021
Semantically Distributed Robust Optimization for Vision-and-Language
  Inference
Semantically Distributed Robust Optimization for Vision-and-Language Inference
Tejas Gokhale
A. Chaudhary
Pratyay Banerjee
Chitta Baral
Yezhou Yang
126
17
0
14 Oct 2021
bert2BERT: Towards Reusable Pretrained Language Models
bert2BERT: Towards Reusable Pretrained Language Models
Cheng Chen
Yichun Yin
Lifeng Shang
Xin Jiang
Yujia Qin
Fengyu Wang
Zhi Wang
Xiao Chen
Zhiyuan Liu
Qun Liu
VLM
85
64
0
14 Oct 2021
Towards Efficient NLP: A Standard Evaluation and A Strong Baseline
Towards Efficient NLP: A Standard Evaluation and A Strong Baseline
Xiangyang Liu
Tianxiang Sun
Junliang He
Jiawen Wu
Lingling Wu
Xinyu Zhang
Hao Jiang
Bo Zhao
Xuanjing Huang
Xipeng Qiu
ELM
85
47
0
13 Oct 2021
Bandits Don't Follow Rules: Balancing Multi-Facet Machine Translation
  with Multi-Armed Bandits
Bandits Don't Follow Rules: Balancing Multi-Facet Machine Translation with Multi-Armed Bandits
Julia Kreutzer
David Vilar
Artem Sokolov
99
15
0
13 Oct 2021
ConditionalQA: A Complex Reading Comprehension Dataset with Conditional
  Answers
ConditionalQA: A Complex Reading Comprehension Dataset with Conditional Answers
Haitian Sun
William W. Cohen
Ruslan Salakhutdinov
84
33
0
13 Oct 2021
Leveraging redundancy in attention with Reuse Transformers
Leveraging redundancy in attention with Reuse Transformers
Srinadh Bhojanapalli
Ayan Chakrabarti
Andreas Veit
Michal Lukasik
Himanshu Jain
Frederick Liu
Yin-Wen Chang
Sanjiv Kumar
52
27
0
13 Oct 2021
SGD-X: A Benchmark for Robust Generalization in Schema-Guided Dialogue
  Systems
SGD-X: A Benchmark for Robust Generalization in Schema-Guided Dialogue Systems
Harrison Lee
Raghav Gupta
Abhinav Rastogi
Yuan Cao
Bin Zhang
Yonghui Wu
129
33
0
13 Oct 2021
Mengzi: Towards Lightweight yet Ingenious Pre-trained Models for Chinese
Mengzi: Towards Lightweight yet Ingenious Pre-trained Models for Chinese
Zhuosheng Zhang
Hanqing Zhang
Keming Chen
Yuhang Guo
Jingyun Hua
Yulong Wang
Ming Zhou
VLM
110
72
0
13 Oct 2021
Truthful AI: Developing and governing AI that does not lie
Truthful AI: Developing and governing AI that does not lie
Owain Evans
Owen Cotton-Barratt
Lukas Finnveden
Adam Bales
Avital Balwit
Peter Wills
Luca Righetti
William Saunders
HILM
302
117
0
13 Oct 2021
MSP: Multi-Stage Prompting for Making Pre-trained Language Models Better
  Translators
MSP: Multi-Stage Prompting for Making Pre-trained Language Models Better Translators
Zhixing Tan
Xiangwen Zhang
Shuo Wang
Yang Liu
VLMLRM
285
53
0
13 Oct 2021
Dict-BERT: Enhancing Language Model Pre-training with Dictionary
Dict-BERT: Enhancing Language Model Pre-training with Dictionary
Wenhao Yu
Chenguang Zhu
Yuwei Fang
Donghan Yu
Shuohang Wang
Yichong Xu
Michael Zeng
Meng Jiang
123
65
0
13 Oct 2021
Attention-guided Generative Models for Extractive Question Answering
Attention-guided Generative Models for Extractive Question Answering
Peng Xu
Davis Liang
Zhiheng Huang
Bing Xiang
94
18
0
12 Oct 2021
Learning Compact Metrics for MT
Learning Compact Metrics for MT
Amy Pu
Hyung Won Chung
Ankur P. Parikh
Sebastian Gehrmann
Thibault Sellam
77
101
0
12 Oct 2021
LiST: Lite Prompted Self-training Makes Parameter-Efficient Few-shot
  Learners
LiST: Lite Prompted Self-training Makes Parameter-Efficient Few-shot Learners
Yaqing Wang
Subhabrata Mukherjee
Xiaodong Liu
Jing Gao
Ahmed Hassan Awadallah
Jianfeng Gao
VLMBDL
106
11
0
12 Oct 2021
TCube: Domain-Agnostic Neural Time-series Narration
TCube: Domain-Agnostic Neural Time-series Narration
Mandar Sharma
J. Brownstein
Naren Ramakrishnan
AI4TS
41
7
0
11 Oct 2021
Unsupervised Neural Machine Translation with Generative Language Models
  Only
Unsupervised Neural Machine Translation with Generative Language Models Only
Jesse Michael Han
Igor Babuschkin
Harrison Edwards
Arvind Neelakantan
Tao Xu
...
Alex Ray
Pranav Shyam
Aditya A. Ramesh
Alec Radford
Ilya Sutskever
117
37
0
11 Oct 2021
Multi-Task Learning for Situated Multi-Domain End-to-End Dialogue
  Systems
Multi-Task Learning for Situated Multi-Domain End-to-End Dialogue Systems
Po-Nien Kung
Chung-Cheng Chang
Tse-Hsuan Yang
H. Hsu
Yu-Jia Liou
Yun-Nung Chen
65
6
0
11 Oct 2021
Pre-trained Language Models in Biomedical Domain: A Systematic Survey
Pre-trained Language Models in Biomedical Domain: A Systematic Survey
Benyou Wang
Qianqian Xie
Jiahuan Pei
Zhihong Chen
Prayag Tiwari
Zhao Li
Jie Fu
LM&MAAI4CE
154
172
0
11 Oct 2021
Advances in Multi-turn Dialogue Comprehension: A Survey
Zhuosheng Zhang
Hai Zhao
99
21
0
11 Oct 2021
Language Models As or For Knowledge Bases
Language Models As or For Knowledge Bases
Simon Razniewski
Andrew Yates
Nora Kassner
Gerhard Weikum
KELM
80
1
0
10 Oct 2021
Yuan 1.0: Large-Scale Pre-trained Language Model in Zero-Shot and
  Few-Shot Learning
Yuan 1.0: Large-Scale Pre-trained Language Model in Zero-Shot and Few-Shot Learning
Shaohua Wu
Xudong Zhao
Tong Yu
Rongguo Zhang
C. Shen
...
Feng Li
Hong Zhu
Jiangang Luo
Liang Xu
Xuanwei Zhang
ALM
69
61
0
10 Oct 2021
Disentangled Sequence to Sequence Learning for Compositional
  Generalization
Disentangled Sequence to Sequence Learning for Compositional Generalization
Hao Zheng
Mirella Lapata
CoGeDRL
92
39
0
09 Oct 2021
Learning to Follow Language Instructions with Compositional Policies
Learning to Follow Language Instructions with Compositional Policies
Vanya Cohen
Geraud Nangue Tasse
N. Gopalan
Steven D. James
Matthew C. Gombolay
Benjamin Rosman
50
4
0
09 Oct 2021
Generating Disentangled Arguments with Prompts: A Simple Event
  Extraction Framework that Works
Generating Disentangled Arguments with Prompts: A Simple Event Extraction Framework that Works
Jinghui Si
Xutan Peng
Chen Li
Haotian Xu
Jianxin Li
109
10
0
09 Oct 2021
Previous
123...173174175...196197198
Next