ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.10683
  4. Cited By
Exploring the Limits of Transfer Learning with a Unified Text-to-Text
  Transformer
v1v2v3v4 (latest)

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
    AIMat
ArXiv (abs)PDFHTML

Papers citing "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

50 / 9,870 papers shown
Title
RoFormer: Enhanced Transformer with Rotary Position Embedding
RoFormer: Enhanced Transformer with Rotary Position Embedding
Jianlin Su
Yu Lu
Shengfeng Pan
Ahmed Murtadha
Bo Wen
Yunfeng Liu
363
2,546
0
20 Apr 2021
M2TR: Multi-modal Multi-scale Transformers for Deepfake Detection
M2TR: Multi-modal Multi-scale Transformers for Deepfake Detection
Junke Wang
Zuxuan Wu
Wenhao Ouyang
Xintong Han
Jingjing Chen
Ser-Nam Lim
Yu-Gang Jiang
ViT
179
277
0
20 Apr 2021
Probing Commonsense Explanation in Dialogue Response Generation
Probing Commonsense Explanation in Dialogue Response Generation
Pei Zhou
Pegah Jandaghi
Bill Yuchen Lin
Justin Cho
Jay Pujara
Xiang Ren
LRM
157
17
0
19 Apr 2021
Understanding Chinese Video and Language via Contrastive Multimodal
  Pre-Training
Understanding Chinese Video and Language via Contrastive Multimodal Pre-Training
Chenyi Lei
Shixian Luo
Yong Liu
Wanggui He
Jiamang Wang
Guoxin Wang
Haihong Tang
Chunyan Miao
Houqiang Li
60
42
0
19 Apr 2021
Latent-Optimized Adversarial Neural Transfer for Sarcasm Detection
Latent-Optimized Adversarial Neural Transfer for Sarcasm Detection
Xu Guo
Boyang Albert Li
Han Yu
Chunyan Miao
AAML
94
18
0
19 Apr 2021
On the Influence of Masking Policies in Intermediate Pre-training
On the Influence of Masking Policies in Intermediate Pre-training
Qinyuan Ye
Belinda Z. Li
Sinong Wang
Benjamin Bolte
Hao Ma
Wen-tau Yih
Xiang Ren
Madian Khabsa
91
12
0
18 Apr 2021
CrossFit: A Few-shot Learning Challenge for Cross-task Generalization in
  NLP
CrossFit: A Few-shot Learning Challenge for Cross-task Generalization in NLP
Qinyuan Ye
Bill Yuchen Lin
Xiang Ren
301
185
0
18 Apr 2021
Flexible Generation of Natural Language Deductions
Flexible Generation of Natural Language Deductions
Kaj Bostrom
Xinyu Zhao
Swarat Chaudhuri
Greg Durrett
ReLMLRM
317
33
0
18 Apr 2021
FedNLP: Benchmarking Federated Learning Methods for Natural Language
  Processing Tasks
FedNLP: Benchmarking Federated Learning Methods for Natural Language Processing Tasks
Bill Yuchen Lin
Chaoyang He
ZiHang Zeng
Hulin Wang
Yufen Huang
Christophe Dupuy
Rahul Gupta
Mahdi Soltanolkotabi
Xiang Ren
Salman Avestimehr
FedML
88
116
0
18 Apr 2021
Misinfo Reaction Frames: Reasoning about Readers' Reactions to News
  Headlines
Misinfo Reaction Frames: Reasoning about Readers' Reactions to News Headlines
Saadia Gabriel
Skyler Hallinan
Maarten Sap
Pemi Nguyen
Franziska Roesner
Eunsol Choi
Yejin Choi
101
44
0
18 Apr 2021
Fantastically Ordered Prompts and Where to Find Them: Overcoming
  Few-Shot Prompt Order Sensitivity
Fantastically Ordered Prompts and Where to Find Them: Overcoming Few-Shot Prompt Order Sensitivity
Yao Lu
Max Bartolo
Alastair Moore
Sebastian Riedel
Pontus Stenetorp
AILawLRM
439
1,200
0
18 Apr 2021
Cross-Task Generalization via Natural Language Crowdsourcing
  Instructions
Cross-Task Generalization via Natural Language Crowdsourcing Instructions
Swaroop Mishra
Daniel Khashabi
Chitta Baral
Hannaneh Hajishirzi
LRM
184
756
0
18 Apr 2021
Constrained Language Models Yield Few-Shot Semantic Parsers
Constrained Language Models Yield Few-Shot Semantic Parsers
Richard Shin
C. H. Lin
Sam Thomson
Charles C. Chen
Subhro Roy
Emmanouil Antonios Platanios
Adam Pauls
Dan Klein
J. Eisner
Benjamin Van Durme
395
206
0
18 Apr 2021
Improving Neural Model Performance through Natural Language Feedback on
  Their Explanations
Improving Neural Model Performance through Natural Language Feedback on Their Explanations
Aman Madaan
Niket Tandon
Dheeraj Rajagopal
Yiming Yang
Peter Clark
Keisuke Sakaguchi
Eduard H. Hovy
ReLMLRM
54
6
0
18 Apr 2021
Case-based Reasoning for Natural Language Queries over Knowledge Bases
Case-based Reasoning for Natural Language Queries over Knowledge Bases
Rajarshi Das
Manzil Zaheer
Dung Ngoc Thai
Ameya Godbole
Ethan Perez
Jay Yoon Lee
Lizhen Tan
L. Polymenakos
Andrew McCallum
111
169
0
18 Apr 2021
Documenting Large Webtext Corpora: A Case Study on the Colossal Clean
  Crawled Corpus
Documenting Large Webtext Corpora: A Case Study on the Colossal Clean Crawled Corpus
Jesse Dodge
Maarten Sap
Ana Marasović
William Agnew
Gabriel Ilharco
Dirk Groeneveld
Margaret Mitchell
Matt Gardner
AILaw
126
455
0
18 Apr 2021
Generative Context Pair Selection for Multi-hop Question Answering
Generative Context Pair Selection for Multi-hop Question Answering
Dheeru Dua
Cicero Nogueira dos Santos
Patrick Ng
Ben Athiwaratkun
Bing Xiang
Matt Gardner
Sameer Singh
LRM
30
2
0
18 Apr 2021
Can NLI Models Verify QA Systems' Predictions?
Can NLI Models Verify QA Systems' Predictions?
Jifan Chen
Eunsol Choi
Greg Durrett
137
54
0
18 Apr 2021
GooAQ: Open Question Answering with Diverse Answer Types
GooAQ: Open Question Answering with Diverse Answer Types
Daniel Khashabi
Amos Ng
Tushar Khot
Ashish Sabharwal
Hannaneh Hajishirzi
Chris Callison-Burch
90
54
0
18 Apr 2021
Extract, Denoise and Enforce: Evaluating and Improving Concept
  Preservation for Text-to-Text Generation
Extract, Denoise and Enforce: Evaluating and Improving Concept Preservation for Text-to-Text Generation
Yuning Mao
Wenchang Ma
Deren Lei
Jiawei Han
Xiang Ren
100
4
0
18 Apr 2021
A Simple and Effective Positional Encoding for Transformers
A Simple and Effective Positional Encoding for Transformers
Pu-Chin Chen
Henry Tsai
Srinadh Bhojanapalli
Hyung Won Chung
Yin-Wen Chang
Chun-Sung Ferng
120
66
0
18 Apr 2021
MT6: Multilingual Pretrained Text-to-Text Transformer with Translation
  Pairs
MT6: Multilingual Pretrained Text-to-Text Transformer with Translation Pairs
Zewen Chi
Li Dong
Shuming Ma
Shaohan Huang Xian-Ling Mao
Heyan Huang
Furu Wei
LRM
123
74
0
18 Apr 2021
The Power of Scale for Parameter-Efficient Prompt Tuning
The Power of Scale for Parameter-Efficient Prompt Tuning
Brian Lester
Rami Al-Rfou
Noah Constant
VPVLM
682
4,119
0
18 Apr 2021
Worst of Both Worlds: Biases Compound in Pre-trained Vision-and-Language
  Models
Worst of Both Worlds: Biases Compound in Pre-trained Vision-and-Language Models
Tejas Srinivasan
Yonatan Bisk
VLM
83
56
0
18 Apr 2021
BEIR: A Heterogenous Benchmark for Zero-shot Evaluation of Information
  Retrieval Models
BEIR: A Heterogenous Benchmark for Zero-shot Evaluation of Information Retrieval Models
Nandan Thakur
Nils Reimers
Andreas Rucklé
Abhishek Srivastava
Iryna Gurevych
VLM
439
1,059
0
17 Apr 2021
Explaining Answers with Entailment Trees
Explaining Answers with Entailment Trees
Bhavana Dalvi
Peter Alexander Jansen
Oyvind Tafjord
Zhengnan Xie
Hannah Smith
Leighanna Pipatanangkura
Peter Clark
ReLMFAttLRM
306
186
0
17 Apr 2021
TransVG: End-to-End Visual Grounding with Transformers
TransVG: End-to-End Visual Grounding with Transformers
Jiajun Deng
Zhengyuan Yang
Tianlang Chen
Wen-gang Zhou
Houqiang Li
ViT
107
348
0
17 Apr 2021
Joint Passage Ranking for Diverse Multi-Answer Retrieval
Joint Passage Ranking for Diverse Multi-Answer Retrieval
Sewon Min
Kenton Lee
Ming-Wei Chang
Kristina Toutanova
Hannaneh Hajishirzi
113
42
0
17 Apr 2021
ESTER: A Machine Reading Comprehension Dataset for Event Semantic
  Relation Reasoning
ESTER: A Machine Reading Comprehension Dataset for Event Semantic Relation Reasoning
Rujun Han
I-Hung Hsu
Jiao Sun
J. Baylón
Qiang Ning
Dan Roth
Nanyun Peng
71
46
0
16 Apr 2021
AMMU : A Survey of Transformer-based Biomedical Pretrained Language
  Models
AMMU : A Survey of Transformer-based Biomedical Pretrained Language Models
Katikapalli Subramanyam Kalyan
A. Rajasekharan
S. Sangeetha
LM&MAMedIm
115
171
0
16 Apr 2021
Learning to Reason for Text Generation from Scientific Tables
Learning to Reason for Text Generation from Scientific Tables
N. Moosavi
Andreas Rucklé
Dan Roth
Iryna Gurevych
LMTDLRM
105
20
0
16 Apr 2021
$Q^{2}$: Evaluating Factual Consistency in Knowledge-Grounded Dialogues
  via Question Generation and Question Answering
Q2Q^{2}Q2: Evaluating Factual Consistency in Knowledge-Grounded Dialogues via Question Generation and Question Answering
Or Honovich
Leshem Choshen
Roee Aharoni
Ella Neeman
Idan Szpektor
Omri Abend
HILM
104
143
0
16 Apr 2021
Editing Factual Knowledge in Language Models
Editing Factual Knowledge in Language Models
Nicola De Cao
Wilker Aziz
Ivan Titov
KELM
136
513
0
16 Apr 2021
Back to Square One: Artifact Detection, Training and Commonsense
  Disentanglement in the Winograd Schema
Back to Square One: Artifact Detection, Training and Commonsense Disentanglement in the Winograd Schema
Yanai Elazar
Hongming Zhang
Yoav Goldberg
Dan Roth
ReLMLRM
128
44
0
16 Apr 2021
LU-BZU at SemEval-2021 Task 2: Word2Vec and Lemma2Vec performance in
  Arabic Word-in-Context disambiguation
LU-BZU at SemEval-2021 Task 2: Word2Vec and Lemma2Vec performance in Arabic Word-in-Context disambiguation
Moustafa Al-Hajj
Mustafa Jarrar
67
15
0
16 Apr 2021
Generating Bug-Fixes Using Pretrained Transformers
Generating Bug-Fixes Using Pretrained Transformers
Dawn Drain
Chen Henry Wu
Alexey Svyatkovskiy
Neel Sundaresan
74
51
0
16 Apr 2021
Probing Across Time: What Does RoBERTa Know and When?
Probing Across Time: What Does RoBERTa Know and When?
Leo Z. Liu
Yizhong Wang
Jungo Kasai
Hannaneh Hajishirzi
Noah A. Smith
KELM
114
88
0
16 Apr 2021
ZeRO-Infinity: Breaking the GPU Memory Wall for Extreme Scale Deep
  Learning
ZeRO-Infinity: Breaking the GPU Memory Wall for Extreme Scale Deep Learning
Samyam Rajbhandari
Olatunji Ruwase
Jeff Rasley
Shaden Smith
Yuxiong He
GNN
101
393
0
16 Apr 2021
Does BERT Pretrained on Clinical Notes Reveal Sensitive Data?
Does BERT Pretrained on Clinical Notes Reveal Sensitive Data?
Eric P. Lehman
Sarthak Jain
Karl Pichotta
Yoav Goldberg
Byron C. Wallace
OODMIACV
83
121
0
15 Apr 2021
A Survey of Recent Abstract Summarization Techniques
A Survey of Recent Abstract Summarization Techniques
Diyah Puspitaningrum
24
7
0
15 Apr 2021
ExplaGraphs: An Explanation Graph Generation Task for Structured
  Commonsense Reasoning
ExplaGraphs: An Explanation Graph Generation Task for Structured Commonsense Reasoning
Swarnadeep Saha
Prateek Yadav
Lisa Bauer
Joey Tianyi Zhou
LRM
90
59
0
15 Apr 2021
Planning with Learned Entity Prompts for Abstractive Summarization
Planning with Learned Entity Prompts for Abstractive Summarization
Shashi Narayan
Yao-Min Zhao
Joshua Maynez
Gonçalo Simões
Vitaly Nikolaev
Ryan T. McDonald
LRM
94
120
0
15 Apr 2021
SummVis: Interactive Visual Analysis of Models, Data, and Evaluation for
  Text Summarization
SummVis: Interactive Visual Analysis of Models, Data, and Evaluation for Text Summarization
Jesse Vig
Wojciech Kry'sciñski
Karan Goel
Nazneen Rajani
69
22
0
15 Apr 2021
Retrieval Augmentation Reduces Hallucination in Conversation
Retrieval Augmentation Reduces Hallucination in Conversation
Kurt Shuster
Spencer Poff
Moya Chen
Douwe Kiela
Jason Weston
HILM
127
751
0
15 Apr 2021
Hierarchical Learning for Generation with Long Source Sequences
Hierarchical Learning for Generation with Long Source Sequences
T. Rohde
Xiaoxia Wu
Yinhan Liu
BDLVLM
76
56
0
15 Apr 2021
Generating Datasets with Pretrained Language Models
Generating Datasets with Pretrained Language Models
Timo Schick
Hinrich Schütze
173
235
0
15 Apr 2021
Unlocking Compositional Generalization in Pre-trained Models Using
  Intermediate Representations
Unlocking Compositional Generalization in Pre-trained Models Using Intermediate Representations
Jonathan Herzig
Peter Shaw
Ming-Wei Chang
Kelvin Guu
Panupong Pasupat
Yuan Zhang
AI4CE
76
69
0
15 Apr 2021
XTREME-R: Towards More Challenging and Nuanced Multilingual Evaluation
XTREME-R: Towards More Challenging and Nuanced Multilingual Evaluation
Sebastian Ruder
Noah Constant
Jan A. Botha
Aditya Siddhant
Orhan Firat
...
Pengfei Liu
Junjie Hu
Dan Garrette
Graham Neubig
Melvin Johnson
ELMAAMLLRM
93
190
0
15 Apr 2021
NT5?! Training T5 to Perform Numerical Reasoning
NT5?! Training T5 to Perform Numerical Reasoning
Peng Yang
Ying Chen
Yuechan Chen
Daniel Cer
AIMatLRM
75
15
0
15 Apr 2021
Sentence-Permuted Paragraph Generation
Sentence-Permuted Paragraph Generation
Wenhao Yu
Chenguang Zhu
Tong Zhao
Zhichun Guo
Meng Jiang
50
11
0
15 Apr 2021
Previous
123...183184185...196197198
Next