ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.10683
  4. Cited By
Exploring the Limits of Transfer Learning with a Unified Text-to-Text
  Transformer
v1v2v3v4 (latest)

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
    AIMat
ArXiv (abs)PDFHTML

Papers citing "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

50 / 9,866 papers shown
Title
A Self-supervised Approach for Semantic Indexing in the Context of
  COVID-19 Pandemic
A Self-supervised Approach for Semantic Indexing in the Context of COVID-19 Pandemic
Nima Ebadi
Peyman Najafirad
OOD
37
2
0
07 Oct 2020
Galileo at SemEval-2020 Task 12: Multi-lingual Learning for Offensive
  Language Identification using Pre-trained Language Models
Galileo at SemEval-2020 Task 12: Multi-lingual Learning for Offensive Language Identification using Pre-trained Language Models
Shuohuan Wang
Jiaxiang Liu
Ouyang Xuan
Yu Sun
68
36
0
07 Oct 2020
TeaForN: Teacher-Forcing with N-grams
TeaForN: Teacher-Forcing with N-grams
Sebastian Goodman
Nan Ding
Radu Soricut
72
19
0
07 Oct 2020
Toward Stance-based Personas for Opinionated Dialogues
Toward Stance-based Personas for Opinionated Dialogues
Thomas Scialom
Serra Sinem Tekiroğlu
Jacopo Staiano
Marco Guerini
79
9
0
07 Oct 2020
Improving the Efficiency of Grammatical Error Correction with Erroneous
  Span Detection and Correction
Improving the Efficiency of Grammatical Error Correction with Erroneous Span Detection and Correction
M. Chen
Tao Ge
Xingxing Zhang
Furu Wei
M. Zhou
87
47
0
07 Oct 2020
PyMT5: multi-mode translation of natural language and Python code with
  transformers
PyMT5: multi-mode translation of natural language and Python code with transformers
Colin B. Clement
Dawn Drain
Jonathan Timcheck
Alexey Svyatkovskiy
Neel Sundaresan
84
156
0
07 Oct 2020
Support-set bottlenecks for video-text representation learning
Support-set bottlenecks for video-text representation learning
Mandela Patrick
Po-Yao (Bernie) Huang
Yuki M. Asano
Florian Metze
Alexander G. Hauptmann
João Henriques
Andrea Vedaldi
103
249
0
06 Oct 2020
A Transformer-based Framework for Multivariate Time Series
  Representation Learning
A Transformer-based Framework for Multivariate Time Series Representation Learning
George Zerveas
Srideepika Jayaraman
Dhaval Patel
A. Bhamidipaty
Carsten Eickhoff
AI4TS
109
947
0
06 Oct 2020
Neural Mask Generator: Learning to Generate Adaptive Word Maskings for
  Language Model Adaptation
Neural Mask Generator: Learning to Generate Adaptive Word Maskings for Language Model Adaptation
Minki Kang
Moonsu Han
Sung Ju Hwang
OOD
73
18
0
06 Oct 2020
Universal Natural Language Processing with Limited Annotations: Try
  Few-shot Textual Entailment as a Start
Universal Natural Language Processing with Limited Annotations: Try Few-shot Textual Entailment as a Start
Wenpeng Yin
Nazneen Rajani
Dragomir R. Radev
R. Socher
Caiming Xiong
90
69
0
06 Oct 2020
Efficient Meta Lifelong-Learning with Limited Memory
Efficient Meta Lifelong-Learning with Limited Memory
Zirui Wang
Sanket Vaibhav Mehta
Barnabás Póczós
J. Carbonell
CLLKELM
81
76
0
06 Oct 2020
Multi-Fact Correction in Abstractive Text Summarization
Multi-Fact Correction in Abstractive Text Summarization
Yue Dong
Shuohang Wang
Zhe Gan
Yu Cheng
Jackie C.K. Cheung
Jingjing Liu
KELMHILM
112
119
0
06 Oct 2020
Improving Neural Topic Models using Knowledge Distillation
Improving Neural Topic Models using Knowledge Distillation
Alexander Miserlis Hoyle
Pranav Goel
Philip Resnik
86
49
0
05 Oct 2020
Self-training Improves Pre-training for Natural Language Understanding
Self-training Improves Pre-training for Natural Language Understanding
Jingfei Du
Edouard Grave
Beliz Gunel
Vishrav Chaudhary
Onur Çelebi
Michael Auli
Ves Stoyanov
Alexis Conneau
VLMLRMSSL
52
164
0
05 Oct 2020
Local Label Point Correction for Edge Detection of Overlapping Cervical
  Cells
Local Label Point Correction for Edge Detection of Overlapping Cervical Cells
Jiawei Liu
Huijie Fan
Qiang Wang
Wentao Li
Yandong Tang
Danbo Wang
Mingyi Zhou
Li Chen
49
10
0
05 Oct 2020
PMI-Masking: Principled masking of correlated spans
PMI-Masking: Principled masking of correlated spans
Yoav Levine
Barak Lenz
Opher Lieber
Omri Abend
Kevin Leyton-Brown
Moshe Tennenholtz
Y. Shoham
66
73
0
05 Oct 2020
Improving AMR Parsing with Sequence-to-Sequence Pre-training
Improving AMR Parsing with Sequence-to-Sequence Pre-training
Dong Xu
Junhui Li
Muhua Zhu
Min Zhang
Guodong Zhou
AIMat
66
69
0
05 Oct 2020
Effective Unsupervised Domain Adaptation with Adversarially Trained
  Language Models
Effective Unsupervised Domain Adaptation with Adversarially Trained Language Models
Thuy-Trang Vu
Dinh Q. Phung
Gholamreza Haffari
86
25
0
05 Oct 2020
On Losses for Modern Language Models
On Losses for Modern Language Models
Stephane Aroca-Ouellette
Frank Rudzicz
81
32
0
04 Oct 2020
Multi-View Sequence-to-Sequence Models with Conversational Structure for
  Abstractive Dialogue Summarization
Multi-View Sequence-to-Sequence Models with Conversational Structure for Abstractive Dialogue Summarization
Jiaao Chen
Diyi Yang
80
148
0
04 Oct 2020
An Empirical Study on Large-Scale Multi-Label Text Classification
  Including Few and Zero-Shot Labels
An Empirical Study on Large-Scale Multi-Label Text Classification Including Few and Zero-Shot Labels
Ilias Chalkidis
Manos Fergadiotis
Sotiris Kotitsas
Prodromos Malakasiotis
Nikolaos Aletras
Ion Androutsopoulos
VLMAI4TS
63
86
0
04 Oct 2020
Data-Efficient Pretraining via Contrastive Self-Supervision
Data-Efficient Pretraining via Contrastive Self-Supervision
Nils Rethmeier
Isabelle Augenstein
103
21
0
02 Oct 2020
LUKE: Deep Contextualized Entity Representations with Entity-aware
  Self-attention
LUKE: Deep Contextualized Entity Representations with Entity-aware Self-attention
Ikuya Yamada
Akari Asai
Hiroyuki Shindo
Hideaki Takeda
Yuji Matsumoto
135
676
0
02 Oct 2020
Unsupervised Text Style Transfer with Padded Masked Language Models
Unsupervised Text Style Transfer with Padded Masked Language Models
Eric Malmi
Aliaksei Severyn
S. Rothe
62
12
0
02 Oct 2020
Which *BERT? A Survey Organizing Contextualized Encoders
Which *BERT? A Survey Organizing Contextualized Encoders
Patrick Xia
Shijie Wu
Benjamin Van Durme
62
50
0
02 Oct 2020
STIL -- Simultaneous Slot Filling, Translation, Intent Classification,
  and Language Identification: Initial Results using mBART on MultiATIS++
STIL -- Simultaneous Slot Filling, Translation, Intent Classification, and Language Identification: Initial Results using mBART on MultiATIS++
Jack G. M. FitzGerald
69
13
0
02 Oct 2020
Beyond The Text: Analysis of Privacy Statements through Syntactic and
  Semantic Role Labeling
Beyond The Text: Analysis of Privacy Statements through Syntactic and Semantic Role Labeling
Yan Shvartzshnaider
Ananth Balashankar
Vikas Patidar
Thomas Wies
L. Subramanian
48
4
0
01 Oct 2020
Measuring Systematic Generalization in Neural Proof Generation with
  Transformers
Measuring Systematic Generalization in Neural Proof Generation with Transformers
Nicolas Angelard-Gontier
Koustuv Sinha
Siva Reddy
C. Pal
LRM
106
64
0
30 Sep 2020
Improve Transformer Models with Better Relative Position Embeddings
Improve Transformer Models with Better Relative Position Embeddings
Zhiheng Huang
Davis Liang
Peng Xu
Bing Xiang
ViT
79
132
0
28 Sep 2020
Learning Knowledge Bases with Parameters for Task-Oriented Dialogue
  Systems
Learning Knowledge Bases with Parameters for Task-Oriented Dialogue Systems
Andrea Madotto
Samuel Cahyawijaya
Genta Indra Winata
Yan Xu
Zihan Liu
Zhaojiang Lin
Pascale Fung
116
64
0
28 Sep 2020
Answering Complex Open-Domain Questions with Multi-Hop Dense Retrieval
Answering Complex Open-Domain Questions with Multi-Hop Dense Retrieval
Wenhan Xiong
Xiang Lorraine Li
Srini Iyer
Jingfei Du
Patrick Lewis
...
Yashar Mehdad
Wen-tau Yih
Sebastian Riedel
Douwe Kiela
Barlas Oğuz
79
193
0
27 Sep 2020
KG-BART: Knowledge Graph-Augmented BART for Generative Commonsense
  Reasoning
KG-BART: Knowledge Graph-Augmented BART for Generative Commonsense Reasoning
Ye Liu
Yao Wan
Lifang He
Hao Peng
Philip S. Yu
114
189
0
26 Sep 2020
MinTL: Minimalist Transfer Learning for Task-Oriented Dialogue Systems
MinTL: Minimalist Transfer Learning for Task-Oriented Dialogue Systems
Zhaojiang Lin
Andrea Madotto
Genta Indra Winata
Pascale Fung
81
173
0
25 Sep 2020
RealToxicityPrompts: Evaluating Neural Toxic Degeneration in Language
  Models
RealToxicityPrompts: Evaluating Neural Toxic Degeneration in Language Models
Samuel Gehman
Suchin Gururangan
Maarten Sap
Yejin Choi
Noah A. Smith
216
1,224
0
24 Sep 2020
Seq2Edits: Sequence Transduction Using Span-level Edit Operations
Seq2Edits: Sequence Transduction Using Span-level Edit Operations
Felix Stahlberg
Shankar Kumar
BDL
97
84
0
23 Sep 2020
Preserving Integrity in Online Social Networks
Preserving Integrity in Online Social Networks
A. Halevy
Cristian Canton Ferrer
Hao Ma
Umut Ozertem
Patrick Pantel
Marzieh Saeidi
Fabrizio Silvestri
Ves Stoyanov
65
59
0
22 Sep 2020
Constructing interval variables via faceted Rasch measurement and
  multitask deep learning: a hate speech application
Constructing interval variables via faceted Rasch measurement and multitask deep learning: a hate speech application
Chris J. Kennedy
Geoff Bacon
A. Sahn
Claudia von Vacano
73
87
0
22 Sep 2020
An Empirical Study on Neural Keyphrase Generation
An Empirical Study on Neural Keyphrase Generation
Rui Meng
Xingdi Yuan
Tong Wang
Sanqiang Zhao
Adam Trischler
Daqing He
63
42
0
22 Sep 2020
UCD-CS at W-NUT 2020 Shared Task-3: A Text to Text Approach for COVID-19
  Event Extraction on Social Media
UCD-CS at W-NUT 2020 Shared Task-3: A Text to Text Approach for COVID-19 Event Extraction on Social Media
Congcong Wang
David Lillis
64
4
0
21 Sep 2020
VirtualFlow: Decoupling Deep Learning Models from the Underlying
  Hardware
VirtualFlow: Decoupling Deep Learning Models from the Underlying Hardware
Andrew Or
Haoyu Zhang
M. Freedman
73
10
0
20 Sep 2020
Can questions summarize a corpus? Using question generation for
  characterizing COVID-19 research
Can questions summarize a corpus? Using question generation for characterizing COVID-19 research
Gabriela Surita
Rodrigo Nogueira
R. Lotufo
29
7
0
19 Sep 2020
Conditionally Adaptive Multi-Task Learning: Improving Transfer Learning
  in NLP Using Fewer Parameters & Less Data
Conditionally Adaptive Multi-Task Learning: Improving Transfer Learning in NLP Using Fewer Parameters & Less Data
Jonathan Pilault
Amine Elhattami
C. Pal
CLLMoE
90
91
0
19 Sep 2020
Self-Supervised Meta-Learning for Few-Shot Natural Language
  Classification Tasks
Self-Supervised Meta-Learning for Few-Shot Natural Language Classification Tasks
Trapit Bansal
Rishikesh Jha
Tsendsuren Munkhdalai
Andrew McCallum
SSLVLM
108
88
0
17 Sep 2020
GraphCodeBERT: Pre-training Code Representations with Data Flow
GraphCodeBERT: Pre-training Code Representations with Data Flow
Daya Guo
Shuo Ren
Shuai Lu
Zhangyin Feng
Duyu Tang
...
Dawn Drain
Neel Sundaresan
Jian Yin
Daxin Jiang
M. Zhou
192
1,160
0
17 Sep 2020
Self-supervised pre-training and contrastive representation learning for
  multiple-choice video QA
Self-supervised pre-training and contrastive representation learning for multiple-choice video QA
Seonhoon Kim
Seohyeong Jeong
Eunbyul Kim
Inho Kang
Nojun Kwak
SSL
123
40
0
17 Sep 2020
GLUCOSE: GeneraLized and COntextualized Story Explanations
GLUCOSE: GeneraLized and COntextualized Story Explanations
N. Mostafazadeh
Aditya Kalyanpur
Lori Moon
David W. Buchanan
Lauren Berkowitz
Or Biran
Jennifer Chu-Carroll
155
121
0
16 Sep 2020
Evaluating representations by the complexity of learning low-loss
  predictors
Evaluating representations by the complexity of learning low-loss predictors
William F. Whitney
M. Song
David Brandfonbrener
Jaan Altosaar
Kyunghyun Cho
76
24
0
15 Sep 2020
Augmented Natural Language for Generative Sequence Labeling
Augmented Natural Language for Generative Sequence Labeling
Ben Athiwaratkun
Cicero Nogueira dos Santos
Jason Krone
Bing Xiang
VLM
73
64
0
15 Sep 2020
BERT-QE: Contextualized Query Expansion for Document Re-ranking
BERT-QE: Contextualized Query Expansion for Document Re-ranking
Zhi Zheng
Kai Hui
Xianpei Han
Xianpei Han
Le Sun
Andrew Yates
68
97
0
15 Sep 2020
It's Not Just Size That Matters: Small Language Models Are Also Few-Shot
  Learners
It's Not Just Size That Matters: Small Language Models Are Also Few-Shot Learners
Timo Schick
Hinrich Schütze
177
979
0
15 Sep 2020
Previous
123...191192193...196197198
Next