ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.10683
  4. Cited By
Exploring the Limits of Transfer Learning with a Unified Text-to-Text
  Transformer

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
    AIMat
ArXivPDFHTML

Papers citing "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

50 / 8,733 papers shown
Title
Language Model is All You Need: Natural Language Understanding as
  Question Answering
Language Model is All You Need: Natural Language Understanding as Question Answering
Mahdi Namazifar
Alexandros Papangelis
Gokhan Tur
Dilek Z. Hakkani-Tür
21
47
0
05 Nov 2020
Optimizing Transformer for Low-Resource Neural Machine Translation
Optimizing Transformer for Low-Resource Neural Machine Translation
Ali Araabi
Christof Monz
VLM
35
78
0
04 Nov 2020
Emergent Communication Pretraining for Few-Shot Machine Translation
Emergent Communication Pretraining for Few-Shot Machine Translation
Yaoyiran Li
E. Ponti
Ivan Vulić
Anna Korhonen
25
19
0
02 Nov 2020
ABNIRML: Analyzing the Behavior of Neural IR Models
ABNIRML: Analyzing the Behavior of Neural IR Models
Sean MacAvaney
Sergey Feldman
Nazli Goharian
Doug Downey
Arman Cohan
15
49
0
02 Nov 2020
Automatically Identifying Words That Can Serve as Labels for Few-Shot
  Text Classification
Automatically Identifying Words That Can Serve as Labels for Few-Shot Text Classification
Timo Schick
Helmut Schmid
Hinrich Schütze
VLM
11
206
0
26 Oct 2020
Pre-trained Summarization Distillation
Pre-trained Summarization Distillation
Sam Shleifer
Alexander M. Rush
26
98
0
24 Oct 2020
NeuroLogic Decoding: (Un)supervised Neural Text Generation with
  Predicate Logic Constraints
NeuroLogic Decoding: (Un)supervised Neural Text Generation with Predicate Logic Constraints
Ximing Lu
Peter West
Rowan Zellers
Ronan Le Bras
Chandra Bhagavatula
Yejin Choi
NAI
8
142
0
24 Oct 2020
Learning Contextualized Knowledge Structures for Commonsense Reasoning
Learning Contextualized Knowledge Structures for Commonsense Reasoning
Jun Yan
Mrigank Raman
Aaron Chan
Tianyu Zhang
Ryan Rossi
Handong Zhao
Sungchul Kim
Nedim Lipka
Xiang Ren
231
36
0
24 Oct 2020
Differentiable Open-Ended Commonsense Reasoning
Differentiable Open-Ended Commonsense Reasoning
Bill Yuchen Lin
Haitian Sun
Bhuwan Dhingra
Manzil Zaheer
Xiang Ren
William W. Cohen
ReLM
LRM
13
41
0
24 Oct 2020
CoCo: Controllable Counterfactuals for Evaluating Dialogue State
  Trackers
CoCo: Controllable Counterfactuals for Evaluating Dialogue State Trackers
Shiyang Li
Semih Yavuz
Kazuma Hashimoto
Jia Li
Tong Niu
Nazneen Rajani
Xifeng Yan
Yingbo Zhou
Caiming Xiong
44
62
0
24 Oct 2020
Multilingual Speech Translation with Efficient Finetuning of Pretrained
  Models
Multilingual Speech Translation with Efficient Finetuning of Pretrained Models
Xian Li
Changhan Wang
Yun Tang
C. Tran
Yuqing Tang
J. Pino
Alexei Baevski
Alexis Conneau
Michael Auli
21
6
0
24 Oct 2020
Text Editing by Command
Text Editing by Command
Felix Faltings
Michel Galley
Gerold Hintz
Chris Brockett
Chris Quirk
Jianfeng Gao
Bill Dolan
KELM
147
37
0
24 Oct 2020
Rethinking embedding coupling in pre-trained language models
Rethinking embedding coupling in pre-trained language models
Hyung Won Chung
Thibault Févry
Henry Tsai
Melvin Johnson
Sebastian Ruder
95
142
0
24 Oct 2020
COUGH: A Challenge Dataset and Models for COVID-19 FAQ Retrieval
COUGH: A Challenge Dataset and Models for COVID-19 FAQ Retrieval
Xinliang Frederick Zhang
Heming Sun
Xiang Yue
Simon M. Lin
Huan Sun
RALM
78
17
0
24 Oct 2020
Compositional Generalization and Natural Language Variation: Can a
  Semantic Parsing Approach Handle Both?
Compositional Generalization and Natural Language Variation: Can a Semantic Parsing Approach Handle Both?
Peter Shaw
Ming-Wei Chang
Panupong Pasupat
Kristina Toutanova
CoGe
27
182
0
24 Oct 2020
AQuaMuSe: Automatically Generating Datasets for Query-Based
  Multi-Document Summarization
AQuaMuSe: Automatically Generating Datasets for Query-Based Multi-Document Summarization
Sayali Kulkarni
Sheide Chammas
Wan Zhu
Fei Sha
Eugene Ie
RALM
64
52
0
23 Oct 2020
Knowledge Graph Based Synthetic Corpus Generation for Knowledge-Enhanced
  Language Model Pre-training
Knowledge Graph Based Synthetic Corpus Generation for Knowledge-Enhanced Language Model Pre-training
Oshin Agarwal
Heming Ge
Siamak Shakeri
Rami Al-Rfou
13
38
0
23 Oct 2020
Dynamic Contextualized Word Embeddings
Dynamic Contextualized Word Embeddings
Valentin Hofmann
J. Pierrehumbert
Hinrich Schütze
39
51
0
23 Oct 2020
Synthetic Data Augmentation for Zero-Shot Cross-Lingual Question
  Answering
Synthetic Data Augmentation for Zero-Shot Cross-Lingual Question Answering
Arij Riabi
Thomas Scialom
Rachel Keraron
Benoît Sagot
Djamé Seddah
Jacopo Staiano
142
52
0
23 Oct 2020
Unsupervised Multi-hop Question Answering by Question Generation
Unsupervised Multi-hop Question Answering by Question Generation
Liangming Pan
Wenhu Chen
Wenhan Xiong
Min-Yen Kan
William Yang Wang
34
59
0
23 Oct 2020
Answering Open-Domain Questions of Varying Reasoning Steps from Text
Answering Open-Domain Questions of Varying Reasoning Steps from Text
Peng Qi
Haejun Lee
OghenetegiriTGSido
Christopher D. Manning
KELM
RALM
LRM
191
55
0
23 Oct 2020
Neural Passage Retrieval with Improved Negative Contrast
Neural Passage Retrieval with Improved Negative Contrast
Jing Lu
Gustavo Hernández Ábrego
Ji Ma
Jianmo Ni
Yinfei Yang
23
25
0
23 Oct 2020
Pretraining and Fine-Tuning Strategies for Sentiment Analysis of Latvian
  Tweets
Pretraining and Fine-Tuning Strategies for Sentiment Analysis of Latvian Tweets
Gaurish Thakkar
Marcis Pinnis
60
9
0
23 Oct 2020
ERNIE-Gram: Pre-Training with Explicitly N-Gram Masked Language Modeling
  for Natural Language Understanding
ERNIE-Gram: Pre-Training with Explicitly N-Gram Masked Language Modeling for Natural Language Understanding
Dongling Xiao
Yukun Li
Han Zhang
Yu Sun
Hao Tian
Hua Wu
Haifeng Wang
27
38
0
23 Oct 2020
Language Models are Open Knowledge Graphs
Language Models are Open Knowledge Graphs
Chenguang Wang
Xiao Liu
D. Song
SSL
KELM
26
135
0
22 Oct 2020
Calibrated Language Model Fine-Tuning for In- and Out-of-Distribution
  Data
Calibrated Language Model Fine-Tuning for In- and Out-of-Distribution Data
Lingkai Kong
Haoming Jiang
Yuchen Zhuang
Jie Lyu
T. Zhao
Chao Zhang
OODD
24
26
0
22 Oct 2020
DuoRAT: Towards Simpler Text-to-SQL Models
DuoRAT: Towards Simpler Text-to-SQL Models
Torsten Scholak
Raymond Li
Dzmitry Bahdanau
H. D. Vries
C. Pal
AI4TS
35
26
0
21 Oct 2020
Open-Domain Frame Semantic Parsing Using Transformers
Open-Domain Frame Semantic Parsing Using Transformers
Aditya Kalyanpur
Or Biran
Tom Breloff
Jennifer Chu-Carroll
Ariel Diertani
Owen Rambow
Mark Sammons
26
18
0
21 Oct 2020
Emformer: Efficient Memory Transformer Based Acoustic Model For Low
  Latency Streaming Speech Recognition
Emformer: Efficient Memory Transformer Based Acoustic Model For Low Latency Streaming Speech Recognition
Yangyang Shi
Yongqiang Wang
Chunyang Wu
Ching-Feng Yeh
Julian Chan
Frank Zhang
Duc Le
M. Seltzer
56
168
0
21 Oct 2020
An Empirical Investigation of Contextualized Number Prediction
An Empirical Investigation of Contextualized Number Prediction
Daniel M. Spokoyny
Taylor Berg-Kirkpatrick
AI4TS
27
34
0
20 Oct 2020
Local Knowledge Powered Conversational Agents
Local Knowledge Powered Conversational Agents
Sashank Santhanam
Ming-Yu Liu
Raul Puri
M. Shoeybi
M. Patwary
Bryan Catanzaro
29
4
0
20 Oct 2020
Neural Language Modeling for Contextualized Temporal Graph Generation
Neural Language Modeling for Contextualized Temporal Graph Generation
Aman Madaan
Yiming Yang
38
20
0
20 Oct 2020
Anti-Distillation: Improving reproducibility of deep networks
Anti-Distillation: Improving reproducibility of deep networks
G. Shamir
Lorenzo Coviello
42
20
0
19 Oct 2020
ChemBERTa: Large-Scale Self-Supervised Pretraining for Molecular
  Property Prediction
ChemBERTa: Large-Scale Self-Supervised Pretraining for Molecular Property Prediction
Seyone Chithrananda
Gabriel Grand
Bharath Ramsundar
AI4CE
37
389
0
19 Oct 2020
Neural Databases
Neural Databases
James Thorne
Majid Yazdani
Marzieh Saeidi
Fabrizio Silvestri
Sebastian Riedel
A. Halevy
NAI
34
9
0
14 Oct 2020
Pretrained Transformers for Text Ranking: BERT and Beyond
Pretrained Transformers for Text Ranking: BERT and Beyond
Jimmy J. Lin
Rodrigo Nogueira
Andrew Yates
VLM
239
611
0
13 Oct 2020
BioMegatron: Larger Biomedical Domain Language Model
BioMegatron: Larger Biomedical Domain Language Model
Hoo-Chang Shin
Yang Zhang
Evelina Bakhturina
Raul Puri
M. Patwary
M. Shoeybi
Raghav Mani
AI4CE
19
144
0
12 Oct 2020
Probing Pretrained Language Models for Lexical Semantics
Probing Pretrained Language Models for Lexical Semantics
Ivan Vulić
E. Ponti
Robert Litschko
Goran Glavas
Anna Korhonen
KELM
28
232
0
12 Oct 2020
Reformulating Unsupervised Style Transfer as Paraphrase Generation
Reformulating Unsupervised Style Transfer as Paraphrase Generation
Kalpesh Krishna
John Wieting
Mohit Iyyer
24
236
0
12 Oct 2020
SMYRF: Efficient Attention using Asymmetric Clustering
SMYRF: Efficient Attention using Asymmetric Clustering
Giannis Daras
Nikita Kitaev
Augustus Odena
A. Dimakis
28
44
0
11 Oct 2020
Style Attuned Pre-training and Parameter Efficient Fine-tuning for
  Spoken Language Understanding
Style Attuned Pre-training and Parameter Efficient Fine-tuning for Spoken Language Understanding
Jin Cao
Jun Wang
Wael Hamza
Kelly Vanee
Shang-Wen Li
19
10
0
09 Oct 2020
Precise Task Formalization Matters in Winograd Schema Evaluations
Precise Task Formalization Matters in Winograd Schema Evaluations
Haokun Liu
William Huang
Dhara Mungra
Samuel R. Bowman
ReLM
22
12
0
08 Oct 2020
Infusing Disease Knowledge into BERT for Health Question Answering,
  Medical Inference and Disease Name Recognition
Infusing Disease Knowledge into BERT for Health Question Answering, Medical Inference and Disease Name Recognition
Yun He
Ziwei Zhu
Yin Zhang
Qin Chen
James Caverlee
AI4MH
30
108
0
08 Oct 2020
Uncovering the Limits of Adversarial Training against Norm-Bounded
  Adversarial Examples
Uncovering the Limits of Adversarial Training against Norm-Bounded Adversarial Examples
Sven Gowal
Chongli Qin
J. Uesato
Timothy A. Mann
Pushmeet Kohli
AAML
17
324
0
07 Oct 2020
Toward Stance-based Personas for Opinionated Dialogues
Toward Stance-based Personas for Opinionated Dialogues
Thomas Scialom
Serra Sinem Tekiroğlu
Jacopo Staiano
Marco Guerini
20
9
0
07 Oct 2020
Improving the Efficiency of Grammatical Error Correction with Erroneous
  Span Detection and Correction
Improving the Efficiency of Grammatical Error Correction with Erroneous Span Detection and Correction
M. Chen
Tao Ge
Xingxing Zhang
Furu Wei
M. Zhou
27
46
0
07 Oct 2020
Local Label Point Correction for Edge Detection of Overlapping Cervical
  Cells
Local Label Point Correction for Edge Detection of Overlapping Cervical Cells
Jiawei Liu
Huijie Fan
Qiang Wang
Wentao Li
Yandong Tang
Danbo Wang
Mingyi Zhou
Li Chen
13
9
0
05 Oct 2020
PMI-Masking: Principled masking of correlated spans
PMI-Masking: Principled masking of correlated spans
Yoav Levine
Barak Lenz
Opher Lieber
Omri Abend
Kevin Leyton-Brown
Moshe Tennenholtz
Y. Shoham
14
72
0
05 Oct 2020
Improving AMR Parsing with Sequence-to-Sequence Pre-training
Improving AMR Parsing with Sequence-to-Sequence Pre-training
Dong Xu
Junhui Li
Muhua Zhu
Min Zhang
Guodong Zhou
AIMat
18
68
0
05 Oct 2020
Effective Unsupervised Domain Adaptation with Adversarially Trained
  Language Models
Effective Unsupervised Domain Adaptation with Adversarially Trained Language Models
Thuy-Trang Vu
Dinh Q. Phung
Gholamreza Haffari
14
24
0
05 Oct 2020
Previous
123...171172173174175
Next