ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.10683
  4. Cited By
Exploring the Limits of Transfer Learning with a Unified Text-to-Text
  Transformer
v1v2v3v4 (latest)

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
    AIMat
ArXiv (abs)PDFHTML

Papers citing "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

50 / 9,851 papers shown
Title
SEAL: Segment-wise Extractive-Abstractive Long-form Text Summarization
SEAL: Segment-wise Extractive-Abstractive Long-form Text Summarization
Yao-Min Zhao
Mohammad Saleh
Peter J. Liu
RALM
100
25
0
18 Jun 2020
CO-Search: COVID-19 Information Retrieval with Semantic Search, Question
  Answering, and Abstractive Summarization
CO-Search: COVID-19 Information Retrieval with Semantic Search, Question Answering, and Abstractive Summarization
A. Esteva
Anuprit Kale
Romain Paulus
Kazuma Hashimoto
Wenpeng Yin
Dragomir R. Radev
R. Socher
107
65
0
17 Jun 2020
Cross-lingual Retrieval for Iterative Self-Supervised Training
Cross-lingual Retrieval for Iterative Self-Supervised Training
C. Tran
Y. Tang
Xian Li
Jiatao Gu
RALM
70
75
0
16 Jun 2020
Modeling Graph Structure via Relative Position for Text Generation from
  Knowledge Graphs
Modeling Graph Structure via Relative Position for Text Generation from Knowledge Graphs
Martin Schmitt
Leonardo F. R. Ribeiro
Philipp Dufter
Iryna Gurevych
Hinrich Schütze
GNN
50
8
0
16 Jun 2020
To Pretrain or Not to Pretrain: Examining the Benefits of Pretraining on
  Resource Rich Tasks
To Pretrain or Not to Pretrain: Examining the Benefits of Pretraining on Resource Rich Tasks
Sinong Wang
Madian Khabsa
Hao Ma
60
26
0
15 Jun 2020
Video Understanding as Machine Translation
Bruno Korbar
Fabio Petroni
Rohit Girdhar
Lorenzo Torresani
SSL
88
29
0
12 Jun 2020
NAS-Bench-NLP: Neural Architecture Search Benchmark for Natural Language
  Processing
NAS-Bench-NLP: Neural Architecture Search Benchmark for Natural Language Processing
Nikita Klyuchnikov
I. Trofimov
Ekaterina Artemova
Mikhail Salnikov
M. Fedorov
Evgeny Burnaev
VLM
112
105
0
12 Jun 2020
Learning the Travelling Salesperson Problem Requires Rethinking
  Generalization
Learning the Travelling Salesperson Problem Requires Rethinking Generalization
Chaitanya K. Joshi
Quentin Cappart
Louis-Martin Rousseau
T. Laurent
256
121
0
12 Jun 2020
A Monolingual Approach to Contextualized Word Embeddings for
  Mid-Resource Languages
A Monolingual Approach to Contextualized Word Embeddings for Mid-Resource Languages
Pedro Ortiz Suarez
Laurent Romary
Benoît Sagot
73
234
0
11 Jun 2020
MC-BERT: Efficient Language Pre-Training via a Meta Controller
MC-BERT: Efficient Language Pre-Training via a Meta Controller
Zhenhui Xu
Linyuan Gong
Guolin Ke
Di He
Shuxin Zheng
Liwei Wang
Jiang Bian
Tie-Yan Liu
BDL
65
18
0
10 Jun 2020
Linformer: Self-Attention with Linear Complexity
Linformer: Self-Attention with Linear Complexity
Sinong Wang
Belinda Z. Li
Madian Khabsa
Han Fang
Hao Ma
229
1,721
0
08 Jun 2020
CycleGT: Unsupervised Graph-to-Text and Text-to-Graph Generation via
  Cycle Training
CycleGT: Unsupervised Graph-to-Text and Text-to-Graph Generation via Cycle Training
Qipeng Guo
Zhijing Jin
Xipeng Qiu
Weinan Zhang
David Wipf
Zheng Zhang
126
61
0
08 Jun 2020
ColdGANs: Taming Language GANs with Cautious Sampling Strategies
ColdGANs: Taming Language GANs with Cautious Sampling Strategies
Thomas Scialom
Paul-Alexis Dray
Sylvain Lamprier
Benjamin Piwowarski
Jacopo Staiano
GANSyDa
68
18
0
08 Jun 2020
BERT Loses Patience: Fast and Robust Inference with Early Exit
BERT Loses Patience: Fast and Robust Inference with Early Exit
Wangchunshu Zhou
Canwen Xu
Tao Ge
Julian McAuley
Ke Xu
Furu Wei
63
343
0
07 Jun 2020
Language Models as Fact Checkers?
Language Models as Fact Checkers?
Nayeon Lee
Belinda Z. Li
Sinong Wang
Wen-tau Yih
Hao Ma
Madian Khabsa
KELMHILMLRM
86
75
0
07 Jun 2020
DeBERTa: Decoding-enhanced BERT with Disentangled Attention
DeBERTa: Decoding-enhanced BERT with Disentangled Attention
Pengcheng He
Xiaodong Liu
Jianfeng Gao
Weizhu Chen
AAML
181
2,770
0
05 Jun 2020
GMAT: Global Memory Augmentation for Transformers
GMAT: Global Memory Augmentation for Transformers
Ankit Gupta
Jonathan Berant
RALM
81
50
0
05 Jun 2020
Funnel-Transformer: Filtering out Sequential Redundancy for Efficient
  Language Processing
Funnel-Transformer: Filtering out Sequential Redundancy for Efficient Language Processing
Zihang Dai
Guokun Lai
Yiming Yang
Quoc V. Le
109
236
0
05 Jun 2020
MLE-guided parameter search for task loss minimization in neural
  sequence modeling
MLE-guided parameter search for task loss minimization in neural sequence modeling
Sean Welleck
Kyunghyun Cho
65
10
0
04 Jun 2020
Automatic Text Summarization of COVID-19 Medical Research Articles using
  BERT and GPT-2
Automatic Text Summarization of COVID-19 Medical Research Articles using BERT and GPT-2
V. Kieuvongngam
Bowen Tan
Yiming Niu
AI4MH
56
96
0
03 Jun 2020
A Survey on Transfer Learning in Natural Language Processing
A Survey on Transfer Learning in Natural Language Processing
Zaid Alyafeai
Maged S. Alshaibani
Irfan Ahmad
91
75
0
31 May 2020
Neural Entity Linking: A Survey of Models Based on Deep Learning
Neural Entity Linking: A Survey of Models Based on Deep Learning
Ozge Sevgili
Artem Shelmanov
Mikhail V. Arkhipov
Alexander Panchenko
Christian Biemann
VLM3DVAI4TS
135
123
0
31 May 2020
Language Models are Few-Shot Learners
Language Models are Few-Shot Learners
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
...
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
BDL
1.1K
42,651
0
28 May 2020
Syntactic Structure Distillation Pretraining For Bidirectional Encoders
Syntactic Structure Distillation Pretraining For Bidirectional Encoders
A. Kuncoro
Lingpeng Kong
Daniel Fried
Dani Yogatama
Laura Rimell
Chris Dyer
Phil Blunsom
93
34
0
27 May 2020
Predict-then-Decide: A Predictive Approach for Wait or Answer Task in
  Dialogue Systems
Predict-then-Decide: A Predictive Approach for Wait or Answer Task in Dialogue Systems
Zehao Lin
Shaobo Cui
Guodun Li
Xiaoming Kang
Feng Ji
Feng-Lin Li
Zhongzhou Zhao
Haiqing Chen
Yin Zhang
67
2
0
27 May 2020
ParsBERT: Transformer-based Model for Persian Language Understanding
ParsBERT: Transformer-based Model for Persian Language Understanding
Mehrdad Farahani
Mohammad Gharachorloo
Marzieh Farahani
Mohammad Manthouri
91
210
0
26 May 2020
NILE : Natural Language Inference with Faithful Natural Language
  Explanations
NILE : Natural Language Inference with Faithful Natural Language Explanations
Sawan Kumar
Partha P. Talukdar
XAILRM
116
163
0
25 May 2020
Summarizing and Exploring Tabular Data in Conversational Search
Summarizing and Exploring Tabular Data in Conversational Search
Shuo Zhang
Zhuyun Dai
K. Balog
Jamie Callan
RALMLMTD
110
41
0
23 May 2020
Text-to-Text Pre-Training for Data-to-Text Tasks
Text-to-Text Pre-Training for Data-to-Text Tasks
Mihir Kale
Abhinav Rastogi
AI4CE
90
203
0
21 May 2020
BiQGEMM: Matrix Multiplication with Lookup Table For Binary-Coding-based
  Quantized DNNs
BiQGEMM: Matrix Multiplication with Lookup Table For Binary-Coding-based Quantized DNNs
Yongkweon Jeon
Baeseong Park
S. Kwon
Byeongwook Kim
Jeongin Yun
Dongsoo Lee
MQ
63
31
0
20 May 2020
Normalized Attention Without Probability Cage
Normalized Attention Without Probability Cage
Oliver Richter
Roger Wattenhofer
91
21
0
19 May 2020
CERT: Contrastive Self-supervised Learning for Language Understanding
CERT: Contrastive Self-supervised Learning for Language Understanding
Hongchao Fang
Sicheng Wang
Meng Zhou
Jiayuan Ding
P. Xie
ELMSSL
76
345
0
16 May 2020
Movement Pruning: Adaptive Sparsity by Fine-Tuning
Movement Pruning: Adaptive Sparsity by Fine-Tuning
Victor Sanh
Thomas Wolf
Alexander M. Rush
99
488
0
15 May 2020
Machine Reading Comprehension: The Role of Contextualized Language
  Models and Beyond
Machine Reading Comprehension: The Role of Contextualized Language Models and Beyond
Zhuosheng Zhang
Hai Zhao
Rui Wang
115
63
0
13 May 2020
Large Scale Multi-Actor Generative Dialog Modeling
Large Scale Multi-Actor Generative Dialog Modeling
Alex Boyd
Raul Puri
Mohammad Shoeybi
M. Patwary
Bryan Catanzaro
71
23
0
13 May 2020
SKEP: Sentiment Knowledge Enhanced Pre-training for Sentiment Analysis
SKEP: Sentiment Knowledge Enhanced Pre-training for Sentiment Analysis
Hao Tian
Can Gao
Xinyan Xiao
Hao Liu
Bolei He
Hua Wu
Haifeng Wang
Feng Wu
68
237
0
12 May 2020
Enabling Language Models to Fill in the Blanks
Enabling Language Models to Fill in the Blanks
Chris Donahue
Mina Lee
Percy Liang
58
198
0
11 May 2020
A Dataset for Statutory Reasoning in Tax Law Entailment and Question
  Answering
A Dataset for Statutory Reasoning in Tax Law Entailment and Question Answering
Nils Holzenberger
Andrew Blair-Stanek
Benjamin Van Durme
ELMAILaw
69
69
0
11 May 2020
Leveraging Monolingual Data with Self-Supervision for Multilingual
  Neural Machine Translation
Leveraging Monolingual Data with Self-Supervision for Multilingual Neural Machine Translation
Aditya Siddhant
Ankur Bapna
Yuan Cao
Orhan Firat
Mengzhao Chen
Sneha Kudugunta
N. Arivazhagan
Yonghui Wu
86
86
0
11 May 2020
How Context Affects Language Models' Factual Predictions
How Context Affects Language Models' Factual Predictions
Fabio Petroni
Patrick Lewis
Aleksandra Piktus
Tim Rocktaschel
Yuxiang Wu
Alexander H. Miller
Sebastian Riedel
KELM
76
239
0
10 May 2020
Transformer Based Language Models for Similar Text Retrieval and Ranking
Transformer Based Language Models for Similar Text Retrieval and Ranking
Javed Qadrud-Din
Ashraf Bah Rabiou
Ryan S Walker
Ravindra Soni
M. Gajek
Gabriel Pack
A. Rangaraj
39
5
0
10 May 2020
Measuring the Algorithmic Efficiency of Neural Networks
Measuring the Algorithmic Efficiency of Neural Networks
Danny Hernandez
Tom B. Brown
291
97
0
08 May 2020
JASS: Japanese-specific Sequence to Sequence Pre-training for Neural
  Machine Translation
JASS: Japanese-specific Sequence to Sequence Pre-training for Neural Machine Translation
Zhuoyuan Mao
Fabien Cromierès
Raj Dabre
Haiyue Song
Sadao Kurohashi
73
4
0
07 May 2020
Multi-Stage Conversational Passage Retrieval: An Approach to Fusing Term
  Importance Estimation and Neural Query Rewriting
Multi-Stage Conversational Passage Retrieval: An Approach to Fusing Term Importance Estimation and Neural Query Rewriting
Sheng-Chieh Lin
Jheng-Hong Yang
Rodrigo Nogueira
Ming-Feng Tsai
Chuan-Ju Wang
Jimmy J. Lin
84
24
0
05 May 2020
Towards QoS-Aware and Resource-Efficient GPU Microservices Based on
  Spatial Multitasking GPUs In Datacenters
Towards QoS-Aware and Resource-Efficient GPU Microservices Based on Spatial Multitasking GPUs In Datacenters
Wei Zhang
Quan Chen
Kaihua Fu
Ningxin Zheng
Zhiyi Huang
Jingwen Leng
Chao Li
Wenli Zheng
Minyi Guo
29
3
0
05 May 2020
Establishing Baselines for Text Classification in Low-Resource Languages
Establishing Baselines for Text Classification in Low-Resource Languages
Jan Christian Blaise Cruz
C. Cheng
96
38
0
05 May 2020
Exploring Controllable Text Generation Techniques
Exploring Controllable Text Generation Techniques
Shrimai Prabhumoye
A. Black
Ruslan Salakhutdinov
AI4CE
210
91
0
04 May 2020
Generating SOAP Notes from Doctor-Patient Conversations Using Modular
  Summarization Techniques
Generating SOAP Notes from Doctor-Patient Conversations Using Modular Summarization Techniques
Kundan Krishna
Sopan Khosla
Jeffrey P. Bigham
Zachary Chase Lipton
91
120
0
04 May 2020
How Can We Accelerate Progress Towards Human-like Linguistic
  Generalization?
How Can We Accelerate Progress Towards Human-like Linguistic Generalization?
Tal Linzen
289
195
0
03 May 2020
Teaching Machine Comprehension with Compositional Explanations
Teaching Machine Comprehension with Compositional Explanations
Qinyuan Ye
Xiao Huang
Elizabeth Boschee
Xiang Ren
LRMReLM
96
34
0
02 May 2020
Previous
123...193194195196197198
Next