ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.10683
  4. Cited By
Exploring the Limits of Transfer Learning with a Unified Text-to-Text
  Transformer
v1v2v3v4 (latest)

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
    AIMat
ArXiv (abs)PDFHTML

Papers citing "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

50 / 9,870 papers shown
Title
More but Correct: Generating Diversified and Entity-revised Medical
  Response
More but Correct: Generating Diversified and Entity-revised Medical Response
Bin Li
Encheng Chen
Hongrui Liu
Yixuan Weng
Bin Sun
Shutao Li
Yongping Bai
Meiling Hu
MedIm
87
12
0
03 Aug 2021
From LSAT: The Progress and Challenges of Complex Reasoning
From LSAT: The Progress and Challenges of Complex Reasoning
Siyuan Wang
Zhongkun Liu
Wanjun Zhong
Ming Zhou
Zhongyu Wei
Zhumin Chen
Nan Duan
ELM
85
46
0
02 Aug 2021
Logic-Consistency Text Generation from Semantic Parses
Logic-Consistency Text Generation from Semantic Parses
Chang Shu
Yusen Zhang
Xiangyu Dong
Peng Shi
Tao Yu
Rui Zhang
90
34
0
02 Aug 2021
Learning to Look Inside: Augmenting Token-Based Encoders with
  Character-Level Information
Learning to Look Inside: Augmenting Token-Based Encoders with Character-Level Information
Yuval Pinter
Amanda Stent
Mark Dredze
Jacob Eisenstein
33
7
0
01 Aug 2021
Improving Social Meaning Detection with Pragmatic Masking and Surrogate
  Fine-Tuning
Improving Social Meaning Detection with Pragmatic Masking and Surrogate Fine-Tuning
Chiyu Zhang
Muhammad Abdul-Mageed
ObjDAI4CE
73
6
0
01 Aug 2021
Perceiver IO: A General Architecture for Structured Inputs & Outputs
Perceiver IO: A General Architecture for Structured Inputs & Outputs
Andrew Jaegle
Sebastian Borgeaud
Jean-Baptiste Alayrac
Carl Doersch
Catalin Ionescu
...
Olivier J. Hénaff
M. Botvinick
Andrew Zisserman
Oriol Vinyals
João Carreira
MLLMVLMGNN
140
585
0
30 Jul 2021
Automatic Claim Review for Climate Science via Explanation Generation
Automatic Claim Review for Climate Science via Explanation Generation
Shraey Bhatia
Jey Han Lau
Timothy Baldwin
37
5
0
30 Jul 2021
EmailSum: Abstractive Email Thread Summarization
EmailSum: Abstractive Email Thread Summarization
Shiyue Zhang
Asli Celikyilmaz
Jianfeng Gao
Joey Tianyi Zhou
79
38
0
30 Jul 2021
Rethinking and Improving Relative Position Encoding for Vision
  Transformer
Rethinking and Improving Relative Position Encoding for Vision Transformer
Kan Wu
Houwen Peng
Minghao Chen
Jianlong Fu
Hongyang Chao
ViT
118
339
0
29 Jul 2021
Adapting GPT, GPT-2 and BERT Language Models for Speech Recognition
Adapting GPT, GPT-2 and BERT Language Models for Speech Recognition
Xianrui Zheng
Chao Zhang
P. Woodland
34
49
0
29 Jul 2021
Domain-matched Pre-training Tasks for Dense Retrieval
Domain-matched Pre-training Tasks for Dense Retrieval
Barlas Oğuz
Kushal Lakhotia
Anchit Gupta
Patrick Lewis
Vladimir Karpukhin
...
Xilun Chen
Sebastian Riedel
Wen-tau Yih
Sonal Gupta
Yashar Mehdad
RALM
87
67
0
28 Jul 2021
Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods
  in Natural Language Processing
Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods in Natural Language Processing
Pengfei Liu
Weizhe Yuan
Jinlan Fu
Zhengbao Jiang
Hiroaki Hayashi
Graham Neubig
VLMSyDa
283
4,044
0
28 Jul 2021
Towards Emotion-Aware Agents For Negotiation Dialogues
Towards Emotion-Aware Agents For Negotiation Dialogues
Kushal Chawla
Rene Clever
Jaysa Ramirez
Gale M. Lucas
Jonathan Gratch
67
14
0
28 Jul 2021
Neural Rule-Execution Tracking Machine For Transformer-Based Text
  Generation
Neural Rule-Execution Tracking Machine For Transformer-Based Text Generation
Yufei Wang
Can Xu
Huang Hu
Chongyang Tao
Stephen Wan
Mark Dras
Mark Johnson
Daxin Jiang
57
10
0
27 Jul 2021
Exceeding the Limits of Visual-Linguistic Multi-Task Learning
Exceeding the Limits of Visual-Linguistic Multi-Task Learning
Cameron R. Wolfe
Keld T. Lundgaard
VLM
76
2
0
27 Jul 2021
PiSLTRc: Position-informed Sign Language Transformer with Content-aware
  Convolution
PiSLTRc: Position-informed Sign Language Transformer with Content-aware Convolution
Pan Xie
Mengyi Zhao
Xiaohui Hu
ViTSLR
97
35
0
27 Jul 2021
Learning Span-Level Interactions for Aspect Sentiment Triplet Extraction
Learning Span-Level Interactions for Aspect Sentiment Triplet Extraction
Lu Xu
Yew Ken Chia
Lidong Bing
155
188
0
26 Jul 2021
Go Wider Instead of Deeper
Go Wider Instead of Deeper
Fuzhao Xue
Ziji Shi
Futao Wei
Yuxuan Lou
Yong Liu
Yang You
ViTMoE
93
84
0
25 Jul 2021
Evaluation of contextual embeddings on less-resourced languages
Evaluation of contextual embeddings on less-resourced languages
Matej Ulvcar
Alevs vZagar
C. S. Armendariz
Andravz Repar
Senja Pollak
Matthew Purver
Marko Robnik-vSikonja
66
11
0
22 Jul 2021
Back-Translated Task Adaptive Pretraining: Improving Accuracy and
  Robustness on Text Classification
Back-Translated Task Adaptive Pretraining: Improving Accuracy and Robustness on Text Classification
Junghoon Lee
Jounghee Kim
Pilsung Kang
VLM
72
5
0
22 Jul 2021
Spinning Sequence-to-Sequence Models with Meta-Backdoors
Eugene Bagdasaryan
Vitaly Shmatikov
SILMAAML
86
8
0
22 Jul 2021
Generative Models for Security: Attacks, Defenses, and Opportunities
Generative Models for Security: Attacks, Defenses, and Opportunities
L. A. Bauer
Vincent Bindschaedler
106
4
0
21 Jul 2021
Memorization in Deep Neural Networks: Does the Loss Function matter?
Memorization in Deep Neural Networks: Does the Loss Function matter?
Deep Patel
P. Sastry
TDI
59
8
0
21 Jul 2021
The Effectiveness of Intermediate-Task Training for Code-Switched
  Natural Language Understanding
The Effectiveness of Intermediate-Task Training for Code-Switched Natural Language Understanding
Archiki Prasad
Mohammad Ali Rehan
Shreyasi Pathak
Preethi Jyothi
60
9
0
21 Jul 2021
CausalBERT: Injecting Causal Knowledge Into Pre-trained Models with
  Minimal Supervision
CausalBERT: Injecting Causal Knowledge Into Pre-trained Models with Minimal Supervision
Zhongyang Li
Xiao Ding
Kuo Liao
Bing Qin
Ting Liu
CML
107
19
0
21 Jul 2021
Guided Generation of Cause and Effect
Guided Generation of Cause and Effect
Zhongyang Li
Xiao Ding
Ting Liu
J. E. Hu
Benjamin Van Durme
238
81
0
21 Jul 2021
Sequence-to-Sequence Piano Transcription with Transformers
Sequence-to-Sequence Piano Transcription with Transformers
Curtis Hawthorne
Ian Simon
Rigel Swavely
Ethan Manilow
Jesse Engel
197
83
0
19 Jul 2021
Generative Pretraining for Paraphrase Evaluation
Generative Pretraining for Paraphrase Evaluation
J. Weston
R. Lenain
U. Meepegama
E. Fristed
AIMat
59
10
0
17 Jul 2021
Learning De-identified Representations of Prosody from Raw Audio
Learning De-identified Representations of Prosody from Raw Audio
J. Weston
R. Lenain
U. Meepegama
E. Fristed
SSL
66
17
0
17 Jul 2021
Overview and Insights from the SciVer Shared Task on Scientific Claim
  Verification
Overview and Insights from the SciVer Shared Task on Scientific Claim Verification
David Wadden
Kyle Lo
106
13
0
17 Jul 2021
Picard understanding Darmok: A Dataset and Model for Metaphor-Rich
  Translation in a Constructed Language
Picard understanding Darmok: A Dataset and Model for Metaphor-Rich Translation in a Constructed Language
Peter Jansen
Jordan L. Boyd-Graber
18
0
0
16 Jul 2021
TAPEX: Table Pre-training via Learning a Neural SQL Executor
TAPEX: Table Pre-training via Learning a Neural SQL Executor
Qian Liu
Bei Chen
Jiaqi Guo
Morteza Ziyadi
Zeqi Lin
Weizhu Chen
Jian-Guang Lou
LMTD
116
269
0
16 Jul 2021
Internet-Augmented Dialogue Generation
Internet-Augmented Dialogue Generation
M. Komeili
Kurt Shuster
Jason Weston
RALM
322
291
0
15 Jul 2021
FewCLUE: A Chinese Few-shot Learning Evaluation Benchmark
FewCLUE: A Chinese Few-shot Learning Evaluation Benchmark
Liang Xu
Xiaojing Lu
Chenyang Yuan
Xuanwei Zhang
Huilin Xu
...
Guoao Wei
X. Pan
Xin Tian
Libo Qin
Hai Hu
ELM
99
57
0
15 Jul 2021
AutoBERT-Zero: Evolving BERT Backbone from Scratch
AutoBERT-Zero: Evolving BERT Backbone from Scratch
Jiahui Gao
Hang Xu
Han Shi
Xiaozhe Ren
Philip L. H. Yu
Xiaodan Liang
Xin Jiang
Zhenguo Li
85
37
0
15 Jul 2021
Turning Tables: Generating Examples from Semi-structured Tables for
  Endowing Language Models with Reasoning Skills
Turning Tables: Generating Examples from Semi-structured Tables for Endowing Language Models with Reasoning Skills
Ori Yoran
Alon Talmor
Jonathan Berant
ReLMLRM
238
54
0
15 Jul 2021
FLEX: Unifying Evaluation for Few-Shot NLP
FLEX: Unifying Evaluation for Few-Shot NLP
Jonathan Bragg
Arman Cohan
Kyle Lo
Iz Beltagy
270
108
0
15 Jul 2021
Tailor: Generating and Perturbing Text with Semantic Controls
Tailor: Generating and Perturbing Text with Semantic Controls
Alexis Ross
Tongshuang Wu
Hao Peng
Matthew E. Peters
Matt Gardner
197
79
0
15 Jul 2021
Increasing Faithfulness in Knowledge-Grounded Dialogue with Controllable
  Features
Increasing Faithfulness in Knowledge-Grounded Dialogue with Controllable Features
Hannah Rashkin
David Reitter
Gaurav Singh Tomar
Dipanjan Das
241
102
0
14 Jul 2021
HTLM: Hyper-Text Pre-Training and Prompting of Language Models
HTLM: Hyper-Text Pre-Training and Prompting of Language Models
Armen Aghajanyan
Dmytro Okhonko
M. Lewis
Mandar Joshi
Hu Xu
Gargi Ghosh
Luke Zettlemoyer
VLMVPVLMAI4TSAI4CE
73
76
0
14 Jul 2021
DeepMutants: Training neural bug detectors with contextual mutations
DeepMutants: Training neural bug detectors with contextual mutations
Cedric Richter
Heike Wehrheim
93
3
0
14 Jul 2021
Learning Algebraic Recombination for Compositional Generalization
Learning Algebraic Recombination for Compositional Generalization
Chenyao Liu
Shengnan An
Zeqi Lin
Qian Liu
Bei Chen
Jian-Guang Lou
Lijie Wen
Nanning Zheng
Dongmei Zhang
CoGe
249
36
0
14 Jul 2021
Deduplicating Training Data Makes Language Models Better
Deduplicating Training Data Makes Language Models Better
Katherine Lee
Daphne Ippolito
A. Nystrom
Chiyuan Zhang
Douglas Eck
Chris Callison-Burch
Nicholas Carlini
SyDa
369
638
0
14 Jul 2021
FLAT: An Optimized Dataflow for Mitigating Attention Bottlenecks
FLAT: An Optimized Dataflow for Mitigating Attention Bottlenecks
Sheng-Chun Kao
Suvinay Subramanian
Gaurav Agrawal
Amir Yazdanbakhsh
T. Krishna
130
64
0
13 Jul 2021
Combiner: Full Attention Transformer with Sparse Computation Cost
Combiner: Full Attention Transformer with Sparse Computation Cost
Hongyu Ren
H. Dai
Zihang Dai
Mengjiao Yang
J. Leskovec
Dale Schuurmans
Bo Dai
166
80
0
12 Jul 2021
A Survey on Low-Resource Neural Machine Translation
A Survey on Low-Resource Neural Machine Translation
Rui Wang
Xu Tan
Renqian Luo
Tao Qin
Tie-Yan Liu
3DV
98
61
0
09 Jul 2021
Evaluating Large Language Models Trained on Code
Evaluating Large Language Models Trained on Code
Mark Chen
Jerry Tworek
Heewoo Jun
Qiming Yuan
Henrique Pondé
...
Bob McGrew
Dario Amodei
Sam McCandlish
Ilya Sutskever
Wojciech Zaremba
ELMALM
268
5,696
0
07 Jul 2021
A Survey on Dialogue Summarization: Recent Advances and New Frontiers
A Survey on Dialogue Summarization: Recent Advances and New Frontiers
Xiachong Feng
Xiaocheng Feng
Bing Qin
80
102
0
07 Jul 2021
Structured Denoising Diffusion Models in Discrete State-Spaces
Structured Denoising Diffusion Models in Discrete State-Spaces
Jacob Austin
Daniel D. Johnson
Jonathan Ho
Daniel Tarlow
Rianne van den Berg
DiffM
282
950
0
07 Jul 2021
Neural Natural Language Processing for Unstructured Data in Electronic
  Health Records: a Review
Neural Natural Language Processing for Unstructured Data in Electronic Health Records: a Review
Irene Li
Jessica Pan
Jeremy Goldwasser
Neha Verma
Wai Pan Wong
...
Matthew Zhang
David Chang
R. Taylor
H. Krumholz
Dragomir R. Radev
BDL
86
159
0
07 Jul 2021
Previous
123...178179180...196197198
Next