ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.10683
  4. Cited By
Exploring the Limits of Transfer Learning with a Unified Text-to-Text
  Transformer
v1v2v3v4 (latest)

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
    AIMat
ArXiv (abs)PDFHTML

Papers citing "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

50 / 9,870 papers shown
Title
LOREN: Logic-Regularized Reasoning for Interpretable Fact Verification
LOREN: Logic-Regularized Reasoning for Interpretable Fact Verification
Jiangjie Chen
Qiaoben Bao
Changzhi Sun
Xinbo Zhang
Jiaze Chen
Hao Zhou
Yanghua Xiao
Lei Li
LRM
115
34
0
25 Dec 2020
ProofWriter: Generating Implications, Proofs, and Abductive Statements
  over Natural Language
ProofWriter: Generating Implications, Proofs, and Abductive Statements over Natural Language
Oyvind Tafjord
Bhavana Dalvi
Peter Clark
114
278
0
24 Dec 2020
Bridging Textual and Tabular Data for Cross-Domain Text-to-SQL Semantic
  Parsing
Bridging Textual and Tabular Data for Cross-Domain Text-to-SQL Semantic Parsing
Xi Lin
R. Socher
Caiming Xiong
LMTD
110
210
0
23 Dec 2020
Learning Dense Representations of Phrases at Scale
Learning Dense Representations of Phrases at Scale
Jinhyuk Lee
Mujeen Sung
Jaewoo Kang
Danqi Chen
RALMDMLNAI
72
122
0
23 Dec 2020
A Survey on Visual Transformer
A Survey on Visual Transformer
Kai Han
Yunhe Wang
Hanting Chen
Xinghao Chen
Jianyuan Guo
...
Chunjing Xu
Yixing Xu
Zhaohui Yang
Yiman Zhang
Dacheng Tao
ViT
231
2,278
0
23 Dec 2020
TicketTalk: Toward human-level performance with end-to-end,
  transaction-based dialog systems
TicketTalk: Toward human-level performance with end-to-end, transaction-based dialog systems
Bill Byrne
Karthikeyan K
Saravanan Ganesh
Mihir Kale
78
25
0
23 Dec 2020
Few-Shot Text Generation with Pattern-Exploiting Training
Few-Shot Text Generation with Pattern-Exploiting Training
Timo Schick
Hinrich Schütze
111
148
0
22 Dec 2020
Intrinsic Dimensionality Explains the Effectiveness of Language Model
  Fine-Tuning
Intrinsic Dimensionality Explains the Effectiveness of Language Model Fine-Tuning
Armen Aghajanyan
Luke Zettlemoyer
Sonal Gupta
110
577
1
22 Dec 2020
RealFormer: Transformer Likes Residual Attention
RealFormer: Transformer Likes Residual Attention
Ruining He
Anirudh Ravula
Bhargav Kanagal
Joshua Ainslie
76
110
0
21 Dec 2020
Learning Contextual Representations for Semantic Parsing with
  Generation-Augmented Pre-Training
Learning Contextual Representations for Semantic Parsing with Generation-Augmented Pre-Training
Peng Shi
Patrick Ng
Zhiguo Wang
Henghui Zhu
Alexander Hanbo Li
Jun Wang
Cicero Nogueira dos Santos
Bing Xiang
67
117
0
18 Dec 2020
Toward Transformer-Based Object Detection
Toward Transformer-Based Object Detection
Josh Beal
Eric Kim
Eric Tzeng
Dong Huk Park
Andrew Zhai
Dmitry Kislyuk
ViT
99
215
0
17 Dec 2020
Continual Lifelong Learning in Natural Language Processing: A Survey
Continual Lifelong Learning in Natural Language Processing: A Survey
Magdalena Biesialska
Katarzyna Biesialska
Marta R. Costa-jussá
KELMCLL
97
222
0
17 Dec 2020
*-CFQ: Analyzing the Scalability of Machine Learning on a Compositional
  Task
*-CFQ: Analyzing the Scalability of Machine Learning on a Compositional Task
Dmitry Tsarkov
Tibor Tihon
Nathan Scales
Nikola Momchev
Danila Sinopalnikov
Nathanael Scharli
76
17
0
15 Dec 2020
Learning to Rationalize for Nonmonotonic Reasoning with Distant
  Supervision
Learning to Rationalize for Nonmonotonic Reasoning with Distant Supervision
Faeze Brahman
Vered Shwartz
Rachel Rudinger
Yejin Choi
LRM
98
42
0
14 Dec 2020
Extracting Training Data from Large Language Models
Extracting Training Data from Large Language Models
Nicholas Carlini
Florian Tramèr
Eric Wallace
Matthew Jagielski
Ariel Herbert-Voss
...
Tom B. Brown
Basel Alomair
Ulfar Erlingsson
Alina Oprea
Colin Raffel
MLAUSILM
562
1,964
0
14 Dec 2020
Parameter-Efficient Transfer Learning with Diff Pruning
Parameter-Efficient Transfer Learning with Diff Pruning
Demi Guo
Alexander M. Rush
Yoon Kim
92
406
0
14 Dec 2020
Contrastive Learning with Adversarial Perturbations for Conditional Text
  Generation
Contrastive Learning with Adversarial Perturbations for Conditional Text Generation
Seanie Lee
Dong Bok Lee
Sung Ju Hwang
115
109
0
14 Dec 2020
ParsiNLU: A Suite of Language Understanding Challenges for Persian
ParsiNLU: A Suite of Language Understanding Challenges for Persian
Daniel Khashabi
Arman Cohan
Siamak Shakeri
Pedram Hosseini
Pouya Pezeshkpour
...
Niloofar Safi Samghabadi
Mahsa Shafaei
Saber Sheybani
Ali Tazarv
Yadollah Yaghoobzadeh
69
44
0
11 Dec 2020
Infusing Finetuning with Semantic Dependencies
Infusing Finetuning with Semantic Dependencies
Zhaofeng Wu
Hao Peng
Noah A. Smith
71
37
0
10 Dec 2020
SongMASS: Automatic Song Writing with Pre-training and Alignment
  Constraint
SongMASS: Automatic Song Writing with Pre-training and Alignment Constraint
Zhonghao Sheng
Kaitao Song
Xu Tan
Yi Ren
Wei Ye
Shikun Zhang
Tao Qin
CVBM
75
67
0
09 Dec 2020
Fusing Context Into Knowledge Graph for Commonsense Question Answering
Fusing Context Into Knowledge Graph for Commonsense Question Answering
Yichong Xu
Chenguang Zhu
Ruochen Xu
Yang Liu
Michael Zeng
Xuedong Huang
82
72
0
09 Dec 2020
Distilling Knowledge from Reader to Retriever for Question Answering
Distilling Knowledge from Reader to Retriever for Question Answering
Gautier Izacard
Edouard Grave
RALM
260
267
0
08 Dec 2020
Facts2Story: Controlling Text Generation by Key Facts
Facts2Story: Controlling Text Generation by Key Facts
Eyal Orbach
Yoav Goldberg
62
16
0
08 Dec 2020
Parallel Training of Deep Networks with Local Updates
Parallel Training of Deep Networks with Local Updates
Michael Laskin
Luke Metz
Seth Nabarrao
Mark Saroufim
Badreddine Noune
Carlo Luschi
Jascha Narain Sohl-Dickstein
Pieter Abbeel
FedML
122
27
0
07 Dec 2020
When Do Curricula Work?
When Do Curricula Work?
Xiaoxia Wu
Ethan Dyer
Behnam Neyshabur
94
118
0
05 Dec 2020
Data Boost: Text Data Augmentation Through Reinforcement Learning Guided
  Conditional Generation
Data Boost: Text Data Augmentation Through Reinforcement Learning Guided Conditional Generation
Ruibo Liu
Guangxuan Xu
Chenyan Jia
Weicheng Ma
Lili Wang
Soroush Vosoughi
82
110
0
05 Dec 2020
RPT: Relational Pre-trained Transformer Is Almost All You Need towards
  Democratizing Data Preparation
RPT: Relational Pre-trained Transformer Is Almost All You Need towards Democratizing Data Preparation
Nan Tang
Ju Fan
Fangyi Li
Jianhong Tu
Xiaoyong Du
Guoliang Li
Samuel Madden
M. Ouzzani
93
76
0
04 Dec 2020
WeaQA: Weak Supervision via Captions for Visual Question Answering
WeaQA: Weak Supervision via Captions for Visual Question Answering
Pratyay Banerjee
Tejas Gokhale
Yezhou Yang
Chitta Baral
110
36
0
04 Dec 2020
Sentiment analysis in Bengali via transfer learning using multi-lingual
  BERT
Sentiment analysis in Bengali via transfer learning using multi-lingual BERT
Khondoker Ittehadul Islam
Md. Saiful Islam
Md Ruhul Amin
71
43
0
03 Dec 2020
End-to-End QA on COVID-19: Domain Adaptation with Synthetic Training
End-to-End QA on COVID-19: Domain Adaptation with Synthetic Training
R. Reddy
Bhavani Iyer
Md Arafat Sultan
Rong Zhang
Avirup Sil
Vittorio Castelli
Radu Florian
Salim Roukos
OOD
75
19
0
02 Dec 2020
Learning from others' mistakes: Avoiding dataset biases without modeling
  them
Learning from others' mistakes: Avoiding dataset biases without modeling them
Victor Sanh
Thomas Wolf
Yonatan Belinkov
Alexander M. Rush
96
116
0
02 Dec 2020
How Can We Know When Language Models Know? On the Calibration of
  Language Models for Question Answering
How Can We Know When Language Models Know? On the Calibration of Language Models for Question Answering
Zhengbao Jiang
Jun Araki
Haibo Ding
Graham Neubig
UQCV
67
439
0
02 Dec 2020
An Enhanced Knowledge Injection Model for Commonsense Generation
An Enhanced Knowledge Injection Model for Commonsense Generation
Zhihao Fan
Yeyun Gong
Zhongyu Wei
Siyuan Wang
Ya-Chieh Huang
Jian Jiao
Xuanjing Huang
Nan Duan
Ruofei Zhang
70
28
0
01 Dec 2020
Pre-Trained Image Processing Transformer
Pre-Trained Image Processing Transformer
Hanting Chen
Yunhe Wang
Tianyu Guo
Chang Xu
Yiping Deng
Zhenhua Liu
Siwei Ma
Chunjing Xu
Chao Xu
Wen Gao
VLMViT
171
1,690
0
01 Dec 2020
Modifying Memories in Transformer Models
Modifying Memories in Transformer Models
Chen Zhu
A. S. Rawat
Manzil Zaheer
Srinadh Bhojanapalli
Daliang Li
Felix X. Yu
Sanjiv Kumar
KELM
123
203
0
01 Dec 2020
Inductive Biases for Deep Learning of Higher-Level Cognition
Inductive Biases for Deep Learning of Higher-Level Cognition
Anirudh Goyal
Yoshua Bengio
AI4CE
114
366
0
30 Nov 2020
Transformer Query-Target Knowledge Discovery (TEND): Drug Discovery from
  CORD-19
Transformer Query-Target Knowledge Discovery (TEND): Drug Discovery from CORD-19
Leo K. Tam
Xiaosong Wang
Daguang Xu
MedIm
45
2
0
28 Nov 2020
Braid: Weaving Symbolic and Neural Knowledge into Coherent Logical
  Explanations
Braid: Weaving Symbolic and Neural Knowledge into Coherent Logical Explanations
Aditya Kalyanpur
Tom Breloff
D. Ferrucci
85
18
0
26 Nov 2020
GLGE: A New General Language Generation Evaluation Benchmark
GLGE: A New General Language Generation Evaluation Benchmark
Dayiheng Liu
Yu Yan
Yeyun Gong
Weizhen Qi
Hang Zhang
...
Jiancheng Lv
Ruofei Zhang
Winnie Wu
Ming Zhou
Nan Duan
ELM
111
66
0
24 Nov 2020
EasyTransfer -- A Simple and Scalable Deep Transfer Learning Platform
  for NLP Applications
EasyTransfer -- A Simple and Scalable Deep Transfer Learning Platform for NLP Applications
Minghui Qiu
Peng Li
Chengyu Wang
Hanjie Pan
Yaliang Li
...
Jun Yang
Yaliang Li
Jun Huang
Deng Cai
Wei Lin
VLMSyDa
109
20
0
18 Nov 2020
A Definition and a Test for Human-Level Artificial Intelligence
A Definition and a Test for Human-Level Artificial Intelligence
Deokgun Park
Md Ashaduzzaman Rubel Mondol
Aishwarya Pothula
Mazharul Islam
VLM
56
4
0
18 Nov 2020
Out-of-Task Training for Dialog State Tracking Models
Out-of-Task Training for Dialog State Tracking Models
Michael Heck
Carel van Niekerk
Nurul Lubis
Christian Geishauser
Hsien-Chin Lin
Marco Moresi
Milica Gavsić
41
3
0
18 Nov 2020
Whale: Efficient Giant Model Training over Heterogeneous GPUs
Whale: Efficient Giant Model Training over Heterogeneous GPUs
Xianyan Jia
Le Jiang
Ang Wang
Wencong Xiao
Ziji Shi
...
Lan-yue Chen
Yong Li
Zhen Zheng
Xiaoyong Liu
Wei Lin
90
56
0
18 Nov 2020
Digging Deeper into CRNN Model in Chinese Text Images Recognition
Digging Deeper into CRNN Model in Chinese Text Images Recognition
Kunhong Yu
Yuze Zhang
34
2
0
17 Nov 2020
A Two-Phase Approach for Abstractive Podcast Summarization
A Two-Phase Approach for Abstractive Podcast Summarization
Chujie Zheng
Kunpeng Zhang
Harry J. Wang
Ling Fan
36
11
0
16 Nov 2020
Learning from Task Descriptions
Learning from Task Descriptions
Orion Weller
Nicholas Lourie
Matt Gardner
Matthew E. Peters
113
91
0
16 Nov 2020
Language Models not just for Pre-training: Fast Online Neural Noisy
  Channel Modeling
Language Models not just for Pre-training: Fast Online Neural Noisy Channel Modeling
Shruti Bhosale
Kyra Yee
Sergey Edunov
Michael Auli
85
7
0
13 Nov 2020
Generating Fact Checking Briefs
Generating Fact Checking Briefs
Angela Fan
Aleksandra Piktus
Fabio Petroni
Guillaume Wenzek
Marzieh Saeidi
Andreas Vlachos
Antoine Bordes
Sebastian Riedel
HILM
104
59
0
10 Nov 2020
Multimodal Pretraining for Dense Video Captioning
Multimodal Pretraining for Dense Video Captioning
Gabriel Huang
Bo Pang
Zhenhai Zhu
Clara E. Rivera
Radu Soricut
96
87
0
10 Nov 2020
UmBERTo-MTSA @ AcCompl-It: Improving Complexity and Acceptability
  Prediction with Multi-task Learning on Self-Supervised Annotations
UmBERTo-MTSA @ AcCompl-It: Improving Complexity and Acceptability Prediction with Multi-task Learning on Self-Supervised Annotations
Gabriele Sarti
19
5
0
10 Nov 2020
Previous
123...188189190...196197198
Next