ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.10683
  4. Cited By
Exploring the Limits of Transfer Learning with a Unified Text-to-Text
  Transformer
v1v2v3v4 (latest)

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
    AIMat
ArXiv (abs)PDFHTML

Papers citing "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

50 / 9,870 papers shown
Title
Parallelizing Legendre Memory Unit Training
Parallelizing Legendre Memory Unit Training
Narsimha Chilkuri
C. Eliasmith
88
39
0
22 Feb 2021
Position Information in Transformers: An Overview
Position Information in Transformers: An Overview
Philipp Dufter
Martin Schmitt
Hinrich Schütze
93
148
0
22 Feb 2021
UniT: Multimodal Multitask Learning with a Unified Transformer
UniT: Multimodal Multitask Learning with a Unified Transformer
Ronghang Hu
Amanpreet Singh
ViT
106
301
0
22 Feb 2021
Pruning the Index Contents for Memory Efficient Open-Domain QA
Pruning the Index Contents for Memory Efficient Open-Domain QA
Martin Fajcik
Martin Docekal
Karel Ondrej
Pavel Smrz
80
8
0
21 Feb 2021
VisualGPT: Data-efficient Adaptation of Pretrained Language Models for
  Image Captioning
VisualGPT: Data-efficient Adaptation of Pretrained Language Models for Image Captioning
Jun Chen
Han Guo
Kai Yi
Boyang Albert Li
Mohamed Elhoseiny
VLM
162
227
0
20 Feb 2021
Evolving Attention with Residual Convolutions
Evolving Attention with Residual Convolutions
Yujing Wang
Yaming Yang
Jiangang Bai
Mingliang Zhang
Jing Bai
Jiahao Yu
Ce Zhang
Gao Huang
Yunhai Tong
ViT
112
34
0
20 Feb 2021
Formal Language Theory Meets Modern NLP
Formal Language Theory Meets Modern NLP
William Merrill
AI4CENAI
112
13
0
19 Feb 2021
Analyzing Curriculum Learning for Sentiment Analysis along Task
  Difficulty, Pacing and Visualization Axes
Analyzing Curriculum Learning for Sentiment Analysis along Task Difficulty, Pacing and Visualization Axes
Anvesh Rao Vijjini
Kaveri Anuranjana
R. Mamidi
70
3
0
19 Feb 2021
Going Full-TILT Boogie on Document Understanding with Text-Image-Layout
  Transformer
Going Full-TILT Boogie on Document Understanding with Text-Image-Layout Transformer
Rafal Powalski
Łukasz Borchmann
Dawid Jurkiewicz
Tomasz Dwojak
Michal Pietruszka
Gabriela Pałka
ViT
94
160
0
18 Feb 2021
Quiz-Style Question Generation for News Stories
Quiz-Style Question Generation for News Stories
Á. Lelkes
Vinh Q. Tran
Cong Yu
84
42
0
18 Feb 2021
Conceptual 12M: Pushing Web-Scale Image-Text Pre-Training To Recognize
  Long-Tail Visual Concepts
Conceptual 12M: Pushing Web-Scale Image-Text Pre-Training To Recognize Long-Tail Visual Concepts
Soravit Changpinyo
P. Sharma
Nan Ding
Radu Soricut
VLM
502
1,143
0
17 Feb 2021
COCO-LM: Correcting and Contrasting Text Sequences for Language Model
  Pretraining
COCO-LM: Correcting and Contrasting Text Sequences for Language Model Pretraining
Yu Meng
Chenyan Xiong
Payal Bajaj
Saurabh Tiwary
Paul N. Bennett
Jiawei Han
Xia Song
182
205
0
16 Feb 2021
Boosting Low-Resource Biomedical QA via Entity-Aware Masking Strategies
Boosting Low-Resource Biomedical QA via Entity-Aware Masking Strategies
Gabriele Pergola
E. Kochkina
Lin Gui
Maria Liakata
Yulan He
145
32
0
16 Feb 2021
CATE: Computation-aware Neural Architecture Encoding with Transformers
CATE: Computation-aware Neural Architecture Encoding with Transformers
Shen Yan
Kaiqiang Song
Z. Feng
Mi Zhang
81
28
0
14 Feb 2021
TransGAN: Two Pure Transformers Can Make One Strong GAN, and That Can
  Scale Up
TransGAN: Two Pure Transformers Can Make One Strong GAN, and That Can Scale Up
Yi Ding
Shiyu Chang
Zhangyang Wang
ViT
151
393
0
14 Feb 2021
Reasoning Over Virtual Knowledge Bases With Open Predicate Relations
Reasoning Over Virtual Knowledge Bases With Open Predicate Relations
Haitian Sun
Pat Verga
Bhuwan Dhingra
Ruslan Salakhutdinov
William W. Cohen
LRM
106
26
0
14 Feb 2021
PAQ: 65 Million Probably-Asked Questions and What You Can Do With Them
PAQ: 65 Million Probably-Asked Questions and What You Can Do With Them
Patrick Lewis
Yuxiang Wu
Linqing Liu
Pasquale Minervini
Heinrich Küttler
Aleksandra Piktus
Pontus Stenetorp
Sebastian Riedel
RALM
138
234
0
13 Feb 2021
Proof Artifact Co-training for Theorem Proving with Language Models
Proof Artifact Co-training for Theorem Proving with Language Models
Jesse Michael Han
Jason M. Rute
Yuhuai Wu
Edward W. Ayers
Stanislas Polu
AIMat
117
127
0
11 Feb 2021
Less is More: ClipBERT for Video-and-Language Learning via Sparse
  Sampling
Less is More: ClipBERT for Video-and-Language Learning via Sparse Sampling
Jie Lei
Linjie Li
Luowei Zhou
Zhe Gan
Tamara L. Berg
Joey Tianyi Zhou
Jingjing Liu
CLIP
176
665
0
11 Feb 2021
Scaling Up Visual and Vision-Language Representation Learning With Noisy
  Text Supervision
Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision
Chao Jia
Yinfei Yang
Ye Xia
Yi-Ting Chen
Zarana Parekh
Hieu H. Pham
Quoc V. Le
Yun-hsuan Sung
Zhen Li
Tom Duerig
VLMCLIP
531
3,911
0
11 Feb 2021
Biomedical Question Answering: A Survey of Approaches and Challenges
Biomedical Question Answering: A Survey of Approaches and Challenges
Qiao Jin
Zheng Yuan
Guangzhi Xiong
Qian Yu
Huaiyuan Ying
Chuanqi Tan
Mosha Chen
Songfang Huang
Xiaozhong Liu
Sheng Yu
108
104
0
10 Feb 2021
Decontextualization: Making Sentences Stand-Alone
Decontextualization: Making Sentences Stand-Alone
Eunsol Choi
J. Palomaki
Matthew Lamm
Tom Kwiatkowski
Dipanjan Das
Michael Collins
65
100
0
09 Feb 2021
Is Space-Time Attention All You Need for Video Understanding?
Is Space-Time Attention All You Need for Video Understanding?
Gedas Bertasius
Heng Wang
Lorenzo Torresani
ViT
420
2,075
0
09 Feb 2021
Improving Scene Graph Classification by Exploiting Knowledge from Texts
Improving Scene Graph Classification by Exploiting Knowledge from Texts
Sahand Sharifzadeh
Sina Moayed Baharlou
Martin Schmitt
Hinrich Schutze
Volker Tresp
59
19
0
09 Feb 2021
Efficient Retrieval Augmented Generation from Unstructured Knowledge for
  Task-Oriented Dialog
Efficient Retrieval Augmented Generation from Unstructured Knowledge for Task-Oriented Dialog
David Thulke
Nico Daheim
Christian Dugast
Hermann Ney
RALM
76
49
0
09 Feb 2021
Damage detection using in-domain and cross-domain transfer learning
Damage detection using in-domain and cross-domain transfer learning
Zaharah Bukhsh
N. Jansen
Aaqib Saeed
63
43
0
07 Feb 2021
Symbolic Behaviour in Artificial Intelligence
Symbolic Behaviour in Artificial Intelligence
Adam Santoro
Andrew Kyle Lampinen
Kory W. Mathewson
Timothy Lillicrap
David Raposo
79
34
0
05 Feb 2021
Think you have Solved Direct-Answer Question Answering? Try ARC-DA, the
  Direct-Answer AI2 Reasoning Challenge
Think you have Solved Direct-Answer Question Answering? Try ARC-DA, the Direct-Answer AI2 Reasoning Challenge
Sumithra Bhakthavatsalam
Daniel Khashabi
Tushar Khot
Bhavana Dalvi
Kyle Richardson
Ashish Sabharwal
Carissa Schoenick
Oyvind Tafjord
Peter Clark
RALMAI4CE
70
66
0
05 Feb 2021
ChainCQG: Flow-Aware Conversational Question Generation
ChainCQG: Flow-Aware Conversational Question Generation
Jing Gu
Mostafa Mirshekari
Zhou Yu
Aaron Sisto
73
35
0
04 Feb 2021
Unifying Vision-and-Language Tasks via Text Generation
Unifying Vision-and-Language Tasks via Text Generation
Jaemin Cho
Jie Lei
Hao Tan
Joey Tianyi Zhou
MLLM
392
546
0
04 Feb 2021
The GEM Benchmark: Natural Language Generation, its Evaluation and
  Metrics
The GEM Benchmark: Natural Language Generation, its Evaluation and Metrics
Sebastian Gehrmann
Tosin Adewumi
Karmanya Aggarwal
Pawan Sasanka Ammanamanchi
Aremu Anuoluwapo
...
Nishant Subramani
Wei Xu
Diyi Yang
Akhila Yerukola
Jiawei Zhou
VLM
321
285
0
02 Feb 2021
Neural Data Augmentation via Example Extrapolation
Neural Data Augmentation via Example Extrapolation
Kenton Lee
Kelvin Guu
Luheng He
Timothy Dozat
Hyung Won Chung
76
72
0
02 Feb 2021
Scaling Laws for Transfer
Scaling Laws for Transfer
Danny Hernandez
Jared Kaplan
T. Henighan
Sam McCandlish
100
251
0
02 Feb 2021
Measuring and Improving Consistency in Pretrained Language Models
Measuring and Improving Consistency in Pretrained Language Models
Yanai Elazar
Nora Kassner
Shauli Ravfogel
Abhilasha Ravichander
Eduard H. Hovy
Hinrich Schütze
Yoav Goldberg
HILM
337
371
0
01 Feb 2021
Civil Rephrases Of Toxic Texts With Self-Supervised Transformers
Civil Rephrases Of Toxic Texts With Self-Supervised Transformers
Leo Laugier
John Pavlopoulos
Jeffrey Scott Sorensen
Lucas Dixon
90
48
0
01 Feb 2021
TruthBot: An Automated Conversational Tool for Intent Learning, Curated
  Information Presenting, and Fake News Alerting
TruthBot: An Automated Conversational Tool for Intent Learning, Curated Information Presenting, and Fake News Alerting
Ankur Gupta
Yash Varun
Prarthana Das
Nithya Muttineni
Parth Srivastava
Hamim Zafar
Tanmoy Chakraborty
Swaprava Nath
39
7
0
31 Jan 2021
VX2TEXT: End-to-End Learning of Video-Based Text Generation From
  Multimodal Inputs
VX2TEXT: End-to-End Learning of Video-Based Text Generation From Multimodal Inputs
Xudong Lin
Gedas Bertasius
Jue Wang
Shih-Fu Chang
Devi Parikh
Lorenzo Torresani
VGen
102
67
0
28 Jan 2021
BENDR: using transformers and a contrastive self-supervised learning
  task to learn from massive amounts of EEG data
BENDR: using transformers and a contrastive self-supervised learning task to learn from massive amounts of EEG data
Demetres Kostas
Stephane Aroca-Ouellette
Frank Rudzicz
SSL
120
210
0
28 Jan 2021
Explaining Natural Language Processing Classifiers with Occlusion and
  Language Modeling
Explaining Natural Language Processing Classifiers with Occlusion and Language Modeling
David Harbecke
AAML
53
2
0
28 Jan 2021
DRAG: Director-Generator Language Modelling Framework for Non-Parallel
  Author Stylized Rewriting
DRAG: Director-Generator Language Modelling Framework for Non-Parallel Author Stylized Rewriting
Hrituraj Singh
Gaurav Verma
Aparna Garimella
Balaji Vasan Srinivasan
DiffM
37
6
0
28 Jan 2021
VisualMRC: Machine Reading Comprehension on Document Images
VisualMRC: Machine Reading Comprehension on Document Images
Ryota Tanaka
Kyosuke Nishida
Sen Yoshida
101
146
0
27 Jan 2021
Muppet: Massive Multi-task Representations with Pre-Finetuning
Muppet: Massive Multi-task Representations with Pre-Finetuning
Armen Aghajanyan
Anchit Gupta
Akshat Shrivastava
Xilun Chen
Luke Zettlemoyer
Sonal Gupta
100
269
0
26 Jan 2021
Slot Self-Attentive Dialogue State Tracking
Slot Self-Attentive Dialogue State Tracking
Fanghua Ye
Jarana Manotumruksa
Qiang Zhang
Shenghui Li
Emine Yilmaz
136
63
0
22 Jan 2021
Distilling Large Language Models into Tiny and Effective Students using
  pQRNN
Distilling Large Language Models into Tiny and Effective Students using pQRNN
P. Kaliamoorthi
Aditya Siddhant
Edward Li
Melvin Johnson
MQ
60
17
0
21 Jan 2021
Zero-shot Generalization in Dialog State Tracking through Generative
  Question Answering
Zero-shot Generalization in Dialog State Tracking through Generative Question Answering
Shuyang Li
Jin Cao
Mukund Sridhar
Henghui Zhu
Shang-Wen Li
Wael Hamza
Julian McAuley
BDL
72
46
0
20 Jan 2021
Open-Domain Conversational Search Assistant with Transformers
Open-Domain Conversational Search Assistant with Transformers
Rafael Ferreira
Mariana Leite
David Semedo
João Magalhães
41
11
0
20 Jan 2021
ZeRO-Offload: Democratizing Billion-Scale Model Training
ZeRO-Offload: Democratizing Billion-Scale Model Training
Jie Ren
Samyam Rajbhandari
Reza Yazdani Aminabadi
Olatunji Ruwase
Shuangyang Yang
Minjia Zhang
Dong Li
Yuxiong He
MoE
289
434
0
18 Jan 2021
What Makes Good In-Context Examples for GPT-$3$?
What Makes Good In-Context Examples for GPT-333?
Jiachang Liu
Dinghan Shen
Yizhe Zhang
Bill Dolan
Lawrence Carin
Weizhu Chen
AAMLRALM
400
1,400
0
17 Jan 2021
GENIE: Toward Reproducible and Standardized Human Evaluation for Text
  Generation
GENIE: Toward Reproducible and Standardized Human Evaluation for Text Generation
Daniel Khashabi
Gabriel Stanovsky
Jonathan Bragg
Nicholas Lourie
Jungo Kasai
Yejin Choi
Noah A. Smith
Daniel S. Weld
131
21
0
17 Jan 2021
Transformer-Based Models for Question Answering on COVID19
Transformer-Based Models for Question Answering on COVID19
Hillary Ngai
Yoona Park
John Chen
Mahboobeh Parsapoor
OOD
48
21
0
16 Jan 2021
Previous
123...186187188...196197198
Next