ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.10683
  4. Cited By
Exploring the Limits of Transfer Learning with a Unified Text-to-Text
  Transformer
v1v2v3v4 (latest)

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
    AIMat
ArXiv (abs)PDFHTML

Papers citing "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

50 / 9,870 papers shown
Title
RefSum: Refactoring Neural Summarization
RefSum: Refactoring Neural Summarization
Yixin Liu
Zi-Yi Dou
Pengfei Liu
65
40
0
15 Apr 2021
Ultra-High Dimensional Sparse Representations with Binarization for
  Efficient Text Retrieval
Ultra-High Dimensional Sparse Representations with Binarization for Efficient Text Retrieval
Kyoung-Rok Jang
Junmo Kang
Giwon Hong
Sung-Hyon Myaeng
Joohee Park
Taewon Yoon
Heecheol Seo
83
20
0
15 Apr 2021
TSDAE: Using Transformer-based Sequential Denoising Auto-Encoder for
  Unsupervised Sentence Embedding Learning
TSDAE: Using Transformer-based Sequential Denoising Auto-Encoder for Unsupervised Sentence Embedding Learning
Kexin Wang
Nils Reimers
Iryna Gurevych
149
189
0
14 Apr 2021
K-PLUG: Knowledge-injected Pre-trained Language Model for Natural
  Language Understanding and Generation in E-Commerce
K-PLUG: Knowledge-injected Pre-trained Language Model for Natural Language Understanding and Generation in E-Commerce
Song Xu
Haoran Li
Peng Yuan
Yujia Wang
Youzheng Wu
Xiaodong He
Ying Liu
Bowen Zhou
KELM
91
24
0
14 Apr 2021
Knowledge-driven Answer Generation for Conversational Search
Knowledge-driven Answer Generation for Conversational Search
Mariana Leite
Rafael Ferreira
David Semedo
João Magalhães
RALMKELM
76
1
0
14 Apr 2021
Ask what's missing and what's useful: Improving Clarification Question
  Generation using Global Knowledge
Ask what's missing and what's useful: Improving Clarification Question Generation using Global Knowledge
Bodhisattwa Prasad Majumder
Sudha Rao
Michel Galley
Julian McAuley
61
50
0
14 Apr 2021
Towards BERT-based Automatic ICD Coding: Limitations and Opportunities
Towards BERT-based Automatic ICD Coding: Limitations and Opportunities
Damian Pascual
Sandro Luck
Roger Wattenhofer
MedIm
73
55
0
14 Apr 2021
BERT Embeddings Can Track Context in Conversational Search
BERT Embeddings Can Track Context in Conversational Search
Rafael Ferreira
David Semedo
João Magalhães
AI4TS
47
0
0
13 Apr 2021
MS2: Multi-Document Summarization of Medical Studies
MS2: Multi-Document Summarization of Medical Studies
Jay DeYoung
Iz Beltagy
Madeleine van Zuylen
Bailey Kuehl
Lucy Lu Wang
91
113
0
13 Apr 2021
QA-GNN: Reasoning with Language Models and Knowledge Graphs for Question
  Answering
QA-GNN: Reasoning with Language Models and Knowledge Graphs for Question Answering
Michihiro Yasunaga
Hongyu Ren
Antoine Bosselut
Percy Liang
J. Leskovec
RALMLMTDAI4MHLRM
92
599
0
13 Apr 2021
CLEVR_HYP: A Challenge Dataset and Baselines for Visual Question
  Answering with Hypothetical Actions over Images
CLEVR_HYP: A Challenge Dataset and Baselines for Visual Question Answering with Hypothetical Actions over Images
Shailaja Keyur Sampat
Akshay Kumar
Yezhou Yang
Chitta Baral
78
26
0
13 Apr 2021
Document-Level Event Argument Extraction by Conditional Generation
Document-Level Event Argument Extraction by Conditional Generation
Sha Li
Heng Ji
Jiawei Han
70
305
0
13 Apr 2021
Discourse Probing of Pretrained Language Models
Discourse Probing of Pretrained Language Models
Fajri Koto
Jey Han Lau
Tim Baldwin
78
53
0
13 Apr 2021
Relational World Knowledge Representation in Contextual Language Models:
  A Review
Relational World Knowledge Representation in Contextual Language Models: A Review
Tara Safavi
Danai Koutra
KELM
100
51
0
12 Apr 2021
FUDGE: Controlled Text Generation With Future Discriminators
FUDGE: Controlled Text Generation With Future Discriminators
Kevin Kaichuang Yang
Dan Klein
109
337
0
12 Apr 2021
UTNLP at SemEval-2021 Task 5: A Comparative Analysis of Toxic Span
  Detection using Attention-based, Named Entity Recognition, and Ensemble
  Models
UTNLP at SemEval-2021 Task 5: A Comparative Analysis of Toxic Span Detection using Attention-based, Named Entity Recognition, and Ensemble Models
Alireza Salemi
Nazanin Sabri
Emad Kebriaei
B. Bahrak
A. Shakery
40
3
0
10 Apr 2021
Fool Me Twice: Entailment from Wikipedia Gamification
Fool Me Twice: Entailment from Wikipedia Gamification
Julian Martin Eisenschlos
Bhuwan Dhingra
Jannis Bulian
Benjamin Borschinger
Jordan L. Boyd-Graber
93
48
0
10 Apr 2021
Adapting Language Models for Zero-shot Learning by Meta-tuning on
  Dataset and Prompt Collections
Adapting Language Models for Zero-shot Learning by Meta-tuning on Dataset and Prompt Collections
Ruiqi Zhong
Kristy Lee
Zheng Zhang
Dan Klein
158
173
0
10 Apr 2021
Efficient Large-Scale Language Model Training on GPU Clusters Using
  Megatron-LM
Efficient Large-Scale Language Model Training on GPU Clusters Using Megatron-LM
Deepak Narayanan
Mohammad Shoeybi
Jared Casper
P. LeGresley
M. Patwary
...
Prethvi Kashinkunti
J. Bernauer
Bryan Catanzaro
Amar Phanishayee
Matei A. Zaharia
MoE
197
712
0
09 Apr 2021
KI-BERT: Infusing Knowledge Context for Better Language and Domain
  Understanding
KI-BERT: Infusing Knowledge Context for Better Language and Domain Understanding
Keyur Faldu
A. Sheth
Prashant Kikani
Hemang Akabari
69
28
0
09 Apr 2021
FIBER: Fill-in-the-Blanks as a Challenging Video Understanding
  Evaluation Framework
FIBER: Fill-in-the-Blanks as a Challenging Video Understanding Evaluation Framework
Santiago Castro
Ruoyao Wang
Pingxuan Huang
Ian Stewart
Oana Ignat
Nan Liu
Jonathan C. Stroud
Rada Mihalcea
AIMat
89
11
0
09 Apr 2021
Automatic Generation of Descriptive Titles for Video Clips Using Deep
  Learning
Automatic Generation of Descriptive Titles for Video Clips Using Deep Learning
Soheyla Amirian
Khaled Rasheed
T. Taha
H. Arabnia
VLMVGen
49
23
0
07 Apr 2021
Creativity and Machine Learning: A Survey
Creativity and Machine Learning: A Survey
Giorgio Franceschelli
Mirco Musolesi
VLMAI4CE
129
43
0
06 Apr 2021
CodeTrans: Towards Cracking the Language of Silicon's Code Through
  Self-Supervised Deep Learning and High Performance Computing
CodeTrans: Towards Cracking the Language of Silicon's Code Through Self-Supervised Deep Learning and High Performance Computing
Ahmed Elnaggar
Wei Ding
Llion Jones
Tom Gibbs
Tamas B. Fehér
Christoph Angerer
Silvia Severini
Florian Matthes
B. Rost
72
72
0
06 Apr 2021
What Will it Take to Fix Benchmarking in Natural Language Understanding?
What Will it Take to Fix Benchmarking in Natural Language Understanding?
Samuel R. Bowman
George E. Dahl
ELMALM
76
164
0
05 Apr 2021
ASER: Towards Large-scale Commonsense Knowledge Acquisition via
  Higher-order Selectional Preference over Eventualities
ASER: Towards Large-scale Commonsense Knowledge Acquisition via Higher-order Selectional Preference over Eventualities
Hongming Zhang
Xin Liu
Haojie Pan
Hao Ke
Jiefu Ou
Tianqing Fang
Yangqiu Song
77
48
0
05 Apr 2021
Compressing Visual-linguistic Model via Knowledge Distillation
Compressing Visual-linguistic Model via Knowledge Distillation
Zhiyuan Fang
Jianfeng Wang
Xiaowei Hu
Lijuan Wang
Yezhou Yang
Zicheng Liu
VLM
116
99
0
05 Apr 2021
Inference Time Style Control for Summarization
Inference Time Style Control for Summarization
Shuyang Cao
Lu Wang
AI4TS
77
16
0
05 Apr 2021
MCL@IITK at SemEval-2021 Task 2: Multilingual and Cross-lingual
  Word-in-Context Disambiguation using Augmented Data, Signals, and
  Transformers
MCL@IITK at SemEval-2021 Task 2: Multilingual and Cross-lingual Word-in-Context Disambiguation using Augmented Data, Signals, and Transformers
Rohan Gupta
Jay Mundra
Deepak Mahajan
Ashutosh Modi
43
3
0
04 Apr 2021
IndT5: A Text-to-Text Transformer for 10 Indigenous Languages
IndT5: A Text-to-Text Transformer for 10 Indigenous Languages
El Moatez Billah Nagoudi
Wei-Rui Chen
Muhammad Abdul-Mageed
H. Cavusoglu
85
24
0
04 Apr 2021
Humor@IITK at SemEval-2021 Task 7: Large Language Models for Quantifying
  Humor and Offensiveness
Humor@IITK at SemEval-2021 Task 7: Large Language Models for Quantifying Humor and Offensiveness
Aishwarya Gupta
Avik Pal
Bholeshwar Khurana
Lakshay Tyagi
Ashutosh Modi
53
6
0
02 Apr 2021
Towards General Purpose Vision Systems
Towards General Purpose Vision Systems
Tanmay Gupta
Amita Kamath
Aniruddha Kembhavi
Derek Hoiem
100
53
0
01 Apr 2021
FeTaQA: Free-form Table Question Answering
FeTaQA: Free-form Table Question Answering
Linyong Nan
Chia-Hsuan Hsieh
Ziming Mao
Xi Lin
Neha Verma
...
Isabel Trindade
Renusree Bandaru
Jacob Cunningham
Caiming Xiong
Dragomir R. Radev
LMTD
152
167
0
01 Apr 2021
HAConvGNN: Hierarchical Attention Based Convolutional Graph Neural
  Network for Code Documentation Generation in Jupyter Notebooks
HAConvGNN: Hierarchical Attention Based Convolutional Graph Neural Network for Code Documentation Generation in Jupyter Notebooks
Xuye Liu
Dakuo Wang
A. Wang
Yufang Hou
Lingfei Wu
66
22
0
31 Mar 2021
BASE Layers: Simplifying Training of Large, Sparse Models
BASE Layers: Simplifying Training of Large, Sparse Models
M. Lewis
Shruti Bhosale
Tim Dettmers
Naman Goyal
Luke Zettlemoyer
MoE
208
285
0
30 Mar 2021
Automatic Graph Partitioning for Very Large-scale Deep Learning
Automatic Graph Partitioning for Very Large-scale Deep Learning
Masahiro Tanaka
Kenjiro Taura
T. Hanawa
Kentaro Torisawa
GNNAI4CE
56
21
0
30 Mar 2021
Domain-robust VQA with diverse datasets and methods but no target labels
Domain-robust VQA with diverse datasets and methods but no target labels
Ruotong Wang
Tristan D. Maidment
Ahmad Diab
Adriana Kovashka
R. Hwa
OOD
129
23
0
29 Mar 2021
ViViT: A Video Vision Transformer
ViViT: A Video Vision Transformer
Anurag Arnab
Mostafa Dehghani
G. Heigold
Chen Sun
Mario Lucic
Cordelia Schmid
ViT
240
2,175
0
29 Mar 2021
Generic Attention-model Explainability for Interpreting Bi-Modal and
  Encoder-Decoder Transformers
Generic Attention-model Explainability for Interpreting Bi-Modal and Encoder-Decoder Transformers
Hila Chefer
Shir Gur
Lior Wolf
ViT
100
326
0
29 Mar 2021
Multi-Scale Vision Longformer: A New Vision Transformer for
  High-Resolution Image Encoding
Multi-Scale Vision Longformer: A New Vision Transformer for High-Resolution Image Encoding
Pengchuan Zhang
Xiyang Dai
Jianwei Yang
Bin Xiao
Lu Yuan
Lei Zhang
Jianfeng Gao
ViT
113
337
0
29 Mar 2021
One Network Fits All? Modular versus Monolithic Task Formulations in
  Neural Networks
One Network Fits All? Modular versus Monolithic Task Formulations in Neural Networks
Atish Agarwala
Abhimanyu Das
Brendan Juba
Rina Panigrahy
Vatsal Sharan
Xin Wang
Qiuyi Zhang
MoMe
41
11
0
29 Mar 2021
Alignment of Language Agents
Alignment of Language Agents
Zachary Kenton
Tom Everitt
Laura Weidinger
Iason Gabriel
Vladimir Mikulik
G. Irving
85
166
0
26 Mar 2021
Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
Ze Liu
Yutong Lin
Yue Cao
Han Hu
Yixuan Wei
Zheng Zhang
Stephen Lin
B. Guo
ViT
517
21,773
0
25 Mar 2021
FastMoE: A Fast Mixture-of-Expert Training System
FastMoE: A Fast Mixture-of-Expert Training System
Jiaao He
J. Qiu
Aohan Zeng
Zhilin Yang
Jidong Zhai
Jie Tang
ALMMoE
109
104
0
24 Mar 2021
Finetuning Pretrained Transformers into RNNs
Finetuning Pretrained Transformers into RNNs
Jungo Kasai
Hao Peng
Yizhe Zhang
Dani Yogatama
Gabriel Ilharco
Nikolaos Pappas
Yi Mao
Weizhu Chen
Noah A. Smith
112
67
0
24 Mar 2021
Czert -- Czech BERT-like Model for Language Representation
Czert -- Czech BERT-like Model for Language Representation
Jakub Sido
O. Pražák
P. Pribán
Jan Pasek
Michal Seják
Miloslav Konopík
73
44
0
24 Mar 2021
UNICORN on RAINBOW: A Universal Commonsense Reasoning Model on a New
  Multitask Benchmark
UNICORN on RAINBOW: A Universal Commonsense Reasoning Model on a New Multitask Benchmark
Nicholas Lourie
Ronan Le Bras
Chandra Bhagavatula
Yejin Choi
LRM
105
140
0
24 Mar 2021
NaturalProofs: Mathematical Theorem Proving in Natural Language
NaturalProofs: Mathematical Theorem Proving in Natural Language
Sean Welleck
Jiacheng Liu
Ronan Le Bras
Hannaneh Hajishirzi
Yejin Choi
Kyunghyun Cho
AIMat
91
69
0
24 Mar 2021
The NLP Cookbook: Modern Recipes for Transformer based Deep Learning
  Architectures
The NLP Cookbook: Modern Recipes for Transformer based Deep Learning Architectures
Sushant Singh
A. Mahmood
AI4TS
111
95
0
23 Mar 2021
A Pseudo-Metric between Probability Distributions based on Depth-Trimmed
  Regions
A Pseudo-Metric between Probability Distributions based on Depth-Trimmed Regions
Guillaume Staerman
Pavlo Mozharovskyi
Pierre Colombo
Stéphan Clémenccon
Florence dÁlché-Buc
OOD
573
17
0
23 Mar 2021
Previous
123...184185186...196197198
Next