Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1910.10683
Cited By
v1
v2
v3
v4 (latest)
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"
50 / 9,870 papers shown
Title
RefSum: Refactoring Neural Summarization
Yixin Liu
Zi-Yi Dou
Pengfei Liu
65
40
0
15 Apr 2021
Ultra-High Dimensional Sparse Representations with Binarization for Efficient Text Retrieval
Kyoung-Rok Jang
Junmo Kang
Giwon Hong
Sung-Hyon Myaeng
Joohee Park
Taewon Yoon
Heecheol Seo
83
20
0
15 Apr 2021
TSDAE: Using Transformer-based Sequential Denoising Auto-Encoder for Unsupervised Sentence Embedding Learning
Kexin Wang
Nils Reimers
Iryna Gurevych
149
189
0
14 Apr 2021
K-PLUG: Knowledge-injected Pre-trained Language Model for Natural Language Understanding and Generation in E-Commerce
Song Xu
Haoran Li
Peng Yuan
Yujia Wang
Youzheng Wu
Xiaodong He
Ying Liu
Bowen Zhou
KELM
91
24
0
14 Apr 2021
Knowledge-driven Answer Generation for Conversational Search
Mariana Leite
Rafael Ferreira
David Semedo
João Magalhães
RALM
KELM
76
1
0
14 Apr 2021
Ask what's missing and what's useful: Improving Clarification Question Generation using Global Knowledge
Bodhisattwa Prasad Majumder
Sudha Rao
Michel Galley
Julian McAuley
61
50
0
14 Apr 2021
Towards BERT-based Automatic ICD Coding: Limitations and Opportunities
Damian Pascual
Sandro Luck
Roger Wattenhofer
MedIm
73
55
0
14 Apr 2021
BERT Embeddings Can Track Context in Conversational Search
Rafael Ferreira
David Semedo
João Magalhães
AI4TS
47
0
0
13 Apr 2021
MS2: Multi-Document Summarization of Medical Studies
Jay DeYoung
Iz Beltagy
Madeleine van Zuylen
Bailey Kuehl
Lucy Lu Wang
91
113
0
13 Apr 2021
QA-GNN: Reasoning with Language Models and Knowledge Graphs for Question Answering
Michihiro Yasunaga
Hongyu Ren
Antoine Bosselut
Percy Liang
J. Leskovec
RALM
LMTD
AI4MH
LRM
92
599
0
13 Apr 2021
CLEVR_HYP: A Challenge Dataset and Baselines for Visual Question Answering with Hypothetical Actions over Images
Shailaja Keyur Sampat
Akshay Kumar
Yezhou Yang
Chitta Baral
78
26
0
13 Apr 2021
Document-Level Event Argument Extraction by Conditional Generation
Sha Li
Heng Ji
Jiawei Han
70
305
0
13 Apr 2021
Discourse Probing of Pretrained Language Models
Fajri Koto
Jey Han Lau
Tim Baldwin
78
53
0
13 Apr 2021
Relational World Knowledge Representation in Contextual Language Models: A Review
Tara Safavi
Danai Koutra
KELM
100
51
0
12 Apr 2021
FUDGE: Controlled Text Generation With Future Discriminators
Kevin Kaichuang Yang
Dan Klein
109
337
0
12 Apr 2021
UTNLP at SemEval-2021 Task 5: A Comparative Analysis of Toxic Span Detection using Attention-based, Named Entity Recognition, and Ensemble Models
Alireza Salemi
Nazanin Sabri
Emad Kebriaei
B. Bahrak
A. Shakery
40
3
0
10 Apr 2021
Fool Me Twice: Entailment from Wikipedia Gamification
Julian Martin Eisenschlos
Bhuwan Dhingra
Jannis Bulian
Benjamin Borschinger
Jordan L. Boyd-Graber
93
48
0
10 Apr 2021
Adapting Language Models for Zero-shot Learning by Meta-tuning on Dataset and Prompt Collections
Ruiqi Zhong
Kristy Lee
Zheng Zhang
Dan Klein
158
173
0
10 Apr 2021
Efficient Large-Scale Language Model Training on GPU Clusters Using Megatron-LM
Deepak Narayanan
Mohammad Shoeybi
Jared Casper
P. LeGresley
M. Patwary
...
Prethvi Kashinkunti
J. Bernauer
Bryan Catanzaro
Amar Phanishayee
Matei A. Zaharia
MoE
197
712
0
09 Apr 2021
KI-BERT: Infusing Knowledge Context for Better Language and Domain Understanding
Keyur Faldu
A. Sheth
Prashant Kikani
Hemang Akabari
69
28
0
09 Apr 2021
FIBER: Fill-in-the-Blanks as a Challenging Video Understanding Evaluation Framework
Santiago Castro
Ruoyao Wang
Pingxuan Huang
Ian Stewart
Oana Ignat
Nan Liu
Jonathan C. Stroud
Rada Mihalcea
AIMat
89
11
0
09 Apr 2021
Automatic Generation of Descriptive Titles for Video Clips Using Deep Learning
Soheyla Amirian
Khaled Rasheed
T. Taha
H. Arabnia
VLM
VGen
49
23
0
07 Apr 2021
Creativity and Machine Learning: A Survey
Giorgio Franceschelli
Mirco Musolesi
VLM
AI4CE
129
43
0
06 Apr 2021
CodeTrans: Towards Cracking the Language of Silicon's Code Through Self-Supervised Deep Learning and High Performance Computing
Ahmed Elnaggar
Wei Ding
Llion Jones
Tom Gibbs
Tamas B. Fehér
Christoph Angerer
Silvia Severini
Florian Matthes
B. Rost
72
72
0
06 Apr 2021
What Will it Take to Fix Benchmarking in Natural Language Understanding?
Samuel R. Bowman
George E. Dahl
ELM
ALM
76
164
0
05 Apr 2021
ASER: Towards Large-scale Commonsense Knowledge Acquisition via Higher-order Selectional Preference over Eventualities
Hongming Zhang
Xin Liu
Haojie Pan
Hao Ke
Jiefu Ou
Tianqing Fang
Yangqiu Song
77
48
0
05 Apr 2021
Compressing Visual-linguistic Model via Knowledge Distillation
Zhiyuan Fang
Jianfeng Wang
Xiaowei Hu
Lijuan Wang
Yezhou Yang
Zicheng Liu
VLM
116
99
0
05 Apr 2021
Inference Time Style Control for Summarization
Shuyang Cao
Lu Wang
AI4TS
77
16
0
05 Apr 2021
MCL@IITK at SemEval-2021 Task 2: Multilingual and Cross-lingual Word-in-Context Disambiguation using Augmented Data, Signals, and Transformers
Rohan Gupta
Jay Mundra
Deepak Mahajan
Ashutosh Modi
43
3
0
04 Apr 2021
IndT5: A Text-to-Text Transformer for 10 Indigenous Languages
El Moatez Billah Nagoudi
Wei-Rui Chen
Muhammad Abdul-Mageed
H. Cavusoglu
85
24
0
04 Apr 2021
Humor@IITK at SemEval-2021 Task 7: Large Language Models for Quantifying Humor and Offensiveness
Aishwarya Gupta
Avik Pal
Bholeshwar Khurana
Lakshay Tyagi
Ashutosh Modi
53
6
0
02 Apr 2021
Towards General Purpose Vision Systems
Tanmay Gupta
Amita Kamath
Aniruddha Kembhavi
Derek Hoiem
100
53
0
01 Apr 2021
FeTaQA: Free-form Table Question Answering
Linyong Nan
Chia-Hsuan Hsieh
Ziming Mao
Xi Lin
Neha Verma
...
Isabel Trindade
Renusree Bandaru
Jacob Cunningham
Caiming Xiong
Dragomir R. Radev
LMTD
152
167
0
01 Apr 2021
HAConvGNN: Hierarchical Attention Based Convolutional Graph Neural Network for Code Documentation Generation in Jupyter Notebooks
Xuye Liu
Dakuo Wang
A. Wang
Yufang Hou
Lingfei Wu
66
22
0
31 Mar 2021
BASE Layers: Simplifying Training of Large, Sparse Models
M. Lewis
Shruti Bhosale
Tim Dettmers
Naman Goyal
Luke Zettlemoyer
MoE
208
285
0
30 Mar 2021
Automatic Graph Partitioning for Very Large-scale Deep Learning
Masahiro Tanaka
Kenjiro Taura
T. Hanawa
Kentaro Torisawa
GNN
AI4CE
56
21
0
30 Mar 2021
Domain-robust VQA with diverse datasets and methods but no target labels
Ruotong Wang
Tristan D. Maidment
Ahmad Diab
Adriana Kovashka
R. Hwa
OOD
129
23
0
29 Mar 2021
ViViT: A Video Vision Transformer
Anurag Arnab
Mostafa Dehghani
G. Heigold
Chen Sun
Mario Lucic
Cordelia Schmid
ViT
240
2,175
0
29 Mar 2021
Generic Attention-model Explainability for Interpreting Bi-Modal and Encoder-Decoder Transformers
Hila Chefer
Shir Gur
Lior Wolf
ViT
100
326
0
29 Mar 2021
Multi-Scale Vision Longformer: A New Vision Transformer for High-Resolution Image Encoding
Pengchuan Zhang
Xiyang Dai
Jianwei Yang
Bin Xiao
Lu Yuan
Lei Zhang
Jianfeng Gao
ViT
113
337
0
29 Mar 2021
One Network Fits All? Modular versus Monolithic Task Formulations in Neural Networks
Atish Agarwala
Abhimanyu Das
Brendan Juba
Rina Panigrahy
Vatsal Sharan
Xin Wang
Qiuyi Zhang
MoMe
41
11
0
29 Mar 2021
Alignment of Language Agents
Zachary Kenton
Tom Everitt
Laura Weidinger
Iason Gabriel
Vladimir Mikulik
G. Irving
85
166
0
26 Mar 2021
Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
Ze Liu
Yutong Lin
Yue Cao
Han Hu
Yixuan Wei
Zheng Zhang
Stephen Lin
B. Guo
ViT
517
21,773
0
25 Mar 2021
FastMoE: A Fast Mixture-of-Expert Training System
Jiaao He
J. Qiu
Aohan Zeng
Zhilin Yang
Jidong Zhai
Jie Tang
ALM
MoE
109
104
0
24 Mar 2021
Finetuning Pretrained Transformers into RNNs
Jungo Kasai
Hao Peng
Yizhe Zhang
Dani Yogatama
Gabriel Ilharco
Nikolaos Pappas
Yi Mao
Weizhu Chen
Noah A. Smith
112
67
0
24 Mar 2021
Czert -- Czech BERT-like Model for Language Representation
Jakub Sido
O. Pražák
P. Pribán
Jan Pasek
Michal Seják
Miloslav Konopík
73
44
0
24 Mar 2021
UNICORN on RAINBOW: A Universal Commonsense Reasoning Model on a New Multitask Benchmark
Nicholas Lourie
Ronan Le Bras
Chandra Bhagavatula
Yejin Choi
LRM
105
140
0
24 Mar 2021
NaturalProofs: Mathematical Theorem Proving in Natural Language
Sean Welleck
Jiacheng Liu
Ronan Le Bras
Hannaneh Hajishirzi
Yejin Choi
Kyunghyun Cho
AIMat
91
69
0
24 Mar 2021
The NLP Cookbook: Modern Recipes for Transformer based Deep Learning Architectures
Sushant Singh
A. Mahmood
AI4TS
111
95
0
23 Mar 2021
A Pseudo-Metric between Probability Distributions based on Depth-Trimmed Regions
Guillaume Staerman
Pavlo Mozharovskyi
Pierre Colombo
Stéphan Clémenccon
Florence dÁlché-Buc
OOD
573
17
0
23 Mar 2021
Previous
1
2
3
...
184
185
186
...
196
197
198
Next