Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1910.10683
Cited By
v1
v2
v3
v4 (latest)
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"
50 / 9,891 papers shown
Title
PromptCast: A New Prompt-based Learning Paradigm for Time Series Forecasting
Hao Xue
Flora D.Salim
AI4TS
144
165
0
20 Sep 2022
Learn to Explain: Multimodal Reasoning via Thought Chains for Science Question Answering
Pan Lu
Swaroop Mishra
Tony Xia
Liang Qiu
Kai-Wei Chang
Song-Chun Zhu
Oyvind Tafjord
Peter Clark
Ashwin Kalyan
ELM
ReLM
LRM
299
1,301
0
20 Sep 2022
A Few-shot Approach to Resume Information Extraction via Prompts
Chengguang Gan
Tatsunori Mori
41
10
0
20 Sep 2022
Automatic Label Sequence Generation for Prompting Sequence-to-sequence Models
Zichun Yu
Tianyu Gao
Zhengyan Zhang
Yankai Lin
Zhiyuan Liu
Maosong Sun
Jie Zhou
VLM
LRM
51
1
0
20 Sep 2022
NL2INTERFACE: Interactive Visualization Interface Generation from Natural Language Queries
Yiru Chen
Ryan Li
Austin Mac
Tianbao Xie
Tao Yu
Eugene Wu
73
12
0
19 Sep 2022
Autoregressive Entity Generation for End-to-End Task-Oriented Dialog
Guanhuan Huang
Xiaojun Quan
Qifan Wang
RALM
55
18
0
19 Sep 2022
Automated MeSH Term Suggestion for Effective Query Formulation in Systematic Reviews Literature Search
Shuai Wang
Harrisen Scells
Bevan Koopman
Guido Zuccon
AI4CE
73
18
0
19 Sep 2022
APPDIA: A Discourse-aware Transformer-based Style Transfer Model for Offensive Social Media Conversations
Katherine Atwell
Sabit Hassan
Malihe Alikhani
89
31
0
17 Sep 2022
SF-DST: Few-Shot Self-Feeding Reading Comprehension Dialogue State Tracking with Auxiliary Task
Jihyun Lee
G. G. Lee
77
2
0
16 Sep 2022
Stateful Memory-Augmented Transformers for Efficient Dialogue Modeling
Qingyang Wu
Zhou Yu
RALM
29
0
0
15 Sep 2022
TwHIN-BERT: A Socially-Enriched Pre-trained Language Model for Multilingual Tweet Representations at Twitter
Xinyang Zhang
Yury Malkov
Omar U. Florez
Serim Park
Brian McWilliams
Jiawei Han
Ahmed El-Kishky
VLM
109
94
0
15 Sep 2022
Unsupervised Opinion Summarization Using Approximate Geodesics
Somnath Basu Roy Chowdhury
Nicholas Monath
Kumar Avinava Dubey
Amr Ahmed
Snigdha Chaturvedi
78
7
0
15 Sep 2022
A Continual Development Methodology for Large-scale Multitask Dynamic ML Systems
Andrea Gesmundo
61
18
0
15 Sep 2022
Knowledge Is Flat: A Seq2Seq Generative Framework for Various Knowledge Graph Completion
Chen Chen
Yufei Wang
Bing Li
Kwok-Yan Lam
70
32
0
15 Sep 2022
Linear Transformations for Cross-lingual Sentiment Analysis
Pavel Přibáň
Jakub Šmíd
Adam Mištera
Pavel Král
79
3
0
15 Sep 2022
Generative Visual Prompt: Unifying Distributional Control of Pre-Trained Generative Models
Chen Henry Wu
Saman Motamed
Shaunak Srivastava
Fernando de la Torre
VLM
DiffM
72
35
0
14 Sep 2022
Out of One, Many: Using Language Models to Simulate Human Samples
Lisa P. Argyle
Ethan C. Busby
Nancy Fulda
Joshua R Gubler
Christopher Rytting
David Wingate
SyDa
105
617
0
14 Sep 2022
PaLI: A Jointly-Scaled Multilingual Language-Image Model
Xi Chen
Tianlin Li
Soravit Changpinyo
A. Piergiovanni
Piotr Padlewski
...
Andreas Steiner
A. Angelova
Xiaohua Zhai
N. Houlsby
Radu Soricut
MLLM
VLM
208
742
0
14 Sep 2022
vec2text with Round-Trip Translations
Geoffrey Cideron
Sertan Girgin
Anton Raichuk
Olivier Pietquin
Olivier Bachem
Léonard Hussenot
91
3
0
14 Sep 2022
MUST-VQA: MUltilingual Scene-text VQA
Emanuele Vivoli
Ali Furkan Biten
Andrés Mafla
Dimosthenis Karatzas
Lluís Gómez
113
6
0
14 Sep 2022
SPACE-3: Unified Dialog Model Pre-training for Task-Oriented Dialog Understanding and Generation
Wanwei He
Yinpei Dai
Min Yang
Jian Sun
Fei Huang
Luo Si
Yongbin Li
81
62
0
14 Sep 2022
CoHS-CQG: Context and History Selection for Conversational Question Generation
Do Xuan Long
Bowei Zou
Liangming Pan
Nancy F. Chen
Shafiq Joty
Ai Ti Aw
SLR
96
10
0
14 Sep 2022
Pre-training for Information Retrieval: Are Hyperlinks Fully Explored?
Jiawen Wu
Xinyu Zhang
Yutao Zhu
Zheng Liu
Zikai Guo
Zhaoye Fei
Ruofei Lai
Yongkang Wu
Bo Zhao
Zhicheng Dou
82
5
0
14 Sep 2022
SUN: Exploring Intrinsic Uncertainties in Text-to-SQL Parsers
Bowen Qin
Lihan Wang
Binyuan Hui
Bowen Li
Xiangpeng Wei
Binhua Li
Fei Huang
Luo Si
Min Yang
Yongbin Li
94
9
0
14 Sep 2022
Language Chameleon: Transformation analysis between languages using Cross-lingual Post-training based on Pre-trained language models
Suhyune Son
Chanjun Park
Jungseob Lee
Midan Shim
Chanhee Lee
Yoonna Jang
Jaehyung Seo
Heu-Jeoung Lim
73
0
0
14 Sep 2022
Do Androids Laugh at Electric Sheep? Humor "Understanding" Benchmarks from The New Yorker Caption Contest
Jack Hessel
Ana Marasović
Jena D. Hwang
Lillian Lee
Jeff Da
Rowan Zellers
Robert Mankoff
Yejin Choi
VLM
112
91
0
13 Sep 2022
PANCETTA: Phoneme Aware Neural Completion to Elicit Tongue Twisters Automatically
Sedrick Scott Keh
Steven Y. Feng
Varun Gangal
Malihe Alikhani
Eduard H. Hovy
59
4
0
13 Sep 2022
Don't Judge a Language Model by Its Last Layer: Contrastive Learning with Layer-Wise Attention Pooling
Dongsuk Oh
Yejin Kim
Hodong Lee
Huimin Huang
Heu-Jeoung Lim
90
10
0
13 Sep 2022
Socially Enhanced Situation Awareness from Microblogs using Artificial Intelligence: A Survey
Rabindra Lamsal
Aaron Harwood
M. Read
101
21
0
13 Sep 2022
An Embedding-Based Grocery Search Model at Instacart
Yuqing Xie
Taesik Na
X. Xiao
Saurav Manchanda
Young Rao
Zhihong Xu
Guanghua Shu
Esther Vasiete
Tejaswi Tenneti
Haixun Wang
DML
RALM
58
6
0
12 Sep 2022
PreSTU: Pre-Training for Scene-Text Understanding
Jihyung Kil
Soravit Changpinyo
Xi Chen
Hexiang Hu
Sebastian Goodman
Wei-Lun Chao
Radu Soricut
VLM
191
29
0
12 Sep 2022
Factual and Informative Review Generation for Explainable Recommendation
Zhouhang Xie
Sameer Singh
Julian McAuley
Bodhisattwa Prasad Majumder
106
26
0
12 Sep 2022
CSL: A Large-scale Chinese Scientific Literature Dataset
Yudong Li
Yuqing Zhang
Zhe Zhao
Lin-cheng Shen
Weijie Liu
Weiquan Mao
Hui Zhang
AILaw
202
52
0
12 Sep 2022
Knowledge Base Question Answering: A Semantic Parsing Perspective
Yu Gu
Vardaan Pahuja
Gong Cheng
Yu-Chuan Su
120
29
0
12 Sep 2022
Chain of Explanation: New Prompting Method to Generate Higher Quality Natural Language Explanation for Implicit Hate Speech
Fan Huang
Haewoon Kwak
Jisun An
LRM
103
24
0
11 Sep 2022
Code Compliance Assessment as a Learning Problem
Neela Sawant
Srinivasan H. Sengamedu
65
1
0
10 Sep 2022
Pre-training image-language transformers for open-vocabulary tasks
A. Piergiovanni
Weicheng Kuo
A. Angelova
VLM
ViT
119
10
0
09 Sep 2022
Ranking-Enhanced Unsupervised Sentence Representation Learning
Yeon Seonwoo
Guoyin Wang
Changmin Seo
Sajal Choudhary
Jiwei Li
Xiang Li
Puyang Xu
Sunghyun Park
Alice Oh
SSL
DRL
AI4TS
90
16
0
09 Sep 2022
Pre-Training a Graph Recurrent Network for Language Representation
Yile Wang
Linyi Yang
Zhiyang Teng
M. Zhou
Yue Zhang
GNN
81
1
0
08 Sep 2022
Applying Transformer-based Text Summarization for Keyphrase Generation
Anna Glazkova
Dmitry A. Morozov
61
17
0
08 Sep 2022
Towards explainable evaluation of language models on the semantic similarity of visual concepts
Maria Lymperaiou
George Manoliadis
Orfeas Menis Mastromichalakis
Edmund Dervakos
Giorgos Stamou
AAML
73
5
0
08 Sep 2022
FETA: Towards Specializing Foundation Models for Expert Task Applications
Amit Alfassy
Assaf Arbelle
Oshri Halimi
Sivan Harary
Roei Herzig
...
Christoph Auer
Kate Saenko
Peter W. J. Staar
Rogerio Feris
Leonid Karlinsky
90
20
0
08 Sep 2022
AudioLM: a Language Modeling Approach to Audio Generation
Zalan Borsos
Raphaël Marinier
Damien Vincent
Eugene Kharitonov
Olivier Pietquin
...
Dominik Roblek
O. Teboul
David Grangier
Marco Tagliasacchi
Neil Zeghidour
AuLLM
163
616
0
07 Sep 2022
The BLue Amazon Brain (BLAB): A Modular Architecture of Services about the Brazilian Maritime Territory
Paulo Pirozelli
Ais B. R. Castro
Ana Luiza C. de Oliveira
A. Oliveira
Flávio N. Caccao
...
A. H. R. Costa
A. Brandão
Denis Deratani Mauá
Fabio Gagliardi Cozman
S. M. Peres
64
2
0
06 Sep 2022
Analyzing Transformers in Embedding Space
Guy Dar
Mor Geva
Ankit Gupta
Jonathan Berant
83
93
0
06 Sep 2022
Transfer Learning of Lexical Semantic Families for Argumentative Discourse Units Identification
João Rodrigues
Ruben Branco
António Branco
60
0
0
06 Sep 2022
Mimose: An Input-Aware Checkpointing Planner for Efficient Training on GPU
Jian-He Liao
Mingzhen Li
Qingxiao Sun
Jiwei Hao
F. Yu
...
Ye Tao
Zicheng Zhang
Hailong Yang
Zhongzhi Luan
D. Qian
76
4
0
06 Sep 2022
External Knowledge Selection with Weighted Negative Sampling in Knowledge-grounded Task-oriented Dialogue Systems
Janghoon Han
Joongbo Shin
Hosung Song
Hyunjik Jo
Gyeonghun Kim
Yireun Kim
Stanley Jungkyu Choi
48
4
0
06 Sep 2022
Multi-Figurative Language Generation
Huiyuan Lai
Malvina Nissim
54
1
0
05 Sep 2022
A Review of Sparse Expert Models in Deep Learning
W. Fedus
J. Dean
Barret Zoph
MoE
129
155
0
04 Sep 2022
Previous
1
2
3
...
152
153
154
...
196
197
198
Next