Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1910.10683
Cited By
v1
v2
v3
v4 (latest)
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"
50 / 9,877 papers shown
Title
Consistency of Implicit and Explicit Features Matters for Monocular 3D Object Detection
Qian Ye
L. Jiang
Wang Zhen
Yuyang Du
49
6
0
16 Jul 2022
A No-Code Low-Code Paradigm for Authoring Business Automations Using Natural Language
Michael Desmond
Evelyn Duesterwald
Vatche Isahagian
Vinod Muthusamy
60
4
0
15 Jul 2022
Plex: Towards Reliability using Pretrained Large Model Extensions
Dustin Tran
J. Liu
Michael W. Dusenberry
Du Phan
Mark Collier
...
D. Sculley
Y. Gal
Zoubin Ghahramani
Jasper Snoek
Balaji Lakshminarayanan
VLM
140
126
0
15 Jul 2022
Session-based Cyberbullying Detection in Social Media: A Survey
Peiling Yi
A. Zubiaga
59
54
0
14 Jul 2022
Convolutional Bypasses Are Better Vision Transformer Adapters
Shibo Jie
Zhi-Hong Deng
VPVLM
91
137
0
14 Jul 2022
Language Modelling with Pixels
Phillip Rust
Jonas F. Lotz
Emanuele Bugliarello
Elizabeth Salesky
Miryam de Lhoneux
Desmond Elliott
VLM
107
46
0
14 Jul 2022
Distance Learner: Incorporating Manifold Prior to Model Training
Aditya Chetan
Nipun Kwatra
31
1
0
14 Jul 2022
Neural Data-to-Text Generation Based on Small Datasets: Comparing the Added Value of Two Semi-Supervised Learning Approaches on Top of a Large Language Model
Chris van der Lee
Thiago Castro Ferreira
Chris Emmery
Travis J. Wiltshire
Emiel Krahmer
78
2
0
14 Jul 2022
BERTIN: Efficient Pre-Training of a Spanish Language Model using Perplexity Sampling
Javier de la Rosa
E. G. Ponferrada
Paulo Villegas
Pablo González de Prado Salas
Manu Romero
María Grandury
74
96
0
14 Jul 2022
N-Grammer: Augmenting Transformers with latent n-grams
Aurko Roy
Rohan Anil
Guangda Lai
Benjamin Lee
Jeffrey Zhao
...
Yu
Phuong Dao
Christopher Fifty
Zhiwen Chen
Yonghui Wu
77
8
0
13 Jul 2022
Re2G: Retrieve, Rerank, Generate
Michael R. Glass
Gaetano Rossiello
Md. Faisal Mahbub Chowdhury
Ankita Rajaram Naik
Pengshan Cai
A. Gliozzo
RALM
89
96
0
13 Jul 2022
Does GNN Pretraining Help Molecular Representation?
Ruoxi Sun
Hanjun Dai
Adams Wei Yu
SSL
AI4CE
GNN
65
75
0
13 Jul 2022
DocPrompting: Generating Code by Retrieving the Docs
Shuyan Zhou
Uri Alon
Frank F. Xu
Zhiruo Wang
Zhengbao Jiang
Graham Neubig
LLMAG
106
141
0
13 Jul 2022
PLM-ICD: Automatic ICD Coding with Pretrained Language Models
Chao-Wei Huang
Shang-Chi Tsai
Yun-Nung Chen
90
51
0
12 Jul 2022
Effective Few-Shot Named Entity Linking by Meta-Learning
Xiuxing Li
Zhenyu Li
Zhengyan Zhang
Ning Liu
Haitao Yuan
Wei Zhang
Zhiyuan Liu
Jianyong Wang
OffRL
107
11
0
12 Jul 2022
Towards Neural Numeric-To-Text Generation From Temporal Personal Health Data
Jon Harris
Mohammed J Zaki
AI4TS
65
2
0
11 Jul 2022
Embedding Recycling for Language Models
Jon Saad-Falcon
Amanpreet Singh
Luca Soldaini
Mike DÁrcy
Arman Cohan
Doug Downey
KELM
63
4
0
11 Jul 2022
Exploring Length Generalization in Large Language Models
Cem Anil
Yuhuai Wu
Anders Andreassen
Aitor Lewkowycz
Vedant Misra
V. Ramasesh
Ambrose Slone
Guy Gur-Ari
Ethan Dyer
Behnam Neyshabur
ReLM
LRM
111
170
0
11 Jul 2022
Multi-level Fusion of Wav2vec 2.0 and BERT for Multimodal Emotion Recognition
Zihan Zhao
Yanfeng Wang
Yu Wang
57
34
0
11 Jul 2022
Few-shot training LLMs for project-specific code-summarization
Toufique Ahmed
Prem Devanbu
235
241
0
09 Jul 2022
TalkToModel: Explaining Machine Learning Models with Interactive Natural Language Conversations
Dylan Slack
Satyapriya Krishna
Himabindu Lakkaraju
Sameer Singh
84
84
0
08 Jul 2022
The Harvard USPTO Patent Dataset: A Large-Scale, Well-Structured, and Multi-Purpose Corpus of Patent Applications
Mirac Suzgun
Luke Melas-Kyriazi
Suproteem K. Sarkar
S. Kominers
Stuart M. Shieber
117
29
0
08 Jul 2022
Beyond Transfer Learning: Co-finetuning for Action Localisation
Anurag Arnab
Xuehan Xiong
A. Gritsenko
Rob Romijnders
Josip Djolonga
Mostafa Dehghani
Chen Sun
Mario Lucic
Cordelia Schmid
87
8
0
08 Jul 2022
Meta-Learning the Difference: Preparing Large Language Models for Efficient Adaptation
Zejiang Hou
Julian Salazar
George Polovets
79
15
0
07 Jul 2022
Training Transformers Together
Alexander Borzunov
Max Ryabinin
Tim Dettmers
Quentin Lhoest
Lucile Saulnier
Michael Diskin
Yacine Jernite
Thomas Wolf
ViT
63
10
0
07 Jul 2022
Rethinking the Value of Gazetteer in Chinese Named Entity Recognition
Qianglong Chen
Xiangji Zeng
Jiangang Zhu
Yin Zhang
Bojia Lin
Yang Yang
Daxin Jiang
62
2
0
06 Jul 2022
Transformers are Adaptable Task Planners
Vidhi Jain
Yixin Lin
Eric Undersander
Yonatan Bisk
Akshara Rai
113
24
0
06 Jul 2022
Cross-Lingual QA as a Stepping Stone for Monolingual Open QA in Icelandic
Vésteinn Snæbjarnarson
H. Einarsson
59
6
0
05 Jul 2022
Scene-Aware Prompt for Multi-modal Dialogue Understanding and Generation
Bin Li
Yixuan Weng
Ziyu Ma
Bin Sun
Shutao Li
VLM
34
2
0
05 Jul 2022
CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning
Hung Le
Yue Wang
Akhilesh Deepak Gotmare
Silvio Savarese
Guosheng Lin
SyDa
ALM
227
273
0
05 Jul 2022
Probing via Prompting
Jiaoda Li
Ryan Cotterell
Mrinmaya Sachan
109
13
0
04 Jul 2022
Masked Autoencoders in 3D Point Cloud Representation Learning
Jincen Jiang
Xuequan Lu
Lizhi Zhao
Richard Dazeley
Meili Wang
3DPC
ViT
144
29
0
04 Jul 2022
An Empirical Survey on Long Document Summarization: Datasets, Models and Metrics
Huan Yee Koh
Jiaxin Ju
Ming Liu
Shirui Pan
149
128
0
03 Jul 2022
Generating Repetitions with Appropriate Repeated Words
Toshiki Kawamoto
Hidetaka Kamigaito
Kotaro Funakoshi
Manabu Okumura
47
3
0
03 Jul 2022
FRAME: Evaluating Rationale-Label Consistency Metrics for Free-Text Rationales
Aaron Chan
Shaoliang Nie
Liang Tan
Xiaochang Peng
Hamed Firooz
Maziar Sanjabi
Xiang Ren
118
10
0
02 Jul 2022
Can Language Models Make Fun? A Case Study in Chinese Comical Crosstalk
Benyou Wang
Xiang Wu
Xiaokang Liu
Jianquan Li
Prayag Tiwari
Qianqian Xie
61
6
0
02 Jul 2022
The Parallelism Tradeoff: Limitations of Log-Precision Transformers
William Merrill
Ashish Sabharwal
108
116
0
02 Jul 2022
Can we learn from developer mistakes? Learning to localize and repair real bugs from real bug fixes
Cedric Richter
Heike Wehrheim
53
8
0
01 Jul 2022
Pile of Law: Learning Responsible Data Filtering from the Law and a 256GB Open-Source Legal Dataset
Peter Henderson
M. Krass
Lucia Zheng
Neel Guha
Christopher D. Manning
Dan Jurafsky
Daniel E. Ho
AILaw
ELM
231
103
0
01 Jul 2022
e-CLIP: Large-Scale Vision-Language Representation Learning in E-commerce
Wonyoung Shin
Jonghun Park
Taekang Woo
Yongwoo Cho
Kwangjin Oh
Hwanjun Song
VLM
125
17
0
01 Jul 2022
Measuring Forgetting of Memorized Training Examples
Matthew Jagielski
Om Thakkar
Florian Tramèr
Daphne Ippolito
Katherine Lee
...
Eric Wallace
Shuang Song
Abhradeep Thakurta
Nicolas Papernot
Chiyuan Zhang
TDI
156
111
0
30 Jun 2022
Forecasting Future World Events with Neural Networks
Andy Zou
Tristan Xiao
Ryan Jia
Joe Kwon
Mantas Mazeika
Richard Li
Dawn Song
Jacob Steinhardt
Owain Evans
Dan Hendrycks
108
27
0
30 Jun 2022
FL-Tuning: Layer Tuning for Feed-Forward Network in Transformer
Jingping Liu
Yuqiu Song
Kui Xue
Hongli Sun
Chao Wang
Lihan Chen
Haiyun Jiang
Jiaqing Liang
Tong Ruan
74
2
0
30 Jun 2022
esCorpius: A Massive Spanish Crawling Corpus
Asier Gutiérrez-Fandiño
David Pérez-Fernández
Jordi Armengol-Estapé
D. Griol
Z. Callejas
102
2
0
30 Jun 2022
BigBIO: A Framework for Data-Centric Biomedical Natural Language Processing
Jason Alan Fries
Leon Weber
Natasha Seelam
Gabriel Altay
Debajyoti Datta
...
Minh Chien Vu
Trishala Neeraj
Jonas Golde
Albert Villanova del Moral
Benjamin Beilharz
LM&MA
151
49
0
30 Jun 2022
Compressing Pre-trained Transformers via Low-Bit NxM Sparsity for Natural Language Understanding
Connor Holmes
Minjia Zhang
Yuxiong He
Bo Wu
59
3
0
30 Jun 2022
GPTs at Factify 2022: Prompt Aided Fact-Verification
Pawan Kumar Sahu
Saksham Aggarwal
Taneesh Gupta
Gyanendra Das
67
1
0
29 Jun 2022
Solving Quantitative Reasoning Problems with Language Models
Aitor Lewkowycz
Anders Andreassen
David Dohan
Ethan Dyer
Henryk Michalewski
...
Theo Gutman-Solo
Yuhuai Wu
Behnam Neyshabur
Guy Gur-Ari
Vedant Misra
ReLM
ELM
LRM
227
865
0
29 Jun 2022
On the Robustness of Dialogue History Representation in Conversational Question Answering: A Comprehensive Study and a New Prompt-based Method
Zorik Gekhman
Nadav Oved
Orgad Keller
Idan Szpektor
Roi Reichart
71
8
0
29 Jun 2022
Trial2Vec: Zero-Shot Clinical Trial Document Similarity Search using Self-Supervision
Zifeng Wang
Jimeng Sun
94
25
0
29 Jun 2022
Previous
1
2
3
...
155
156
157
...
196
197
198
Next