ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.10683
  4. Cited By
Exploring the Limits of Transfer Learning with a Unified Text-to-Text
  Transformer
v1v2v3v4 (latest)

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
    AIMat
ArXiv (abs)PDFHTML

Papers citing "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

50 / 9,877 papers shown
Title
Consistency of Implicit and Explicit Features Matters for Monocular 3D
  Object Detection
Consistency of Implicit and Explicit Features Matters for Monocular 3D Object Detection
Qian Ye
L. Jiang
Wang Zhen
Yuyang Du
49
6
0
16 Jul 2022
A No-Code Low-Code Paradigm for Authoring Business Automations Using
  Natural Language
A No-Code Low-Code Paradigm for Authoring Business Automations Using Natural Language
Michael Desmond
Evelyn Duesterwald
Vatche Isahagian
Vinod Muthusamy
60
4
0
15 Jul 2022
Plex: Towards Reliability using Pretrained Large Model Extensions
Plex: Towards Reliability using Pretrained Large Model Extensions
Dustin Tran
J. Liu
Michael W. Dusenberry
Du Phan
Mark Collier
...
D. Sculley
Y. Gal
Zoubin Ghahramani
Jasper Snoek
Balaji Lakshminarayanan
VLM
140
126
0
15 Jul 2022
Session-based Cyberbullying Detection in Social Media: A Survey
Session-based Cyberbullying Detection in Social Media: A Survey
Peiling Yi
A. Zubiaga
59
54
0
14 Jul 2022
Convolutional Bypasses Are Better Vision Transformer Adapters
Convolutional Bypasses Are Better Vision Transformer Adapters
Shibo Jie
Zhi-Hong Deng
VPVLM
91
137
0
14 Jul 2022
Language Modelling with Pixels
Language Modelling with Pixels
Phillip Rust
Jonas F. Lotz
Emanuele Bugliarello
Elizabeth Salesky
Miryam de Lhoneux
Desmond Elliott
VLM
107
46
0
14 Jul 2022
Distance Learner: Incorporating Manifold Prior to Model Training
Distance Learner: Incorporating Manifold Prior to Model Training
Aditya Chetan
Nipun Kwatra
31
1
0
14 Jul 2022
Neural Data-to-Text Generation Based on Small Datasets: Comparing the
  Added Value of Two Semi-Supervised Learning Approaches on Top of a Large
  Language Model
Neural Data-to-Text Generation Based on Small Datasets: Comparing the Added Value of Two Semi-Supervised Learning Approaches on Top of a Large Language Model
Chris van der Lee
Thiago Castro Ferreira
Chris Emmery
Travis J. Wiltshire
Emiel Krahmer
78
2
0
14 Jul 2022
BERTIN: Efficient Pre-Training of a Spanish Language Model using
  Perplexity Sampling
BERTIN: Efficient Pre-Training of a Spanish Language Model using Perplexity Sampling
Javier de la Rosa
E. G. Ponferrada
Paulo Villegas
Pablo González de Prado Salas
Manu Romero
María Grandury
74
96
0
14 Jul 2022
N-Grammer: Augmenting Transformers with latent n-grams
N-Grammer: Augmenting Transformers with latent n-grams
Aurko Roy
Rohan Anil
Guangda Lai
Benjamin Lee
Jeffrey Zhao
...
Yu
Phuong Dao
Christopher Fifty
Zhiwen Chen
Yonghui Wu
77
8
0
13 Jul 2022
Re2G: Retrieve, Rerank, Generate
Re2G: Retrieve, Rerank, Generate
Michael R. Glass
Gaetano Rossiello
Md. Faisal Mahbub Chowdhury
Ankita Rajaram Naik
Pengshan Cai
A. Gliozzo
RALM
89
96
0
13 Jul 2022
Does GNN Pretraining Help Molecular Representation?
Does GNN Pretraining Help Molecular Representation?
Ruoxi Sun
Hanjun Dai
Adams Wei Yu
SSLAI4CEGNN
65
75
0
13 Jul 2022
DocPrompting: Generating Code by Retrieving the Docs
DocPrompting: Generating Code by Retrieving the Docs
Shuyan Zhou
Uri Alon
Frank F. Xu
Zhiruo Wang
Zhengbao Jiang
Graham Neubig
LLMAG
106
141
0
13 Jul 2022
PLM-ICD: Automatic ICD Coding with Pretrained Language Models
PLM-ICD: Automatic ICD Coding with Pretrained Language Models
Chao-Wei Huang
Shang-Chi Tsai
Yun-Nung Chen
90
51
0
12 Jul 2022
Effective Few-Shot Named Entity Linking by Meta-Learning
Effective Few-Shot Named Entity Linking by Meta-Learning
Xiuxing Li
Zhenyu Li
Zhengyan Zhang
Ning Liu
Haitao Yuan
Wei Zhang
Zhiyuan Liu
Jianyong Wang
OffRL
107
11
0
12 Jul 2022
Towards Neural Numeric-To-Text Generation From Temporal Personal Health
  Data
Towards Neural Numeric-To-Text Generation From Temporal Personal Health Data
Jon Harris
Mohammed J Zaki
AI4TS
65
2
0
11 Jul 2022
Embedding Recycling for Language Models
Embedding Recycling for Language Models
Jon Saad-Falcon
Amanpreet Singh
Luca Soldaini
Mike DÁrcy
Arman Cohan
Doug Downey
KELM
63
4
0
11 Jul 2022
Exploring Length Generalization in Large Language Models
Exploring Length Generalization in Large Language Models
Cem Anil
Yuhuai Wu
Anders Andreassen
Aitor Lewkowycz
Vedant Misra
V. Ramasesh
Ambrose Slone
Guy Gur-Ari
Ethan Dyer
Behnam Neyshabur
ReLMLRM
111
170
0
11 Jul 2022
Multi-level Fusion of Wav2vec 2.0 and BERT for Multimodal Emotion
  Recognition
Multi-level Fusion of Wav2vec 2.0 and BERT for Multimodal Emotion Recognition
Zihan Zhao
Yanfeng Wang
Yu Wang
57
34
0
11 Jul 2022
Few-shot training LLMs for project-specific code-summarization
Few-shot training LLMs for project-specific code-summarization
Toufique Ahmed
Prem Devanbu
235
241
0
09 Jul 2022
TalkToModel: Explaining Machine Learning Models with Interactive Natural
  Language Conversations
TalkToModel: Explaining Machine Learning Models with Interactive Natural Language Conversations
Dylan Slack
Satyapriya Krishna
Himabindu Lakkaraju
Sameer Singh
84
84
0
08 Jul 2022
The Harvard USPTO Patent Dataset: A Large-Scale, Well-Structured, and
  Multi-Purpose Corpus of Patent Applications
The Harvard USPTO Patent Dataset: A Large-Scale, Well-Structured, and Multi-Purpose Corpus of Patent Applications
Mirac Suzgun
Luke Melas-Kyriazi
Suproteem K. Sarkar
S. Kominers
Stuart M. Shieber
117
29
0
08 Jul 2022
Beyond Transfer Learning: Co-finetuning for Action Localisation
Beyond Transfer Learning: Co-finetuning for Action Localisation
Anurag Arnab
Xuehan Xiong
A. Gritsenko
Rob Romijnders
Josip Djolonga
Mostafa Dehghani
Chen Sun
Mario Lucic
Cordelia Schmid
87
8
0
08 Jul 2022
Meta-Learning the Difference: Preparing Large Language Models for
  Efficient Adaptation
Meta-Learning the Difference: Preparing Large Language Models for Efficient Adaptation
Zejiang Hou
Julian Salazar
George Polovets
79
15
0
07 Jul 2022
Training Transformers Together
Training Transformers Together
Alexander Borzunov
Max Ryabinin
Tim Dettmers
Quentin Lhoest
Lucile Saulnier
Michael Diskin
Yacine Jernite
Thomas Wolf
ViT
63
10
0
07 Jul 2022
Rethinking the Value of Gazetteer in Chinese Named Entity Recognition
Rethinking the Value of Gazetteer in Chinese Named Entity Recognition
Qianglong Chen
Xiangji Zeng
Jiangang Zhu
Yin Zhang
Bojia Lin
Yang Yang
Daxin Jiang
62
2
0
06 Jul 2022
Transformers are Adaptable Task Planners
Transformers are Adaptable Task Planners
Vidhi Jain
Yixin Lin
Eric Undersander
Yonatan Bisk
Akshara Rai
113
24
0
06 Jul 2022
Cross-Lingual QA as a Stepping Stone for Monolingual Open QA in
  Icelandic
Cross-Lingual QA as a Stepping Stone for Monolingual Open QA in Icelandic
Vésteinn Snæbjarnarson
H. Einarsson
59
6
0
05 Jul 2022
Scene-Aware Prompt for Multi-modal Dialogue Understanding and Generation
Scene-Aware Prompt for Multi-modal Dialogue Understanding and Generation
Bin Li
Yixuan Weng
Ziyu Ma
Bin Sun
Shutao Li
VLM
34
2
0
05 Jul 2022
CodeRL: Mastering Code Generation through Pretrained Models and Deep
  Reinforcement Learning
CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning
Hung Le
Yue Wang
Akhilesh Deepak Gotmare
Silvio Savarese
Guosheng Lin
SyDaALM
227
273
0
05 Jul 2022
Probing via Prompting
Probing via Prompting
Jiaoda Li
Ryan Cotterell
Mrinmaya Sachan
109
13
0
04 Jul 2022
Masked Autoencoders in 3D Point Cloud Representation Learning
Masked Autoencoders in 3D Point Cloud Representation Learning
Jincen Jiang
Xuequan Lu
Lizhi Zhao
Richard Dazeley
Meili Wang
3DPCViT
144
29
0
04 Jul 2022
An Empirical Survey on Long Document Summarization: Datasets, Models and
  Metrics
An Empirical Survey on Long Document Summarization: Datasets, Models and Metrics
Huan Yee Koh
Jiaxin Ju
Ming Liu
Shirui Pan
149
128
0
03 Jul 2022
Generating Repetitions with Appropriate Repeated Words
Generating Repetitions with Appropriate Repeated Words
Toshiki Kawamoto
Hidetaka Kamigaito
Kotaro Funakoshi
Manabu Okumura
47
3
0
03 Jul 2022
FRAME: Evaluating Rationale-Label Consistency Metrics for Free-Text
  Rationales
FRAME: Evaluating Rationale-Label Consistency Metrics for Free-Text Rationales
Aaron Chan
Shaoliang Nie
Liang Tan
Xiaochang Peng
Hamed Firooz
Maziar Sanjabi
Xiang Ren
118
10
0
02 Jul 2022
Can Language Models Make Fun? A Case Study in Chinese Comical Crosstalk
Can Language Models Make Fun? A Case Study in Chinese Comical Crosstalk
Benyou Wang
Xiang Wu
Xiaokang Liu
Jianquan Li
Prayag Tiwari
Qianqian Xie
61
6
0
02 Jul 2022
The Parallelism Tradeoff: Limitations of Log-Precision Transformers
The Parallelism Tradeoff: Limitations of Log-Precision Transformers
William Merrill
Ashish Sabharwal
108
116
0
02 Jul 2022
Can we learn from developer mistakes? Learning to localize and repair
  real bugs from real bug fixes
Can we learn from developer mistakes? Learning to localize and repair real bugs from real bug fixes
Cedric Richter
Heike Wehrheim
53
8
0
01 Jul 2022
Pile of Law: Learning Responsible Data Filtering from the Law and a
  256GB Open-Source Legal Dataset
Pile of Law: Learning Responsible Data Filtering from the Law and a 256GB Open-Source Legal Dataset
Peter Henderson
M. Krass
Lucia Zheng
Neel Guha
Christopher D. Manning
Dan Jurafsky
Daniel E. Ho
AILawELM
231
103
0
01 Jul 2022
e-CLIP: Large-Scale Vision-Language Representation Learning in
  E-commerce
e-CLIP: Large-Scale Vision-Language Representation Learning in E-commerce
Wonyoung Shin
Jonghun Park
Taekang Woo
Yongwoo Cho
Kwangjin Oh
Hwanjun Song
VLM
125
17
0
01 Jul 2022
Measuring Forgetting of Memorized Training Examples
Measuring Forgetting of Memorized Training Examples
Matthew Jagielski
Om Thakkar
Florian Tramèr
Daphne Ippolito
Katherine Lee
...
Eric Wallace
Shuang Song
Abhradeep Thakurta
Nicolas Papernot
Chiyuan Zhang
TDI
156
111
0
30 Jun 2022
Forecasting Future World Events with Neural Networks
Forecasting Future World Events with Neural Networks
Andy Zou
Tristan Xiao
Ryan Jia
Joe Kwon
Mantas Mazeika
Richard Li
Dawn Song
Jacob Steinhardt
Owain Evans
Dan Hendrycks
108
27
0
30 Jun 2022
FL-Tuning: Layer Tuning for Feed-Forward Network in Transformer
FL-Tuning: Layer Tuning for Feed-Forward Network in Transformer
Jingping Liu
Yuqiu Song
Kui Xue
Hongli Sun
Chao Wang
Lihan Chen
Haiyun Jiang
Jiaqing Liang
Tong Ruan
74
2
0
30 Jun 2022
esCorpius: A Massive Spanish Crawling Corpus
esCorpius: A Massive Spanish Crawling Corpus
Asier Gutiérrez-Fandiño
David Pérez-Fernández
Jordi Armengol-Estapé
D. Griol
Z. Callejas
102
2
0
30 Jun 2022
BigBIO: A Framework for Data-Centric Biomedical Natural Language
  Processing
BigBIO: A Framework for Data-Centric Biomedical Natural Language Processing
Jason Alan Fries
Leon Weber
Natasha Seelam
Gabriel Altay
Debajyoti Datta
...
Minh Chien Vu
Trishala Neeraj
Jonas Golde
Albert Villanova del Moral
Benjamin Beilharz
LM&MA
151
49
0
30 Jun 2022
Compressing Pre-trained Transformers via Low-Bit NxM Sparsity for
  Natural Language Understanding
Compressing Pre-trained Transformers via Low-Bit NxM Sparsity for Natural Language Understanding
Connor Holmes
Minjia Zhang
Yuxiong He
Bo Wu
59
3
0
30 Jun 2022
GPTs at Factify 2022: Prompt Aided Fact-Verification
GPTs at Factify 2022: Prompt Aided Fact-Verification
Pawan Kumar Sahu
Saksham Aggarwal
Taneesh Gupta
Gyanendra Das
67
1
0
29 Jun 2022
Solving Quantitative Reasoning Problems with Language Models
Solving Quantitative Reasoning Problems with Language Models
Aitor Lewkowycz
Anders Andreassen
David Dohan
Ethan Dyer
Henryk Michalewski
...
Theo Gutman-Solo
Yuhuai Wu
Behnam Neyshabur
Guy Gur-Ari
Vedant Misra
ReLMELMLRM
227
865
0
29 Jun 2022
On the Robustness of Dialogue History Representation in Conversational
  Question Answering: A Comprehensive Study and a New Prompt-based Method
On the Robustness of Dialogue History Representation in Conversational Question Answering: A Comprehensive Study and a New Prompt-based Method
Zorik Gekhman
Nadav Oved
Orgad Keller
Idan Szpektor
Roi Reichart
71
8
0
29 Jun 2022
Trial2Vec: Zero-Shot Clinical Trial Document Similarity Search using
  Self-Supervision
Trial2Vec: Zero-Shot Clinical Trial Document Similarity Search using Self-Supervision
Zifeng Wang
Jimeng Sun
94
25
0
29 Jun 2022
Previous
123...155156157...196197198
Next