ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.10683
  4. Cited By
Exploring the Limits of Transfer Learning with a Unified Text-to-Text
  Transformer
v1v2v3v4 (latest)

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
    AIMat
ArXiv (abs)PDFHTML

Papers citing "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

50 / 9,944 papers shown
Title
Efficient Movie Scene Detection using State-Space Transformers
Efficient Movie Scene Detection using State-Space Transformers
Md. Mohaiminul Islam
Mahmudul Hasan
Kishan Athrey
Tony Braskich
Gedas Bertasius
ViT
68
46
0
29 Dec 2022
Error syntax aware augmentation of feedback comment generation dataset
Error syntax aware augmentation of feedback comment generation dataset
N. Babakov
M. Lysyuk
Alexander Shvets
Lilya Kazakova
Alexander Panchenko
133
3
0
29 Dec 2022
Reviewing Labels: Label Graph Network with Top-k Prediction Set for
  Relation Extraction
Reviewing Labels: Label Graph Network with Top-k Prediction Set for Relation Extraction
Bo Li
Wei Ye
Jinglei Zhang
Shikun Zhang
86
14
0
29 Dec 2022
Cramming: Training a Language Model on a Single GPU in One Day
Cramming: Training a Language Model on a Single GPU in One Day
Jonas Geiping
Tom Goldstein
MoE
122
91
0
28 Dec 2022
Exploring Vision Transformers as Diffusion Learners
Exploring Vision Transformers as Diffusion Learners
He Cao
Jianan Wang
Tianhe Ren
Xianbiao Qi
Yihao Chen
Yuan Yao
Lefei Zhang
83
10
0
28 Dec 2022
MultiSpider: Towards Benchmarking Multilingual Text-to-SQL Semantic
  Parsing
MultiSpider: Towards Benchmarking Multilingual Text-to-SQL Semantic Parsing
Longxu Dou
Yan Gao
Mingyang Pan
Dingzirui Wang
Wanxiang Che
Dechen Zhan
Jian-Guang Lou
63
20
0
27 Dec 2022
TegFormer: Topic-to-Essay Generation with Good Topic Coverage and High
  Text Coherence
TegFormer: Topic-to-Essay Generation with Good Topic Coverage and High Text Coherence
Wang Qi
R. Liu
Y. Zuo
Yong Chen
Dell Zhang
68
0
0
27 Dec 2022
Large Language Models Encode Clinical Knowledge
Large Language Models Encode Clinical Knowledge
K. Singhal
Shekoofeh Azizi
T. Tu
S. S. Mahdavi
Jason W. Wei
...
A. Rajkomar
Joelle Barral
Christopher Semturs
Alan Karthikesalingam
Vivek Natarajan
LM&MAELMAI4MH
355
2,421
0
26 Dec 2022
TextBox 2.0: A Text Generation Library with Pre-trained Language Models
TextBox 2.0: A Text Generation Library with Pre-trained Language Models
Tianyi Tang
Junyi Li
Zhongfu Chen
Yiwen Hu
Zhuohao Yu
...
Xiaoxue Cheng
Yuhao Wang
Wayne Xin Zhao
J. Nie
Ji-Rong Wen
56
8
0
26 Dec 2022
Rank-LIME: Local Model-Agnostic Feature Attribution for Learning to Rank
Rank-LIME: Local Model-Agnostic Feature Attribution for Learning to Rank
Tanya Chowdhury
Razieh Rahimi
James Allan
FAtt
62
18
0
24 Dec 2022
When Do Curricula Work in Federated Learning?
When Do Curricula Work in Federated Learning?
Saeed Vahidian
Sreevatsank Kadaveru
Woo-Ram Baek
Weijia Wang
Vyacheslav Kungurtsev
Chong Chen
M. Shah
Bill Lin
FedML
95
11
0
24 Dec 2022
MicroBERT: Effective Training of Low-resource Monolingual BERTs through
  Parameter Reduction and Multitask Learning
MicroBERT: Effective Training of Low-resource Monolingual BERTs through Parameter Reduction and Multitask Learning
Luke Gessler
Amir Zeldes
93
14
0
23 Dec 2022
OPT-IML: Scaling Language Model Instruction Meta Learning through the
  Lens of Generalization
OPT-IML: Scaling Language Model Instruction Meta Learning through the Lens of Generalization
Srinivasan Iyer
Xi Lin
Ramakanth Pasunuru
Todor Mihaylov
Daniel Simig
...
Jeff Wang
Christopher Dewan
Asli Celikyilmaz
Luke Zettlemoyer
Veselin Stoyanov
ALM
208
268
0
22 Dec 2022
Improving Automated Program Repair with Domain Adaptation
Improving Automated Program Repair with Domain Adaptation
Armin Zirak
Hadi Hemmati
71
11
0
21 Dec 2022
Contrastive Distillation Is a Sample-Efficient Self-Supervised Loss
  Policy for Transfer Learning
Contrastive Distillation Is a Sample-Efficient Self-Supervised Loss Policy for Transfer Learning
Christopher T. Lengerich
Gabriel Synnaeve
Amy Zhang
Hugh Leather
Kurt Shuster
Franccois Charton
Charysse Redwood
SSLOffRL
68
1
0
21 Dec 2022
What do LLMs Know about Financial Markets? A Case Study on Reddit Market
  Sentiment Analysis
What do LLMs Know about Financial Markets? A Case Study on Reddit Market Sentiment Analysis
Xiang Deng
Vasilisa Bashlovkina
Feng Han
Simon Baumgartner
Michael Bendersky
78
46
0
21 Dec 2022
Commentary Generation from Data Records of Multiplayer Strategy Esports
  Game
Commentary Generation from Data Records of Multiplayer Strategy Esports Game
Zihan Wang
Naoki Yoshinaga
52
0
0
21 Dec 2022
Resolving Indirect Referring Expressions for Entity Selection
Resolving Indirect Referring Expressions for Entity Selection
Mohammad Javad Hosseini
Filip Radlinski
Silvia Pareti
Annie Louis
64
2
0
21 Dec 2022
SPT: Semi-Parametric Prompt Tuning for Multitask Prompted Learning
SPT: Semi-Parametric Prompt Tuning for Multitask Prompted Learning
M Saiful Bari
Aston Zhang
Shuai Zheng
Xingjian Shi
Yi Zhu
Shafiq Joty
Mu Li
RALMVLMVPVLMLRM
97
5
0
21 Dec 2022
Language Models as Inductive Reasoners
Language Models as Inductive Reasoners
Zonglin Yang
Li Dong
Xinya Du
Hao Cheng
Min Zhang
Xiaodong Liu
Jianfeng Gao
Furu Wei
ReLMLRM
98
37
0
21 Dec 2022
From Images to Textual Prompts: Zero-shot VQA with Frozen Large Language
  Models
From Images to Textual Prompts: Zero-shot VQA with Frozen Large Language Models
Jiaxian Guo
Junnan Li
Dongxu Li
A. M. H. Tiong
Boyang Albert Li
Dacheng Tao
Steven C. H. Hoi
VLMMLLM
98
118
0
21 Dec 2022
Generating Multiple-Length Summaries via Reinforcement Learning for
  Unsupervised Sentence Summarization
Generating Multiple-Length Summaries via Reinforcement Learning for Unsupervised Sentence Summarization
Dongmin Hyun
Xiting Wang
Chanyoung Park
Xing Xie
Hwanjo Yu
58
8
0
21 Dec 2022
ZEROTOP: Zero-Shot Task-Oriented Semantic Parsing using Large Language
  Models
ZEROTOP: Zero-Shot Task-Oriented Semantic Parsing using Large Language Models
Dheeraj Mekala
Jason Wolfe
Subhro Roy
100
9
0
21 Dec 2022
OpineSum: Entailment-based self-training for abstractive opinion
  summarization
OpineSum: Entailment-based self-training for abstractive opinion summarization
Annie Louis
Joshua Maynez
103
7
0
21 Dec 2022
SERENGETI: Massively Multilingual Language Models for Africa
SERENGETI: Massively Multilingual Language Models for Africa
Ife Adebara
AbdelRahim Elmadany
Muhammad Abdul-Mageed
Alcides Alcoba Inciarte
78
33
0
21 Dec 2022
ImPaKT: A Dataset for Open-Schema Knowledge Base Construction
ImPaKT: A Dataset for Open-Schema Knowledge Base Construction
Luke Vilnis
Zachary Kenneth Fisher
Bhargav Kanagal
Patrick C. Murray
Sumit Sanghai
79
3
0
21 Dec 2022
Uncontrolled Lexical Exposure Leads to Overestimation of Compositional
  Generalization in Pretrained Models
Uncontrolled Lexical Exposure Leads to Overestimation of Compositional Generalization in Pretrained Models
Najoung Kim
Tal Linzen
P. Smolensky
106
33
0
21 Dec 2022
How Does Beam Search improve Span-Level Confidence Estimation in
  Generative Sequence Labeling?
How Does Beam Search improve Span-Level Confidence Estimation in Generative Sequence Labeling?
Kazuma Hashimoto
Iftekhar Naim
K. Raman
UQLM
79
2
0
21 Dec 2022
Learning List-Level Domain-Invariant Representations for Ranking
Learning List-Level Domain-Invariant Representations for Ranking
Ruicheng Xian
Honglei Zhuang
Zhen Qin
Hamed Zamani
Jing Lu
Ji Ma
Kai Hui
Han Zhao
Xuanhui Wang
Michael Bendersky
OOD
107
10
0
21 Dec 2022
ORCA: A Challenging Benchmark for Arabic Language Understanding
ORCA: A Challenging Benchmark for Arabic Language Understanding
AbdelRahim Elmadany
El Moatez Billah Nagoudi
Muhammad Abdul-Mageed
ELM
111
46
0
21 Dec 2022
JASMINE: Arabic GPT Models for Few-Shot Learning
JASMINE: Arabic GPT Models for Few-Shot Learning
El Moatez Billah Nagoudi
Muhammad Abdul-Mageed
AbdelRahim Elmadany
Alcides Alcoba Inciarte
Md. Tawkat Islam Khondaker
79
9
0
21 Dec 2022
PropSegmEnt: A Large-Scale Corpus for Proposition-Level Segmentation and
  Entailment Recognition
PropSegmEnt: A Large-Scale Corpus for Proposition-Level Segmentation and Entailment Recognition
Sihao Chen
S. Buthpitiya
Alex Fabrikant
Dan Roth
Tal Schuster
69
25
0
21 Dec 2022
Zero-shot Triplet Extraction by Template Infilling
Zero-shot Triplet Extraction by Template Infilling
Bosung Kim
Hayate Iso
Nikita Bhutani
Estevam R. Hruschka
Ndapandula Nakashole
Tom Mitchell
ViT
60
10
0
21 Dec 2022
KronA: Parameter Efficient Tuning with Kronecker Adapter
KronA: Parameter Efficient Tuning with Kronecker Adapter
Ali Edalati
Marzieh S. Tahaei
I. Kobyzev
V. Nia
J. Clark
Mehdi Rezagholizadeh
103
104
0
20 Dec 2022
Ontologically Faithful Generation of Non-Player Character Dialogues
Ontologically Faithful Generation of Non-Player Character Dialogues
Nathaniel Weir
Ryan Thomas
Randolph DÁmore
Kellie Hill
Benjamin Van Durme
Harsh Jhamtani
79
7
0
20 Dec 2022
Character-Aware Models Improve Visual Text Rendering
Character-Aware Models Improve Visual Text Rendering
Rosanne Liu
Daniel H Garrette
Chitwan Saharia
William Chan
Adam Roberts
Sharan Narang
Irina Blok
R. Mical
Mohammad Norouzi
Noah Constant
VLM
124
74
0
20 Dec 2022
DialGuide: Aligning Dialogue Model Behavior with Developer Guidelines
DialGuide: Aligning Dialogue Model Behavior with Developer Guidelines
Prakhar Gupta
Yang Liu
Di Jin
Behnam Hedayatnia
Spandana Gella
Sijia Liu
P. Lange
Julia Hirschberg
Dilek Z. Hakkani-Tür
115
5
0
20 Dec 2022
PairReranker: Pairwise Reranking for Natural Language Generation
PairReranker: Pairwise Reranking for Natural Language Generation
Dongfu Jiang
Bill Yuchen Lin
Xiang Ren
64
3
0
20 Dec 2022
A Length-Extrapolatable Transformer
A Length-Extrapolatable Transformer
Yutao Sun
Li Dong
Barun Patra
Shuming Ma
Shaohan Huang
Alon Benhaim
Vishrav Chaudhary
Xia Song
Furu Wei
122
124
0
20 Dec 2022
A Survey of Deep Learning for Mathematical Reasoning
A Survey of Deep Learning for Mathematical Reasoning
Pan Lu
Liang Qiu
Wenhao Yu
Sean Welleck
Kai-Wei Chang
ReLMLRM
135
150
0
20 Dec 2022
HYRR: Hybrid Infused Reranking for Passage Retrieval
HYRR: Hybrid Infused Reranking for Passage Retrieval
Jing Lu
Keith B. Hall
Ji Ma
Jianmo Ni
51
6
0
20 Dec 2022
Open Domain Multi-document Summarization: A Comprehensive Study of Model
  Brittleness under Retrieval
Open Domain Multi-document Summarization: A Comprehensive Study of Model Brittleness under Retrieval
John Giorgi
Luca Soldaini
Bo Wang
Gary D. Bader
Kyle Lo
Lucy Lu Wang
Arman Cohan
97
19
0
20 Dec 2022
SLUE Phase-2: A Benchmark Suite of Diverse Spoken Language Understanding
  Tasks
SLUE Phase-2: A Benchmark Suite of Diverse Spoken Language Understanding Tasks
Suwon Shon
Siddhant Arora
Chyi-Jiunn Lin
Ankita Pasad
Felix Wu
Roshan S. Sharma
Wei Wu
Hung-yi Lee
Karen Livescu
Shinji Watanabe
ELM
85
33
0
20 Dec 2022
Transformers Go for the LOLs: Generating (Humourous) Titles from
  Scientific Abstracts End-to-End
Transformers Go for the LOLs: Generating (Humourous) Titles from Scientific Abstracts End-to-End
Yanran Chen
Steffen Eger
112
17
0
20 Dec 2022
CausalDialogue: Modeling Utterance-level Causality in Conversations
CausalDialogue: Modeling Utterance-level Causality in Conversations
Yi-Lin Tuan
Alon Albalak
Wenda Xu
Michael Stephen Saxon
Connor Pryor
Lise Getoor
William Yang Wang
CML
74
2
0
20 Dec 2022
When Not to Trust Language Models: Investigating Effectiveness of
  Parametric and Non-Parametric Memories
When Not to Trust Language Models: Investigating Effectiveness of Parametric and Non-Parametric Memories
Alex Troy Mallen
Akari Asai
Victor Zhong
Rajarshi Das
Daniel Khashabi
Hannaneh Hajishirzi
RALMHILMKELM
156
611
0
20 Dec 2022
ByGPT5: End-to-End Style-conditioned Poetry Generation with Token-free
  Language Models
ByGPT5: End-to-End Style-conditioned Poetry Generation with Token-free Language Models
Jonas Belouadi
Steffen Eger
105
26
0
20 Dec 2022
Little Red Riding Hood Goes Around the Globe:Crosslingual Story Planning
  and Generation with Large Language Models
Little Red Riding Hood Goes Around the Globe:Crosslingual Story Planning and Generation with Large Language Models
E. Razumovskaia
Joshua Maynez
Annie Louis
Mirella Lapata
Shashi Narayan
LRM
71
5
0
20 Dec 2022
SODA: Million-scale Dialogue Distillation with Social Commonsense
  Contextualization
SODA: Million-scale Dialogue Distillation with Social Commonsense Contextualization
Hyunwoo J. Kim
Jack Hessel
Liwei Jiang
Peter West
Ximing Lu
...
Ronan Le Bras
Malihe Alikhani
Gunhee Kim
Maarten Sap
Yejin Choi
HILM
143
171
0
20 Dec 2022
Go-tuning: Improving Zero-shot Learning Abilities of Smaller Language
  Models
Go-tuning: Improving Zero-shot Learning Abilities of Smaller Language Models
Jingjing Xu
Qingxiu Dong
Hongyi Liu
Lei Li
ALMLRM
73
1
0
20 Dec 2022
Previous
123...139140141...197198199
Next