ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.10683
  4. Cited By
Exploring the Limits of Transfer Learning with a Unified Text-to-Text
  Transformer
v1v2v3v4 (latest)

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
    AIMat
ArXiv (abs)PDFHTML

Papers citing "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

50 / 9,920 papers shown
Title
Finding Skill Neurons in Pre-trained Transformer-based Language Models
Finding Skill Neurons in Pre-trained Transformer-based Language Models
Xiaozhi Wang
Kaiyue Wen
Zhengyan Zhang
Lei Hou
Zhiyuan Liu
Juanzi Li
MILMMoE
88
52
0
14 Nov 2022
Towards Understanding Omission in Dialogue Summarization
Towards Understanding Omission in Dialogue Summarization
Yicheng Zou
Kaitao Song
Xu Tan
Zhongkai Fu
Qi Zhang
Dongsheng Li
Tao Gui
69
2
0
14 Nov 2022
Controllable Citation Sentence Generation with Language Models
Controllable Citation Sentence Generation with Language Models
Nianlong Gu
Richard H. R. Hahnloser
58
2
0
14 Nov 2022
FPT: Improving Prompt Tuning Efficiency via Progressive Training
FPT: Improving Prompt Tuning Efficiency via Progressive Training
Yufei Huang
Yujia Qin
Huadong Wang
Yichun Yin
Maosong Sun
Zhiyuan Liu
Qun Liu
VLMLRM
63
6
0
13 Nov 2022
Large-scale Contrastive Language-Audio Pretraining with Feature Fusion
  and Keyword-to-Caption Augmentation
Large-scale Contrastive Language-Audio Pretraining with Feature Fusion and Keyword-to-Caption Augmentation
Yusong Wu
Kai Chen
Tianyu Zhang
Yuchen Hui
Marianna Nezhurina
Taylor Berg-Kirkpatrick
Shlomo Dubnov
CLIP
190
546
0
12 Nov 2022
Few-shot Multimodal Sentiment Analysis based on Multimodal Probabilistic
  Fusion Prompts
Few-shot Multimodal Sentiment Analysis based on Multimodal Probabilistic Fusion Prompts
Xiaocui Yang
Shi Feng
Daling Wang
Pengfei Hong
Soujanya Poria
82
23
0
12 Nov 2022
DocuT5: Seq2seq SQL Generation with Table Documentation
DocuT5: Seq2seq SQL Generation with Table Documentation
E. Soare
Iain Mackie
Jeffrey Stephen Dalton
LMTD
79
2
0
11 Nov 2022
A Survey of Knowledge Enhanced Pre-trained Language Models
A Survey of Knowledge Enhanced Pre-trained Language Models
Linmei Hu
Zeyi Liu
Ziwang Zhao
Lei Hou
Liqiang Nie
Juanzi Li
KELMVLM
170
137
0
11 Nov 2022
InternImage: Exploring Large-Scale Vision Foundation Models with
  Deformable Convolutions
InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions
Wenhai Wang
Jifeng Dai
Zhe Chen
Zhenhang Huang
Zhiqi Li
...
Tong Lu
Lewei Lu
Hongsheng Li
Xiaogang Wang
Yu Qiao
VLM
180
699
0
10 Nov 2022
DisentQA: Disentangling Parametric and Contextual Knowledge with
  Counterfactual Question Answering
DisentQA: Disentangling Parametric and Contextual Knowledge with Counterfactual Question Answering
Ella Neeman
Roee Aharoni
Or Honovich
Leshem Choshen
Idan Szpektor
Omri Abend
KELMCML
105
84
0
10 Nov 2022
Can Transformers Reason in Fragments of Natural Language?
Can Transformers Reason in Fragments of Natural Language?
Viktor Schlegel
Kamen V. Pavlov
Ian Pratt-Hartmann
LRMReLM
77
7
0
10 Nov 2022
ADEPT: A DEbiasing PrompT Framework
ADEPT: A DEbiasing PrompT Framework
Ke Yang
Charles Yu
Yi R. Fung
Manling Li
Heng Ji
124
27
0
10 Nov 2022
Grammatical Error Correction: A Survey of the State of the Art
Grammatical Error Correction: A Survey of the State of the Art
Christopher Bryant
Zheng Yuan
Muhammad Reza Qorib
Hannan Cao
Hwee Tou Ng
Ted Briscoe
3DV
91
87
0
09 Nov 2022
Uni-Parser: Unified Semantic Parser for Question Answering on Knowledge
  Base and Database
Uni-Parser: Unified Semantic Parser for Question Answering on Knowledge Base and Database
Ye Liu
Semih Yavuz
Rui Meng
Dragomir R. Radev
Caiming Xiong
Yingbo Zhou
78
33
0
09 Nov 2022
Large Language Models with Controllable Working Memory
Large Language Models with Controllable Working Memory
Daliang Li
A. S. Rawat
Manzil Zaheer
Xin Wang
Michal Lukasik
Andreas Veit
Felix X. Yu
Surinder Kumar
KELM
146
171
0
09 Nov 2022
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model
BigScience Workshop
:
Teven Le Scao
Angela Fan
Christopher Akiki
...
Zhongli Xie
Zifan Ye
M. Bras
Younes Belkada
Thomas Wolf
VLM
486
2,401
0
09 Nov 2022
What is Wrong with Language Models that Can Not Tell a Story?
What is Wrong with Language Models that Can Not Tell a Story?
Ivan P. Yamshchikov
Alexey Tikhonov
69
7
0
09 Nov 2022
Syntax-Aware On-the-Fly Code Completion
Syntax-Aware On-the-Fly Code Completion
Wannita Takerngsaksiri
Chakkrit Tantithamthavorn
Yuankui Li
93
19
0
09 Nov 2022
Discovering the Hidden Facts of User-Dispatcher Interactions via
  Text-based Reporting Systems for Community Safety
Discovering the Hidden Facts of User-Dispatcher Interactions via Text-based Reporting Systems for Community Safety
Yiren Liu
Ryan D. W. Mayfield
Yun Huang
47
2
0
09 Nov 2022
Active Example Selection for In-Context Learning
Active Example Selection for In-Context Learning
Yiming Zhang
Shi Feng
Chenhao Tan
SILMLRM
114
207
0
08 Nov 2022
nBIIG: A Neural BI Insights Generation System for Table Reporting
nBIIG: A Neural BI Insights Generation System for Table Reporting
Yotam Perlitz
D. Sheinwald
Noam Slonim
Michal Shmueli-Scheuer
31
2
0
08 Nov 2022
Self-conditioned Embedding Diffusion for Text Generation
Self-conditioned Embedding Diffusion for Text Generation
Robin Strudel
Corentin Tallec
Florent Altché
Yilun Du
Yaroslav Ganin
...
Will Grathwohl
Nikolay Savinov
Sander Dieleman
Laurent Sifre
Rémi Leblond
DiffM
89
88
0
08 Nov 2022
Conciseness: An Overlooked Language Task
Conciseness: An Overlooked Language Task
Felix Stahlberg
Aashish Kumar
Chris Alberti
Shankar Kumar
47
1
0
08 Nov 2022
COPEN: Probing Conceptual Knowledge in Pre-trained Language Models
COPEN: Probing Conceptual Knowledge in Pre-trained Language Models
Hao Peng
Xiaozhi Wang
Shengding Hu
Hailong Jin
Lei Hou
Juanzi Li
Zhiyuan Liu
Qun Liu
93
25
0
08 Nov 2022
Reranking Overgenerated Responses for End-to-End Task-Oriented Dialogue
  Systems
Reranking Overgenerated Responses for End-to-End Task-Oriented Dialogue Systems
Songbo Hu
Ivan Vulić
Fangyu Liu
Anna Korhonen
93
0
0
07 Nov 2022
ERNIE-SAT: Speech and Text Joint Pretraining for Cross-Lingual
  Multi-Speaker Text-to-Speech
ERNIE-SAT: Speech and Text Joint Pretraining for Cross-Lingual Multi-Speaker Text-to-Speech
Xiaoran Fan
Chao Pang
Tian Yuan
Richard He Bai
Renjie Zheng
...
Junkun Chen
Zeyu Chen
Liang Huang
Yu Sun
Hua Wu
125
0
0
07 Nov 2022
Knowledge Graph Embedding: A Survey from the Perspective of
  Representation Spaces
Knowledge Graph Embedding: A Survey from the Perspective of Representation Spaces
Jiahang Cao
Jinyuan Fang
Zaiqiao Meng
Shangsong Liang
108
76
0
07 Nov 2022
NAPG: Non-Autoregressive Program Generation for Hybrid Tabular-Textual
  Question Answering
NAPG: Non-Autoregressive Program Generation for Hybrid Tabular-Textual Question Answering
Tengxun Zhang
Hongfei Xu
Josef van Genabith
Deyi Xiong
Hongying Zan
AIMatLRM
65
5
0
07 Nov 2022
Fixing Model Bugs with Natural Language Patches
Fixing Model Bugs with Natural Language Patches
Shikhar Murty
Christopher D. Manning
Scott M. Lundberg
Marco Tulio Ribeiro
KELM
80
39
0
07 Nov 2022
Contrastive Learning enhanced Author-Style Headline Generation
Contrastive Learning enhanced Author-Style Headline Generation
Hui Liu
Weidong Guo
Yige Chen
Xiangyang Li
56
5
0
07 Nov 2022
Complex Reading Comprehension Through Question Decomposition
Complex Reading Comprehension Through Question Decomposition
Xiao-Yu Guo
Yuan-Fang Li
Gholamreza Haffari
ReLM
74
10
0
07 Nov 2022
On the Domain Adaptation and Generalization of Pretrained Language
  Models: A Survey
On the Domain Adaptation and Generalization of Pretrained Language Models: A Survey
Xu Guo
Han Yu
LM&MAVLM
145
30
0
06 Nov 2022
Tuning Language Models as Training Data Generators for
  Augmentation-Enhanced Few-Shot Learning
Tuning Language Models as Training Data Generators for Augmentation-Enhanced Few-Shot Learning
Yu Meng
Martin Michalski
Jiaxin Huang
Yu Zhang
Tarek Abdelzaher
Jiawei Han
VLM
122
49
0
06 Nov 2022
Robust Lottery Tickets for Pre-trained Language Models
Robust Lottery Tickets for Pre-trained Language Models
Rui Zheng
Rong Bao
Yuhao Zhou
Di Liang
Sirui Wang
Wei Wu
Tao Gui
Qi Zhang
Xuanjing Huang
AAML
87
14
0
06 Nov 2022
Continuous Prompt Tuning Based Textual Entailment Model for E-commerce
  Entity Typing
Continuous Prompt Tuning Based Textual Entailment Model for E-commerce Entity Typing
Yibo Wang
Congying Xia
Guan Wang
Philip Yu
58
6
0
04 Nov 2022
A General Purpose Neural Architecture for Geospatial Systems
A General Purpose Neural Architecture for Geospatial Systems
Nasim Rahaman
Martin Weiss
Frederik Trauble
Francesco Locatello
Alexandre Lacoste
Yoshua Bengio
C. Pal
Li Erran Li
Bernhard Schölkopf
AI4TSAI4CE
55
6
0
04 Nov 2022
Time-aware Prompting for Text Generation
Time-aware Prompting for Text Generation
Shuyang Cao
Lu Wang
70
12
0
03 Nov 2022
Book Cover Synthesis from the Summary
Book Cover Synthesis from the Summary
Emdadul Haque
Md. Faraz Kabir Khan
Mohammad Imrul Jubair
Jarin Anjum
Abrar Zahir Niloy
3DV
51
2
0
03 Nov 2022
LMentry: A Language Model Benchmark of Elementary Language Tasks
LMentry: A Language Model Benchmark of Elementary Language Tasks
Avia Efrat
Or Honovich
Omer Levy
105
20
0
03 Nov 2022
Inverse scaling can become U-shaped
Inverse scaling can become U-shaped
Jason W. Wei
Najoung Kim
Yi Tay
Quoc V. Le
LRM
110
64
0
03 Nov 2022
Large Language Models Are Human-Level Prompt Engineers
Large Language Models Are Human-Level Prompt Engineers
Yongchao Zhou
Andrei Ioan Muresanu
Ziwen Han
Keiran Paster
Silviu Pitis
Harris Chan
Jimmy Ba
ALMLLMAG
195
907
0
03 Nov 2022
Latent Prompt Tuning for Text Summarization
Latent Prompt Tuning for Text Summarization
Yubo Zhang
Xingxing Zhang
Xun Wang
Si-Qing Chen
Furu Wei
VLM
95
12
0
03 Nov 2022
Eliciting Knowledge from Large Pre-Trained Models for Unsupervised
  Knowledge-Grounded Conversation
Eliciting Knowledge from Large Pre-Trained Models for Unsupervised Knowledge-Grounded Conversation
Yanyang Li
Jianqiao Zhao
Michael R. Lyu
Liwei Wang
70
16
0
03 Nov 2022
Using Large Pre-Trained Language Model to Assist FDA in Premarket
  Medical Device
Using Large Pre-Trained Language Model to Assist FDA in Premarket Medical Device
Zongzhe Xu
LM&MAMedIm
66
0
0
03 Nov 2022
Open-Vocabulary Argument Role Prediction for Event Extraction
Open-Vocabulary Argument Role Prediction for Event Extraction
Yizhu Jiao
Sha Li
Yiqing Xie
Ming Zhong
Heng Ji
Jiawei Han
115
17
0
03 Nov 2022
PINTO: Faithful Language Reasoning Using Prompt-Generated Rationales
PINTO: Faithful Language Reasoning Using Prompt-Generated Rationales
Peifeng Wang
Aaron Chan
Filip Ilievski
Muhao Chen
Xiang Ren
LRMReLM
117
65
0
03 Nov 2022
RQUGE: Reference-Free Metric for Evaluating Question Generation by
  Answering the Question
RQUGE: Reference-Free Metric for Evaluating Question Generation by Answering the Question
Alireza Mohammadshahi
Thomas Scialom
Majid Yazdani
Pouya Yanki
Angela Fan
James Henderson
Marzieh Saeidi
94
20
0
02 Nov 2022
Generative Entity-to-Entity Stance Detection with Knowledge Graph
  Augmentation
Generative Entity-to-Entity Stance Detection with Knowledge Graph Augmentation
Xinliang Frederick Zhang
Nick Beauchamp
Lu Wang
57
10
0
02 Nov 2022
MPCFormer: fast, performant and private Transformer inference with MPC
MPCFormer: fast, performant and private Transformer inference with MPC
Dacheng Li
Rulin Shao
Hongyi Wang
Han Guo
Eric P. Xing
Haotong Zhang
95
87
0
02 Nov 2022
eDiff-I: Text-to-Image Diffusion Models with an Ensemble of Expert
  Denoisers
eDiff-I: Text-to-Image Diffusion Models with an Ensemble of Expert Denoisers
Yogesh Balaji
Seungjun Nah
Xun Huang
Arash Vahdat
Jiaming Song
...
Timo Aila
S. Laine
Bryan Catanzaro
Tero Karras
Xuan Li
VLMMoE
219
832
0
02 Nov 2022
Previous
123...145146147...197198199
Next