ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.10683
  4. Cited By
Exploring the Limits of Transfer Learning with a Unified Text-to-Text
  Transformer
v1v2v3v4 (latest)

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
    AIMat
ArXiv (abs)PDFHTML

Papers citing "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

50 / 9,944 papers shown
Title
Is GPT-3 a Good Data Annotator?
Is GPT-3 a Good Data Annotator?
Bosheng Ding
Chengwei Qin
Linlin Liu
Yew Ken Chia
Shafiq Joty
Boyang Albert Li
Lidong Bing
97
250
0
20 Dec 2022
Socratic Pretraining: Question-Driven Pretraining for Controllable
  Summarization
Socratic Pretraining: Question-Driven Pretraining for Controllable Summarization
Artidoro Pagnoni
Alexander R. Fabbri
Wojciech Kry'sciñski
Chien-Sheng Wu
RALM
118
18
0
20 Dec 2022
Towards Reasoning in Large Language Models: A Survey
Towards Reasoning in Large Language Models: A Survey
Jie Huang
Kevin Chen-Chuan Chang
LM&MAELMLRM
219
645
0
20 Dec 2022
To Adapt or to Annotate: Challenges and Interventions for Domain
  Adaptation in Open-Domain Question Answering
To Adapt or to Annotate: Challenges and Interventions for Domain Adaptation in Open-Domain Question Answering
Dheeru Dua
Emma Strubell
Sameer Singh
Pat Verga
OOD
98
3
0
20 Dec 2022
CoCo: Coherence-Enhanced Machine-Generated Text Detection Under Data
  Limitation With Contrastive Learning
CoCo: Coherence-Enhanced Machine-Generated Text Detection Under Data Limitation With Contrastive Learning
Xiaoming Liu
Zhaohan Zhang
Yichen Wang
Hang Pu
Y. Lan
Chao Shen
102
41
0
20 Dec 2022
HINT: Hypernetwork Instruction Tuning for Efficient Zero- & Few-Shot
  Generalisation
HINT: Hypernetwork Instruction Tuning for Efficient Zero- & Few-Shot Generalisation
Hamish Ivison
Akshita Bhagia
Yizhong Wang
Hannaneh Hajishirzi
Matthew E. Peters
149
20
0
20 Dec 2022
Pre-trained Language Models for Keyphrase Generation: A Thorough
  Empirical Study
Pre-trained Language Models for Keyphrase Generation: A Thorough Empirical Study
Di Wu
Wasi Uddin Ahmad
Kai-Wei Chang
93
18
0
20 Dec 2022
GanLM: Encoder-Decoder Pre-training with an Auxiliary Discriminator
GanLM: Encoder-Decoder Pre-training with an Auxiliary Discriminator
Jian Yang
Shuming Ma
Li Dong
Shaohan Huang
Haoyang Huang
Yuwei Yin
Dongdong Zhang
Liqun Yang
Furu Wei
Zhoujun Li
SyDaAI4CE
76
25
0
20 Dec 2022
Do I have the Knowledge to Answer? Investigating Answerability of
  Knowledge Base Questions
Do I have the Knowledge to Answer? Investigating Answerability of Knowledge Base Questions
Mayur Patidar
Prayushi Faldu
Avinash Kumar Singh
Lovekesh Vig
Indrajit Bhattacharya
Mausam
ELM
89
5
0
20 Dec 2022
A Survey on Pretrained Language Models for Neural Code Intelligence
A Survey on Pretrained Language Models for Neural Code Intelligence
Yichen Xu
Yanqiao Zhu
56
17
0
20 Dec 2022
Large Language Models Are Reasoning Teachers
Large Language Models Are Reasoning Teachers
Namgyu Ho
Laura Schmid
Se-Young Yun
ReLMELMLRM
165
351
0
20 Dec 2022
I Cast Detect Thoughts: Learning to Converse and Guide with Intents and
  Theory-of-Mind in Dungeons and Dragons
I Cast Detect Thoughts: Learning to Converse and Guide with Intents and Theory-of-Mind in Dungeons and Dragons
Pei Zhou
Andrew Zhu
Jennifer Hu
Jay Pujara
Xiang Ren
Chris Callison-Burch
Yejin Choi
Prithviraj Ammanabrolu
86
28
0
20 Dec 2022
When Federated Learning Meets Pre-trained Language Models'
  Parameter-Efficient Tuning Methods
When Federated Learning Meets Pre-trained Language Models' Parameter-Efficient Tuning Methods
Zhuo Zhang
Yuanhang Yang
Yong Dai
Zhuang Li
Zenglin Xu
FedML
127
85
0
20 Dec 2022
On the Blind Spots of Model-Based Evaluation Metrics for Text Generation
On the Blind Spots of Model-Based Evaluation Metrics for Text Generation
Tianxing He
Jingyu Zhang
Tianle Wang
Sachin Kumar
Kyunghyun Cho
James R. Glass
Yulia Tsvetkov
156
45
0
20 Dec 2022
When Do Decompositions Help for Machine Reading?
When Do Decompositions Help for Machine Reading?
Kangda Wei
Dawn J Lawrie
Benjamin Van Durme
Yunmo Chen
Orion Weller
ReLM
152
3
0
20 Dec 2022
DIONYSUS: A Pre-trained Model for Low-Resource Dialogue Summarization
DIONYSUS: A Pre-trained Model for Low-Resource Dialogue Summarization
Yu Li
Baolin Peng
Pengcheng He
Michel Galley
Zhou Yu
Jianfeng Gao
84
8
0
20 Dec 2022
Language Modeling with Latent Situations
Language Modeling with Latent Situations
Belinda Z. Li
Maxwell Nye
Jacob Andreas
LRM
98
7
0
20 Dec 2022
Enhancing Task Bot Engagement with Synthesized Open-Domain Dialog
Enhancing Task Bot Engagement with Synthesized Open-Domain Dialog
Miaoran Li
Baolin Peng
Michel Galley
Jianfeng Gao
Zhu Zhang
101
5
0
20 Dec 2022
On Improving Summarization Factual Consistency from Natural Language
  Feedback
On Improving Summarization Factual Consistency from Natural Language Feedback
Yixin Liu
Budhaditya Deb
Milagro Teruel
Aaron L Halfaker
Dragomir R. Radev
Ahmed Hassan Awadallah
HILM
71
38
0
20 Dec 2022
Improving the Robustness of Summarization Models by Detecting and
  Removing Input Noise
Improving the Robustness of Summarization Models by Detecting and Removing Input Noise
Kundan Krishna
Yao-Min Zhao
Jie Jessie Ren
Balaji Lakshminarayanan
Jiaming Luo
Mohammad Saleh
Peter J. Liu
63
4
0
20 Dec 2022
Tokenization Consistency Matters for Generative Models on Extractive NLP
  Tasks
Tokenization Consistency Matters for Generative Models on Extractive NLP Tasks
Kaiser Sun
Peng Qi
Yuhao Zhang
Lan Liu
William Yang Wang
Zhiheng Huang
85
9
0
19 Dec 2022
Inducing Character-level Structure in Subword-based Language Models with
  Type-level Interchange Intervention Training
Inducing Character-level Structure in Subword-based Language Models with Type-level Interchange Intervention Training
Jing-ling Huang
Zhengxuan Wu
Kyle Mahowald
Christopher Potts
87
14
0
19 Dec 2022
Improved Long-Form Spoken Language Translation with Large Language
  Models
Improved Long-Form Spoken Language Translation with Large Language Models
Arya D. McCarthy
Haotong Zhang
Shankar Kumar
Felix Stahlberg
Axel H. Ng
73
2
0
19 Dec 2022
Synthetic Pre-Training Tasks for Neural Machine Translation
Synthetic Pre-Training Tasks for Neural Machine Translation
Zexue He
Graeme W. Blackwood
Yikang Shen
Julian McAuley
Rogerio Feris
65
4
0
19 Dec 2022
Dataless Knowledge Fusion by Merging Weights of Language Models
Dataless Knowledge Fusion by Merging Weights of Language Models
Xisen Jin
Xiang Ren
Daniel Preoţiuc-Pietro
Pengxiang Cheng
FedMLMoMe
122
251
0
19 Dec 2022
Do CoNLL-2003 Named Entity Taggers Still Work Well in 2023?
Do CoNLL-2003 Named Entity Taggers Still Work Well in 2023?
Shuheng Liu
Alan Ritter
AI4TS
95
13
0
19 Dec 2022
DSI++: Updating Transformer Memory with New Documents
DSI++: Updating Transformer Memory with New Documents
Sanket Vaibhav Mehta
Jai Gupta
Yi Tay
Mostafa Dehghani
Vinh Q. Tran
J. Rao
Marc Najork
Emma Strubell
Donald Metzler
CLL
103
46
0
19 Dec 2022
LENS: A Learnable Evaluation Metric for Text Simplification
LENS: A Learnable Evaluation Metric for Text Simplification
Mounica Maddela
Yao Dou
David Heineman
Wei Xu
62
65
0
19 Dec 2022
Position-guided Text Prompt for Vision-Language Pre-training
Position-guided Text Prompt for Vision-Language Pre-training
Alex Jinpeng Wang
Pan Zhou
Mike Zheng Shou
Shuicheng Yan
VLM
77
38
0
19 Dec 2022
Don't Generate, Discriminate: A Proposal for Grounding Language Models
  to Real-World Environments
Don't Generate, Discriminate: A Proposal for Grounding Language Models to Real-World Environments
Yu Gu
Xiang Deng
Yu-Chuan Su
LLMAG
125
58
0
19 Dec 2022
Improving Faithfulness of Abstractive Summarization by Controlling
  Confounding Effect of Irrelevant Sentences
Improving Faithfulness of Abstractive Summarization by Controlling Confounding Effect of Irrelevant Sentences
Asish Ghoshal
Arash Einolghozati
A. Arun
Haoran Li
L. Yu
Vera Gor
Yashar Mehdad
Scott Yih
Asli Celikyilmaz
HILM
71
1
0
19 Dec 2022
On Event Individuation for Document-Level Information Extraction
On Event Individuation for Document-Level Information Extraction
William Gantt
Reno Kriz
Yunmo Chen
Siddharth Vashishtha
Aaron Steven White
71
2
0
19 Dec 2022
Unnatural Instructions: Tuning Language Models with (Almost) No Human
  Labor
Unnatural Instructions: Tuning Language Models with (Almost) No Human Labor
Or Honovich
Thomas Scialom
Omer Levy
Timo Schick
ALM
192
374
0
19 Dec 2022
A Natural Bias for Language Generation Models
A Natural Bias for Language Generation Models
Clara Meister
Wojciech Stokowiec
Tiago Pimentel
Lei Yu
Laura Rimell
A. Kuncoro
MILM
89
6
0
19 Dec 2022
Human-in-the-loop Evaluation for Early Misinformation Detection: A Case
  Study of COVID-19 Treatments
Human-in-the-loop Evaluation for Early Misinformation Detection: A Case Study of COVID-19 Treatments
Ethan Mendes
Yang Chen
Wei Xu
Alan Ritter
97
16
0
19 Dec 2022
Multilingual Sequence-to-Sequence Models for Hebrew NLP
Multilingual Sequence-to-Sequence Models for Hebrew NLP
Matan Eyal
Hila Noga
Roee Aharoni
Idan Szpektor
Reut Tsarfaty
56
4
0
19 Dec 2022
MatCha: Enhancing Visual Language Pretraining with Math Reasoning and
  Chart Derendering
MatCha: Enhancing Visual Language Pretraining with Math Reasoning and Chart Derendering
Fangyu Liu
Francesco Piccinno
Syrine Krichene
Chenxi Pang
Kenton Lee
Mandar Joshi
Yasemin Altun
Nigel Collier
Julian Martin Eisenschlos
VLMLRM
61
102
0
19 Dec 2022
Visconde: Multi-document QA with GPT-3 and Neural Reranking
Visconde: Multi-document QA with GPT-3 and Neural Reranking
Jayr Pereira
R. Fidalgo
R. Lotufo
Rodrigo Nogueira
BDLRALM
86
33
0
19 Dec 2022
Query Enhanced Knowledge-Intensive Conversation via Unsupervised Joint
  Modeling
Query Enhanced Knowledge-Intensive Conversation via Unsupervised Joint Modeling
Mingzhu Cai
Siqi Bao
Xin Tian
H. He
Fan Wang
Hua Wu
73
5
0
19 Dec 2022
Source-Free Domain Adaptation for Question Answering with Masked
  Self-training
Source-Free Domain Adaptation for Question Answering with Masked Self-training
M. Yin
B. Wang
Yue Dong
Charles Ling
OOD
100
4
0
19 Dec 2022
Mu$^{2}$SLAM: Multitask, Multilingual Speech and Language Models
Mu2^{2}2SLAM: Multitask, Multilingual Speech and Language Models
Yong Cheng
Yu Zhang
Melvin Johnson
Wolfgang Macherey
Ankur Bapna
66
8
0
19 Dec 2022
Latent Diffusion for Language Generation
Latent Diffusion for Language Generation
Justin Lovelace
Varsha Kishore
Chao-gang Wan
Eliot Shekhtman
Kilian Q. Weinberger
DiffM
134
82
0
19 Dec 2022
Universal Object Detection with Large Vision Model
Universal Object Detection with Large Vision Model
Feng-Huei Lin
Wenze Hu
Yaowei Wang
Yonghong Tian
Guangming Lu
Fanglin Chen
Yong-mei Xu
Xiaoyu Wang
VLMObjD
100
8
0
19 Dec 2022
Bridging The Gap: Entailment Fused-T5 for Open-retrieval Conversational
  Machine Reading Comprehension
Bridging The Gap: Entailment Fused-T5 for Open-retrieval Conversational Machine Reading Comprehension
Xiao Zhang
Heyan Huang
Zewen Chi
Xian-Ling Mao
78
1
0
19 Dec 2022
APOLLO: A Simple Approach for Adaptive Pretraining of Language Models
  for Logical Reasoning
APOLLO: A Simple Approach for Adaptive Pretraining of Language Models for Logical Reasoning
Soumya Sanyal
Yichong Xu
Shuohang Wang
Ziyi Yang
Reid Pryzant
Wenhao Yu
Chenguang Zhu
Xiang Ren
ReLMLRM
103
10
0
19 Dec 2022
MIGA: A Unified Multi-task Generation Framework for Conversational
  Text-to-SQL
MIGA: A Unified Multi-task Generation Framework for Conversational Text-to-SQL
Yingwen Fu
Wenjie Ou
Zhou Yu
Yue Lin
75
7
0
19 Dec 2022
PromptBoosting: Black-Box Text Classification with Ten Forward Passes
PromptBoosting: Black-Box Text Classification with Ten Forward Passes
Bairu Hou
J. O'Connor
Jacob Andreas
Shiyu Chang
Yang Zhang
VLM
57
44
0
19 Dec 2022
Can Retriever-Augmented Language Models Reason? The Blame Game Between
  the Retriever and the Language Model
Can Retriever-Augmented Language Models Reason? The Blame Game Between the Retriever and the Language Model
Parishad BehnamGhader
Santiago Miret
Siva Reddy
ReLMLRM
101
36
0
18 Dec 2022
Neural Rankers for Effective Screening Prioritisation in Medical
  Systematic Review Literature Search
Neural Rankers for Effective Screening Prioritisation in Medical Systematic Review Literature Search
Shuai Wang
Harrisen Scells
Bevan Koopman
Guido Zuccon
76
24
0
18 Dec 2022
Sentence-level Feedback Generation for English Language Learners: Does
  Data Augmentation Help?
Sentence-level Feedback Generation for English Language Learners: Does Data Augmentation Help?
Shabnam Behzad
Amir Zeldes
Nathan Schneider
58
5
0
18 Dec 2022
Previous
123...140141142...197198199
Next