ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.10683
  4. Cited By
Exploring the Limits of Transfer Learning with a Unified Text-to-Text
  Transformer
v1v2v3v4 (latest)

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
    AIMat
ArXiv (abs)PDFHTML

Papers citing "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

50 / 9,870 papers shown
Title
Scalable and Efficient MoE Training for Multitask Multilingual Models
Scalable and Efficient MoE Training for Multitask Multilingual Models
Young Jin Kim
A. A. Awan
Alexandre Muzio
Andres Felipe Cruz Salinas
Liyang Lu
Amr Hendy
Samyam Rajbhandari
Yuxiong He
Hany Awadalla
MoE
148
85
0
22 Sep 2021
RETRONLU: Retrieval Augmented Task-Oriented Semantic Parsing
RETRONLU: Retrieval Augmented Task-Oriented Semantic Parsing
Vivek Gupta
Akshat Shrivastava
Adithya Sagar
Armen Aghajanyan
Denis Savenkov
RALM
87
23
0
21 Sep 2021
Relation-Guided Pre-Training for Open-Domain Question Answering
Relation-Guided Pre-Training for Open-Domain Question Answering
Ziniu Hu
Yizhou Sun
Kai-Wei Chang
RALMOnRL
73
6
0
21 Sep 2021
Knowledge Distillation with Noisy Labels for Natural Language
  Understanding
Knowledge Distillation with Noisy Labels for Natural Language Understanding
Shivendra Bhardwaj
Abbas Ghaddar
Ahmad Rashid
Khalil Bibi
Cheng-huan Li
A. Ghodsi
Philippe Langlais
Mehdi Rezagholizadeh
53
1
0
21 Sep 2021
ConvFiT: Conversational Fine-Tuning of Pretrained Language Models
ConvFiT: Conversational Fine-Tuning of Pretrained Language Models
Ivan Vulić
Pei-hao Su
Sam Coope
D. Gerz
Paweł Budzianowski
I. Casanueva
Nikola Mrkvsić
Tsung-Hsien Wen
100
37
0
21 Sep 2021
A Plug-and-Play Method for Controlled Text Generation
A Plug-and-Play Method for Controlled Text Generation
Damian Pascual
Béni Egressy
Clara Meister
Ryan Cotterell
Roger Wattenhofer
130
94
0
20 Sep 2021
BARTpho: Pre-trained Sequence-to-Sequence Models for Vietnamese
BARTpho: Pre-trained Sequence-to-Sequence Models for Vietnamese
Nguyen Luong Tran
Duong Minh Le
Dat Quoc Nguyen
70
55
0
20 Sep 2021
PLATO-XL: Exploring the Large-scale Pre-training of Dialogue Generation
PLATO-XL: Exploring the Large-scale Pre-training of Dialogue Generation
Siqi Bao
H. He
Fan Wang
Hua Wu
Haifeng Wang
...
Xinxian Huang
Xin Tian
Xinchao Xu
Yingzhan Lin
Zhengyu Niu
VLMALM
81
63
0
20 Sep 2021
Towards Zero-Label Language Learning
Towards Zero-Label Language Learning
Zirui Wang
Adams Wei Yu
Orhan Firat
Yuan Cao
SyDa
246
105
0
19 Sep 2021
Multi-Task Learning in Natural Language Processing: An Overview
Multi-Task Learning in Natural Language Processing: An Overview
Shijie Chen
Yu Zhang
Qiang Yang
AIMat
145
113
0
19 Sep 2021
Text Detoxification using Large Pre-trained Neural Models
Text Detoxification using Large Pre-trained Neural Models
David Dale
Anton Voronov
Daryna Dementieva
V. Logacheva
Olga Kozlova
Nikita Semenov
Alexander Panchenko
124
74
0
18 Sep 2021
RnG-KBQA: Generation Augmented Iterative Ranking for Knowledge Base
  Question Answering
RnG-KBQA: Generation Augmented Iterative Ranking for Knowledge Base Question Answering
Xi Ye
Semih Yavuz
Kazuma Hashimoto
Yingbo Zhou
Caiming Xiong
222
148
0
17 Sep 2021
Primer: Searching for Efficient Transformers for Language Modeling
Primer: Searching for Efficient Transformers for Language Modeling
David R. So
Wojciech Mañke
Hanxiao Liu
Zihang Dai
Noam M. Shazeer
Quoc V. Le
VLM
277
156
0
17 Sep 2021
Hierarchy-Aware T5 with Path-Adaptive Mask Mechanism for Hierarchical
  Text Classification
Hierarchy-Aware T5 with Path-Adaptive Mask Mechanism for Hierarchical Text Classification
Wei Huang
Chen Liu
Yihua Zhao
Xinyun Yang
Zhaoming Pan
Zhimin Zhang
Guiquan Liu
41
2
0
17 Sep 2021
Exploring Multitask Learning for Low-Resource AbstractiveSummarization
Exploring Multitask Learning for Low-Resource AbstractiveSummarization
Ahmed Magooda
Mohamed S. Elaraby
Diane Litman
73
11
0
17 Sep 2021
Task-adaptive Pre-training of Language Models with Word Embedding
  Regularization
Task-adaptive Pre-training of Language Models with Word Embedding Regularization
Kosuke Nishida
Kyosuke Nishida
Sen Yoshida
VLM
94
8
0
17 Sep 2021
Language Models as a Knowledge Source for Cognitive Agents
Language Models as a Knowledge Source for Cognitive Agents
R. Wray
James R. Kirk
John E. Laird
57
15
0
17 Sep 2021
Pre-trained Gaussian processes for Bayesian optimization
Pre-trained Gaussian processes for Bayesian optimization
Zehao Wang
George E. Dahl
Kevin Swersky
Chansoo Lee
Zachary Nado
Justin Gilmer
Jasper Snoek
Zoubin Ghahramani
151
46
0
16 Sep 2021
Phrase Retrieval Learns Passage Retrieval, Too
Phrase Retrieval Learns Passage Retrieval, Too
Jinhyuk Lee
Alexander Wettig
Danqi Chen
RALMDML
82
48
0
16 Sep 2021
Does External Knowledge Help Explainable Natural Language Inference?
  Automatic Evaluation vs. Human Ratings
Does External Knowledge Help Explainable Natural Language Inference? Automatic Evaluation vs. Human Ratings
Hendrik Schuff
Hsiu-yu Yang
Heike Adel
Ngoc Thang Vu
ELMReLMLRM
62
13
0
16 Sep 2021
Scaling Laws for Neural Machine Translation
Scaling Laws for Neural Machine Translation
Behrooz Ghorbani
Orhan Firat
Markus Freitag
Ankur Bapna
M. Krikun
Xavier Garcia
Ciprian Chelba
Colin Cherry
90
103
0
16 Sep 2021
Language Models are Few-shot Multilingual Learners
Language Models are Few-shot Multilingual Learners
Genta Indra Winata
Andrea Madotto
Zhaojiang Lin
Rosanne Liu
J. Yosinski
Pascale Fung
ELMLRM
115
138
0
16 Sep 2021
On the Complementarity of Data Selection and Fine Tuning for Domain
  Adaptation
On the Complementarity of Data Selection and Fine Tuning for Domain Adaptation
Dan Iter
David Grangier
92
10
0
15 Sep 2021
Dialogue State Tracking with a Language Model using Schema-Driven
  Prompting
Dialogue State Tracking with a Language Model using Schema-Driven Prompting
Chia-Hsuan Lee
Hao Cheng
Mari Ostendorf
102
132
0
15 Sep 2021
Challenges in Detoxifying Language Models
Challenges in Detoxifying Language Models
Johannes Welbl
Amelia Glaese
J. Uesato
Sumanth Dathathri
John F. J. Mellor
Lisa Anne Hendricks
Kirsty Anderson
Pushmeet Kohli
Ben Coppin
Po-Sen Huang
LM&MA
313
196
0
15 Sep 2021
Topic Transferable Table Question Answering
Topic Transferable Table Question Answering
Saneem A. Chemmengath
Vishwajeet Kumar
Samarth Bharadwaj
Jaydeep Sen
Mustafa Canim
Soumen Chakrabarti
A. Gliozzo
Karthik Sankaranarayanan
OOD
96
11
0
15 Sep 2021
Prefix-to-SQL: Text-to-SQL Generation from Incomplete User Questions
Prefix-to-SQL: Text-to-SQL Generation from Incomplete User Questions
Naihao Deng
Shuaichen Chang
Peng Shi
Tao Yu
Rui Zhang
LMTD
64
4
0
15 Sep 2021
Image Captioning for Effective Use of Language Models in Knowledge-Based
  Visual Question Answering
Image Captioning for Effective Use of Language Models in Knowledge-Based Visual Question Answering
Ander Salaberria
Gorka Azkune
Oier López de Lacalle
Aitor Soroa Etxabe
Eneko Agirre
92
61
0
15 Sep 2021
Improving Text Auto-Completion with Next Phrase Prediction
Improving Text Auto-Completion with Next Phrase Prediction
Dong-Ho Lee
Zhiqiang Hu
Roy Ka-wei Lee
LRM
50
4
0
15 Sep 2021
Attention Is Indeed All You Need: Semantically Attention-Guided Decoding
  for Data-to-Text NLG
Attention Is Indeed All You Need: Semantically Attention-Guided Decoding for Data-to-Text NLG
Juraj Juraska
M. Walker
56
17
0
15 Sep 2021
Summarize-then-Answer: Generating Concise Explanations for Multi-hop
  Reading Comprehension
Summarize-then-Answer: Generating Concise Explanations for Multi-hop Reading Comprehension
Naoya Inoue
H. Trivedi
Steven K. Sinha
Niranjan Balasubramanian
Kentaro Inui
78
16
0
14 Sep 2021
KFCNet: Knowledge Filtering and Contrastive Learning Network for
  Generative Commonsense Reasoning
KFCNet: Knowledge Filtering and Contrastive Learning Network for Generative Commonsense Reasoning
Haonan Li
Yeyun Gong
Jian Jiao
Ruofei Zhang
Timothy Baldwin
Nan Duan
OffRL
93
6
0
14 Sep 2021
Exploring Prompt-based Few-shot Learning for Grounded Dialog Generation
Exploring Prompt-based Few-shot Learning for Grounded Dialog Generation
Chujie Zheng
Minlie Huang
104
44
0
14 Sep 2021
Task-adaptive Pre-training and Self-training are Complementary for
  Natural Language Understanding
Task-adaptive Pre-training and Self-training are Complementary for Natural Language Understanding
Shiyang Li
Semih Yavuz
Wenhu Chen
Xifeng Yan
69
12
0
14 Sep 2021
STraTA: Self-Training with Task Augmentation for Better Few-shot
  Learning
STraTA: Self-Training with Task Augmentation for Better Few-shot Learning
Tu Vu
Minh-Thang Luong
Quoc V. Le
Grady Simon
Mohit Iyyer
176
61
0
13 Sep 2021
SituatedQA: Incorporating Extra-Linguistic Contexts into QA
SituatedQA: Incorporating Extra-Linguistic Contexts into QA
Michael J.Q. Zhang
Eunsol Choi
RALM
87
154
0
13 Sep 2021
Packed Levitated Marker for Entity and Relation Extraction
Packed Levitated Marker for Entity and Relation Extraction
Deming Ye
Yankai Lin
Peng Li
Maosong Sun
212
112
0
13 Sep 2021
Question Answering over Electronic Devices: A New Benchmark Dataset and
  a Multi-Task Learning based QA Framework
Question Answering over Electronic Devices: A New Benchmark Dataset and a Multi-Task Learning based QA Framework
Abhilash Nandy
Soumya Sharma
Shubham Maddhashiya
K. Sachdeva
Pawan Goyal
Niloy Ganguly
68
19
0
13 Sep 2021
Abstract, Rationale, Stance: A Joint Model for Scientific Claim
  Verification
Abstract, Rationale, Stance: A Joint Model for Scientific Claim Verification
Zhiwei Zhang
Jiyi Li
Fumiyo Fukumoto
Yanming Ye
84
28
0
13 Sep 2021
CPT: A Pre-Trained Unbalanced Transformer for Both Chinese Language
  Understanding and Generation
CPT: A Pre-Trained Unbalanced Transformer for Both Chinese Language Understanding and Generation
Yunfan Shao
Zhichao Geng
Yitao Liu
Junqi Dai
Hang Yan
Fei Yang
Li Zhe
Hujun Bao
Xipeng Qiu
MedIm
148
151
0
13 Sep 2021
Contrastive Learning for Context-aware Neural Machine TranslationUsing
  Coreference Information
Contrastive Learning for Context-aware Neural Machine TranslationUsing Coreference Information
Yong-keun Hwang
Hyungu Yun
Kyomin Jung
64
11
0
13 Sep 2021
How to Select One Among All? An Extensive Empirical Study Towards the
  Robustness of Knowledge Distillation in Natural Language Understanding
How to Select One Among All? An Extensive Empirical Study Towards the Robustness of Knowledge Distillation in Natural Language Understanding
Tianda Li
Ahmad Rashid
A. Jafari
Pranav Sharma
A. Ghodsi
Mehdi Rezagholizadeh
AAML
122
5
0
13 Sep 2021
SHAPE: Shifted Absolute Position Embedding for Transformers
SHAPE: Shifted Absolute Position Embedding for Transformers
Shun Kiyono
Sosuke Kobayashi
Jun Suzuki
Kentaro Inui
292
47
0
13 Sep 2021
Good-Enough Example Extrapolation
Good-Enough Example Extrapolation
Jason W. Wei
60
6
0
12 Sep 2021
End-to-End Conversational Search for Online Shopping with Utterance
  Transfer
End-to-End Conversational Search for Online Shopping with Utterance Transfer
Liqiang Xiao
Jun Ma
Xin Luna Dong
Pascual Martínez-Gómez
Nasser Zalmout
Wei Chen
Tong Zhao
Hao He
Yaohui Jin
46
12
0
12 Sep 2021
"Let Your Characters Tell Their Story": A Dataset for Character-Centric
  Narrative Understanding
"Let Your Characters Tell Their Story": A Dataset for Character-Centric Narrative Understanding
Faeze Brahman
Meng Huang
Oyvind Tafjord
Chao Zhao
Mrinmaya Sachan
Snigdha Chaturvedi
77
57
0
12 Sep 2021
Multilingual Translation via Grafting Pre-trained Language Models
Multilingual Translation via Grafting Pre-trained Language Models
Zewei Sun
Mingxuan Wang
Lei Li
AI4CE
240
22
0
11 Sep 2021
Semantic Categorization of Social Knowledge for Commonsense Question
  Answering
Semantic Categorization of Social Knowledge for Commonsense Question Answering
Gengyu Wang
Xiaochen Hou
Diyi Yang
Kathleen McKeown
Jing Huang
VLM
49
3
0
11 Sep 2021
StreamHover: Livestream Transcript Summarization and Annotation
StreamHover: Livestream Transcript Summarization and Annotation
Sangwoo Cho
Franck Dernoncourt
Timothy Jeewun Ganter
Trung Bui
Nedim Lipka
Walter Chang
Hailin Jin
Jonathan Brandt
H. Foroosh
Fei Liu
3DGSAI4TS
75
29
0
11 Sep 2021
PICARD: Parsing Incrementally for Constrained Auto-Regressive Decoding
  from Language Models
PICARD: Parsing Incrementally for Constrained Auto-Regressive Decoding from Language Models
Torsten Scholak
Nathan Schucher
Dzmitry Bahdanau
236
396
0
10 Sep 2021
Previous
123...175176177...196197198
Next