ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.10683
  4. Cited By
Exploring the Limits of Transfer Learning with a Unified Text-to-Text
  Transformer
v1v2v3v4 (latest)

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
    AIMat
ArXiv (abs)PDFHTML

Papers citing "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

50 / 9,903 papers shown
Title
Is MultiWOZ a Solved Task? An Interactive TOD Evaluation Framework with
  User Simulator
Is MultiWOZ a Solved Task? An Interactive TOD Evaluation Framework with User Simulator
Qinyuan Cheng
Linyang Li
Guofeng Quan
Feng Gao
Xiaofeng Mou
Xipeng Qiu
72
13
0
26 Oct 2022
Will we run out of data? Limits of LLM scaling based on human-generated
  data
Will we run out of data? Limits of LLM scaling based on human-generated data
Pablo Villalobos
A. Ho
J. Sevilla
T. Besiroglu
Lennart Heim
Marius Hobbhahn
ALM
102
125
0
26 Oct 2022
Universal Evasion Attacks on Summarization Scoring
Universal Evasion Attacks on Summarization Scoring
Wenchuan Mu
Kwan Hui Lim
AAML
81
1
0
25 Oct 2022
Same Pre-training Loss, Better Downstream: Implicit Bias Matters for
  Language Models
Same Pre-training Loss, Better Downstream: Implicit Bias Matters for Language Models
Hong Liu
Sang Michael Xie
Zhiyuan Li
Tengyu Ma
AI4CE
135
55
0
25 Oct 2022
IELM: An Open Information Extraction Benchmark for Pre-Trained Language
  Models
IELM: An Open Information Extraction Benchmark for Pre-Trained Language Models
Chenguang Wang
Xiao Liu
Dawn Song
VLM
41
2
0
25 Oct 2022
Exploring Mode Connectivity for Pre-trained Language Models
Exploring Mode Connectivity for Pre-trained Language Models
Yujia Qin
Cheng Qian
Jing Yi
Weize Chen
Yankai Lin
Xu Han
Zhiyuan Liu
Maosong Sun
Jie Zhou
97
21
0
25 Oct 2022
A Survey on Artificial Intelligence for Music Generation: Agents,
  Domains and Perspectives
A Survey on Artificial Intelligence for Music Generation: Agents, Domains and Perspectives
Carlos Hernandez-Olivan
Javier Hernandez-Olivan
J. R. Beltrán
MGen
98
7
0
25 Oct 2022
Cloning Ideology and Style using Deep Learning
Cloning Ideology and Style using Deep Learning
Omer Beg
Muhammad Nasir Zafar
Waleed Anjum
44
0
0
25 Oct 2022
Multilingual Relation Classification via Efficient and Effective
  Prompting
Multilingual Relation Classification via Efficient and Effective Prompting
Yuxuan Chen
David Harbecke
Leonhard Hennig
LRM
87
12
0
25 Oct 2022
Parameter-Efficient Legal Domain Adaptation
Parameter-Efficient Legal Domain Adaptation
Jonathan Li
R. Bhambhoria
Xiao-Dan Zhu
ELMAILawALM
86
14
0
25 Oct 2022
Rich Knowledge Sources Bring Complex Knowledge Conflicts: Recalibrating
  Models to Reflect Conflicting Evidence
Rich Knowledge Sources Bring Complex Knowledge Conflicts: Recalibrating Models to Reflect Conflicting Evidence
Hung-Ting Chen
Michael J.Q. Zhang
Eunsol Choi
RALMHILM
141
100
0
25 Oct 2022
Evaluating Parameter Efficient Learning for Generation
Evaluating Parameter Efficient Learning for Generation
Peng Xu
M. Patwary
Shrimai Prabhumoye
Virginia Adams
R. Prenger
Ming-Yu Liu
Nayeon Lee
Mohammad Shoeybi
Bryan Catanzaro
MoE
69
3
0
25 Oct 2022
Help me write a poem: Instruction Tuning as a Vehicle for Collaborative
  Poetry Writing
Help me write a poem: Instruction Tuning as a Vehicle for Collaborative Poetry Writing
Tuhin Chakrabarty
Vishakh Padmakumar
Hengxing He
86
82
0
25 Oct 2022
LANS: Large-scale Arabic News Summarization Corpus
LANS: Large-scale Arabic News Summarization Corpus
Abdulaziz Alhamadani
Xuchao Zhang
Jianfeng He
Chang-Tien Lu
49
2
0
24 Oct 2022
Does Self-Rationalization Improve Robustness to Spurious Correlations?
Does Self-Rationalization Improve Robustness to Spurious Correlations?
Alexis Ross
Matthew E. Peters
Ana Marasović
LRM
104
13
0
24 Oct 2022
ExPUNations: Augmenting Puns with Keywords and Explanations
ExPUNations: Augmenting Puns with Keywords and Explanations
Jiao Sun
Anjali Narayan-Chen
Shereen Oraby
Alessandra Cervone
Tagyoung Chung
Jing Huang
Yang Liu
Nanyun Peng
81
10
0
24 Oct 2022
MetaFormer Baselines for Vision
MetaFormer Baselines for Vision
Weihao Yu
Chenyang Si
Pan Zhou
Mi Luo
Yichen Zhou
Jiashi Feng
Shuicheng Yan
Xinchao Wang
MoE
99
171
0
24 Oct 2022
Different Tunes Played with Equal Skill: Exploring a Unified
  Optimization Subspace for Delta Tuning
Different Tunes Played with Equal Skill: Exploring a Unified Optimization Subspace for Delta Tuning
Jing Yi
Weize Chen
Yujia Qin
Yankai Lin
Ning Ding
Xu Han
Zhiyuan Liu
Maosong Sun
Jie Zhou
113
2
0
24 Oct 2022
ELMER: A Non-Autoregressive Pre-trained Language Model for Efficient and
  Effective Text Generation
ELMER: A Non-Autoregressive Pre-trained Language Model for Efficient and Effective Text Generation
Junyi Li
Tianyi Tang
Wayne Xin Zhao
J. Nie
Ji-Rong Wen
81
17
0
24 Oct 2022
Structural generalization is hard for sequence-to-sequence models
Structural generalization is hard for sequence-to-sequence models
Yuekun Yao
Alexander Koller
88
22
0
24 Oct 2022
Are Current Task-oriented Dialogue Systems Able to Satisfy Impolite
  Users?
Are Current Task-oriented Dialogue Systems Able to Satisfy Impolite Users?
Zhiqiang Hu
Roy Ka-wei Lee
Nancy F. Chen
57
5
0
24 Oct 2022
TIARA: Multi-grained Retrieval for Robust Question Answering over Large
  Knowledge Bases
TIARA: Multi-grained Retrieval for Robust Question Answering over Large Knowledge Bases
Yiheng Shu
Zhiwei Yu
Yuhan Li
Börje F. Karlsson
Tingting Ma
Yuzhong Qu
Chin-Yew Lin
89
74
0
24 Oct 2022
Event-Centric Question Answering via Contrastive Learning and Invertible
  Event Transformation
Event-Centric Question Answering via Contrastive Learning and Invertible Event Transformation
Junru Lu
Xingwei Tan
Gabriele Pergola
Lin Gui
Yulan He
98
11
0
24 Oct 2022
Retrieval Augmentation for Commonsense Reasoning: A Unified Approach
Retrieval Augmentation for Commonsense Reasoning: A Unified Approach
Wenhao Yu
Chenguang Zhu
Zhihan Zhang
Shuohang Wang
Zhuosheng Zhang
Yuwei Fang
Meng Jiang
LRMReLM
64
19
0
23 Oct 2022
Knowledge Transfer from Answer Ranking to Answer Generation
Knowledge Transfer from Answer Ranking to Answer Generation
Matteo Gabburo
Rik Koncel-Kedziorski
Siddhant Garg
Luca Soldaini
Alessandro Moschitti
59
8
0
23 Oct 2022
Discriminative Language Model as Semantic Consistency Scorer for
  Prompt-based Few-Shot Text Classification
Discriminative Language Model as Semantic Consistency Scorer for Prompt-based Few-Shot Text Classification
Zhipeng Xie
Yahe Li
49
0
0
23 Oct 2022
Generative Knowledge Graph Construction: A Review
Generative Knowledge Graph Construction: A Review
Hongbin Ye
Ningyu Zhang
Hui Chen
Huajun Chen
127
75
0
23 Oct 2022
Towards Generalizable and Robust Text-to-SQL Parsing
Towards Generalizable and Robust Text-to-SQL Parsing
Chang Gao
Bowen Li
Wenxuan Zhang
W. Lam
Binhua Li
Fei Huang
Luo Si
Yongbin Li
123
8
0
23 Oct 2022
Learning to Perform Complex Tasks through Compositional Fine-Tuning of
  Language Models
Learning to Perform Complex Tasks through Compositional Fine-Tuning of Language Models
Victor S. Bursztyn
David Demeter
Doug Downey
Larry Birnbaum
ReLMLRM
83
10
0
23 Oct 2022
Model ensemble instead of prompt fusion: a sample-specific knowledge
  transfer method for few-shot prompt tuning
Model ensemble instead of prompt fusion: a sample-specific knowledge transfer method for few-shot prompt tuning
Xiangyu Peng
Chen Xing
Prafulla Kumar Choubey
Chien-Sheng Wu
Caiming Xiong
VLM
137
12
0
23 Oct 2022
Language Model Pre-Training with Sparse Latent Typing
Language Model Pre-Training with Sparse Latent Typing
Liliang Ren
Zixuan Zhang
H. Wang
Clare R. Voss
Chengxiang Zhai
Heng Ji
98
3
0
23 Oct 2022
The Curious Case of Absolute Position Embeddings
The Curious Case of Absolute Position Embeddings
Koustuv Sinha
Amirhossein Kazemnejad
Siva Reddy
J. Pineau
Dieuwke Hupkes
Adina Williams
135
15
0
23 Oct 2022
Exploring The Landscape of Distributional Robustness for Question
  Answering Models
Exploring The Landscape of Distributional Robustness for Question Answering Models
Anas Awadalla
Mitchell Wortsman
Gabriel Ilharco
Sewon Min
Ian H. Magnusson
Hannaneh Hajishirzi
Ludwig Schmidt
ELMOODKELM
116
21
0
22 Oct 2022
MetaLogic: Logical Reasoning Explanations with Fine-Grained Structure
MetaLogic: Logical Reasoning Explanations with Fine-Grained Structure
Yinya Huang
Hongming Zhang
Ruixin Hong
Xiaodan Liang
Changshui Zhang
Dong Yu
LRM
101
7
0
22 Oct 2022
ECTSum: A New Benchmark Dataset For Bullet Point Summarization of Long
  Earnings Call Transcripts
ECTSum: A New Benchmark Dataset For Bullet Point Summarization of Long Earnings Call Transcripts
Rajdeep Mukherjee
Abhinav Bohra
Akash Banerjee
Soumya Sharma
Manjunath Hegde
...
Shivani Shrivastava
Koustuv Dasgupta
Niloy Ganguly
Saptarshi Ghosh
Pawan Goyal
RALM
114
49
0
22 Oct 2022
The Shared Task on Gender Rewriting
The Shared Task on Gender Rewriting
Bashar Alhafni
Nizar Habash
Houda Bouamor
Ossama Obeid
Sultan Alrowili
...
Mohamed Gabr
Abderrahmane Issam
Abdelrahim Qaddoumi
K. Vijay-Shanker
Mahmoud Zyate
77
2
0
22 Oct 2022
PATS: Sensitivity-aware Noisy Learning for Pretrained Language Models
PATS: Sensitivity-aware Noisy Learning for Pretrained Language Models
Yupeng Zhang
Hongzhi Zhang
Sirui Wang
Wei Wu
Zhoujun Li
AAML
94
1
0
22 Oct 2022
S2WAT: Image Style Transfer via Hierarchical Vision Transformer using
  Strips Window Attention
S2WAT: Image Style Transfer via Hierarchical Vision Transformer using Strips Window Attention
Chi Zhang
Lu Zhou
Lei Wang
Zaiyan Dai
Jun Yang
ViT
132
27
0
22 Oct 2022
ReasTAP: Injecting Table Reasoning Skills During Pre-training via
  Synthetic Reasoning Examples
ReasTAP: Injecting Table Reasoning Skills During Pre-training via Synthetic Reasoning Examples
Yilun Zhao
Linyong Nan
Zhenting Qi
Rui Zhang
Dragomir R. Radev
ReLMLMTDLRM
113
39
0
22 Oct 2022
P$^3$LM: Probabilistically Permuted Prophet Language Modeling for
  Generative Pre-Training
P3^33LM: Probabilistically Permuted Prophet Language Modeling for Generative Pre-Training
Junwei Bao
Yifan Wang
Jiangyong Ying
Yeyun Gong
Jing Zhao
Youzheng Wu
Xiaodong He
70
1
0
22 Oct 2022
Open-domain Question Answering via Chain of Reasoning over Heterogeneous
  Knowledge
Open-domain Question Answering via Chain of Reasoning over Heterogeneous Knowledge
Kaixin Ma
Hao Cheng
Xiaodong Liu
Eric Nyberg
Jianfeng Gao
LRM
215
15
0
22 Oct 2022
A Dataset for Plain Language Adaptation of Biomedical Abstracts
A Dataset for Plain Language Adaptation of Biomedical Abstracts
Kush Attal
Brian D. Ondov
Dina Demner-Fushman
77
26
0
21 Oct 2022
Augmenting Multi-Turn Text-to-SQL Datasets with Self-Play
Augmenting Multi-Turn Text-to-SQL Datasets with Self-Play
Qi Liu
Zihuiwen Ye
Tao Yu
Phil Blunsom
Linfeng Song
76
11
0
21 Oct 2022
Decoding a Neural Retriever's Latent Space for Query Suggestion
Decoding a Neural Retriever's Latent Space for Query Suggestion
Leonard Adolphs
Michelle Chen Huebscher
Christian Buck
Sertan Girgin
Olivier Bachem
Massimiliano Ciaramita
Thomas Hofmann
RALM
82
8
0
21 Oct 2022
LittleBird: Efficient Faster & Longer Transformer for Question Answering
LittleBird: Efficient Faster & Longer Transformer for Question Answering
Minchul Lee
Kijong Han
M. Shin
VLM
113
6
0
21 Oct 2022
Is Encoder-Decoder Redundant for Neural Machine Translation?
Is Encoder-Decoder Redundant for Neural Machine Translation?
Yingbo Gao
Christian Herold
Zijian Yang
Hermann Ney
76
4
0
21 Oct 2022
InforMask: Unsupervised Informative Masking for Language Model
  Pretraining
InforMask: Unsupervised Informative Masking for Language Model Pretraining
Nafis Sadeq
Canwen Xu
Julian McAuley
100
13
0
21 Oct 2022
Augmentation with Projection: Towards an Effective and Efficient Data
  Augmentation Paradigm for Distillation
Augmentation with Projection: Towards an Effective and Efficient Data Augmentation Paradigm for Distillation
Ziqi Wang
Yuexin Wu
Frederick Liu
Daogao Liu
Le Hou
Hongkun Yu
Jing Li
Heng Ji
88
5
0
21 Oct 2022
Metric-guided Distillation: Distilling Knowledge from the Metric to
  Ranker and Retriever for Generative Commonsense Reasoning
Metric-guided Distillation: Distilling Knowledge from the Metric to Ranker and Retriever for Generative Commonsense Reasoning
Xingwei He
Yeyun Gong
Alex Jin
Weizhen Qi
Hang Zhang
Jian Jiao
Bartuer Zhou
Biao Cheng
Sm Yiu
Nan Duan
62
11
0
21 Oct 2022
Efficiently Tuned Parameters are Task Embeddings
Efficiently Tuned Parameters are Task Embeddings
Wangchunshu Zhou
Canwen Xu
Julian McAuley
58
8
0
21 Oct 2022
Previous
123...147148149...197198199
Next