ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.10683
  4. Cited By
Exploring the Limits of Transfer Learning with a Unified Text-to-Text
  Transformer
v1v2v3v4 (latest)

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
    AIMat
ArXiv (abs)PDFHTML

Papers citing "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

50 / 9,891 papers shown
Title
LambdaKG: A Library for Pre-trained Language Model-Based Knowledge Graph
  Embeddings
LambdaKG: A Library for Pre-trained Language Model-Based Knowledge Graph Embeddings
Xin Xie
Zhoubo Li
Xiaohan Wang
Feiyu Xiong
Ningyu Zhang
113
11
0
01 Oct 2022
Zemi: Learning Zero-Shot Semi-Parametric Language Models from Multiple
  Tasks
Zemi: Learning Zero-Shot Semi-Parametric Language Models from Multiple Tasks
Zhenhailong Wang
Xiaoman Pan
Dian Yu
Dong Yu
Jianshu Chen
Heng Ji
VLM
109
10
0
01 Oct 2022
DecAF: Joint Decoding of Answers and Logical Forms for Question
  Answering over Knowledge Bases
DecAF: Joint Decoding of Answers and Logical Forms for Question Answering over Knowledge Bases
Donghan Yu
Shenmin Zhang
Patrick Ng
Henghui Zhu
Alexander Hanbo Li
Jun Wang
Yiqun Hu
William Wang
Zhiguo Wang
Bing Xiang
252
87
0
30 Sep 2022
Calibrating Sequence likelihood Improves Conditional Language Generation
Calibrating Sequence likelihood Improves Conditional Language Generation
Yao-Min Zhao
Misha Khalman
Rishabh Joshi
Shashi Narayan
Mohammad Saleh
Peter J. Liu
UQLM
111
135
0
30 Sep 2022
Out-of-Distribution Detection and Selective Generation for Conditional
  Language Models
Out-of-Distribution Detection and Selective Generation for Conditional Language Models
Jie Jessie Ren
Jiaming Luo
Yao-Min Zhao
Kundan Krishna
Mohammad Saleh
Balaji Lakshminarayanan
Peter J. Liu
OODD
129
114
0
30 Sep 2022
Zero-Shot Retrieval with Search Agents and Hybrid Environments
Zero-Shot Retrieval with Search Agents and Hybrid Environments
Michelle Chen Huebscher
Christian Buck
Massimiliano Ciaramita
S. Rothe
137
9
0
30 Sep 2022
AudioGen: Textually Guided Audio Generation
AudioGen: Textually Guided Audio Generation
Felix Kreuk
Gabriel Synnaeve
Adam Polyak
Uriel Singer
Alexandre Défossez
Jade Copet
Devi Parikh
Yaniv Taigman
Yossi Adi
DiffM
139
309
0
30 Sep 2022
Construction and Applications of Billion-Scale Pre-Trained Multimodal
  Business Knowledge Graph
Construction and Applications of Billion-Scale Pre-Trained Multimodal Business Knowledge Graph
Shumin Deng
Chengming Wang
Zhoubo Li
Ningyu Zhang
Zelin Dai
...
Mosha Chen
Jiaoyan Chen
Jeff Z. Pan
Bryan Hooi
Huajun Chen
VLM
117
22
0
30 Sep 2022
What Makes Pre-trained Language Models Better Zero-shot Learners?
What Makes Pre-trained Language Models Better Zero-shot Learners?
Jinghui Lu
Dongsheng Zhu
Weidong Han
Rui Zhao
Brian Mac Namee
Fei Tan
101
24
0
30 Sep 2022
Learning by Distilling Context
Learning by Distilling Context
Charles Burton Snell
Dan Klein
Ruiqi Zhong
ReLMLRM
233
48
0
30 Sep 2022
PART: Pre-trained Authorship Representation Transformer
PART: Pre-trained Authorship Representation Transformer
Javier Huertas-Tato
Álvaro Huertas-García
Alejandro Martín
137
9
0
30 Sep 2022
ConceptNet infused DialoGPT for Underlying Commonsense Understanding and
  Reasoning in Dialogue Response Generation
ConceptNet infused DialoGPT for Underlying Commonsense Understanding and Reasoning in Dialogue Response Generation
Ye Liu
Wolfgang Maier
Wolfgang Minker
Stefan Ultes
89
2
0
29 Sep 2022
DreamFusion: Text-to-3D using 2D Diffusion
DreamFusion: Text-to-3D using 2D Diffusion
Ben Poole
Ajay Jain
Jonathan T. Barron
B. Mildenhall
259
2,445
0
29 Sep 2022
DR.BENCH: Diagnostic Reasoning Benchmark for Clinical Natural Language
  Processing
DR.BENCH: Diagnostic Reasoning Benchmark for Clinical Natural Language Processing
Yanjun Gao
Dmitriy Dligach
Timothy A. Miller
John R. Caskey
Brihat Sharma
M. Churpek
Majid Afshar
ELMLRM
99
21
0
29 Sep 2022
Generate-and-Retrieve: use your predictions to improve retrieval for
  semantic parsing
Generate-and-Retrieve: use your predictions to improve retrieval for semantic parsing
Yury Zemlyanskiy
Michiel de Jong
Joshua Ainslie
Panupong Pasupat
Peter Shaw
Linlu Qiu
Sumit Sanghai
Fei Sha
RALM
193
20
0
29 Sep 2022
A Multiagent Framework for the Asynchronous and Collaborative Extension
  of Multitask ML Systems
A Multiagent Framework for the Asynchronous and Collaborative Extension of Multitask ML Systems
Andrea Gesmundo
104
2
0
29 Sep 2022
GROOT: Corrective Reward Optimization for Generative Sequential Labeling
GROOT: Corrective Reward Optimization for Generative Sequential Labeling
Kazuma Hashimoto
K. Raman
VLM
93
1
0
29 Sep 2022
Dynamic Prompt Learning via Policy Gradient for Semi-structured
  Mathematical Reasoning
Dynamic Prompt Learning via Policy Gradient for Semi-structured Mathematical Reasoning
Pan Lu
Liang Qiu
Kai-Wei Chang
Ying Nian Wu
Song-Chun Zhu
Tanmay Rajpurohit
Peter Clark
Ashwin Kalyan
ReLMLRM
209
300
0
29 Sep 2022
Bidirectional Language Models Are Also Few-shot Learners
Bidirectional Language Models Are Also Few-shot Learners
Ajay Patel
Bryan Li
Mohammad Sadegh Rasooli
Noah Constant
Colin Raffel
Chris Callison-Burch
LRM
140
47
0
29 Sep 2022
Re-Imagen: Retrieval-Augmented Text-to-Image Generator
Re-Imagen: Retrieval-Augmented Text-to-Image Generator
Wenhu Chen
Hexiang Hu
Chitwan Saharia
William W. Cohen
VLM
215
178
0
29 Sep 2022
Downstream Datasets Make Surprisingly Good Pretraining Corpora
Downstream Datasets Make Surprisingly Good Pretraining Corpora
Kundan Krishna
Saurabh Garg
Jeffrey P. Bigham
Zachary Chase Lipton
108
33
0
28 Sep 2022
Clinical Language Understanding Evaluation (CLUE)
Clinical Language Understanding Evaluation (CLUE)
Travis R. Goodwin
Dina Demner-Fushman
ELMLM&MA
28
1
0
28 Sep 2022
FiD-Light: Efficient and Effective Retrieval-Augmented Text Generation
FiD-Light: Efficient and Effective Retrieval-Augmented Text Generation
Sebastian Hofstatter
Jiecao Chen
K. Raman
Hamed Zamani
RALM
98
82
0
28 Sep 2022
TVLT: Textless Vision-Language Transformer
TVLT: Textless Vision-Language Transformer
Zineng Tang
Jaemin Cho
Yixin Nie
Joey Tianyi Zhou
VLM
137
31
0
28 Sep 2022
Towards Explaining Autonomy with Verbalised Decision Tree States
Towards Explaining Autonomy with Verbalised Decision Tree States
K. Gavriilidis
A. Munafò
Helen F. Hastie
Conlan Cesar
M. Defilippo
M. Benjamin
40
2
0
28 Sep 2022
YATO: Yet Another deep learning based Text analysis Open toolkit
YATO: Yet Another deep learning based Text analysis Open toolkit
Zeqiang Wang
Yile Wang
Jiageng Wu
Zhiyang Teng
Jie Yang
100
3
0
28 Sep 2022
Using contradictions improves question answering systems
Using contradictions improves question answering systems
Étienne Fortier-Dubois
Domenic Rosati
95
0
0
28 Sep 2022
Information Extraction and Human-Robot Dialogue towards Real-life Tasks:
  A Baseline Study with the MobileCS Dataset
Information Extraction and Human-Robot Dialogue towards Real-life Tasks: A Baseline Study with the MobileCS Dataset
Hong Liu
Hao Peng
Zhijian Ou
Juan-Zi Li
Yi Huang
Junlan Feng
89
7
0
27 Sep 2022
EditEval: An Instruction-Based Benchmark for Text Improvements
EditEval: An Instruction-Based Benchmark for Text Improvements
Jane Dwivedi-Yu
Timo Schick
Zhengbao Jiang
Maria Lomeli
Patrick Lewis
Gautier Izacard
Edouard Grave
Sebastian Riedel
Fabio Petroni
104
28
0
27 Sep 2022
WikiDes: A Wikipedia-Based Dataset for Generating Short Descriptions
  from Paragraphs
WikiDes: A Wikipedia-Based Dataset for Generating Short Descriptions from Paragraphs
Hoang Thang Ta
Abu Bakar Siddiqur Rahman
Navonil Majumder
Amir Hussain
Lotfollah Najjar
N. Howard
Soujanya Poria
Alexander Gelbukh
79
11
0
27 Sep 2022
News Summarization and Evaluation in the Era of GPT-3
News Summarization and Evaluation in the Era of GPT-3
Tanya Goyal
Junyi Jessy Li
Greg Durrett
ELM
128
411
0
26 Sep 2022
Re-contextualizing Fairness in NLP: The Case of India
Re-contextualizing Fairness in NLP: The Case of India
Shaily Bhatt
Sunipa Dev
Partha P. Talukdar
Shachi Dave
Vinodkumar Prabhakaran
110
61
0
25 Sep 2022
Application of Deep Learning in Generating Structured Radiology Reports:
  A Transformer-Based Technique
Application of Deep Learning in Generating Structured Radiology Reports: A Transformer-Based Technique
Seyed Ali Reza Moezzi
Abdolrahman Ghaedi
M. Rahmanian
Seyedeh Zahra Mousavi
A. Sami
MedIm
33
9
0
25 Sep 2022
Can Transformer Models Effectively Detect Software Aspects in
  StackOverflow Discussion?
Can Transformer Models Effectively Detect Software Aspects in StackOverflow Discussion?
Nibir Mandal
Tashreef Muhammad
G. M. Shahariar
95
1
0
24 Sep 2022
Towards Explainable 3D Grounded Visual Question Answering: A New
  Benchmark and Strong Baseline
Towards Explainable 3D Grounded Visual Question Answering: A New Benchmark and Strong Baseline
Lichen Zhao
Daigang Cai
Jing Zhang
Lu Sheng
Dong Xu
Ruizhi Zheng
Yinjie Zhao
Lipeng Wang
Xibo Fan
61
27
0
24 Sep 2022
Multiple-Choice Question Generation: Towards an Automated Assessment
  Framework
Multiple-Choice Question Generation: Towards an Automated Assessment Framework
Vatsal Raina
Mark Gales
AI4EdELM
75
35
0
23 Sep 2022
Promptagator: Few-shot Dense Retrieval From 8 Examples
Promptagator: Few-shot Dense Retrieval From 8 Examples
Zhuyun Dai
Vincent Zhao
Ji Ma
Yi Luan
Jianmo Ni
Jing Lu
A. Bakalov
Kelvin Guu
Keith B. Hall
Ming-Wei Chang
RALM
112
243
0
23 Sep 2022
ET5: A Novel End-to-end Framework for Conversational Machine Reading
  Comprehension
ET5: A Novel End-to-end Framework for Conversational Machine Reading Comprehension
Xiao Zhang
Heyan Huang
Zewen Chi
Xian-Ling Mao
LRM
84
2
0
23 Sep 2022
Towards Faithful Model Explanation in NLP: A Survey
Towards Faithful Model Explanation in NLP: A Survey
Qing Lyu
Marianna Apidianaki
Chris Callison-Burch
XAI
237
121
0
22 Sep 2022
Learning Disentangled Representations for Natural Language Definitions
Learning Disentangled Representations for Natural Language Definitions
Danilo S. Carvalho
Giangiacomo Mercatali
Yingji Zhang
André Freitas
CoGeDRL
93
8
0
22 Sep 2022
Selecting Better Samples from Pre-trained LLMs: A Case Study on Question
  Generation
Selecting Better Samples from Pre-trained LLMs: A Case Study on Question Generation
Xingdi Yuan
Tong Wang
Yen-Hsiang Wang
Emery Fine
Rania Abdelghani
Pauline Lucas
Hélene Sauzéon
Pierre-Yves Oudeyer
94
30
0
22 Sep 2022
Implementing and Experimenting with Diffusion Models for Text-to-Image
  Generation
Implementing and Experimenting with Diffusion Models for Text-to-Image Generation
Robin Zbinden
42
3
0
22 Sep 2022
INFINITY: A Simple Yet Effective Unsupervised Framework for Graph-Text
  Mutual Conversion
INFINITY: A Simple Yet Effective Unsupervised Framework for Graph-Text Mutual Conversion
Yi Xu
Luoyi Fu
Zhouhan Lin
Jiexing Qi
Xinbing Wang
77
3
0
22 Sep 2022
Mega: Moving Average Equipped Gated Attention
Mega: Moving Average Equipped Gated Attention
Xuezhe Ma
Chunting Zhou
Xiang Kong
Junxian He
Liangke Gui
Graham Neubig
Jonathan May
Luke Zettlemoyer
143
185
0
21 Sep 2022
Summarization Programs: Interpretable Abstractive Summarization with
  Neural Modular Trees
Summarization Programs: Interpretable Abstractive Summarization with Neural Modular Trees
Swarnadeep Saha
Shiyue Zhang
Peter Hase
Joey Tianyi Zhou
106
20
0
21 Sep 2022
T5QL: Taming language models for SQL generation
T5QL: Taming language models for SQL generation
Samuel Arcadinho
David Oliveira Aparício
Hugo Veiga
António Alegria
83
6
0
21 Sep 2022
Extreme Multi-Domain, Multi-Task Learning With Unified Text-to-Text
  Transfer Transformers
Extreme Multi-Domain, Multi-Task Learning With Unified Text-to-Text Transfer Transformers
Adebayo Oshingbesan
Courage Ekoh
Germann Atakpa
Yonah Byaruagaba
27
0
0
21 Sep 2022
Generate rather than Retrieve: Large Language Models are Strong Context
  Generators
Generate rather than Retrieve: Large Language Models are Strong Context Generators
Wenhao Yu
Dan Iter
Shuohang Wang
Yichong Xu
Mingxuan Ju
Soumya Sanyal
Chenguang Zhu
Michael Zeng
Meng Jiang
RALMAIMat
365
341
0
21 Sep 2022
Adapting Pretrained Text-to-Text Models for Long Text Sequences
Adapting Pretrained Text-to-Text Models for Long Text Sequences
Wenhan Xiong
Anchit Gupta
Shubham Toshniwal
Yashar Mehdad
Wen-tau Yih
RALMVLM
120
31
0
21 Sep 2022
Dynamic Relevance Graph Network for Knowledge-Aware Question Answering
Dynamic Relevance Graph Network for Knowledge-Aware Question Answering
Chen Zheng
Parisa Kordjamshidi
42
6
0
20 Sep 2022
Previous
123...151152153...196197198
Next