ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.10683
  4. Cited By
Exploring the Limits of Transfer Learning with a Unified Text-to-Text
  Transformer
v1v2v3v4 (latest)

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
    AIMat
ArXiv (abs)PDFHTML

Papers citing "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

50 / 9,903 papers shown
Title
Natural Language Deduction with Incomplete Information
Natural Language Deduction with Incomplete Information
Zayne Sprague
Kaj Bostrom
Swarat Chaudhuri
Greg Durrett
LRM
96
17
0
01 Nov 2022
T5lephone: Bridging Speech and Text Self-supervised Models for Spoken
  Language Understanding via Phoneme level T5
T5lephone: Bridging Speech and Text Self-supervised Models for Spoken Language Understanding via Phoneme level T5
Chan-Jan Hsu
Ho-Lam Chung
Hung-yi Lee
Yu Tsao
118
6
0
01 Nov 2022
VarMAE: Pre-training of Variational Masked Autoencoder for
  Domain-adaptive Language Understanding
VarMAE: Pre-training of Variational Masked Autoencoder for Domain-adaptive Language Understanding
Dou Hu
Xiaolong Hou
Xiyang Du
Mengyuan Zhou
Lian-Xin Jiang
Yang Mo
Xiaofeng Shi
99
13
0
01 Nov 2022
A General Search-based Framework for Generating Textual Counterfactual
  Explanations
A General Search-based Framework for Generating Textual Counterfactual Explanations
Daniel Gilo
Shaul Markovitch
LRM
92
0
0
01 Nov 2022
CONDAQA: A Contrastive Reading Comprehension Dataset for Reasoning about
  Negation
CONDAQA: A Contrastive Reading Comprehension Dataset for Reasoning about Negation
Abhilasha Ravichander
Matt Gardner
Ana Marasović
112
35
0
01 Nov 2022
FRSUM: Towards Faithful Abstractive Summarization via Enhancing Factual
  Robustness
FRSUM: Towards Faithful Abstractive Summarization via Enhancing Factual Robustness
Wenhao Wu
Wei Li
Jiachen Liu
Xinyan Xiao
Ziqiang Cao
Sujian Li
Hua Wu
HILM
77
11
0
01 Nov 2022
Training Vision-Language Models with Less Bimodal Supervision
Training Vision-Language Models with Less Bimodal Supervision
Elad Segal
Ben Bogin
Jonathan Berant
VLM
53
2
0
01 Nov 2022
A Close Look into the Calibration of Pre-trained Language Models
A Close Look into the Calibration of Pre-trained Language Models
Yangyi Chen
Lifan Yuan
Ganqu Cui
Zhiyuan Liu
Heng Ji
153
53
0
31 Oct 2022
Where to start? Analyzing the potential value of intermediate models
Where to start? Analyzing the potential value of intermediate models
Leshem Choshen
Elad Venezian
Shachar Don-Yehiya
Noam Slonim
Yoav Katz
MoMe
102
27
0
31 Oct 2022
Generating Sequences by Learning to Self-Correct
Generating Sequences by Learning to Self-Correct
Sean Welleck
Ximing Lu
Peter West
Faeze Brahman
T. Shen
Daniel Khashabi
Yejin Choi
LRM
111
238
0
31 Oct 2022
Zero-Shot Text Classification with Self-Training
Zero-Shot Text Classification with Self-Training
Ariel Gera
Alon Halfon
Eyal Shnarch
Yotam Perlitz
L. Ein-Dor
Noam Slonim
VLM
76
62
0
31 Oct 2022
AdaMix: Mixture-of-Adaptations for Parameter-efficient Model Tuning
Yaqing Wang
Sahaj Agarwal
Subhabrata Mukherjee
Xiaodong Liu
Jing Gao
Ahmed Hassan Awadallah
Jianfeng Gao
MoE
109
136
0
31 Oct 2022
SSD-LM: Semi-autoregressive Simplex-based Diffusion Language Model for
  Text Generation and Modular Control
SSD-LM: Semi-autoregressive Simplex-based Diffusion Language Model for Text Generation and Modular Control
Xiaochuang Han
Sachin Kumar
Yulia Tsvetkov
167
91
0
31 Oct 2022
GPTQ: Accurate Post-Training Quantization for Generative Pre-trained
  Transformers
GPTQ: Accurate Post-Training Quantization for Generative Pre-trained Transformers
Elias Frantar
Saleh Ashkboos
Torsten Hoefler
Dan Alistarh
MQ
206
1,015
0
31 Oct 2022
Effective Cross-Task Transfer Learning for Explainable Natural Language
  Inference with T5
Effective Cross-Task Transfer Learning for Explainable Natural Language Inference with T5
Irina Bigoulaeva
Rachneet Sachdeva
Harish Tayyar Madabushi
Aline Villavicencio
Iryna Gurevych
LRM
91
5
0
31 Oct 2022
When Language Model Meets Private Library
When Language Model Meets Private Library
Daoguang Zan
Bei Chen
Zeqi Lin
Bei Guan
Yongji Wang
Jian-Guang Lou
ALM
134
74
0
31 Oct 2022
RLET: A Reinforcement Learning Based Approach for Explainable QA with
  Entailment Trees
RLET: A Reinforcement Learning Based Approach for Explainable QA with Entailment Trees
Tengxiao Liu
Qipeng Guo
Xiangkun Hu
Yue Zhang
Xipeng Qiu
Zheng Zhang
LRM
75
14
0
31 Oct 2022
GPS: Genetic Prompt Search for Efficient Few-shot Learning
GPS: Genetic Prompt Search for Efficient Few-shot Learning
Hanwei Xu
Yujun Chen
Yulun Du
Nan Shao
Yanggang Wang
Haiyu Li
Zhilin Yang
VLM
63
31
0
31 Oct 2022
CodeEditor: Learning to Edit Source Code with Pre-trained Models
CodeEditor: Learning to Edit Source Code with Pre-trained Models
Jia Li
Ge Li
Zhuo Li
Zhi Jin
Xing Hu
Kechi Zhang
Zhiyi Fu
KELM
79
28
0
31 Oct 2022
Learning to Decompose: Hypothetical Question Decomposition Based on
  Comparable Texts
Learning to Decompose: Hypothetical Question Decomposition Based on Comparable Texts
Ben Zhou
Kyle Richardson
Xiaodong Yu
Dan Roth
ReLM
101
22
0
30 Oct 2022
Generate, Discriminate and Contrast: A Semi-Supervised Sentence
  Representation Learning Framework
Generate, Discriminate and Contrast: A Semi-Supervised Sentence Representation Learning Framework
Yiming Chen
Yan Zhang
Bin Wang
Zuozhu Liu
Haizhou Li
62
26
0
30 Oct 2022
Fine-Grained Emotional Paraphrasing along Emotion Gradients
Fine-Grained Emotional Paraphrasing along Emotion Gradients
Justin J Xie
54
1
0
30 Oct 2022
How Far are We from Robust Long Abstractive Summarization?
How Far are We from Robust Long Abstractive Summarization?
Huan Yee Koh
Jiaxin Ju
He Zhang
Ming Liu
Shirui Pan
HILM
113
40
0
30 Oct 2022
Beyond Prompting: Making Pre-trained Language Models Better Zero-shot
  Learners by Clustering Representations
Beyond Prompting: Making Pre-trained Language Models Better Zero-shot Learners by Clustering Representations
Yu Fei
Ping Nie
Zhao Meng
Roger Wattenhofer
Mrinmaya Sachan
VLM
100
20
0
29 Oct 2022
Exploiting prompt learning with pre-trained language models for
  Alzheimer's Disease detection
Exploiting prompt learning with pre-trained language models for Alzheimer's Disease detection
Yi Wang
Jiajun Deng
Tianzi Wang
Bo Zheng
Shoukang Hu
Xunying Liu
Helen M. Meng
97
16
0
29 Oct 2022
Knowledge-in-Context: Towards Knowledgeable Semi-Parametric Language
  Models
Knowledge-in-Context: Towards Knowledgeable Semi-Parametric Language Models
Xiaoman Pan
Wenlin Yao
Hongming Zhang
Dian Yu
Dong Yu
Jianshu Chen
KELM
298
25
0
28 Oct 2022
Just-DREAM-about-it: Figurative Language Understanding with DREAM-FLUTE
Just-DREAM-about-it: Figurative Language Understanding with DREAM-FLUTE
Yuling Gu
Yao Fu
Valentina Pyatkin
Ian H. Magnusson
Bhavana Dalvi
Peter Clark
255
8
0
28 Oct 2022
BRATsynthetic: Text De-identification using a Markov Chain Replacement
  Strategy for Surrogate Personal Identifying Information
BRATsynthetic: Text De-identification using a Markov Chain Replacement Strategy for Surrogate Personal Identifying Information
J. D. Osborne
Tobias O'Leary
A. Nadimpalli
S. Aly
Richard Kennedy
23
1
0
28 Oct 2022
DORE: Document Ordered Relation Extraction based on Generative Framework
DORE: Document Ordered Relation Extraction based on Generative Framework
Qipeng Guo
Yuqing Yang
Hang Yan
Xipeng Qiu
Zheng Zhang
127
7
0
28 Oct 2022
UPainting: Unified Text-to-Image Diffusion Generation with Cross-modal
  Guidance
UPainting: Unified Text-to-Image Diffusion Generation with Cross-modal Guidance
Wei Li
Xue Xu
Xinyan Xiao
Jiacheng Liu
Hu Yang
...
Zhanpeng Wang
Zhifan Feng
Qiaoqiao She
Yajuan Lyu
Hua Wu
232
30
0
28 Oct 2022
QUILL: Query Intent with Large Language Models using Retrieval
  Augmentation and Multi-stage Distillation
QUILL: Query Intent with Large Language Models using Retrieval Augmentation and Multi-stage Distillation
Krishna Srinivasan
K. Raman
Anupam Samanta
Ling-Yen Liao
L. Bertelli
Michael Bendersky
RALMLRM
81
20
0
27 Oct 2022
Improving abstractive summarization with energy-based re-ranking
Improving abstractive summarization with energy-based re-ranking
Diogo Pernes
Afonso Mendes
André F. T. Martins
78
6
0
27 Oct 2022
Terminology-aware Medical Dialogue Generation
Terminology-aware Medical Dialogue Generation
Chen Tang
Hongbo Zhang
Tyler Loakman
Chenghua Lin
Frank Guerin
LM&MAMedIm
60
13
0
27 Oct 2022
What Language Model to Train if You Have One Million GPU Hours?
What Language Model to Train if You Have One Million GPU Hours?
Teven Le Scao
Thomas Wang
Daniel Hesslow
Lucile Saulnier
Stas Bekman
...
Lintang Sutawika
Jaesung Tae
Zheng-Xin Yong
Julien Launay
Iz Beltagy
MoEAI4CE
320
109
0
27 Oct 2022
Dial2vec: Self-Guided Contrastive Learning of Unsupervised Dialogue
  Embeddings
Dial2vec: Self-Guided Contrastive Learning of Unsupervised Dialogue Embeddings
Che Liu
Rui Wang
Junfeng Jiang
Yongbin Li
Fei Huang
SSL
115
9
0
27 Oct 2022
Towards Language-driven Scientific AI
Towards Language-driven Scientific AI
José Manuél Gómez-Pérez
54
0
0
27 Oct 2022
Explaining the Explainers in Graph Neural Networks: a Comparative Study
Explaining the Explainers in Graph Neural Networks: a Comparative Study
Antonio Longa
Steve Azzolin
G. Santin
G. Cencetti
Pietro Lio
Bruno Lepri
Andrea Passerini
111
31
0
27 Oct 2022
Can language models handle recursively nested grammatical structures? A
  case study on comparing models and humans
Can language models handle recursively nested grammatical structures? A case study on comparing models and humans
Andrew Kyle Lampinen
ReLMELM
125
36
0
27 Oct 2022
COCO-DR: Combating Distribution Shifts in Zero-Shot Dense Retrieval with
  Contrastive and Distributionally Robust Learning
COCO-DR: Combating Distribution Shifts in Zero-Shot Dense Retrieval with Contrastive and Distributionally Robust Learning
Yue Yu
Chenyan Xiong
Si Sun
Chao Zhang
Arnold Overwijk
VLMOOD
148
22
0
27 Oct 2022
Dictionary-Assisted Supervised Contrastive Learning
Dictionary-Assisted Supervised Contrastive Learning
Patrick Y. Wu
Richard Bonneau
Joshua A. Tucker
Jonathan Nagler
CLIP
66
0
0
27 Oct 2022
Open-vocabulary Semantic Segmentation with Frozen Vision-Language Models
Open-vocabulary Semantic Segmentation with Frozen Vision-Language Models
Chaofan Ma
Yu-Hao Yang
Yanfeng Wang
Ya Zhang
Weidi Xie
VLM
79
48
0
27 Oct 2022
arXivEdits: Understanding the Human Revision Process in Scientific
  Writing
arXivEdits: Understanding the Human Revision Process in Scientific Writing
Chao Jiang
Wei Xu
Samuel Stevens
72
23
0
26 Oct 2022
Privately Fine-Tuning Large Language Models with Differential Privacy
Privately Fine-Tuning Large Language Models with Differential Privacy
R. Behnia
Mohammadreza Ebrahimi
Jason L. Pacheco
B. Padmanabhan
127
51
0
26 Oct 2022
Generalization Differences between End-to-End and Neuro-Symbolic
  Vision-Language Reasoning Systems
Generalization Differences between End-to-End and Neuro-Symbolic Vision-Language Reasoning Systems
Wang Zhu
Jesse Thomason
Robin Jia
VLMOODNAILRM
55
6
0
26 Oct 2022
Broken Neural Scaling Laws
Broken Neural Scaling Laws
Ethan Caballero
Kshitij Gupta
Irina Rish
David M. Krueger
148
76
0
26 Oct 2022
ProVe: A Pipeline for Automated Provenance Verification of Knowledge
  Graphs against Textual Sources
ProVe: A Pipeline for Automated Provenance Verification of Knowledge Graphs against Textual Sources
Gabriel Amaral
Odinaldo Rodrigues
Elena Simperl
67
3
0
26 Oct 2022
Autoregressive Structured Prediction with Language Models
Autoregressive Structured Prediction with Language Models
Tianyu Liu
Yuchen Eleanor Jiang
Nicholas Monath
Ryan Cotterell
Mrinmaya Sachan
87
53
0
26 Oct 2022
MOCHA: A Multi-Task Training Approach for Coherent Text Generation from
  Cognitive Perspective
MOCHA: A Multi-Task Training Approach for Coherent Text Generation from Cognitive Perspective
Zhe Hu
Hou Pong Chan
Lifu Huang
99
8
0
26 Oct 2022
Analyzing Multi-Task Learning for Abstractive Text Summarization
Analyzing Multi-Task Learning for Abstractive Text Summarization
Frederic Kirstein
Jan Philip Wahle
Terry Ruas
Bela Gipp
81
4
0
26 Oct 2022
Uncertainty Sentence Sampling by Virtual Adversarial Perturbation
Uncertainty Sentence Sampling by Virtual Adversarial Perturbation
Han Zhang
Zhen Zhang
Hongfei Jiang
Yang Song
40
0
0
26 Oct 2022
Previous
123...146147148...197198199
Next