ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.10683
  4. Cited By
Exploring the Limits of Transfer Learning with a Unified Text-to-Text
  Transformer
v1v2v3v4 (latest)

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
    AIMat
ArXiv (abs)PDFHTML

Papers citing "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

50 / 9,948 papers shown
Title
The Prompt Artists
The Prompt Artists
Minsuk Chang
Stefania Druga
Alexander J. Fiannaca
P. Vergani
Chinmay Kulkarni
Carrie J. Cai
Michael Terry
61
66
0
22 Mar 2023
Improving Transformer Performance for French Clinical Notes Classification Using Mixture of Experts on a Limited Dataset
Improving Transformer Performance for French Clinical Notes Classification Using Mixture of Experts on a Limited Dataset
Thanh-Dung Le
P. Jouvet
R. Noumeir
MoEMedIm
144
5
0
22 Mar 2023
Improving Content Retrievability in Search with Controllable Query
  Generation
Improving Content Retrievability in Search with Controllable Query Generation
Gustavo Penha
Enrico Palumbo
Maryam Aziz
Alice Wang
Hugues Bouchard
81
11
0
21 Mar 2023
Transformers in Speech Processing: A Survey
Transformers in Speech Processing: A Survey
S. Latif
Aun Zaidi
Heriberto Cuayáhuitl
Fahad Shamshad
Moazzam Shoukat
Muhammad Usama
Junaid Qadir
179
48
0
21 Mar 2023
TWINS: A Fine-Tuning Framework for Improved Transferability of
  Adversarial Robustness and Generalization
TWINS: A Fine-Tuning Framework for Improved Transferability of Adversarial Robustness and Generalization
Ziquan Liu
Yi Tian Xu
Xiangyang Ji
Antoni B. Chan
AAML
58
18
0
20 Mar 2023
DeID-GPT: Zero-shot Medical Text De-Identification by GPT-4
DeID-GPT: Zero-shot Medical Text De-Identification by GPT-4
Zheng-Long Liu
Yue Huang
Xiao-Xing Yu
Lu Zhang
Zihao Wu
...
Dinggang Shen
Quanzheng Li
Tianming Liu
Dajiang Zhu
Xiang Li
LM&MAMedIm
129
179
0
20 Mar 2023
Towards Reliable Neural Machine Translation with Consistency-Aware
  Meta-Learning
Towards Reliable Neural Machine Translation with Consistency-Aware Meta-Learning
Rongxiang Weng
Qiang Wang
Wensen Cheng
Changfeng Zhu
Min Zhang
74
2
0
20 Mar 2023
Retrieving Multimodal Information for Augmented Generation: A Survey
Retrieving Multimodal Information for Augmented Generation: A Survey
Ruochen Zhao
Hailin Chen
Weishi Wang
Fangkai Jiao
Do Xuan Long
...
Bosheng Ding
Xiaobao Guo
Minzhi Li
Xingxuan Li
Shafiq Joty
133
89
0
20 Mar 2023
Revisiting Automatic Question Summarization Evaluation in the Biomedical
  Domain
Revisiting Automatic Question Summarization Evaluation in the Biomedical Domain
Hongyi Yuan
Yaoyun Zhang
Fei Huang
Songfang Huang
80
1
0
18 Mar 2023
Dual-path Adaptation from Image to Video Transformers
Dual-path Adaptation from Image to Video Transformers
Jungin Park
Jiyoung Lee
Kwanghoon Sohn
ViT
87
38
0
17 Mar 2023
TypeT5: Seq2seq Type Inference using Static Analysis
TypeT5: Seq2seq Type Inference using Static Analysis
Jiayi Wei
Greg Durrett
Işıl Dillig
81
20
0
16 Mar 2023
Unified Visual Relationship Detection with Vision and Language Models
Unified Visual Relationship Detection with Vision and Language Models
Long Zhao
Liangzhe Yuan
Boqing Gong
Huayu Chen
Florian Schroff
Ming-Hsuan Yang
Hartwig Adam
Ting Liu
ObjD
93
9
0
16 Mar 2023
Cognitive Semantic Communication Systems Driven by Knowledge Graph:
  Principle, Implementation, and Performance Evaluation
Cognitive Semantic Communication Systems Driven by Knowledge Graph: Principle, Implementation, and Performance Evaluation
Fuhui Zhou
Yihao Li
Ming Xu
Lu Yuan
Qihui Wu
R. Hu
N. Al-Dhahir
93
21
0
15 Mar 2023
Progress Note Understanding -- Assessment and Plan Reasoning: Overview
  of the 2022 N2C2 Track 3 Shared Task
Progress Note Understanding -- Assessment and Plan Reasoning: Overview of the 2022 N2C2 Track 3 Shared Task
Yanjun Gao
Dmitriy Dligach
Timothy A. Miller
M. Churpek
Özlem Uzuner
Majid Afshar
65
5
0
14 Mar 2023
Revisit Parameter-Efficient Transfer Learning: A Two-Stage Paradigm
Revisit Parameter-Efficient Transfer Learning: A Two-Stage Paradigm
Hengyuan Zhao
Hao Luo
Yuyang Zhao
Pichao Wang
F. Wang
Mike Zheng Shou
78
5
0
14 Mar 2023
Input-length-shortening and text generation via attention values
Input-length-shortening and text generation via attention values
Necset Ozkan Tan
A. Peng
Joshua Bensemann
Qiming Bao
Tim Hartill
M. Gahegan
Michael Witbrock
84
1
0
14 Mar 2023
Diffusion Models in NLP: A Survey
Diffusion Models in NLP: A Survey
Yuansong Zhu
Yu Zhao
DiffMVLMMedIm
95
23
0
14 Mar 2023
Architext: Language-Driven Generative Architecture Design
Architext: Language-Driven Generative Architecture Design
Theodoros Galanos
Antonios Liapis
Georgios N. Yannakakis
VLMAI4CE
80
6
0
13 Mar 2023
Proactive Prioritization of App Issues via Contrastive Learning
Proactive Prioritization of App Issues via Contrastive Learning
Moghis Fereidouni
A. Mosharrof
Umar Farooq
A.B. Siddique
81
6
0
12 Mar 2023
Xformer: Hybrid X-Shaped Transformer for Image Denoising
Xformer: Hybrid X-Shaped Transformer for Image Denoising
Jiale Zhang
Yulun Zhang
Jinjin Gu
Jiahua Dong
Lingyu Kong
Xiaokang Yang
ViT
91
29
0
11 Mar 2023
AUTODIAL: Efficient Asynchronous Task-Oriented Dialogue Model
AUTODIAL: Efficient Asynchronous Task-Oriented Dialogue Model
Prajjwal Bhargava
P. Amini
Shahin Shayandeh
Chinnadhurai Sankar
34
0
0
10 Mar 2023
An Overview on Language Models: Recent Developments and Outlook
An Overview on Language Models: Recent Developments and Outlook
Chengwei Wei
Yun Cheng Wang
Bin Wang
C.-C. Jay Kuo
95
47
0
10 Mar 2023
Planning with Large Language Models for Code Generation
Planning with Large Language Models for Code Generation
Shun Zhang
Zhenfang Chen
Songlin Yang
Mingyu Ding
J. Tenenbaum
Chuang Gan
107
163
0
09 Mar 2023
Greener yet Powerful: Taming Large Code Generation Models with
  Quantization
Greener yet Powerful: Taming Large Code Generation Models with Quantization
Xiaokai Wei
Sujan Kumar Gonugondla
W. Ahmad
Shiqi Wang
Baishakhi Ray
...
Ben Athiwaratkun
Mingyue Shang
M. K. Ramanathan
Parminder Bhatia
Bing Xiang
MQ
61
6
0
09 Mar 2023
disco: a toolkit for Distributional Control of Generative Models
disco: a toolkit for Distributional Control of Generative Models
Germán Kruszewski
Jos Rozen
Marc Dymetman
72
4
0
08 Mar 2023
A Categorical Framework of General Intelligence
A Categorical Framework of General Intelligence
Yang Yuan
87
2
0
08 Mar 2023
Class Cardinality Comparison as a Fermi Problem
Class Cardinality Comparison as a Fermi Problem
Tuan-Phong Nguyen
Simon Razniewski
Gerhard Weikum
60
2
0
08 Mar 2023
A Comprehensive Survey of AI-Generated Content (AIGC): A History of
  Generative AI from GAN to ChatGPT
A Comprehensive Survey of AI-Generated Content (AIGC): A History of Generative AI from GAN to ChatGPT
Yihan Cao
Siyu Li
Yixin Liu
Zhiling Yan
Yutong Dai
Philip S. Yu
Lichao Sun
120
554
0
07 Mar 2023
CroCoSum: A Benchmark Dataset for Cross-Lingual Code-Switched
  Summarization
CroCoSum: A Benchmark Dataset for Cross-Lingual Code-Switched Summarization
Ruochen Zhang
Carsten Eickhoff
117
6
0
07 Mar 2023
A Challenging Benchmark for Low-Resource Learning
A Challenging Benchmark for Low-Resource Learning
Yudong Wang
Chang Ma
Qingxiu Dong
Lingpeng Kong
Jingjing Xu
93
4
0
07 Mar 2023
Exploring the Feasibility of ChatGPT for Event Extraction
Exploring the Feasibility of ChatGPT for Event Extraction
Jun Gao
Huan Zhao
Changlong Yu
Ruifeng Xu
83
115
0
07 Mar 2023
Graph Neural Networks in Vision-Language Image Understanding: A Survey
Graph Neural Networks in Vision-Language Image Understanding: A Survey
Henry Senior
Greg Slabaugh
Shanxin Yuan
Luca Rossi
GNN
97
21
0
07 Mar 2023
Multimodal Prompting with Missing Modalities for Visual Recognition
Multimodal Prompting with Missing Modalities for Visual Recognition
Yi-Lun Lee
Yi-Hsuan Tsai
Wei-Chen Chiu
Chen-Yu Lee
VPVLM
116
104
0
06 Mar 2023
Enhancing Activity Prediction Models in Drug Discovery with the Ability
  to Understand Human Language
Enhancing Activity Prediction Models in Drug Discovery with the Ability to Understand Human Language
Philipp Seidl
Andreu Vall
Sepp Hochreiter
Günter Klambauer
149
42
0
06 Mar 2023
xCodeEval: A Large Scale Multilingual Multitask Benchmark for Code
  Understanding, Generation, Translation and Retrieval
xCodeEval: A Large Scale Multilingual Multitask Benchmark for Code Understanding, Generation, Translation and Retrieval
Mohammad Abdullah Matin Khan
M Saiful Bari
Xuan Long Do
Weishi Wang
Md. Rizwan Parvez
Shafiq Joty
ALMELM
124
23
0
06 Mar 2023
UniHCP: A Unified Model for Human-Centric Perceptions
UniHCP: A Unified Model for Human-Centric Perceptions
Yuanzheng Ci
Yizhou Wang
Meilin Chen
Shixiang Tang
Lei Bai
Feng Zhu
Rui Zhao
F. Yu
Donglian Qi
Wanli Ouyang
145
52
0
06 Mar 2023
The Contribution of Knowledge in Visiolinguistic Learning: A Survey on
  Tasks and Challenges
The Contribution of Knowledge in Visiolinguistic Learning: A Survey on Tasks and Challenges
Maria Lymperaiou
Giorgos Stamou
VLM
101
4
0
04 Mar 2023
DiTTO: A Feature Representation Imitation Approach for Improving
  Cross-Lingual Transfer
DiTTO: A Feature Representation Imitation Approach for Improving Cross-Lingual Transfer
Shanu Kumar
Abbaraju Soujanya
Sandipan Dandapat
Sunayana Sitaram
Monojit Choudhury
VLM
80
1
0
04 Mar 2023
Investigating the Translation Performance of a Large Multilingual
  Language Model: the Case of BLOOM
Investigating the Translation Performance of a Large Multilingual Language Model: the Case of BLOOM
Rachel Bawden
François Yvon
VLMLRM
90
65
0
03 Mar 2023
Multi-task neural networks by learned contextual inputs
Multi-task neural networks by learned contextual inputs
Anders T. Sandnes
B. Grimstad
O. Kolbjørnsen
51
2
0
01 Mar 2023
CoProver: A Recommender System for Proof Construction
CoProver: A Recommender System for Proof Construction
Eric Yeh
Briland Hitaj
S. Owre
Maena Quemener
N. Shankar
118
5
0
01 Mar 2023
AdaSAM: Boosting Sharpness-Aware Minimization with Adaptive Learning
  Rate and Momentum for Training Deep Neural Networks
AdaSAM: Boosting Sharpness-Aware Minimization with Adaptive Learning Rate and Momentum for Training Deep Neural Networks
Hao Sun
Li Shen
Qihuang Zhong
Liang Ding
Shi-Yong Chen
Jingwei Sun
Jing Li
Guangzhong Sun
Dacheng Tao
98
34
0
01 Mar 2023
N-best T5: Robust ASR Error Correction using Multiple Input Hypotheses
  and Constrained Decoding Space
N-best T5: Robust ASR Error Correction using Multiple Input Hypotheses and Constrained Decoding Space
Rao Ma
Mark Gales
Kate Knill
Mengjie Qian
103
33
0
01 Mar 2023
GLM-Dialog: Noise-tolerant Pre-training for Knowledge-grounded Dialogue
  Generation
GLM-Dialog: Noise-tolerant Pre-training for Knowledge-grounded Dialogue Generation
Jing Zhang
Yanling Wang
Daniel Zhang-Li
Jifan Yu
Zijun Yao
...
Xiaohan Zhang
Nianyi Lin
Sunrui Lu
Juan Li
Jie Tang
81
19
0
28 Feb 2023
Are Character-level Translations Worth the Wait? Comparing ByT5 and mT5
  for Machine Translation
Are Character-level Translations Worth the Wait? Comparing ByT5 and mT5 for Machine Translation
Lukas Edman
Gabriele Sarti
Antonio Toral
Gertjan van Noord
Arianna Bisazza
80
14
0
28 Feb 2023
The ROOTS Search Tool: Data Transparency for LLMs
The ROOTS Search Tool: Data Transparency for LLMs
Aleksandra Piktus
Christopher Akiki
Paulo Villegas
Hugo Laurenccon
Gérard Dupont
A. Luccioni
Yacine Jernite
Anna Rogers
VLM
106
29
0
27 Feb 2023
Full Stack Optimization of Transformer Inference: a Survey
Full Stack Optimization of Transformer Inference: a Survey
Sehoon Kim
Coleman Hooper
Thanakul Wattanawong
Minwoo Kang
Ruohan Yan
...
Qijing Huang
Kurt Keutzer
Michael W. Mahoney
Y. Shao
A. Gholami
MQ
167
106
0
27 Feb 2023
Systematic Rectification of Language Models via Dead-end Analysis
Systematic Rectification of Language Models via Dead-end Analysis
Mengyao Cao
Mehdi Fatemi
Jackie C.K. Cheung
Samira Shabanian
KELM
75
16
0
27 Feb 2023
Make Every Example Count: On the Stability and Utility of Self-Influence
  for Learning from Noisy NLP Datasets
Make Every Example Count: On the Stability and Utility of Self-Influence for Learning from Noisy NLP Datasets
Irina Bejan
Artem Sokolov
Katja Filippova
TDI
130
11
0
27 Feb 2023
Prompt-based Learning for Text Readability Assessment
Prompt-based Learning for Text Readability Assessment
Bruce W. Lee
J. Lee
VLM
67
13
0
25 Feb 2023
Previous
123...135136137...197198199
Next