ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.10683
  4. Cited By
Exploring the Limits of Transfer Learning with a Unified Text-to-Text
  Transformer
v1v2v3v4 (latest)

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
    AIMat
ArXiv (abs)PDFHTML

Papers citing "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

50 / 9,924 papers shown
Title
Enhancing Multi-modal and Multi-hop Question Answering via Structured
  Knowledge and Unified Retrieval-Generation
Enhancing Multi-modal and Multi-hop Question Answering via Structured Knowledge and Unified Retrieval-Generation
Qian Yang
Qian Chen
Wen Wang
Baotian Hu
Min Zhang
103
27
0
16 Dec 2022
MURMUR: Modular Multi-Step Reasoning for Semi-Structured Data-to-Text
  Generation
MURMUR: Modular Multi-Step Reasoning for Semi-Structured Data-to-Text Generation
Swarnadeep Saha
Xinyan Velocity Yu
Joey Tianyi Zhou
Ramakanth Pasunuru
Asli Celikyilmaz
ReLMLRM
59
11
0
16 Dec 2022
Dense Feature Memory Augmented Transformers for COVID-19 Vaccination
  Search Classification
Dense Feature Memory Augmented Transformers for COVID-19 Vaccination Search Classification
Jai Gupta
Yi Tay
C. Kamath
Vinh Q. Tran
Donald Metzler
S. Bavadekar
Mimi Sun
E. Gabrilovich
MedIm
40
0
0
16 Dec 2022
Teaching Small Language Models to Reason
Teaching Small Language Models to Reason
Lucie Charlotte Magister
Jonathan Mallinson
Jakub Adamek
Eric Malmi
Aliaksei Severyn
LRMAI4CEReLM
239
267
0
16 Dec 2022
Decoder Tuning: Efficient Language Understanding as Decoding
Decoder Tuning: Efficient Language Understanding as Decoding
Ganqu Cui
Wentao Li
Ning Ding
Longtao Huang
Zhiyuan Liu
Maosong Sun
78
6
0
16 Dec 2022
Lessons learned from the evaluation of Spanish Language Models
Lessons learned from the evaluation of Spanish Language Models
Rodrigo Agerri
Eneko Agirre
ELM
93
15
0
16 Dec 2022
FewFedWeight: Few-shot Federated Learning Framework across Multiple NLP
  Tasks
FewFedWeight: Few-shot Federated Learning Framework across Multiple NLP Tasks
Weilong Dong
Xinwei Wu
Junzhuo Li
Shuangzhi Wu
Chao Bian
Deyi Xiong
FedML
106
6
0
16 Dec 2022
Convolution-enhanced Evolving Attention Networks
Convolution-enhanced Evolving Attention Networks
Yujing Wang
Yaming Yang
Zhuowan Li
Jiangang Bai
Mingliang Zhang
Xiangtai Li
Jiahao Yu
Ce Zhang
Gao Huang
Yu Tong
ViT
102
6
0
16 Dec 2022
Autoencoders as Cross-Modal Teachers: Can Pretrained 2D Image
  Transformers Help 3D Representation Learning?
Autoencoders as Cross-Modal Teachers: Can Pretrained 2D Image Transformers Help 3D Representation Learning?
Runpei Dong
Zekun Qi
Linfeng Zhang
Junbo Zhang
Jian‐Yuan Sun
Zheng Ge
Li Yi
Kaisheng Ma
ViT3DPC
115
92
0
16 Dec 2022
Saved You A Click: Automatically Answering Clickbait Titles
Saved You A Click: Automatically Answering Clickbait Titles
Andrey Kurenkov
TA Mentor
Yian Zhang
O. Johnson
46
5
0
15 Dec 2022
FiDO: Fusion-in-Decoder optimized for stronger performance and faster
  inference
FiDO: Fusion-in-Decoder optimized for stronger performance and faster inference
Michiel de Jong
Yury Zemlyanskiy
Joshua Ainslie
Nicholas FitzGerald
Sumit Sanghai
Fei Sha
William W. Cohen
VLM
75
36
0
15 Dec 2022
Efficient Long Sequence Modeling via State Space Augmented Transformer
Efficient Long Sequence Modeling via State Space Augmented Transformer
Simiao Zuo
Xiaodong Liu
Jian Jiao
Denis Xavier Charles
Eren Manavoglu
Tuo Zhao
Jianfeng Gao
175
37
0
15 Dec 2022
CLIPPO: Image-and-Language Understanding from Pixels Only
CLIPPO: Image-and-Language Understanding from Pixels Only
Michael Tschannen
Basil Mustafa
N. Houlsby
CLIPVLM
104
49
0
15 Dec 2022
Attributed Question Answering: Evaluation and Modeling for Attributed
  Large Language Models
Attributed Question Answering: Evaluation and Modeling for Attributed Large Language Models
Bernd Bohnet
Vinh Q. Tran
Pat Verga
Roee Aharoni
D. Andor
...
Michael Collins
Dipanjan Das
Donald Metzler
Slav Petrov
Kellie Webster
115
65
0
15 Dec 2022
Visually-augmented pretrained language models for NLP tasks without
  images
Visually-augmented pretrained language models for NLP tasks without images
Hangyu Guo
Kun Zhou
Wayne Xin Zhao
Qinyu Zhang
Ji-Rong Wen
VLM
56
10
0
15 Dec 2022
The Effects of In-domain Corpus Size on pre-training BERT
The Effects of In-domain Corpus Size on pre-training BERT
Chris Sanchez
Zheyu Zhang
AI4CE
28
4
0
15 Dec 2022
MASTER: Multi-task Pre-trained Bottlenecked Masked Autoencoders are
  Better Dense Retrievers
MASTER: Multi-task Pre-trained Bottlenecked Masked Autoencoders are Better Dense Retrievers
Kun Zhou
Xiao Liu
Yeyun Gong
Wayne Xin Zhao
Daxin Jiang
Nan Duan
Ji-Rong Wen
105
16
0
15 Dec 2022
Summary-Oriented Vision Modeling for Multimodal Abstractive
  Summarization
Summary-Oriented Vision Modeling for Multimodal Abstractive Summarization
Yunlong Liang
Fandong Meng
Jinan Xu
Jiaan Wang
Jinan Xu
Jie Zhou
103
22
0
15 Dec 2022
Build-a-Bot: Teaching Conversational AI Using a Transformer-Based Intent
  Recognition and Question Answering Architecture
Build-a-Bot: Teaching Conversational AI Using a Transformer-Based Intent Recognition and Question Answering Architecture
Kate Pearce
Sharifa Alghowinem
C. Breazeal
75
19
0
14 Dec 2022
Leveraging Natural Language Processing to Augment Structured Social
  Determinants of Health Data in the Electronic Health Record
Leveraging Natural Language Processing to Augment Structured Social Determinants of Health Data in the Electronic Health Record
K. Lybarger
Nicholas J. Dobbins
Ritche Long
Angad Singh
Patrick Wedgeworth
Özlem Ozuner
Meliha Yetisgen-Yildiz
49
25
0
14 Dec 2022
Efficient Self-supervised Learning with Contextualized Target
  Representations for Vision, Speech and Language
Efficient Self-supervised Learning with Contextualized Target Representations for Vision, Speech and Language
Alexei Baevski
Arun Babu
Wei-Ning Hsu
Michael Auli
VLMSSL
129
97
0
14 Dec 2022
MANTa: Efficient Gradient-Based Tokenization for Robust End-to-End
  Language Modeling
MANTa: Efficient Gradient-Based Tokenization for Robust End-to-End Language Modeling
Nathan Godey
Roman Castagné
Eric Villemonte de la Clergerie
Benoît Sagot
47
3
0
14 Dec 2022
APOLLO: An Optimized Training Approach for Long-form Numerical Reasoning
APOLLO: An Optimized Training Approach for Long-form Numerical Reasoning
Jiashuo Sun
Hang Zhang
Chen Lin
Nan Duan
Yeyun Gong
Jian Guo
AIMatRALM
78
6
0
14 Dec 2022
Mitigating Negative Style Transfer in Hybrid Dialogue System
Mitigating Negative Style Transfer in Hybrid Dialogue System
Shimin Li
Qinyuan Cheng
Linyang Li
Xipeng Qiu
101
1
0
14 Dec 2022
Reproducible scaling laws for contrastive language-image learning
Reproducible scaling laws for contrastive language-image learning
Mehdi Cherti
Romain Beaumont
Ross Wightman
Mitchell Wortsman
Gabriel Ilharco
Cade Gordon
Christoph Schuhmann
Ludwig Schmidt
J. Jitsev
VLMCLIP
141
825
0
14 Dec 2022
Explainability of Text Processing and Retrieval Methods: A Critical
  Survey
Explainability of Text Processing and Retrieval Methods: A Critical Survey
Sourav Saha
Debapriyo Majumdar
Mandar Mitra
96
5
0
14 Dec 2022
SMSMix: Sense-Maintained Sentence Mixup for Word Sense Disambiguation
SMSMix: Sense-Maintained Sentence Mixup for Word Sense Disambiguation
Hee Suk Yoon
Eunseop Yoon
John Harvill
Sunjae Yoon
M. Hasegawa-Johnson
Chang D. Yoo
65
4
0
14 Dec 2022
Pre-trained Language Models Can be Fully Zero-Shot Learners
Pre-trained Language Models Can be Fully Zero-Shot Learners
Xuandong Zhao
Siqi Ouyang
Zhiguo Yu
Ming-li Wu
Lei Li
VLMLRM
103
34
0
14 Dec 2022
Paraphrase Identification with Deep Learning: A Review of Datasets and
  Methods
Paraphrase Identification with Deep Learning: A Review of Datasets and Methods
Chao Zhou
Cheng Qiu
Daniel Ernesto Acuna
127
26
0
13 Dec 2022
A fine-grained comparison of pragmatic language understanding in humans
  and language models
A fine-grained comparison of pragmatic language understanding in humans and language models
Jennifer Hu
Sammy Floyd
Olessia Jouravlev
Evelina Fedorenko
E. Gibson
73
63
0
13 Dec 2022
Diverse Demonstrations Improve In-context Compositional Generalization
Diverse Demonstrations Improve In-context Compositional Generalization
Itay Levy
Ben Bogin
Jonathan Berant
111
146
0
13 Dec 2022
ERNIE-Code: Beyond English-Centric Cross-lingual Pretraining for
  Programming Languages
ERNIE-Code: Beyond English-Centric Cross-lingual Pretraining for Programming Languages
Yekun Chai
Shuohuan Wang
Chao Pang
Yu Sun
Hao Tian
Hua Wu
93
38
0
13 Dec 2022
Do Text-to-Text Multi-Task Learners Suffer from Task Conflict?
Do Text-to-Text Multi-Task Learners Suffer from Task Conflict?
David Mueller
Nicholas Andrews
Mark Dredze
109
7
0
13 Dec 2022
TencentPretrain: A Scalable and Flexible Toolkit for Pre-training Models
  of Different Modalities
TencentPretrain: A Scalable and Flexible Toolkit for Pre-training Models of Different Modalities
Zhe Zhao
Yudong Li
Cheng-An Hou
Jing-xin Zhao
Rong Tian
...
Xingwu Sun
Zhanhui Kang
Xiaoyong Du
Linlin Shen
Kimmo Yan
VLM
109
24
0
13 Dec 2022
In Defense of Cross-Encoders for Zero-Shot Retrieval
In Defense of Cross-Encoders for Zero-Shot Retrieval
G. Rosa
L. Bonifacio
Vitor Jeronymo
Hugo Queiroz Abonizio
Marzieh Fadaee
R. Lotufo
Rodrigo Nogueira
66
18
0
12 Dec 2022
Real-World Compositional Generalization with Disentangled
  Sequence-to-Sequence Learning
Real-World Compositional Generalization with Disentangled Sequence-to-Sequence Learning
Hao Zheng
Mirella Lapata
OODCoGeDRL
64
5
0
12 Dec 2022
Improving Generalization of Pre-trained Language Models via Stochastic
  Weight Averaging
Improving Generalization of Pre-trained Language Models via Stochastic Weight Averaging
Peng Lu
I. Kobyzev
Mehdi Rezagholizadeh
Ahmad Rashid
A. Ghodsi
Philippe Langlais
MoMe
108
11
0
12 Dec 2022
BigText-QA: Question Answering over a Large-Scale Hybrid Knowledge Graph
BigText-QA: Question Answering over a Large-Scale Hybrid Knowledge Graph
Jingjing Xu
M. Biryukov
Martin Theobald
V. Venugopal
69
0
0
12 Dec 2022
Collaborating Heterogeneous Natural Language Processing Tasks via
  Federated Learning
Collaborating Heterogeneous Natural Language Processing Tasks via Federated Learning
Chenhe Dong
Yuexiang Xie
Bolin Ding
Ying Shen
Yaliang Li
FedML
62
5
0
12 Dec 2022
Information-Theoretic Text Hallucination Reduction for Video-grounded
  Dialogue
Information-Theoretic Text Hallucination Reduction for Video-grounded Dialogue
Sunjae Yoon
Eunseop Yoon
Hee Suk Yoon
Junyeong Kim
Changdong Yoo
70
20
0
12 Dec 2022
Momentum Contrastive Pre-training for Question Answering
Momentum Contrastive Pre-training for Question Answering
Minda Hu
Muzhi Li
Yasheng Wang
Irwin King
94
3
0
12 Dec 2022
Searching for Effective Multilingual Fine-Tuning Methods: A Case Study
  in Summarization
Searching for Effective Multilingual Fine-Tuning Methods: A Case Study in Summarization
Yiwei Qin
Graham Neubig
Pengfei Liu
72
4
0
12 Dec 2022
T5Score: Discriminative Fine-tuning of Generative Evaluation Metrics
T5Score: Discriminative Fine-tuning of Generative Evaluation Metrics
Yiwei Qin
Weizhe Yuan
Graham Neubig
Pengfei Liu
70
23
0
12 Dec 2022
Implementing Deep Learning-Based Approaches for Article Summarization in
  Indian Languages
Implementing Deep Learning-Based Approaches for Article Summarization in Indian Languages
Rahul Tangsali
Aabha Pingle
Aditya Vyawahare
Isha Joshi
Raviraj Joshi
90
7
0
12 Dec 2022
ResFed: Communication Efficient Federated Learning by Transmitting Deep
  Compressed Residuals
ResFed: Communication Efficient Federated Learning by Transmitting Deep Compressed Residuals
Rui Song
Liguo Zhou
Lingjuan Lyu
Andreas Festag
Alois Knoll
FedML
81
5
0
11 Dec 2022
MORTY: Structured Summarization for Targeted Information Extraction from
  Scholarly Articles
MORTY: Structured Summarization for Targeted Information Extraction from Scholarly Articles
M. Y. Jaradeh
M. Stocker
Sören Auer
67
1
0
11 Dec 2022
Topic-Aware Response Generation in Task-Oriented Dialogue with
  Unstructured Knowledge Access
Topic-Aware Response Generation in Task-Oriented Dialogue with Unstructured Knowledge Access
Yue Feng
Gerasimos Lampouras
Ignacio Iacobacci
56
4
0
10 Dec 2022
Position Embedding Needs an Independent Layer Normalization
Position Embedding Needs an Independent Layer Normalization
Runyi Yu
Zhennan Wang
Yinhuai Wang
Kehan Li
Yian Zhao
Jian Zhang
Guoli Song
Jie Chen
103
1
0
10 Dec 2022
REVEAL: Retrieval-Augmented Visual-Language Pre-Training with
  Multi-Source Multimodal Knowledge Memory
REVEAL: Retrieval-Augmented Visual-Language Pre-Training with Multi-Source Multimodal Knowledge Memory
Ziniu Hu
Ahmet Iscen
Chen Sun
Zirui Wang
Kai-Wei Chang
Yizhou Sun
Cordelia Schmid
David A. Ross
Alireza Fathi
RALMVLM
105
96
0
10 Dec 2022
SMILE: Scaling Mixture-of-Experts with Efficient Bi-level Routing
SMILE: Scaling Mixture-of-Experts with Efficient Bi-level Routing
Chaoyang He
Shuai Zheng
Aston Zhang
George Karypis
Trishul Chilimbi
Mahdi Soltanolkotabi
Salman Avestimehr
MoE
50
1
0
10 Dec 2022
Previous
123...141142143...197198199
Next