ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.10683
  4. Cited By
Exploring the Limits of Transfer Learning with a Unified Text-to-Text
  Transformer
v1v2v3v4 (latest)

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
    AIMat
ArXiv (abs)PDFHTML

Papers citing "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

50 / 9,877 papers shown
Title
Improving Compositional Generalization with Latent Structure and Data
  Augmentation
Improving Compositional Generalization with Latent Structure and Data Augmentation
Linlu Qiu
Peter Shaw
Panupong Pasupat
Pawel Krzysztof Nowak
Tal Linzen
Fei Sha
Kristina Toutanova
CoGe
98
57
0
14 Dec 2021
GPL: Generative Pseudo Labeling for Unsupervised Domain Adaptation of
  Dense Retrieval
GPL: Generative Pseudo Labeling for Unsupervised Domain Adaptation of Dense Retrieval
Kexin Wang
Nandan Thakur
Nils Reimers
Iryna Gurevych
VLM
168
157
0
14 Dec 2021
LMTurk: Few-Shot Learners as Crowdsourcing Workers in a
  Language-Model-as-a-Service Framework
LMTurk: Few-Shot Learners as Crowdsourcing Workers in a Language-Model-as-a-Service Framework
Mengjie Zhao
Fei Mi
Yasheng Wang
Minglei Li
Xin Jiang
Qun Liu
Hinrich Schütze
RALM
115
11
0
14 Dec 2021
Model Uncertainty-Aware Knowledge Amalgamation for Pre-Trained Language
  Models
Model Uncertainty-Aware Knowledge Amalgamation for Pre-Trained Language Models
Lei Li
Yankai Lin
Xuancheng Ren
Guangxiang Zhao
Peng Li
Jie Zhou
Xu Sun
MoMe
60
2
0
14 Dec 2021
Towards a Unified Foundation Model: Jointly Pre-Training Transformers on
  Unpaired Images and Text
Towards a Unified Foundation Model: Jointly Pre-Training Transformers on Unpaired Images and Text
Qing Li
Boqing Gong
Huayu Chen
Dan Kondratyuk
Xianzhi Du
Ming-Hsuan Yang
Matthew A. Brown
ViT
49
17
0
14 Dec 2021
GLaM: Efficient Scaling of Language Models with Mixture-of-Experts
GLaM: Efficient Scaling of Language Models with Mixture-of-Experts
Nan Du
Yanping Huang
Andrew M. Dai
Simon Tong
Dmitry Lepikhin
...
Kun Zhang
Quoc V. Le
Yonghui Wu
Zhiwen Chen
Claire Cui
ALMMoE
275
833
0
13 Dec 2021
VL-Adapter: Parameter-Efficient Transfer Learning for
  Vision-and-Language Tasks
VL-Adapter: Parameter-Efficient Transfer Learning for Vision-and-Language Tasks
Yi-Lin Sung
Jaemin Cho
Joey Tianyi Zhou
VLMVPVLM
114
358
0
13 Dec 2021
Step-unrolled Denoising Autoencoders for Text Generation
Step-unrolled Denoising Autoencoders for Text Generation
Nikolay Savinov
Junyoung Chung
Mikolaj Binkowski
Erich Elsen
Aaron van den Oord
DiffM
134
120
0
13 Dec 2021
Automated Evidence Collection for Fake News Detection
Automated Evidence Collection for Fake News Detection
Mrinal Rawat
Diptesh Kanojia
83
3
0
13 Dec 2021
ISEEQ: Information Seeking Question Generation using Dynamic
  Meta-Information Retrieval and Knowledge Graphs
ISEEQ: Information Seeking Question Generation using Dynamic Meta-Information Retrieval and Knowledge Graphs
Manas Gaur
Kalpa Gunaratna
Vijay Srinivasan
Hongxia Jin
RALM
73
53
0
13 Dec 2021
Dependency Learning for Legal Judgment Prediction with a Unified
  Text-to-Text Transformer
Dependency Learning for Legal Judgment Prediction with a Unified Text-to-Text Transformer
Yunyun Huang
Xiaoyu Shen
Chuanyi Li
Jidong Ge
B. Luo
AILaw
77
20
0
13 Dec 2021
Weakly Supervised Text-to-SQL Parsing through Question Decomposition
Weakly Supervised Text-to-SQL Parsing through Question Decomposition
Tomer Wolfson
Daniel Deutch
Jonathan Berant
ReLM
43
16
0
12 Dec 2021
UPV at TREC Health Misinformation Track 2021 Ranking with SBERT and
  Quality Estimators
UPV at TREC Health Misinformation Track 2021 Ranking with SBERT and Quality Estimators
Ipek Baris Schlicht
Angel Felipe Magnossão de Paula
Paolo Rosso
42
8
0
11 Dec 2021
Discourse-Aware Soft Prompting for Text Generation
Discourse-Aware Soft Prompting for Text Generation
Marjan Ghazvininejad
Vladimir Karpukhin
Vera Gor
Asli Celikyilmaz
67
6
0
10 Dec 2021
Pruning Pretrained Encoders with a Multitask Objective
Pruning Pretrained Encoders with a Multitask Objective
Patrick Xia
Richard Shin
65
0
0
10 Dec 2021
VUT: Versatile UI Transformer for Multi-Modal Multi-Task User Interface
  Modeling
VUT: Versatile UI Transformer for Multi-Modal Multi-Task User Interface Modeling
Yang Li
Gang Li
Xin Zhou
Mostafa Dehghani
A. Gritsenko
MLLM
92
36
0
10 Dec 2021
Spinning Language Models: Risks of Propaganda-As-A-Service and
  Countermeasures
Spinning Language Models: Risks of Propaganda-As-A-Service and Countermeasures
Eugene Bagdasaryan
Vitaly Shmatikov
SILMAAML
106
84
0
09 Dec 2021
Compositional Generalization for Natural Language Interfaces to Web APIs
Compositional Generalization for Natural Language Interfaces to Web APIs
Saghar Hosseini
Ahmed Hassan Awadallah
Yu-Chuan Su
66
6
0
09 Dec 2021
Semantic Search as Extractive Paraphrase Span Detection
Semantic Search as Extractive Paraphrase Span Detection
Jenna Kanerva
Hanna Kitti
Li-Hsin Chang
Teemu Vahtola
Mathias Creutz
Filip Ginter
57
2
0
09 Dec 2021
Multimodal Pre-Training Model for Sequence-based Prediction of
  Protein-Protein Interaction
Multimodal Pre-Training Model for Sequence-based Prediction of Protein-Protein Interaction
Yang Xue
Zijing Liu
Xiaomin Fang
Fan Wang
111
8
0
09 Dec 2021
FLAVA: A Foundational Language And Vision Alignment Model
FLAVA: A Foundational Language And Vision Alignment Model
Amanpreet Singh
Ronghang Hu
Vedanuj Goswami
Guillaume Couairon
Wojciech Galuba
Marcus Rohrbach
Douwe Kiela
CLIPVLM
151
719
0
08 Dec 2021
Improving language models by retrieving from trillions of tokens
Improving language models by retrieving from trillions of tokens
Sebastian Borgeaud
A. Mensch
Jordan Hoffmann
Trevor Cai
Eliza Rutherford
...
Simon Osindero
Karen Simonyan
Jack W. Rae
Erich Elsen
Laurent Sifre
KELMRALM
303
1,109
0
08 Dec 2021
Ethical and social risks of harm from Language Models
Ethical and social risks of harm from Language Models
Laura Weidinger
John F. J. Mellor
Maribeth Rauh
Conor Griffin
J. Uesato
...
Lisa Anne Hendricks
William S. Isaac
Sean Legassick
G. Irving
Iason Gabriel
PILM
208
1,045
0
08 Dec 2021
Does Structure Matter? Leveraging Data-to-Text Generation for Answering
  Complex Information Needs
Does Structure Matter? Leveraging Data-to-Text Generation for Answering Complex Information Needs
Hanane Djeddal
Thomas Gerald
Laure Soulier
K. Pinel-Sauvagnat
L. Tamine
33
1
0
08 Dec 2021
VIRT: Improving Representation-based Models for Text Matching through
  Virtual Interaction
VIRT: Improving Representation-based Models for Text Matching through Virtual Interaction
Dan Li
Yang Yang
Hongyin Tang
Jingang Wang
Tong Xu
Wei Wu
Enhong Chen
64
9
0
08 Dec 2021
Bidimensional Leaderboards: Generate and Evaluate Language Hand in Hand
Bidimensional Leaderboards: Generate and Evaluate Language Hand in Hand
Jungo Kasai
Keisuke Sakaguchi
Ronan Le Bras
Lavinia Dunagan
Jacob Morrison
Alexander R. Fabbri
Yejin Choi
Noah A. Smith
97
40
0
08 Dec 2021
A Transferable Approach for Partitioning Machine Learning Models on
  Multi-Chip-Modules
A Transferable Approach for Partitioning Machine Learning Models on Multi-Chip-Modules
Xinfeng Xie
Prakash Prabhu
Ulysse Beaugnon
P. Phothilimthana
Sudip Roy
Azalia Mirhoseini
E. Brevdo
James Laudon
Yanqi Zhou
51
5
0
07 Dec 2021
UCD-CS at TREC 2021 Incident Streams Track
UCD-CS at TREC 2021 Incident Streams Track
Congcong Wang
David Lillis
44
1
0
07 Dec 2021
Human Parity on CommonsenseQA: Augmenting Self-Attention with External
  Attention
Human Parity on CommonsenseQA: Augmenting Self-Attention with External Attention
Yichong Xu
Chenguang Zhu
Shuohang Wang
Siqi Sun
Hao Cheng
Xiaodong Liu
Jianfeng Gao
Pengcheng He
Michael Zeng
Xuedong Huang
LRM
326
59
0
06 Dec 2021
Quantifying Adaptability in Pre-trained Language Models with 500 Tasks
Quantifying Adaptability in Pre-trained Language Models with 500 Tasks
Belinda Z. Li
Jane A. Yu
Madian Khabsa
Luke Zettlemoyer
A. Halevy
Jacob Andreas
ELM
89
17
0
06 Dec 2021
UniLog: Deploy One Model and Specialize it for All Log Analysis Tasks
UniLog: Deploy One Model and Specialize it for All Log Analysis Tasks
Yichen Zhu
Weibin Meng
Ying Liu
Shenglin Zhang
Tao Han
Shimin Tao
Dan Pei
MoE
89
15
0
06 Dec 2021
General Facial Representation Learning in a Visual-Linguistic Manner
General Facial Representation Learning in a Visual-Linguistic Manner
Yinglin Zheng
Hao Yang
Ting Zhang
Jianmin Bao
Dongdong Chen
Yangyu Huang
Lu Yuan
Dong Chen
Ming Zeng
Fang Wen
CVBM
207
176
0
06 Dec 2021
Search and Learn: Improving Semantic Coverage for Data-to-Text
  Generation
Search and Learn: Improving Semantic Coverage for Data-to-Text Generation
Shailza Jolly
Zi Xuan Zhang
Andreas Dengel
Lili Mou
67
11
0
06 Dec 2021
End-to-end Adaptive Distributed Training on PaddlePaddle
End-to-end Adaptive Distributed Training on PaddlePaddle
Yulong Ao
Zhihua Wu
Dianhai Yu
Weibao Gong
Zhiqing Kui
...
Yanjun Ma
Tian Wu
Haifeng Wang
Wei Zeng
Chao Yang
118
11
0
06 Dec 2021
JointLK: Joint Reasoning with Language Models and Knowledge Graphs for
  Commonsense Question Answering
JointLK: Joint Reasoning with Language Models and Knowledge Graphs for Commonsense Question Answering
Yueqing Sun
Qi Shi
Le Qi
Yu Zhang
RALMLRM
89
72
0
06 Dec 2021
NL-Augmenter: A Framework for Task-Sensitive Natural Language
  Augmentation
NL-Augmenter: A Framework for Task-Sensitive Natural Language Augmentation
Kaustubh D. Dhole
Varun Gangal
Sebastian Gehrmann
Aadesh Gupta
Zhenhao Li
...
Tianbao Xie
Usama Yaseen
Michael A. Yee
Jing Zhang
Yue Zhang
235
88
0
06 Dec 2021
VarCLR: Variable Semantic Representation Pre-training via Contrastive
  Learning
VarCLR: Variable Semantic Representation Pre-training via Contrastive Learning
Qibin Chen
Jeremy Lacomis
Edward J. Schwartz
Graham Neubig
Bogdan Vasilescu
Claire Le Goues
VLM
77
35
0
05 Dec 2021
Representation Learning for Conversational Data using Discourse Mutual
  Information Maximization
Representation Learning for Conversational Data using Discourse Mutual Information Maximization
Bishal Santra
Sumegh Roychowdhury
Aishik Mandal
Vasu Gurram
Atharva Naik
Manish Gupta
Pawan Goyal
SSL
89
4
0
04 Dec 2021
Bridging Pre-trained Models and Downstream Tasks for Source Code
  Understanding
Bridging Pre-trained Models and Downstream Tasks for Source Code Understanding
Deze Wang
Zhouyang Jia
Shanshan Li
Yue Yu
Yun Xiong
Wei Dong
Xiangke Liao
103
83
0
04 Dec 2021
Hierarchical Neural Data Synthesis for Semantic Parsing
Hierarchical Neural Data Synthesis for Semantic Parsing
Wei Yang
Peng Xu
Yanshuai Cao
66
9
0
04 Dec 2021
ALX: Large Scale Matrix Factorization on TPUs
ALX: Large Scale Matrix Factorization on TPUs
Harsh Mehta
Steffen Rendle
Walid Krichene
Li Zhang
25
6
0
03 Dec 2021
Multilingual training for Software Engineering
Multilingual training for Software Engineering
Toufique Ahmed
Prem Devanbu
134
76
0
03 Dec 2021
MetaQA: Combining Expert Agents for Multi-Skill Question Answering
MetaQA: Combining Expert Agents for Multi-Skill Question Answering
Haritz Puerto
Gözde Gül Sahin
Iryna Gurevych
LLMAG
105
21
0
03 Dec 2021
The Influence of Data Pre-processing and Post-processing on Long
  Document Summarization
The Influence of Data Pre-processing and Post-processing on Long Document Summarization
Xinwei Du
Kailun Dong
Yuchen Zhang
Yongsheng Li
R. Tsay
34
0
0
03 Dec 2021
PLSUM: Generating PT-BR Wikipedia by Summarizing Multiple Websites
PLSUM: Generating PT-BR Wikipedia by Summarizing Multiple Websites
A. Oliveira
A. H. R. Costa
44
2
0
02 Dec 2021
InfoLM: A New Metric to Evaluate Summarization & Data2Text Generation
InfoLM: A New Metric to Evaluate Summarization & Data2Text Generation
Pierre Colombo
Chloe Clave
Pablo Piantanida
129
44
0
02 Dec 2021
KPDrop: Improving Absent Keyphrase Generation
KPDrop: Improving Absent Keyphrase Generation
Jishnu Ray Chowdhury
Seoyeon Park
Tuhin Kundu
Cornelia Caragea
95
7
0
02 Dec 2021
How not to Lie with a Benchmark: Rearranging NLP Leaderboards
How not to Lie with a Benchmark: Rearranging NLP Leaderboards
Tatiana Shavrina
Valentin Malykh
ALMELM
505
12
0
02 Dec 2021
CO2Sum:Contrastive Learning for Factual-Consistent Abstractive
  Summarization
CO2Sum:Contrastive Learning for Factual-Consistent Abstractive Summarization
Wei Liu
Huanqin Wu
Wenjing Mu
Zhen Li
Tao Chen
Dan Nie
HILM
60
17
0
02 Dec 2021
Iconary: A Pictionary-Based Game for Testing Multimodal Communication
  with Drawings and Text
Iconary: A Pictionary-Based Game for Testing Multimodal Communication with Drawings and Text
Christopher Clark
Jordi Salvador
Dustin Schwenk
Derrick Bonafilia
Mark Yatskar
...
Aaron Sarnat
Hannaneh Hajishirzi
Aniruddha Kembhavi
Oren Etzioni
Ali Farhadi
MLLM
52
5
0
01 Dec 2021
Previous
123...170171172...196197198
Next