ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.10683
  4. Cited By
Exploring the Limits of Transfer Learning with a Unified Text-to-Text
  Transformer
v1v2v3v4 (latest)

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
    AIMat
ArXiv (abs)PDFHTML

Papers citing "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

50 / 9,877 papers shown
Title
A Robustly Optimized Long Text to Math Models for Numerical Reasoning On
  FinQA
A Robustly Optimized Long Text to Math Models for Numerical Reasoning On FinQA
Renhui Zhang
Youwei Zhang
Yao Yu
AIMat
31
1
0
29 Jun 2022
Test2Vec: An Execution Trace Embedding for Test Case Prioritization
Test2Vec: An Execution Trace Embedding for Test Case Prioritization
E. Jabbar
Soheila Zangeneh
Hadi Hemmati
R. Feldt
89
5
0
28 Jun 2022
Proton: Probing Schema Linking Information from Pre-trained Language
  Models for Text-to-SQL Parsing
Proton: Probing Schema Linking Information from Pre-trained Language Models for Text-to-SQL Parsing
Lihan Wang
Bowen Qin
Binyuan Hui
Bowen Li
Min Yang
Bailin Wang
Binhua Li
Fei Huang
Luo Si
Yongbin Li
135
44
0
28 Jun 2022
Joint Generator-Ranker Learning for Natural Language Generation
Joint Generator-Ranker Learning for Natural Language Generation
Weizhou Shen
Yeyun Gong
Yelong Shen
Song Wang
Xiaojun Quan
Nan Duan
Weizhu Chen
107
5
0
28 Jun 2022
Materials Transformers Language Models for Generative Materials Design:
  a benchmark study
Materials Transformers Language Models for Generative Materials Design: a benchmark study
Nihang Fu
Lai Wei
Yuqi Song
Qinyang Li
Rui Xin
Sadman Sadeed Omee
Rongzhi Dong
Edirisuriya M Dilanga Siriwardane
Jianjun Hu
50
2
0
27 Jun 2022
Long Range Language Modeling via Gated State Spaces
Long Range Language Modeling via Gated State Spaces
Harsh Mehta
Ankit Gupta
Ashok Cutkosky
Behnam Neyshabur
Mamba
143
243
0
27 Jun 2022
Adversarial Self-Attention for Language Understanding
Adversarial Self-Attention for Language Understanding
Hongqiu Wu
Ruixue Ding
Hai Zhao
Pengjun Xie
Fei Huang
Min Zhang
81
12
0
25 Jun 2022
MVP: Multi-task Supervised Pre-training for Natural Language Generation
MVP: Multi-task Supervised Pre-training for Natural Language Generation
Tianyi Tang
Junyi Li
Wayne Xin Zhao
Ji-Rong Wen
120
24
0
24 Jun 2022
Unified BERT for Few-shot Natural Language Understanding
Unified BERT for Few-shot Natural Language Understanding
Junyu Lu
Ping Yang
Ruyi Gan
Jing Yang
Jiaxing Zhang
67
2
0
24 Jun 2022
Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online
  Videos
Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos
Bowen Baker
Ilge Akkaya
Peter Zhokhov
Joost Huizinga
Jie Tang
Adrien Ecoffet
Brandon Houghton
Raul Sampedro
Jeff Clune
OffRL
159
304
0
23 Jun 2022
AST-Probe: Recovering abstract syntax trees from hidden representations
  of pre-trained language models
AST-Probe: Recovering abstract syntax trees from hidden representations of pre-trained language models
José Antonio Hernández López
Martin Weyssow
Jesús Sánchez Cuadrado
H. Sahraoui
57
23
0
23 Jun 2022
GODEL: Large-Scale Pre-Training for Goal-Directed Dialog
GODEL: Large-Scale Pre-Training for Goal-Directed Dialog
Baolin Peng
Michel Galley
Pengcheng He
Chris Brockett
Lars Liden
E. Nouri
Zhou Yu
Bill Dolan
Jianfeng Gao
VLM
95
75
0
22 Jun 2022
Multi-LexSum: Real-World Summaries of Civil Rights Lawsuits at Multiple
  Granularities
Multi-LexSum: Real-World Summaries of Civil Rights Lawsuits at Multiple Granularities
Zejiang Shen
Kyle Lo
L. Yu
N. Dahlberg
Margo Schlanger
Doug Downey
ELMAILaw
119
48
0
22 Jun 2022
Jointist: Joint Learning for Multi-instrument Transcription and Its
  Applications
Jointist: Joint Learning for Multi-instrument Transcription and Its Applications
K. Cheuk
Keunwoo Choi
Qiuqiang Kong
Bochen Li
Minz Won
Amy Hung
Ju-Chiang Wang
Dorien Herremans
91
7
0
22 Jun 2022
Scaling Autoregressive Models for Content-Rich Text-to-Image Generation
Scaling Autoregressive Models for Content-Rich Text-to-Image Generation
Jiahui Yu
Yuanzhong Xu
Jing Yu Koh
Thang Luong
Gunjan Baid
...
Zarana Parekh
Xin Li
Han Zhang
Jason Baldridge
Yonghui Wu
EGVM
252
1,134
0
22 Jun 2022
Questions Are All You Need to Train a Dense Passage Retriever
Questions Are All You Need to Train a Dense Passage Retriever
Devendra Singh Sachan
M. Lewis
Dani Yogatama
Luke Zettlemoyer
J. Pineau
Manzil Zaheer
RALM
134
57
0
21 Jun 2022
Insights into Pre-training via Simpler Synthetic Tasks
Insights into Pre-training via Simpler Synthetic Tasks
Yuhuai Wu
Felix Li
Percy Liang
AIMat
92
21
0
21 Jun 2022
Bridging the Gap Between Indexing and Retrieval for Differentiable
  Search Index with Query Generation
Bridging the Gap Between Indexing and Retrieval for Differentiable Search Index with Query Generation
Shengyao Zhuang
Houxing Ren
Linjun Shou
Jian Pei
Ming Gong
Guido Zuccon
Daxin Jiang
106
68
0
21 Jun 2022
Supervision-Guided Codebooks for Masked Prediction in Speech
  Pre-training
Supervision-Guided Codebooks for Masked Prediction in Speech Pre-training
Chengyi Wang
Yiming Wang
Yu Wu
Sanyuan Chen
Jinyu Li
Shujie Liu
Furu Wei
SSL
95
20
0
21 Jun 2022
Automatic Controllable Product Copywriting for E-Commerce
Automatic Controllable Product Copywriting for E-Commerce
Xiaojie Guo
Qingkai Zeng
Meng Jiang
Yun Xiao
Bo Long
Lingfei Wu
45
10
0
21 Jun 2022
Fewer Errors, but More Stereotypes? The Effect of Model Size on Gender
  Bias
Fewer Errors, but More Stereotypes? The Effect of Model Size on Gender Bias
Yarden Tal
Inbal Magar
Roy Schwartz
77
36
0
20 Jun 2022
Studying the role of named entities for content preservation in text
  style transfer
Studying the role of named entities for content preservation in text style transfer
N. Babakov
David Dale
V. Logacheva
I. Krotova
Alexander Panchenko
50
2
0
20 Jun 2022
Towards Unified Conversational Recommender Systems via
  Knowledge-Enhanced Prompt Learning
Towards Unified Conversational Recommender Systems via Knowledge-Enhanced Prompt Learning
Xiaolei Wang
Kun Zhou
Ji-Rong Wen
Wayne Xin Zhao
HAIAI4TS
73
137
0
19 Jun 2022
Automatic Summarization of Russian Texts: Comparison of Extractive and
  Abstractive Methods
Automatic Summarization of Russian Texts: Comparison of Extractive and Abstractive Methods
Valeriya Goloviznina
Evgeny Kotelnikov
41
4
0
18 Jun 2022
Self-Supervised Learning for Videos: A Survey
Self-Supervised Learning for Videos: A Survey
Madeline Chantry Schiappa
Yogesh S Rawat
M. Shah
SSL
128
136
0
18 Jun 2022
CLiMB: A Continual Learning Benchmark for Vision-and-Language Tasks
CLiMB: A Continual Learning Benchmark for Vision-and-Language Tasks
Tejas Srinivasan
Ting-Yun Chang
Leticia Pinto-Alva
Georgios Chochlakis
Mohammad Rostami
Jesse Thomason
VLMCLL
103
76
0
18 Jun 2022
Unified-IO: A Unified Model for Vision, Language, and Multi-Modal Tasks
Unified-IO: A Unified Model for Vision, Language, and Multi-Modal Tasks
Jiasen Lu
Christopher Clark
Rowan Zellers
Roozbeh Mottaghi
Aniruddha Kembhavi
ObjDVLMMLLM
171
412
0
17 Jun 2022
Zero-Shot Video Question Answering via Frozen Bidirectional Language
  Models
Zero-Shot Video Question Answering via Frozen Bidirectional Language Models
Antoine Yang
Antoine Miech
Josef Sivic
Ivan Laptev
Cordelia Schmid
149
239
0
16 Jun 2022
Multimodal Dialogue State Tracking
Multimodal Dialogue State Tracking
Hung Le
Nancy F. Chen
Guosheng Lin
70
9
0
16 Jun 2022
TOKEN is a MASK: Few-shot Named Entity Recognition with Pre-trained
  Language Models
TOKEN is a MASK: Few-shot Named Entity Recognition with Pre-trained Language Models
A. Davody
David Ifeoluwa Adelani
Thomas Kleinbauer
Dietrich Klakow
75
4
0
15 Jun 2022
Alexa Teacher Model: Pretraining and Distilling Multi-Billion-Parameter
  Encoders for Natural Language Understanding Systems
Alexa Teacher Model: Pretraining and Distilling Multi-Billion-Parameter Encoders for Natural Language Understanding Systems
Jack G. M. FitzGerald
Shankar Ananthakrishnan
Konstantine Arkoudas
Davide Bernardi
Abhishek Bhagia
...
Pan Wei
Haiyang Yu
Shuai Zheng
Gokhan Tur
Premkumar Natarajan
ELM
46
30
0
15 Jun 2022
DIRECTOR: Generator-Classifiers For Supervised Language Modeling
DIRECTOR: Generator-Classifiers For Supervised Language Modeling
Kushal Arora
Kurt Shuster
Sainbayar Sukhbaatar
Jason Weston
VLM
98
41
0
15 Jun 2022
Emergent Abilities of Large Language Models
Emergent Abilities of Large Language Models
Jason W. Wei
Yi Tay
Rishi Bommasani
Colin Raffel
Barret Zoph
...
Tatsunori Hashimoto
Oriol Vinyals
Percy Liang
J. Dean
W. Fedus
ELMReLMLRM
322
2,524
0
15 Jun 2022
A Unified Sequence Interface for Vision Tasks
A Unified Sequence Interface for Vision Tasks
Ting-Li Chen
Saurabh Saxena
Lala Li
Nayeon Lee
David J. Fleet
Geoffrey E. Hinton
VLMMLLM
81
152
0
15 Jun 2022
Coarse-to-Fine Vision-Language Pre-training with Fusion in the Backbone
Coarse-to-Fine Vision-Language Pre-training with Fusion in the Backbone
Zi-Yi Dou
Aishwarya Kamath
Zhe Gan
Pengchuan Zhang
Jianfeng Wang
...
Ce Liu
Yann LeCun
Nanyun Peng
Jianfeng Gao
Lijuan Wang
VLMObjD
115
129
0
15 Jun 2022
NatGen: Generative pre-training by "Naturalizing" source code
NatGen: Generative pre-training by "Naturalizing" source code
Saikat Chakraborty
Toufique Ahmed
Yangruibo Ding
Prem Devanbu
Baishakhi Ray
AI4CE
116
118
0
15 Jun 2022
Forecasting of depth and ego-motion with transformers and
  self-supervision
Forecasting of depth and ego-motion with transformers and self-supervision
Houssem-eddine Boulahbal
A. Voicila
Andrew I. Comport
ViTMDE
71
3
0
15 Jun 2022
An Extractive-and-Abstractive Framework for Source Code Summarization
An Extractive-and-Abstractive Framework for Source Code Summarization
Weisong Sun
Chunrong Fang
Yuchen Chen
Quanjun Zhang
Guanhong Tao
Tingxu Han
Yifei Ge
Yudu You
Bin Luo
95
33
0
15 Jun 2022
LAVENDER: Unifying Video-Language Understanding as Masked Language
  Modeling
LAVENDER: Unifying Video-Language Understanding as Masked Language Modeling
Linjie Li
Zhe Gan
Kevin Qinghong Lin
Chung-Ching Lin
Zicheng Liu
Ce Liu
Lijuan Wang
MLLMVLM
90
84
0
14 Jun 2022
SBERT studies Meaning Representations: Decomposing Sentence Embeddings
  into Explainable Semantic Features
SBERT studies Meaning Representations: Decomposing Sentence Embeddings into Explainable Semantic Features
Juri Opitz
Anette Frank
98
37
0
14 Jun 2022
FETILDA: An Effective Framework For Fin-tuned Embeddings For Long
  Financial Text Documents
FETILDA: An Effective Framework For Fin-tuned Embeddings For Long Financial Text Documents
BolunNamirXia
Vipula Rawte
Mohammed J Zaki
Aparna Gupta
AI4TS
18
1
0
14 Jun 2022
TransVG++: End-to-End Visual Grounding with Language Conditioned Vision
  Transformer
TransVG++: End-to-End Visual Grounding with Language Conditioned Vision Transformer
Jiajun Deng
Zhengyuan Yang
Daqing Liu
Tianlang Chen
Wen-gang Zhou
Yanyong Zhang
Houqiang Li
Wanli Ouyang
ViT
107
57
0
14 Jun 2022
CHQ-Summ: A Dataset for Consumer Healthcare Question Summarization
CHQ-Summ: A Dataset for Consumer Healthcare Question Summarization
S. Yadav
D. Gupta
Dina Demner-Fushman
82
16
0
14 Jun 2022
LIFT: Language-Interfaced Fine-Tuning for Non-Language Machine Learning
  Tasks
LIFT: Language-Interfaced Fine-Tuning for Non-Language Machine Learning Tasks
Tuan Dinh
Yuchen Zeng
Ruisu Zhang
Ziqian Lin
Michael Gira
Shashank Rajput
Jy-yong Sohn
Dimitris Papailiopoulos
Kangwook Lee
LMTD
174
140
0
14 Jun 2022
LST: Ladder Side-Tuning for Parameter and Memory Efficient Transfer
  Learning
LST: Ladder Side-Tuning for Parameter and Memory Efficient Transfer Learning
Yi-Lin Sung
Jaemin Cho
Joey Tianyi Zhou
VLM
99
246
0
13 Jun 2022
Memory-Based Model Editing at Scale
Memory-Based Model Editing at Scale
E. Mitchell
Charles Lin
Antoine Bosselut
Christopher D. Manning
Chelsea Finn
KELM
116
362
0
13 Jun 2022
Modern Distributed Data-Parallel Large-Scale Pre-training Strategies For
  NLP models
Modern Distributed Data-Parallel Large-Scale Pre-training Strategies For NLP models
Haoli Bai
MoE
143
5
0
13 Jun 2022
Language Models are General-Purpose Interfaces
Language Models are General-Purpose Interfaces
Y. Hao
Haoyu Song
Li Dong
Shaohan Huang
Zewen Chi
Wenhui Wang
Shuming Ma
Furu Wei
MLLM
78
102
0
13 Jun 2022
JiuZhang: A Chinese Pre-trained Language Model for Mathematical Problem
  Understanding
JiuZhang: A Chinese Pre-trained Language Model for Mathematical Problem Understanding
Wayne Xin Zhao
Kun Zhou
Zheng Gong
Beichen Zhang
Yuanhang Zhou
Jing Sha
Zhigang Chen
Shijin Wang
Cong Liu
Ji-Rong Wen
83
19
0
13 Jun 2022
Grounding in social media: An approach to building a chit-chat dialogue
  model
Grounding in social media: An approach to building a chit-chat dialogue model
Ritvik Choudhary
Daisuke Kawahara
65
5
0
12 Jun 2022
Previous
123...156157158...196197198
Next