ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.10683
  4. Cited By
Exploring the Limits of Transfer Learning with a Unified Text-to-Text
  Transformer
v1v2v3v4 (latest)

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
    AIMat
ArXiv (abs)PDFHTML

Papers citing "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

50 / 9,920 papers shown
Title
Exploring the Efficacy of Pre-trained Checkpoints in Text-to-Music
  Generation Task
Exploring the Efficacy of Pre-trained Checkpoints in Text-to-Music Generation Task
Shangda Wu
Maosong Sun
79
20
0
21 Nov 2022
Diffusion-Based Scene Graph to Image Generation with Masked Contrastive
  Pre-Training
Diffusion-Based Scene Graph to Image Generation with Masked Contrastive Pre-Training
Ling Yang
Zhilin Huang
Yang Song
Shenda Hong
Ge Li
Wentao Zhang
Tengjiao Wang
Guohao Li
Ming-Hsuan Yang
104
57
0
21 Nov 2022
VER: Unifying Verbalizing Entities and Relations
VER: Unifying Verbalizing Entities and Relations
Jie Huang
Kevin Chen-Chuan Chang
105
1
0
20 Nov 2022
Aging with GRACE: Lifelong Model Editing with Discrete Key-Value
  Adaptors
Aging with GRACE: Lifelong Model Editing with Discrete Key-Value Adaptors
Thomas Hartvigsen
S. Sankaranarayanan
Hamid Palangi
Yoon Kim
Marzyeh Ghassemi
KELM
155
177
0
20 Nov 2022
A survey on knowledge-enhanced multimodal learning
A survey on knowledge-enhanced multimodal learning
Maria Lymperaiou
Giorgos Stamou
172
15
0
19 Nov 2022
Knowledge Graph Generation From Text
Knowledge Graph Generation From Text
Igor Melnyk
Pierre Dognin
Payel Das
81
25
0
18 Nov 2022
Visual Programming: Compositional visual reasoning without training
Visual Programming: Compositional visual reasoning without training
Tanmay Gupta
Aniruddha Kembhavi
ReLMVLMLRM
183
440
0
18 Nov 2022
GENIUS: Sketch-based Language Model Pre-training via Extreme and
  Selective Masking for Text Generation and Augmentation
GENIUS: Sketch-based Language Model Pre-training via Extreme and Selective Masking for Text Generation and Augmentation
Biyang Guo
Yeyun Gong
Yelong Shen
Songqiao Han
Hailiang Huang
Nan Duan
Weizhu Chen
VLM
91
19
0
18 Nov 2022
Towards Explaining Subjective Ground of Individuals on Social Media
Towards Explaining Subjective Ground of Individuals on Social Media
Younghun Lee
Dan Goldwasser
63
1
0
18 Nov 2022
Random-LTD: Random and Layerwise Token Dropping Brings Efficient
  Training for Large-scale Transformers
Random-LTD: Random and Layerwise Token Dropping Brings Efficient Training for Large-scale Transformers
Z. Yao
Xiaoxia Wu
Conglong Li
Connor Holmes
Minjia Zhang
Cheng-rong Li
Yuxiong He
87
12
0
17 Nov 2022
Summarizing Community-based Question-Answer Pairs
Summarizing Community-based Question-Answer Pairs
Ting-Yao Hsu
Yoshihiko Suhara
Xiaolan Wang
52
5
0
17 Nov 2022
VeLO: Training Versatile Learned Optimizers by Scaling Up
VeLO: Training Versatile Learned Optimizers by Scaling Up
Luke Metz
James Harrison
C. Freeman
Amil Merchant
Lucas Beyer
...
Naman Agrawal
Ben Poole
Igor Mordatch
Adam Roberts
Jascha Narain Sohl-Dickstein
140
60
0
17 Nov 2022
Listen, Denoise, Action! Audio-Driven Motion Synthesis with Diffusion
  Models
Listen, Denoise, Action! Audio-Driven Motion Synthesis with Diffusion Models
Simon Alexanderson
Rajmund Nagy
Jonas Beskow
G. Henter
DiffMVGen
81
174
0
17 Nov 2022
Cross-Modal Adapter for Text-Video Retrieval
Cross-Modal Adapter for Text-Video Retrieval
Haojun Jiang
Jianke Zhang
Rui Huang
Chunjiang Ge
Zanlin Ni
Jiwen Lu
Jie Zhou
S. Song
Gao Huang
136
38
0
17 Nov 2022
Ignore Previous Prompt: Attack Techniques For Language Models
Ignore Previous Prompt: Attack Techniques For Language Models
Fábio Perez
Ian Ribeiro
SILM
106
452
0
17 Nov 2022
Data-Efficient Autoregressive Document Retrieval for Fact Verification
Data-Efficient Autoregressive Document Retrieval for Fact Verification
James Thorne
RALM
70
7
0
17 Nov 2022
Self-Training with Purpose Preserving Augmentation Improves Few-shot
  Generative Dialogue State Tracking
Self-Training with Purpose Preserving Augmentation Improves Few-shot Generative Dialogue State Tracking
Jihyun Lee
C. Lee
Yunsu Kim
G. G. Lee
71
0
0
17 Nov 2022
CapEnrich: Enriching Caption Semantics for Web Images via Cross-modal
  Pre-trained Knowledge
CapEnrich: Enriching Caption Semantics for Web Images via Cross-modal Pre-trained Knowledge
Linli Yao
Wei Chen
Qin Jin
VLM
121
11
0
17 Nov 2022
Artificial Disfluency Detection, Uh No, Disfluency Generation for the
  Masses
Artificial Disfluency Detection, Uh No, Disfluency Generation for the Masses
T. Passali
T. Mavropoulos
Grigorios Tsoumakas
G. Meditskos
S. Vrochidis
61
1
0
16 Nov 2022
Unified Question Answering in Slovene
Unified Question Answering in Slovene
Katja Logar
Marko Robnik-Šikonja
41
0
0
16 Nov 2022
Galactica: A Large Language Model for Science
Galactica: A Large Language Model for Science
Ross Taylor
Marcin Kardas
Guillem Cucurull
Thomas Scialom
Anthony Hartshorn
Elvis Saravia
Andrew Poulton
Viktor Kerkez
Robert Stojnic
ELMReLM
131
786
0
16 Nov 2022
GAMMT: Generative Ambiguity Modeling Using Multiple Transformers
GAMMT: Generative Ambiguity Modeling Using Multiple Transformers
Xingcheng Xu
81
0
0
16 Nov 2022
Toward expanding the scope of radiology report summarization to multiple
  anatomies and modalities
Toward expanding the scope of radiology report summarization to multiple anatomies and modalities
Zhihong Chen
M. Varma
Xiang Wan
C. Langlotz
Jean-Benoit Delbrouck
57
19
0
15 Nov 2022
On the Compositional Generalization Gap of In-Context Learning
On the Compositional Generalization Gap of In-Context Learning
Arian Hosseini
Ankit Vani
Dzmitry Bahdanau
Alessandro Sordoni
Rameswar Panda
79
25
0
15 Nov 2022
ED-FAITH: Evaluating Dialogue Summarization on Faithfulness
ED-FAITH: Evaluating Dialogue Summarization on Faithfulness
Sicong Huang
Asli Celikyilmaz
Haoran Li
HILM
56
4
0
15 Nov 2022
PromptCap: Prompt-Guided Task-Aware Image Captioning
PromptCap: Prompt-Guided Task-Aware Image Captioning
Yushi Hu
Hang Hua
Zhengyuan Yang
Weijia Shi
Noah A. Smith
Jiebo Luo
121
106
0
15 Nov 2022
Evaluating the Factual Consistency of Large Language Models Through News
  Summarization
Evaluating the Factual Consistency of Large Language Models Through News Summarization
Derek Tam
Anisha Mascarenhas
Shiyue Zhang
Sarah Kwan
Joey Tianyi Zhou
Colin Raffel
HILM
97
106
0
15 Nov 2022
Large Language Models Struggle to Learn Long-Tail Knowledge
Large Language Models Struggle to Learn Long-Tail Knowledge
Nikhil Kandpal
H. Deng
Adam Roberts
Eric Wallace
Colin Raffel
RALMKELM
166
420
0
15 Nov 2022
Empowering Language Models with Knowledge Graph Reasoning for Question
  Answering
Empowering Language Models with Knowledge Graph Reasoning for Question Answering
Ziniu Hu
Yichong Xu
Wenhao Yu
Shuohang Wang
Ziyi Yang
Chenguang Zhu
Kai-Wei Chang
Yizhou Sun
KELMRALMLRM
102
26
0
15 Nov 2022
Versatile Diffusion: Text, Images and Variations All in One Diffusion
  Model
Versatile Diffusion: Text, Images and Variations All in One Diffusion Model
Xingqian Xu
Zhangyang Wang
Eric Zhang
Kai Wang
Humphrey Shi
DiffM
186
198
0
15 Nov 2022
An FNet based Auto Encoder for Long Sequence News Story Generation
An FNet based Auto Encoder for Long Sequence News Story Generation
Paul K. Mandal
Rakeshkumar V. Mahto
26
0
0
15 Nov 2022
QAmeleon: Multilingual QA with Only 5 Examples
QAmeleon: Multilingual QA with Only 5 Examples
Priyanka Agrawal
Chris Alberti
Fantine Huot
Joshua Maynez
Ji Ma
Sebastian Ruder
Kuzman Ganchev
Dipanjan Das
Mirella Lapata
67
30
0
15 Nov 2022
A Universal Discriminator for Zero-Shot Generalization
A Universal Discriminator for Zero-Shot Generalization
Haike Xu
Zongyu Lin
Jing Zhou
Yanan Zheng
Zhilin Yang
AI4CE
66
16
0
15 Nov 2022
GLUE-X: Evaluating Natural Language Understanding Models from an
  Out-of-distribution Generalization Perspective
GLUE-X: Evaluating Natural Language Understanding Models from an Out-of-distribution Generalization Perspective
Linyi Yang
Shuibai Zhang
Libo Qin
Yafu Li
Yidong Wang
Hanmeng Liu
Jindong Wang
Xingxu Xie
Yue Zhang
ELM
188
82
0
15 Nov 2022
A Benchmark and Dataset for Post-OCR text correction in Sanskrit
A Benchmark and Dataset for Post-OCR text correction in Sanskrit
Ayush Maheshwari
Nikhil Singh
Amrith Krishna
Ganesh Ramakrishnan
78
13
0
15 Nov 2022
The Lean Data Scientist: Recent Advances towards Overcoming the Data
  Bottleneck
The Lean Data Scientist: Recent Advances towards Overcoming the Data Bottleneck
Chen Shani
Jonathan Zarecki
Dafna Shahaf
41
6
0
15 Nov 2022
Breakpoint Transformers for Modeling and Tracking Intermediate Beliefs
Breakpoint Transformers for Modeling and Tracking Intermediate Beliefs
Kyle Richardson
Ronen Tamari
Oren Sultan
Reut Tsarfaty
Dafna Shahaf
Ashish Sabharwal
KELM
106
8
0
15 Nov 2022
YORO -- Lightweight End to End Visual Grounding
YORO -- Lightweight End to End Visual Grounding
Chih-Hui Ho
Srikar Appalaraju
Bhavan A. Jasani
R. Manmatha
Nuno Vasconcelos
ObjD
60
22
0
15 Nov 2022
Hierarchical Phrase-based Sequence-to-Sequence Learning
Hierarchical Phrase-based Sequence-to-Sequence Learning
Bailin Wang
Ivan Titov
Jacob Andreas
Yoon Kim
68
7
0
15 Nov 2022
A Survey for Efficient Open Domain Question Answering
A Survey for Efficient Open Domain Question Answering
Qin Zhang
Shan Chen
Dongkuan Xu
Qingqing Cao
Xiaojun Chen
Trevor Cohn
Meng Fang
90
36
0
15 Nov 2022
Prompting Language Models for Linguistic Structure
Prompting Language Models for Linguistic Structure
Terra Blevins
Hila Gonen
Luke Zettlemoyer
LRM
127
44
0
15 Nov 2022
Generative Aspect-Based Sentiment Analysis with Contrastive Learning and
  Expressive Structure
Generative Aspect-Based Sentiment Analysis with Contrastive Learning and Expressive Structure
Joseph Peper
Lu Wang
55
34
0
14 Nov 2022
Logical Tasks for Measuring Extrapolation and Rule Comprehension
Logical Tasks for Measuring Extrapolation and Rule Comprehension
Ippei Fujisawa
Ryota Kanai
ELMLRM
71
4
0
14 Nov 2022
EVA: Exploring the Limits of Masked Visual Representation Learning at
  Scale
EVA: Exploring the Limits of Masked Visual Representation Learning at Scale
Yuxin Fang
Wen Wang
Binhui Xie
Quan-Sen Sun
Ledell Yu Wu
Xinggang Wang
Tiejun Huang
Xinlong Wang
Yue Cao
VLMCLIP
254
730
0
14 Nov 2022
Follow the Wisdom of the Crowd: Effective Text Generation via Minimum
  Bayes Risk Decoding
Follow the Wisdom of the Crowd: Effective Text Generation via Minimum Bayes Risk Decoding
Mirac Suzgun
Luke Melas-Kyriazi
Dan Jurafsky
76
46
0
14 Nov 2022
UGIF: UI Grounded Instruction Following
UGIF: UI Grounded Instruction Following
S. Venkatesh
Partha P. Talukdar
S. Narayanan
140
12
0
14 Nov 2022
Why Did the Chicken Cross the Road? Rephrasing and Analyzing Ambiguous
  Questions in VQA
Why Did the Chicken Cross the Road? Rephrasing and Analyzing Ambiguous Questions in VQA
Elias Stengel-Eskin
Jimena Guallar-Blasco
Yi Zhou
Benjamin Van Durme
UQLM
72
12
0
14 Nov 2022
CST5: Data Augmentation for Code-Switched Semantic Parsing
CST5: Data Augmentation for Code-Switched Semantic Parsing
Anmol Agarwal
Jigar Gupta
Rahul Goel
Shyam Upadhyay
Pankaj Joshi
R. Aravamudhan
74
9
0
14 Nov 2022
Calibrated Interpretation: Confidence Estimation in Semantic Parsing
Calibrated Interpretation: Confidence Estimation in Semantic Parsing
Elias Stengel-Eskin
Benjamin Van Durme
UQLM
162
25
0
14 Nov 2022
Multi-VQG: Generating Engaging Questions for Multiple Images
Multi-VQG: Generating Engaging Questions for Multiple Images
Min-Hsuan Yeh
Vicent Chen
Ting-Hao Haung
Lun-Wei Ku
CoGe
113
7
0
14 Nov 2022
Previous
123...144145146...197198199
Next