ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.10683
  4. Cited By
Exploring the Limits of Transfer Learning with a Unified Text-to-Text
  Transformer
v1v2v3v4 (latest)

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
    AIMat
ArXiv (abs)PDFHTML

Papers citing "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

50 / 9,984 papers shown
Title
Unsupervised Paraphrasing of Multiword Expressions
Unsupervised Paraphrasing of Multiword Expressions
Takashi Wada
Yuji Matsumoto
Timothy Baldwin
Jey Han Lau
66
0
0
02 Jun 2023
An Overview on Generative AI at Scale with Edge-Cloud Computing
An Overview on Generative AI at Scale with Edge-Cloud Computing
Yun Cheng Wang
Jintang Xue
Chengwei Wei
C.-C. Jay Kuo
75
35
0
02 Jun 2023
Automatic Translation of Hate Speech to Non-hate Speech in Social Media
  Texts
Automatic Translation of Hate Speech to Non-hate Speech in Social Media Texts
Ye. Kostiuk
A. Tonja
Grigori Sidorov
Olga Kolesnikova
62
0
0
02 Jun 2023
THiFLY Research at SemEval-2023 Task 7: A Multi-granularity System for
  CTR-based Textual Entailment and Evidence Retrieval
THiFLY Research at SemEval-2023 Task 7: A Multi-granularity System for CTR-based Textual Entailment and Evidence Retrieval
Yuxuan Zhou
Ziyun Jin
Meiwei Li
Miao Li
Xien Liu
Xinxin You
Ji Wu
48
11
0
02 Jun 2023
Adapting an Unadaptable ASR System
Adapting an Unadaptable ASR System
Rao Ma
Mengjie Qian
Mark Gales
Kate Knill
112
3
0
01 Jun 2023
Did You Read the Instructions? Rethinking the Effectiveness of Task
  Definitions in Instruction Learning
Did You Read the Instructions? Rethinking the Effectiveness of Task Definitions in Instruction Learning
Fan Yin
Jesse Vig
Philippe Laban
Shafiq Joty
Caiming Xiong
Chien-Sheng Wu
ALM
77
44
0
01 Jun 2023
The RefinedWeb Dataset for Falcon LLM: Outperforming Curated Corpora
  with Web Data, and Web Data Only
The RefinedWeb Dataset for Falcon LLM: Outperforming Curated Corpora with Web Data, and Web Data Only
Guilherme Penedo
Quentin Malartic
Daniel Hesslow
Ruxandra-Aimée Cojocaru
Alessandro Cappelli
Hamza Alobeidli
B. Pannier
Ebtesam Almazrouei
Julien Launay
207
778
0
01 Jun 2023
Hierarchical Attention Encoder Decoder
Hierarchical Attention Encoder Decoder
Asier Mujika
BDL
66
3
0
01 Jun 2023
Reimagining Retrieval Augmented Language Models for Answering Queries
Reimagining Retrieval Augmented Language Models for Answering Queries
W. Tan
Yuliang Li
Pedro Rodriguez
Rich James
Xi Lin
A. Halevy
Scott Yih
KELMLRM
105
9
0
01 Jun 2023
StyleDrop: Text-to-Image Generation in Any Style
StyleDrop: Text-to-Image Generation in Any Style
Kihyuk Sohn
Nataniel Ruiz
Kimin Lee
Daniel Castro Chin
Irina Blok
...
Yuanzhen Li
Yuan Hao
Irfan Essa
Michael Rubinstein
Dilip Krishnan
86
152
0
01 Jun 2023
Discovering Failure Modes of Text-guided Diffusion Models via
  Adversarial Search
Discovering Failure Modes of Text-guided Diffusion Models via Adversarial Search
Qihao Liu
Adam Kortylewski
Yutong Bai
Song Bai
Alan Yuille
DiffM
125
12
0
01 Jun 2023
Intelligent Grimm -- Open-ended Visual Storytelling via Latent Diffusion
  Models
Intelligent Grimm -- Open-ended Visual Storytelling via Latent Diffusion Models
Chang-rui Liu
Haoning Wu
Yujie Zhong
Xiaoyu Zhang
Yanfeng Wang
Weidi Xie
DiffMVLM
169
44
0
01 Jun 2023
Make-Your-Video: Customized Video Generation Using Textual and
  Structural Guidance
Make-Your-Video: Customized Video Generation Using Textual and Structural Guidance
Jinbo Xing
Menghan Xia
Yuxin Liu
Yuechen Zhang
Yong Zhang
...
Haoxin Chen
Xiaodong Cun
Xintao Wang
Ying Shan
T. Wong
VGenDiffM
85
93
0
01 Jun 2023
Topic-Guided Sampling For Data-Efficient Multi-Domain Stance Detection
Topic-Guided Sampling For Data-Efficient Multi-Domain Stance Detection
Erik Arakelyan
Arnav Arora
Isabelle Augenstein
66
10
0
01 Jun 2023
Explanation Graph Generation via Generative Pre-training over Synthetic
  Graphs
Explanation Graph Generation via Generative Pre-training over Synthetic Graphs
H. Cui
Sha Li
Yu Zhang
Qi Shi
135
1
0
01 Jun 2023
Wuerstchen: An Efficient Architecture for Large-Scale Text-to-Image
  Diffusion Models
Wuerstchen: An Efficient Architecture for Large-Scale Text-to-Image Diffusion Models
Pablo Pernias
Dominic Rampas
Mats L. Richter
Christopher Pal
Marc Aubreville
DiffMVLM
119
45
0
01 Jun 2023
Effective Structured Prompting by Meta-Learning and Representative
  Verbalizer
Effective Structured Prompting by Meta-Learning and Representative Verbalizer
Weisen Jiang
Yu Zhang
James T. Kwok
VLMOffRL
102
18
0
01 Jun 2023
Layout and Task Aware Instruction Prompt for Zero-shot Document Image
  Question Answering
Layout and Task Aware Instruction Prompt for Zero-shot Document Image Question Answering
Wenjin Wang
Yunhao Li
Yixin Ou
Yin Zhang
VLM
135
26
0
01 Jun 2023
Revisiting Event Argument Extraction: Can EAE Models Learn Better When
  Being Aware of Event Co-occurrences?
Revisiting Event Argument Extraction: Can EAE Models Learn Better When Being Aware of Event Co-occurrences?
Yuxin He
Jing-Hao Hu
Buzhou Tang
81
30
0
01 Jun 2023
Make Pre-trained Model Reversible: From Parameter to Memory Efficient
  Fine-Tuning
Make Pre-trained Model Reversible: From Parameter to Memory Efficient Fine-Tuning
Baohao Liao
Shaomu Tan
Christof Monz
KELM
116
30
0
01 Jun 2023
Divide, Conquer, and Combine: Mixture of Semantic-Independent Experts
  for Zero-Shot Dialogue State Tracking
Divide, Conquer, and Combine: Mixture of Semantic-Independent Experts for Zero-Shot Dialogue State Tracking
Qingyue Wang
Liang Ding
Yanan Cao
Yibing Zhan
Zheng Lin
Shi Wang
Dacheng Tao
Li Guo
MoMeMoE
102
12
0
01 Jun 2023
Uncertainty-Aware Unlikelihood Learning Improves Generative Aspect
  Sentiment Quad Prediction
Uncertainty-Aware Unlikelihood Learning Improves Generative Aspect Sentiment Quad Prediction
Mengting Hu
Yinhao Bai
Yike Wu
Zhen Zhang
Liqi Zhang
H. Gao
Shiwan Zhao
Min-Tzu Huang
117
18
0
01 Jun 2023
Adapting Pre-trained Language Models to Vision-Language Tasks via
  Dynamic Visual Prompting
Adapting Pre-trained Language Models to Vision-Language Tasks via Dynamic Visual Prompting
Shubin Huang
Qiong Wu
Yiyi Zhou
Weijie Chen
Rongsheng Zhang
Xiaoshuai Sun
Rongrong Ji
VLMVPVLMLRM
59
0
0
01 Jun 2023
Preference-grounded Token-level Guidance for Language Model Fine-tuning
Preference-grounded Token-level Guidance for Language Model Fine-tuning
Shentao Yang
Shujian Zhang
Congying Xia
Yihao Feng
Caiming Xiong
Mi Zhou
146
28
0
01 Jun 2023
Better Context Makes Better Code Language Models: A Case Study on
  Function Call Argument Completion
Better Context Makes Better Code Language Models: A Case Study on Function Call Argument Completion
Hengzhi Pei
Jinman Zhao
Leonard Lausen
Sheng Zha
George Karypis
ELMLRM
67
22
0
01 Jun 2023
Large Scale Generative Multimodal Attribute Extraction for E-commerce
  Attributes
Large Scale Generative Multimodal Attribute Extraction for E-commerce Attributes
Anant Khandelwal
Happy Mittal
S. Kulkarni
D. Gupta
72
10
0
01 Jun 2023
How to Estimate Model Transferability of Pre-Trained Speech Models?
How to Estimate Model Transferability of Pre-Trained Speech Models?
Zih-Ching Chen
Chao-Han Huck Yang
Yue Liu
Yu Zhang
Nanxin Chen
Shoufeng Chang
Rohit Prabhavalkar
Hung-yi Lee
Tara N. Sainath
186
9
0
01 Jun 2023
FlexRound: Learnable Rounding based on Element-wise Division for
  Post-Training Quantization
FlexRound: Learnable Rounding based on Element-wise Division for Post-Training Quantization
J. H. Lee
Jeonghoon Kim
S. Kwon
Dongsoo Lee
MQ
124
38
0
01 Jun 2023
Training-free Neural Architecture Search for RNNs and Transformers
Training-free Neural Architecture Search for RNNs and Transformers
Aaron Serianni
Jugal Kalita
82
7
0
01 Jun 2023
Towards Foundation Models for Scientific Machine Learning:
  Characterizing Scaling and Transfer Behavior
Towards Foundation Models for Scientific Machine Learning: Characterizing Scaling and Transfer Behavior
Shashank Subramanian
P. Harrington
Kurt Keutzer
W. Bhimji
Dmitriy Morozov
Michael W. Mahoney
A. Gholami
AI4CE
130
80
0
01 Jun 2023
From Pixels to UI Actions: Learning to Follow Instructions via Graphical
  User Interfaces
From Pixels to UI Actions: Learning to Follow Instructions via Graphical User Interfaces
Peter Shaw
Mandar Joshi
James Cohan
Jonathan Berant
Panupong Pasupat
Hexiang Hu
Urvashi Khandelwal
Kenton Lee
Kristina Toutanova
LLMAGLM&Ro
104
58
0
31 May 2023
Toward Understanding Why Adam Converges Faster Than SGD for Transformers
Toward Understanding Why Adam Converges Faster Than SGD for Transformers
Yan Pan
Yuanzhi Li
148
45
0
31 May 2023
An Invariant Learning Characterization of Controlled Text Generation
An Invariant Learning Characterization of Controlled Text Generation
Carolina Zheng
Claudia Shi
Keyon Vafa
Amir Feder
David M. Blei
OOD
103
8
0
31 May 2023
Measuring the Robustness of NLP Models to Domain Shifts
Measuring the Robustness of NLP Models to Domain Shifts
Nitay Calderon
Naveh Porat
Eyal Ben-David
Alexander Chapanin
Zorik Gekhman
Nadav Oved
Vitaly Shalumov
Roi Reichart
150
8
0
31 May 2023
Multilingual Multi-Figurative Language Detection
Multilingual Multi-Figurative Language Detection
Huiyuan Lai
Antonio Toral
Malvina Nissim
51
1
0
31 May 2023
Monotonic Location Attention for Length Generalization
Monotonic Location Attention for Length Generalization
Jishnu Ray Chowdhury
Cornelia Caragea
LLMAG
85
8
0
31 May 2023
Scalable Learning of Latent Language Structure With Logical Offline
  Cycle Consistency
Scalable Learning of Latent Language Structure With Logical Offline Cycle Consistency
Mayank Agarwal
Ramón Fernández Astudillo
Tahira Naseem
Subhajit Chaudhury
Pavan Kapanipathi
Salim Roukos
Alexander G. Gray
OffRL
70
0
0
31 May 2023
Correcting Semantic Parses with Natural Language through Dynamic Schema
  Encoding
Correcting Semantic Parses with Natural Language through Dynamic Schema Encoding
Parker Glenn
Parag Dakle
Preethi Raghavan
64
3
0
31 May 2023
How to Plant Trees in Language Models: Data and Architectural Effects on
  the Emergence of Syntactic Inductive Biases
How to Plant Trees in Language Models: Data and Architectural Effects on the Emergence of Syntactic Inductive Biases
Aaron Mueller
Tal Linzen
AI4CE
67
21
0
31 May 2023
AQE: Argument Quadruplet Extraction via a Quad-Tagging Augmented
  Generative Approach
AQE: Argument Quadruplet Extraction via a Quad-Tagging Augmented Generative Approach
Jianwen Guo
Liying Cheng
Wenxuan Zhang
Stanley Kok
Xin Li
Lidong Bing
60
9
0
31 May 2023
BEIR-PL: Zero Shot Information Retrieval Benchmark for the Polish
  Language
BEIR-PL: Zero Shot Information Retrieval Benchmark for the Polish Language
Konrad Wojtasik
Vadim Shishkin
Kacper Wolowiec
Arkadiusz Janz
Maciej Piasecki
70
11
0
31 May 2023
Deliberate then Generate: Enhanced Prompting Framework for Text
  Generation
Deliberate then Generate: Enhanced Prompting Framework for Text Generation
Bei Li
Rui Wang
Junliang Guo
Kaitao Song
Xuejiao Tan
Hany Hassan
Arul Menezes
Tong Xiao
Jiang Bian
JingBo Zhu
88
14
0
31 May 2023
Primal-Attention: Self-attention through Asymmetric Kernel SVD in Primal
  Representation
Primal-Attention: Self-attention through Asymmetric Kernel SVD in Primal Representation
Yingyi Chen
Qinghua Tao
F. Tonin
Johan A. K. Suykens
103
22
0
31 May 2023
Text-to-Speech Pipeline for Swiss German -- A comparison
Text-to-Speech Pipeline for Swiss German -- A comparison
Tobias Bollinger
Jan Deriu
Manfred Vogel
DiffM
60
0
0
31 May 2023
Red Teaming Language Model Detectors with Language Models
Red Teaming Language Model Detectors with Language Models
Zhouxing Shi
Yihan Wang
Fan Yin
Xiangning Chen
Kai-Wei Chang
Cho-Jui Hsieh
DeLMO
92
57
0
31 May 2023
What does the Failure to Reason with "Respectively" in Zero/Few-Shot
  Settings Tell Us about Language Models?
What does the Failure to Reason with "Respectively" in Zero/Few-Shot Settings Tell Us about Language Models?
Ruixiang Cui
Seolhwa Lee
Daniel Hershcovich
Anders Søgaard
54
2
0
31 May 2023
LAIT: Efficient Multi-Segment Encoding in Transformers with
  Layer-Adjustable Interaction
LAIT: Efficient Multi-Segment Encoding in Transformers with Layer-Adjustable Interaction
Jeremiah Milbauer
Annie Louis
Mohammad Javad Hosseini
Alex Fabrikant
Donald Metzler
Tal Schuster
121
9
0
31 May 2023
Large Language Models Are Not Strong Abstract Reasoners
Large Language Models Are Not Strong Abstract Reasoners
Gaël Gendron
Qiming Bao
Michael Witbrock
Gillian Dobbie
ELMLRM
129
37
0
31 May 2023
The Impact of Positional Encoding on Length Generalization in
  Transformers
The Impact of Positional Encoding on Length Generalization in Transformers
Amirhossein Kazemnejad
Inkit Padhi
Karthikeyan N. Ramamurthy
Payel Das
Siva Reddy
104
209
0
31 May 2023
BotArtist: Generic approach for bot detection in Twitter via semi-automatic machine learning pipeline
BotArtist: Generic approach for bot detection in Twitter via semi-automatic machine learning pipeline
Alexander Shevtsov
D. Antonakaki
Ioannis Lamprou
Polyvios Pratikakis
Sotiris Ioannidis
172
0
0
31 May 2023
Previous
123...127128129...198199200
Next