ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.10683
  4. Cited By
Exploring the Limits of Transfer Learning with a Unified Text-to-Text
  Transformer
v1v2v3v4 (latest)

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
    AIMat
ArXiv (abs)PDFHTML

Papers citing "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

50 / 9,983 papers shown
Title
Seeing Seeds Beyond Weeds: Green Teaming Generative AI for Beneficial
  Uses
Seeing Seeds Beyond Weeds: Green Teaming Generative AI for Beneficial Uses
Logan Stapleton
Jordan Taylor
Sarah E Fox
Tongshuang Wu
Haiyi Zhu
83
13
0
30 May 2023
Conceptual Design Generation Using Large Language Models
Conceptual Design Generation Using Large Language Models
Kevin Ma
Daniele Grandi
Christopher McComb
K. Goucher-Lambert
116
24
0
30 May 2023
LLM-BRAIn: AI-driven Fast Generation of Robot Behaviour Tree based on
  Large Language Model
LLM-BRAIn: AI-driven Fast Generation of Robot Behaviour Tree based on Large Language Model
Artem Lykov
Dzmitry Tsetserukou
LM&Ro
54
30
0
30 May 2023
Concise Answers to Complex Questions: Summarization of Long-form Answers
Concise Answers to Complex Questions: Summarization of Long-form Answers
Abhilash Potluri
Fangyuan Xu
Eunsol Choi
ELM
72
11
0
30 May 2023
Intriguing Properties of Quantization at Scale
Intriguing Properties of Quantization at Scale
Arash Ahmadian
Saurabh Dash
Hongyu Chen
Bharat Venkitesh
Stephen Gou
Phil Blunsom
Ahmet Üstün
Sara Hooker
MQ
131
38
0
30 May 2023
What Can We Learn from Unlearnable Datasets?
What Can We Learn from Unlearnable Datasets?
Pedro Sandoval-Segura
Vasu Singla
Jonas Geiping
Micah Goldblum
Tom Goldstein
86
16
0
30 May 2023
Preserving Pre-trained Features Helps Calibrate Fine-tuned Language
  Models
Preserving Pre-trained Features Helps Calibrate Fine-tuned Language Models
Guande He
Jianfei Chen
Jun Zhu
94
22
0
30 May 2023
Controlled Text Generation with Hidden Representation Transformations
Controlled Text Generation with Hidden Representation Transformations
Vaibhav Kumar
H. Koorehdavoudi
Masud Moshtaghi
Amita Misra
Ankit Chadha
Emilio Ferrara
70
3
0
30 May 2023
Cross Encoding as Augmentation: Towards Effective Educational Text
  Classification
Cross Encoding as Augmentation: Towards Effective Educational Text Classification
Hyun Seung Lee
Seungtaek Choi
Yunsung Lee
Hyeongdon Moon
Shinhyeok Oh
Myeongho Jeong
Hyojun Go
C. Wallraven
81
1
0
30 May 2023
Where's the Point? Self-Supervised Multilingual Punctuation-Agnostic
  Sentence Segmentation
Where's the Point? Self-Supervised Multilingual Punctuation-Agnostic Sentence Segmentation
Benjamin Minixhofer
Jonas Pfeiffer
Ivan Vulić
101
18
0
30 May 2023
PreQuant: A Task-agnostic Quantization Approach for Pre-trained Language
  Models
PreQuant: A Task-agnostic Quantization Approach for Pre-trained Language Models
Zhuocheng Gong
Jiahao Liu
Qifan Wang
Yang Yang
Jingang Wang
Wei Wu
Yunsen Xian
Dongyan Zhao
Rui Yan
MQ
84
5
0
30 May 2023
Knowledge Graph-Augmented Language Models for Knowledge-Grounded
  Dialogue Generation
Knowledge Graph-Augmented Language Models for Knowledge-Grounded Dialogue Generation
Minki Kang
Jin Myung Kwak
Jinheon Baek
Sung Ju Hwang
RALM
99
63
0
30 May 2023
Generate then Select: Open-ended Visual Question Answering Guided by
  World Knowledge
Generate then Select: Open-ended Visual Question Answering Guided by World Knowledge
Xingyu Fu
Shenmin Zhang
Gukyeong Kwon
Pramuditha Perera
Henghui Zhu
...
Zhiguo Wang
Vittorio Castelli
Patrick Ng
Dan Roth
Bing Xiang
90
22
0
30 May 2023
HiFA: High-fidelity Text-to-3D Generation with Advanced Diffusion
  Guidance
HiFA: High-fidelity Text-to-3D Generation with Advanced Diffusion Guidance
Junzhe Zhu
Peiye Zhuang
Oluwasanmi Koyejo
DiffM
106
79
0
30 May 2023
Domain Specialization as the Key to Make Large Language Models
  Disruptive: A Comprehensive Survey
Domain Specialization as the Key to Make Large Language Models Disruptive: A Comprehensive Survey
Chen Ling
Xujiang Zhao
Jiaying Lu
Chengyuan Deng
Can Zheng
...
Chris White
Quanquan Gu
Jian Pei
Carl Yang
Liang Zhao
ALM
172
140
0
30 May 2023
LayerDiffusion: Layered Controlled Image Editing with Diffusion Models
LayerDiffusion: Layered Controlled Image Editing with Diffusion Models
Pengzhi Li
Qinxuan Huang
Yikang Ding
Zhiheng Li
DiffM
80
38
0
30 May 2023
Enhanced Chart Understanding in Vision and Language Task via Cross-modal
  Pre-training on Plot Table Pairs
Enhanced Chart Understanding in Vision and Language Task via Cross-modal Pre-training on Plot Table Pairs
Mingyang Zhou
Yi R. Fung
Long Chen
Christopher Thomas
Heng Ji
Shih-Fu Chang
115
13
0
29 May 2023
Alfred: A System for Prompted Weak Supervision
Alfred: A System for Prompted Weak Supervision
Peilin Yu
Stephen H. Bach
89
10
0
29 May 2023
How Effective Are Neural Networks for Fixing Security Vulnerabilities
How Effective Are Neural Networks for Fixing Security Vulnerabilities
Yi Wu
Nan Jiang
H. Pham
Thibaud Lutellier
Jordan Davis
Lin Tan
Petr Babkin
Sameena Shah
AAML
108
97
0
29 May 2023
Information Association for Language Model Updating by Mitigating
  LM-Logical Discrepancy
Information Association for Language Model Updating by Mitigating LM-Logical Discrepancy
Pengfei Yu
Heng Ji
KELM
82
10
0
29 May 2023
Brainformers: Trading Simplicity for Efficiency
Brainformers: Trading Simplicity for Efficiency
Yan-Quan Zhou
Nan Du
Yanping Huang
Daiyi Peng
Chang Lan
...
Zhifeng Chen
Quoc V. Le
Claire Cui
J.H.J. Laundon
J. Dean
MoE
92
27
0
29 May 2023
Contextual Object Detection with Multimodal Large Language Models
Contextual Object Detection with Multimodal Large Language Models
Yuhang Zang
Wei Li
Jun Han
Kaiyang Zhou
Chen Change Loy
ObjDVLMMLLM
129
88
0
29 May 2023
GlyphControl: Glyph Conditional Control for Visual Text Generation
GlyphControl: Glyph Conditional Control for Visual Text Generation
Yukang Yang
Dongnan Gui
Yuhui Yuan
Weicong Liang
Haisong Ding
Hang-Rui Hu
Kai Chen
DiffM
110
85
0
29 May 2023
A Critical Evaluation of Evaluations for Long-form Question Answering
A Critical Evaluation of Evaluations for Long-form Question Answering
Fangyuan Xu
Yixiao Song
Mohit Iyyer
Eunsol Choi
ELM
110
104
0
29 May 2023
GripRank: Bridging the Gap between Retrieval and Generation via the
  Generative Knowledge Improved Passage Ranking
GripRank: Bridging the Gap between Retrieval and Generation via the Generative Knowledge Improved Passage Ranking
Jiaqi Bai
Hongcheng Guo
Jiaheng Liu
Jian Yang
Xinnian Liang
Zhao Yan
Zhoujun Li
RALM
83
15
0
29 May 2023
Code Prompting: a Neural Symbolic Method for Complex Reasoning in Large
  Language Models
Code Prompting: a Neural Symbolic Method for Complex Reasoning in Large Language Models
Yitao Hu
Haotong Yang
Zhouchen Lin
Muhan Zhang
ReLMLRM
81
18
0
29 May 2023
ChatGPT-powered Conversational Drug Editing Using Retrieval and Domain
  Feedback
ChatGPT-powered Conversational Drug Editing Using Retrieval and Domain Feedback
Shengchao Liu
Jiong Wang
Yijin Yang
Chengpeng Wang
Ling Liu
Hongyu Guo
Chaowei Xiao
LM&MAKELMAI4MH
109
38
0
29 May 2023
The Utility of Large Language Models and Generative AI for Education
  Research
The Utility of Large Language Models and Generative AI for Education Research
Andrew Katz
Umair Shakir
B. Chambers
AI4CE
75
6
0
29 May 2023
A Systematic Study and Comprehensive Evaluation of ChatGPT on Benchmark
  Datasets
A Systematic Study and Comprehensive Evaluation of ChatGPT on Benchmark Datasets
Md Tahmid Rahman Laskar
M Saiful Bari
Mizanur Rahman
Md Amran Hossen Bhuiyan
Shafiq Joty
J. Huang
LM&MAELMALM
131
193
0
29 May 2023
Exploring the Compositional Generalization in Context Dependent
  Text-to-SQL Parsing
Exploring the Compositional Generalization in Context Dependent Text-to-SQL Parsing
Aiwei Liu
Wen Liu
Xuming Hu
Shuang Li
Fukun Ma
Yawen Yang
Lijie Wen
69
2
0
29 May 2023
InstructEdit: Improving Automatic Masks for Diffusion-based Image
  Editing With User Instructions
InstructEdit: Improving Automatic Masks for Diffusion-based Image Editing With User Instructions
Qian Wang
Biao Zhang
Michael Birsak
Peter Wonka
DiffM
69
37
0
29 May 2023
Faithfulness Tests for Natural Language Explanations
Faithfulness Tests for Natural Language Explanations
Pepa Atanasova
Oana-Maria Camburu
Christina Lioma
Thomas Lukasiewicz
J. Simonsen
Isabelle Augenstein
FAtt
122
67
0
29 May 2023
Conditional Score Guidance for Text-Driven Image-to-Image Translation
Conditional Score Guidance for Text-Driven Image-to-Image Translation
Hyunsoo Lee
Minsoo Kang
Bohyung Han
DiffM
64
15
0
29 May 2023
Make-An-Audio 2: Temporal-Enhanced Text-to-Audio Generation
Make-An-Audio 2: Temporal-Enhanced Text-to-Audio Generation
Jia-Bin Huang
Yi Ren
Rongjie Huang
Dongchao Yang
Zhenhui Ye
Chen Zhang
Jinglin Liu
Xiang Yin
Zejun Ma
Zhou Zhao
DiffM
123
64
0
29 May 2023
Aligning Optimization Trajectories with Diffusion Models for Constrained
  Design Generation
Aligning Optimization Trajectories with Diffusion Models for Constrained Design Generation
Giorgio Giannone
Akash Srivastava
Ole Winther
Faez Ahmed
DiffMAI4CE
99
36
0
29 May 2023
ContrastNER: Contrastive-based Prompt Tuning for Few-shot NER
ContrastNER: Contrastive-based Prompt Tuning for Few-shot NER
Amirhossein Layegh
A. H. Payberah
A. Soylu
Dumitru Roman
M. Matskin
VLM
80
8
0
29 May 2023
Federated Learning of Gboard Language Models with Differential Privacy
Federated Learning of Gboard Language Models with Differential Privacy
Zheng Xu
Yanxiang Zhang
Galen Andrew
Christopher A. Choquette-Choo
Peter Kairouz
H. B. McMahan
Jesse Rosenstock
Yuanbo Zhang
FedML
141
82
0
29 May 2023
Byte-Level Grammatical Error Correction Using Synthetic and Curated
  Corpora
Byte-Level Grammatical Error Correction Using Synthetic and Curated Corpora
Svanhvít Lilja Ingólfsdóttir
Pétur Orri Ragnarsson
H. Jónsson
Haukur Barri Símonarson
Vilhjálmur Þorsteinsson
Vésteinn Snæbjarnarson
SyDa
82
9
0
29 May 2023
LLM-QAT: Data-Free Quantization Aware Training for Large Language Models
LLM-QAT: Data-Free Quantization Aware Training for Large Language Models
Zechun Liu
Barlas Oğuz
Changsheng Zhao
Ernie Chang
Pierre Stock
Yashar Mehdad
Yangyang Shi
Raghuraman Krishnamoorthi
Vikas Chandra
MQ
137
209
0
29 May 2023
Vec2Gloss: definition modeling leveraging contextualized vectors with
  Wordnet gloss
Vec2Gloss: definition modeling leveraging contextualized vectors with Wordnet gloss
Yu-Hsiang Tseng
Mao-Chang Ku
Wei-Ling Chen
Yu-Lin Chang
S. Hsieh
59
2
0
29 May 2023
NOTABLE: Transferable Backdoor Attacks Against Prompt-based NLP Models
NOTABLE: Transferable Backdoor Attacks Against Prompt-based NLP Models
Kai Mei
Zheng Li
Zhenting Wang
Yang Zhang
Shiqing Ma
AAMLSILM
94
51
0
28 May 2023
A Quantitative Review on Language Model Efficiency Research
A Quantitative Review on Language Model Efficiency Research
Meng Jiang
Hy Dang
Lingbo Tong
78
0
0
28 May 2023
Semantic Segmentation with Bidirectional Language Models Improves
  Long-form ASR
Semantic Segmentation with Bidirectional Language Models Improves Long-form ASR
Wenjie Huang
Hao Zhang
Shankar Kumar
Shuo-yiin Chang
Tara N. Sainath
77
2
0
28 May 2023
GIMLET: A Unified Graph-Text Model for Instruction-Based Molecule
  Zero-Shot Learning
GIMLET: A Unified Graph-Text Model for Instruction-Based Molecule Zero-Shot Learning
Haiteng Zhao
Shengchao Liu
Chang Ma
Hannan Xu
Jie Fu
Zhihong Deng
Lingpeng Kong
Qi Liu
94
65
0
28 May 2023
Feature-Learning Networks Are Consistent Across Widths At Realistic
  Scales
Feature-Learning Networks Are Consistent Across Widths At Realistic Scales
Nikhil Vyas
Alexander B. Atanasov
Blake Bordelon
Depen Morwani
Sabarish Sainathan
Cengiz Pehlevan
131
26
0
28 May 2023
FuseCap: Leveraging Large Language Models for Enriched Fused Image
  Captions
FuseCap: Leveraging Large Language Models for Enriched Fused Image Captions
Noam Rotstein
David Bensaid
Shaked Brody
Roy Ganz
Ron Kimmel
VLM
89
31
0
28 May 2023
Knowledge-Augmented Reasoning Distillation for Small Language Models in
  Knowledge-Intensive Tasks
Knowledge-Augmented Reasoning Distillation for Small Language Models in Knowledge-Intensive Tasks
Minki Kang
Seanie Lee
Jinheon Baek
Kenji Kawaguchi
Sung Ju Hwang
ALMLRM
123
66
0
28 May 2023
Emergent Modularity in Pre-trained Transformers
Emergent Modularity in Pre-trained Transformers
Zhengyan Zhang
Zhiyuan Zeng
Yankai Lin
Chaojun Xiao
Xiaozhi Wang
Xu Han
Zhiyuan Liu
Ruobing Xie
Maosong Sun
Jie Zhou
MoE
124
25
0
28 May 2023
One Network, Many Masks: Towards More Parameter-Efficient Transfer
  Learning
One Network, Many Masks: Towards More Parameter-Efficient Transfer Learning
Guangtao Zeng
Peiyuan Zhang
Wei Lu
95
22
0
28 May 2023
Decoding the Underlying Meaning of Multimodal Hateful Memes
Decoding the Underlying Meaning of Multimodal Hateful Memes
Ming Shan Hee
Wen-Haw Chong
Roy Ka-wei Lee
98
43
0
28 May 2023
Previous
123...128129130...198199200
Next