Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1910.10683
Cited By
v1
v2
v3
v4 (latest)
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"
50 / 9,952 papers shown
Title
Symmetric Dot-Product Attention for Efficient Training of BERT Language Models
Martin Courtois
Malte Ostendorff
Leonhard Hennig
Georg Rehm
95
2
0
10 Jun 2024
Tx-LLM: A Large Language Model for Therapeutics
Juan Manuel Zambrano Chaves
Eric Wang
Tao Tu
E. D. Vaishnav
Byron Lee
S. S. Mahdavi
Christopher Semturs
David Fleet
Vivek Natarajan
Shekoofeh Azizi
LM&MA
125
20
0
10 Jun 2024
Learning Fine-Grained Controllability on Speech Generation via Efficient Fine-Tuning
Chung-Ming Chien
Andros Tjandra
Apoorv Vyas
Matt Le
Bowen Shi
Wei-Ning Hsu
82
0
0
10 Jun 2024
MATES: Model-Aware Data Selection for Efficient Pretraining with Data Influence Models
Zichun Yu
Spandan Das
Chenyan Xiong
128
37
0
10 Jun 2024
Multi-Prompting Decoder Helps Better Language Understanding
Zifeng Cheng
Zhaoling Chen
Zhiwei Jiang
Yafeng Yin
Shiping Ge
Shiping Ge
Qing Gu
AI4CE
102
1
0
10 Jun 2024
MolX: Enhancing Large Language Models for Molecular Learning with A Multi-Modal Extension
Khiem Le
Zhichun Guo
Kaiwen Dong
Xiaobao Huang
B. Nan
Roshni G. Iyer
Xiangliang Zhang
Olaf Wiest
Wei Wang
Nitesh Chawla
116
0
0
10 Jun 2024
Security Vulnerability Detection with Multitask Self-Instructed Fine-Tuning of Large Language Models
Aidan Z. H. Yang
Haoye Tian
He Ye
Ruben Martins
Claire Le Goues
59
5
0
09 Jun 2024
Feriji: A French-Zarma Parallel Corpus, Glossary & Translator
Mamadou K. Keita
Elysabhete Amadou Ibrahim
Habibatou Abdoulaye Alfari
Christopher Homan
95
1
0
09 Jun 2024
Attention as a Hypernetwork
Simon Schug
Seijin Kobayashi
Yassir Akram
João Sacramento
Razvan Pascanu
GNN
86
5
0
09 Jun 2024
RE-RAG: Improving Open-Domain QA Performance and Interpretability with Relevance Estimator in Retrieval-Augmented Generation
Kiseung Kim
Jay-Yoon Lee
RALM
89
7
0
09 Jun 2024
A Superalignment Framework in Autonomous Driving with Large Language Models
Xiangrui Kong
Thomas Braunl
Marco Fahmi
Yue Wang
85
9
0
09 Jun 2024
Exploring the Benefits of Tokenization of Discrete Acoustic Units
Avihu Dekel
Raul Fernandez
87
2
0
08 Jun 2024
Generalist Multimodal AI: A Review of Architectures, Challenges and Opportunities
Sai Munikoti
Ian Stewart
Sameera Horawalavithana
Henry Kvinge
Tegan H. Emerson
Sandra E Thompson
Karl Pazdernik
111
2
0
08 Jun 2024
Teaching-Assistant-in-the-Loop: Improving Knowledge Distillation from Imperfect Teacher Models in Low-Budget Scenarios
Yuhang Zhou
Wei Ai
98
7
0
08 Jun 2024
Integrating Text and Image Pre-training for Multi-modal Algorithmic Reasoning
Zijian Zhang
Wei Liu
102
0
0
08 Jun 2024
SuperPos-Prompt: Enhancing Soft Prompt Tuning of Language Models with Superposition of Multi Token Embeddings
MohammadAli SadraeiJavaeri
Ehsaneddin Asgari
A. Mchardy
Hamid R. Rabiee
VLM
AAML
73
0
0
07 Jun 2024
VTrans: Accelerating Transformer Compression with Variational Information Bottleneck based Pruning
Oshin Dutta
Ritvik Gupta
Sumeet Agarwal
95
2
0
07 Jun 2024
Improving Logits-based Detector without Logits from Black-box LLMs
Cong Zeng
Shengkun Tang
Xianjun Yang
Yuanzhou Chen
Yiyou Sun
zhiqiang xu
Yao Li
Haifeng Chen
Wei Cheng
Dongkuan Xu
DeLMO
129
2
0
07 Jun 2024
CHIQ: Contextual History Enhancement for Improving Query Rewriting in Conversational Search
Fengran Mo
Abbas Ghaddar
Kelong Mao
Mehdi Rezagholizadeh
Boxing Chen
Qun Liu
Jian-Yun Nie
109
16
0
07 Jun 2024
Quantifying Geospatial in the Common Crawl Corpus
Ilya Ilyankou
Meihui Wang
Stefano Cavazzi
James Haworth
88
1
0
07 Jun 2024
Through the Thicket: A Study of Number-Oriented LLMs derived from Random Forest Models
M. Romaszewski
Przemysław Sekuła
P. Głomb
M. Cholewa
Katarzyna Kołodziej
68
0
0
07 Jun 2024
Annotating FrameNet via Structure-Conditioned Language Generation
Xinyue Cui
Swabha Swayamdipta
68
2
0
07 Jun 2024
BERTs are Generative In-Context Learners
David Samuel
85
8
0
07 Jun 2024
MeLFusion: Synthesizing Music from Image and Language Cues using Diffusion Models
Sanjoy Chowdhury
Sayan Nag
K. J. Joseph
Balaji Vasan Srinivasan
Dinesh Manocha
DiffM
89
8
0
07 Jun 2024
MATTER: Memory-Augmented Transformer Using Heterogeneous Knowledge Sources
Dongkyu Lee
Chandana Satya Prakash
Jack G. M. FitzGerald
Jens Lehmann
RALM
83
2
0
07 Jun 2024
DiNeR: a Large Realistic Dataset for Evaluating Compositional Generalization
ChenGang Hu
Xiao Liu
Yansong Feng
CoGe
86
1
0
07 Jun 2024
Large Language Model-guided Document Selection
Xiang Kong
Tom Gunter
Ruoming Pang
72
4
0
07 Jun 2024
Key-Element-Informed sLLM Tuning for Document Summarization
Sangwon Ryu
Heejin Do
Yunsu Kim
G. G. Lee
Jungseul Ok
103
6
0
07 Jun 2024
SC2: Towards Enhancing Content Preservation and Style Consistency in Long Text Style Transfer
Jie Zhao
Ziyu Guan
Cai Xu
Wei Zhao
Yue Jiang
76
2
0
07 Jun 2024
Creating an AI Observer: Generative Semantic Workspaces
Pavan Holur
Shreyas Rajesh
David Chong
V. Roychowdhury
45
0
0
07 Jun 2024
Pretraining Decision Transformers with Reward Prediction for In-Context Multi-task Structured Bandit Learning
Subhojyoti Mukherjee
Josiah P. Hanna
Qiaomin Xie
Robert Nowak
265
2
0
07 Jun 2024
DeepStack: Deeply Stacking Visual Tokens is Surprisingly Simple and Effective for LMMs
Lingchen Meng
Jianwei Yang
Rui Tian
Xiyang Dai
Zuxuan Wu
Jianfeng Gao
Yu-Gang Jiang
VLM
95
9
0
06 Jun 2024
Causal Estimation of Memorisation Profiles
Pietro Lesci
Clara Meister
Thomas Hofmann
Andreas Vlachos
Tiago Pimentel
97
8
0
06 Jun 2024
ShareGPT4Video: Improving Video Understanding and Generation with Better Captions
Lin Chen
Xilin Wei
Jinsong Li
Xiaoyi Dong
Pan Zhang
...
Li Yuan
Yu Qiao
Dahua Lin
Feng Zhao
Jiaqi Wang
149
183
0
06 Jun 2024
Stratified Prediction-Powered Inference for Hybrid Language Model Evaluation
Adam Fisch
Joshua Maynez
R. A. Hofer
Bhuwan Dhingra
Amir Globerson
William W. Cohen
86
11
0
06 Jun 2024
Benchmark Data Contamination of Large Language Models: A Survey
Cheng Xu
Shuhao Guan
Derek Greene
Mohand-Tahar Kechadi
ELM
ALM
104
56
0
06 Jun 2024
FairytaleQA Translated: Enabling Educational Question and Answer Generation in Less-Resourced Languages
Bernardo Leite
T. Osório
Henrique Lopes Cardoso
AI4Ed
81
3
0
06 Jun 2024
Are We Done with MMLU?
Aryo Pradipta Gema
Joshua Ong Jun Leang
Giwon Hong
Alessio Devoto
Alberto Carlo Maria Mancino
...
R. McHardy
Joshua Harris
Jean Kaddour
Emile van Krieken
Pasquale Minervini
ELM
148
44
0
06 Jun 2024
Compressible Dynamics in Deep Overparameterized Low-Rank Learning & Adaptation
Can Yaras
Peng Wang
Laura Balzano
Qing Qu
AI4CE
75
15
0
06 Jun 2024
Zero-Painter: Training-Free Layout Control for Text-to-Image Synthesis
Marianna Ohanyan
Hayk Manukyan
Zhangyang Wang
Shant Navasardyan
Humphrey Shi
DiffM
103
2
0
06 Jun 2024
On The Persona-based Summarization of Domain-Specific Documents
Ankan Mullick
Sombit Bose
Rounak Saha
Ayan Kumar Bhowmick
Pawan Goyal
Niloy Ganguly
Prasenjit Dey
Ravi Kokku
61
3
0
06 Jun 2024
Exploring the Zero-Shot Capabilities of Vision-Language Models for Improving Gaze Following
Anshul Gupta
Pierre Vuillecard
Arya Farkhondeh
J. Odobez
VLM
121
3
0
06 Jun 2024
PALM: A Efficient Performance Simulator for Tiled Accelerators with Large-scale Model Training
Jiahao Fang
Huizheng Wang
Qize Yang
Dehao Kong
Xu Dai
Jinyi Deng
Yang Hu
Shouyi Yin
63
1
0
06 Jun 2024
Proactive Detection of Physical Inter-rule Vulnerabilities in IoT Services Using a Deep Learning Approach
Bing Huang
Chen Chen
K. Lam
Fuqun Huang
AAML
46
1
0
06 Jun 2024
Empirical Guidelines for Deploying LLMs onto Resource-constrained Edge Devices
Ruiyang Qin
Dancheng Liu
Zheyu Yan
Zhaoxuan Tan
Zixuan Pan
Zhenge Jia
Meng Jiang
Ahmed Abbasi
Jinjun Xiong
Yiyu Shi
103
15
0
06 Jun 2024
XL-HeadTags: Leveraging Multimodal Retrieval Augmentation for the Multilingual Generation of News Headlines and Tags
Faisal Tareque Shohan
Mir Tafseer Nayeem
Samsul Islam
Abu Ubaida Akash
Shafiq Joty
76
4
0
06 Jun 2024
Exploring the Latest LLMs for Leaderboard Extraction
Salomon Kabongo
Jennifer D'Souza
Sören Auer
67
2
0
06 Jun 2024
A Survey on Medical Large Language Models: Technology, Application, Trustworthiness, and Future Directions
Lei Liu
Xiaoyan Yang
Junchi Lei
Xiaoyang Liu
Yue Shen
...
Peng Wei
Jinjie Gu
Zhixuan Chu
Zhan Qin
Kui Ren
LM&MA
AILaw
105
19
0
06 Jun 2024
Improving Audio Codec-based Zero-Shot Text-to-Speech Synthesis with Multi-Modal Context and Large Language Model
Jinlong Xue
Yayue Deng
Yicheng Han
Yingming Gao
Ya Li
100
4
0
06 Jun 2024
Synthesizing Conversations from Unlabeled Documents using Automatic Response Segmentation
Fanyou Wu
Weijie Xu
Chandan K. Reddy
Srinivasan H. Sengamedu
66
0
0
06 Jun 2024
Previous
1
2
3
...
54
55
56
...
198
199
200
Next