Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2309.16265
Cited By
Semantic Proximity Alignment: Towards Human Perception-consistent Audio Tagging by Aligning with Label Text Description
28 September 2023
Youbin Jeon
Yanzhen Ren
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Semantic Proximity Alignment: Towards Human Perception-consistent Audio Tagging by Aligning with Label Text Description"
2 / 2 papers shown
Title
HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection
Ke Chen
Xingjian Du
Bilei Zhu
Zejun Ma
Taylor Berg-Kirkpatrick
Shlomo Dubnov
ViT
118
264
0
02 Feb 2022
PSLA: Improving Audio Tagging with Pretraining, Sampling, Labeling, and Aggregation
Yuan Gong
Yu-An Chung
James R. Glass
VLM
104
144
0
02 Feb 2021
1