Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2306.00830
Cited By
Adapting a ConvNeXt model to audio classification on AudioSet
1 June 2023
Thomas Pellegrini
Ismail Khalfaoui-Hassani
Etienne Labbé
T. Masquelier
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Adapting a ConvNeXt model to audio classification on AudioSet"
17 / 17 papers shown
Title
Discrete Audio Representations for Automated Audio Captioning
Jingguang Tian
Haoqin Sun
Xinhui Hu
Xinkang Xu
11
0
0
21 May 2025
Hierarchical Label Propagation: A Model-Size-Dependent Performance Booster for AudioSet Tagging
Ludovic Tuncay
Etienne Labbé
Thomas Pellegrini
VLM
40
0
0
26 Mar 2025
Comparative Study of Spike Encoding Methods for Environmental Sound Classification
Andres Larroza
Javier Naranjo-Alcazar
Vicent Ortiz Castelló
P. Zuccarello
49
0
0
14 Mar 2025
DRCap: Decoding CLAP Latents with Retrieval-Augmented Generation for Zero-shot Audio Captioning
Xiquan Li
Wenxi Chen
Ziyang Ma
Xuenan Xu
Yuzhe Liang
Zhisheng Zheng
Qiuqiang Kong
Xie Chen
VLM
36
2
0
12 Oct 2024
Machine listening in a neonatal intensive care unit
Modan Tailleur
Vincent Lostanlen
Jean-Philippe Riviere
Pierre Aumond
26
0
0
16 Sep 2024
A Survey of Foundation Models for Music Understanding
Wenjun Li
Ying Cai
Ziyang Wu
Wenyi Zhang
Yifan Chen
...
Junwei Han
Bao Ge
Tianming Liu
Lin Gan
Tuo Zhang
63
2
0
15 Sep 2024
EnCLAP++: Analyzing the EnCLAP Framework for Optimizing Automated Audio Captioning Performance
Jaeyeon Kim
Minjeon Jeon
Jaeyoon Jung
Sang Hoon Woo
Jinjoo Lee
34
2
0
02 Sep 2024
Expanding on EnCLAP with Auxiliary Retrieval Model for Automated Audio Captioning
Jaeyeon Kim
Jaeyoon Jung
Minjeong Jeon
Sang Hoon Woo
Jinjoo Lee
24
1
0
02 Sep 2024
Automatic Speech Recognition using Advanced Deep Learning Approaches: A survey
Hamza Kheddar
Mustapha Hemis
Yassine Himeur
OffRL
46
59
0
02 Mar 2024
EnCLAP: Combining Neural Audio Codec and Audio-Text Joint Embedding for Automated Audio Captioning
Jaeyeon Kim
Jaeyoon Jung
Jinjoo Lee
Sang Hoon Woo
CLIP
VLM
25
22
0
31 Jan 2024
Dynamic Convolutional Neural Networks as Efficient Pre-trained Audio Models
Florian Schmid
Khaled Koutini
Gerhard Widmer
18
11
0
24 Oct 2023
Audio classification with Dilated Convolution with Learnable Spacings
Ismail Khalfaoui-Hassani
T. Masquelier
Thomas Pellegrini
25
1
0
25 Sep 2023
Multilingual Audio Captioning using machine translated data
Matéo Cousin
Etienne Labbé
Thomas Pellegrini
30
4
0
14 Sep 2023
CoNeTTE: An efficient Audio Captioning system leveraging multiple datasets with Task Embedding
Etienne Labbé
Thomas Pellegrini
J. Pinquier
30
12
0
01 Sep 2023
Killing two birds with one stone: Can an audio captioning system also be used for audio-text retrieval?
Etienne Labbé
Thomas Pellegrini
J. Pinquier
23
5
0
29 Aug 2023
CED: Consistent ensemble distillation for audio tagging
Heinrich Dinkel
Yongqing Wang
Zhiyong Yan
Junbo Zhang
Yujun Wang
26
19
0
23 Aug 2023
Xception: Deep Learning with Depthwise Separable Convolutions
François Chollet
MDE
BDL
PINN
248
14,387
0
07 Oct 2016
1