Adapting a ConvNeXt model to audio classification on AudioSet

Adapting a ConvNeXt model to audio classification on AudioSet

1 June 2023

Thomas Pellegrini

Ismail Khalfaoui-Hassani

Papers citing "Adapting a ConvNeXt model to audio classification on AudioSet"

17 / 17 papers shown

Title
Discrete Audio Representations for Automated Audio Captioning Jingguang Tian Haoqin Sun Xinhui Hu Xinkang Xu 11 0 0 21 May 2025
Hierarchical Label Propagation: A Model-Size-Dependent Performance Booster for AudioSet Tagging Ludovic Tuncay Etienne Labbé Thomas Pellegrini VLM 40 0 0 26 Mar 2025
Comparative Study of Spike Encoding Methods for Environmental Sound Classification Andres Larroza Javier Naranjo-Alcazar Vicent Ortiz Castelló P. Zuccarello 49 0 0 14 Mar 2025
DRCap: Decoding CLAP Latents with Retrieval-Augmented Generation for Zero-shot Audio Captioning Xiquan Li Wenxi Chen Ziyang Ma Xuenan Xu Yuzhe Liang Zhisheng Zheng Qiuqiang Kong Xie Chen VLM 36 2 0 12 Oct 2024
Machine listening in a neonatal intensive care unit Modan Tailleur Vincent Lostanlen Jean-Philippe Riviere Pierre Aumond 26 0 0 16 Sep 2024
A Survey of Foundation Models for Music Understanding Wenjun Li Ying Cai Ziyang Wu Wenyi Zhang Yifan Chen ... Junwei Han Bao Ge Tianming Liu Lin Gan Tuo Zhang 63 2 0 15 Sep 2024
EnCLAP++: Analyzing the EnCLAP Framework for Optimizing Automated Audio Captioning Performance Jaeyeon Kim Minjeon Jeon Jaeyoon Jung Sang Hoon Woo Jinjoo Lee 34 2 0 02 Sep 2024
Expanding on EnCLAP with Auxiliary Retrieval Model for Automated Audio Captioning Jaeyeon Kim Jaeyoon Jung Minjeong Jeon Sang Hoon Woo Jinjoo Lee 24 1 0 02 Sep 2024
Automatic Speech Recognition using Advanced Deep Learning Approaches: A survey Hamza Kheddar Mustapha Hemis Yassine Himeur OffRL 46 59 0 02 Mar 2024
EnCLAP: Combining Neural Audio Codec and Audio-Text Joint Embedding for Automated Audio Captioning Jaeyeon Kim Jaeyoon Jung Jinjoo Lee Sang Hoon Woo CLIP VLM 25 22 0 31 Jan 2024
Dynamic Convolutional Neural Networks as Efficient Pre-trained Audio Models Florian Schmid Khaled Koutini Gerhard Widmer 18 11 0 24 Oct 2023
Audio classification with Dilated Convolution with Learnable Spacings Ismail Khalfaoui-Hassani T. Masquelier Thomas Pellegrini 25 1 0 25 Sep 2023
Multilingual Audio Captioning using machine translated data Matéo Cousin Etienne Labbé Thomas Pellegrini 30 4 0 14 Sep 2023
CoNeTTE: An efficient Audio Captioning system leveraging multiple datasets with Task Embedding Etienne Labbé Thomas Pellegrini J. Pinquier 30 12 0 01 Sep 2023
Killing two birds with one stone: Can an audio captioning system also be used for audio-text retrieval? Etienne Labbé Thomas Pellegrini J. Pinquier 23 5 0 29 Aug 2023
CED: Consistent ensemble distillation for audio tagging Heinrich Dinkel Yongqing Wang Zhiyong Yan Junbo Zhang Yujun Wang 26 19 0 23 Aug 2023
Xception: Deep Learning with Depthwise Separable Convolutions François Chollet MDE BDL PINN 248 14,387 0 07 Oct 2016