ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.08440
  4. Cited By
A Framework for the Robust Evaluation of Sound Event Detection

A Framework for the Robust Evaluation of Sound Event Detection

18 October 2019
Cagdas Bilen
Giacomo Ferroni
Francesco Tuveri
Juan Azcarreta
Sacha Krstulović
ArXivPDFHTML

Papers citing "A Framework for the Robust Evaluation of Sound Event Detection"

50 / 72 papers shown
Title
TACOS: Temporally-aligned Audio CaptiOnS for Language-Audio Pretraining
TACOS: Temporally-aligned Audio CaptiOnS for Language-Audio Pretraining
Paul Primus
Florian Schmid
Gerhard Widmer
CLIP
AI4TS
VLM
33
0
0
12 May 2025
Temporal Attention Pooling for Frequency Dynamic Convolution in Sound Event Detection
Temporal Attention Pooling for Frequency Dynamic Convolution in Sound Event Detection
Hyeonuk Nam
Yong-Hwa Park
33
0
0
17 Apr 2025
Formula-Supervised Sound Event Detection: Pre-Training Without Real Data
Formula-Supervised Sound Event Detection: Pre-Training Without Real Data
Yuto Shibata
Keitaro Tanaka
Yoshiaki Bando
Keisuke Imoto
Hirokatsu Kataoka
Yoshimitsu Aoki
31
0
0
06 Apr 2025
JiTTER: Jigsaw Temporal Transformer for Event Reconstruction for Self-Supervised Sound Event Detection
JiTTER: Jigsaw Temporal Transformer for Event Reconstruction for Self-Supervised Sound Event Detection
Hyeonuk Nam
Yong-Hwa Park
42
1
0
28 Feb 2025
Leveraging LLM and Text-Queried Separation for Noise-Robust Sound Event Detection
Leveraging LLM and Text-Queried Separation for Noise-Robust Sound Event Detection
Han Yin
Yang Xiao
Jisheng Bai
Rohan Kumar Das
31
0
0
02 Nov 2024
LEAD Dataset: How Can Labels for Sound Event Detection Vary Depending on
  Annotators?
LEAD Dataset: How Can Labels for Sound Event Detection Vary Depending on Annotators?
Naoki Koga
Yoshiaki Bando
Keisuke Imoto
21
0
0
13 Oct 2024
Prototype based Masked Audio Model for Self-Supervised Learning of Sound
  Event Detection
Prototype based Masked Audio Model for Self-Supervised Learning of Sound Event Detection
Pengfei Cai
Yan Song
Nan Jiang
Qing Gu
Ian Mcloughlin
32
2
0
26 Sep 2024
Effective Pre-Training of Audio Transformers for Sound Event Detection
Effective Pre-Training of Audio Transformers for Sound Event Detection
Florian Schmid
T. Morocutti
Francesco Foscarin
Jan Schluter
Paul Primus
Gerhard Widmer
ViT
28
2
0
14 Sep 2024
Energy Consumption Trends in Sound Event Detection Systems
Energy Consumption Trends in Sound Event Detection Systems
Constance Douwes
Romain Serizel
35
1
0
13 Sep 2024
Unified Audio Event Detection
Unified Audio Event Detection
Yidi Jiang
Ruijie Tao
Wen Huang
Qian Chen
Wen Wang
43
0
0
13 Sep 2024
MTDA-HSED: Mutual-Assistance Tuning and Dual-Branch Aggregating for
  Heterogeneous Sound Event Detection
MTDA-HSED: Mutual-Assistance Tuning and Dual-Branch Aggregating for Heterogeneous Sound Event Detection
Zehao Wang
Haobo Yue
Zhicheng Zhang
Da Mu
Jin Tang
Jianqin Yin
35
0
0
10 Sep 2024
ICSD: An Open-source Dataset for Infant Cry and Snoring Detection
ICSD: An Open-source Dataset for Infant Cry and Snoring Detection
Qingyu Liu
Longfei Song
Dongxing Xu
Yanhua Long
42
0
0
20 Aug 2024
MAT-SED: A Masked Audio Transformer with Masked-Reconstruction Based
  Pre-training for Sound Event Detection
MAT-SED: A Masked Audio Transformer with Masked-Reconstruction Based Pre-training for Sound Event Detection
Pengfei Cai
Yan Song
Kang Li
Haoyu Song
Ian Mcloughlin
31
5
0
16 Aug 2024
Improving Audio Spectrogram Transformers for Sound Event Detection
  Through Multi-Stage Training
Improving Audio Spectrogram Transformers for Sound Event Detection Through Multi-Stage Training
Florian Schmid
Paul Primus
T. Morocutti
Jonathan Greif
Gerhard Widmer
32
5
0
17 Jul 2024
UCIL: An Unsupervised Class Incremental Learning Approach for Sound Event Detection
UCIL: An Unsupervised Class Incremental Learning Approach for Sound Event Detection
Yang Xiao
Rohan Kumar Das
44
3
0
04 Jul 2024
FMSG-JLESS Submission for DCASE 2024 Task4 on Sound Event Detection with
  Heterogeneous Training Dataset and Potentially Missing Labels
FMSG-JLESS Submission for DCASE 2024 Task4 on Sound Event Detection with Heterogeneous Training Dataset and Potentially Missing Labels
Yang Xiao
Han Yin
Jisheng Bai
Rohan Kumar Das
27
3
0
29 Jun 2024
Self Training and Ensembling Frequency Dependent Networks with Coarse
  Prediction Pooling and Sound Event Bounding Boxes
Self Training and Ensembling Frequency Dependent Networks with Coarse Prediction Pooling and Sound Event Bounding Boxes
Hyeonuk Nam
D. Min
Seungdeok Choi
Inhan Choi
Yong-Hwa Park
37
3
0
22 Jun 2024
Pushing the Limit of Sound Event Detection with Multi-Dilated Frequency
  Dynamic Convolution
Pushing the Limit of Sound Event Detection with Multi-Dilated Frequency Dynamic Convolution
Hyeonuk Nam
Yong-Hwa Park
26
5
0
19 Jun 2024
DCASE 2024 Task 4: Sound Event Detection with Heterogeneous Data and
  Missing Labels
DCASE 2024 Task 4: Sound Event Detection with Heterogeneous Data and Missing Labels
Samuele Cornell
Janek Ebbers
Constance Douwes
Irene Martín-Morató
Manu Harju
A. Mesaros
Romain Serizel
37
13
0
12 Jun 2024
Diversifying and Expanding Frequency-Adaptive Convolution Kernels for
  Sound Event Detection
Diversifying and Expanding Frequency-Adaptive Convolution Kernels for Sound Event Detection
Hyeonuk Nam
Seong-Hu Kim
D. Min
Junhyeok Lee
Yong-Hwa Park
26
3
0
08 Jun 2024
Sound Event Bounding Boxes
Sound Event Bounding Boxes
Janek Ebbers
François Germain
G. Wichern
Jonathan Le Roux
37
12
0
06 Jun 2024
Meta-Decomposition: Dynamic Segmentation Approach Selection in IoT-based
  Activity Recognition
Meta-Decomposition: Dynamic Segmentation Approach Selection in IoT-based Activity Recognition
Seyed Mohammad Reza Modaresi
A. Osmani
Mohammadreza Razzazi
A. Chibani
33
0
0
17 Apr 2024
Onset and offset weighted loss function for sound event detection
Onset and offset weighted loss function for sound event detection
Tao Song
25
0
0
20 Mar 2024
Frequency-aware convolution for sound event detection
Frequency-aware convolution for sound event detection
Tao Song
CVBM
47
0
0
20 Mar 2024
Fine-Grained Engine Fault Sound Event Detection Using Multimodal Signals
Fine-Grained Engine Fault Sound Event Detection Using Multimodal Signals
Dennis Fedorishin
Livio Forte
Philip Schneider
S. Setlur
Venugopal Govindaraju
25
0
0
16 Mar 2024
Dual Knowledge Distillation for Efficient Sound Event Detection
Dual Knowledge Distillation for Efficient Sound Event Detection
Yang Xiao
Rohan Kumar Das
34
12
0
05 Feb 2024
Contrastive Loss Based Frame-wise Feature disentanglement for Polyphonic
  Sound Event Detection
Contrastive Loss Based Frame-wise Feature disentanglement for Polyphonic Sound Event Detection
Yadong Guan
Jiqing Han
Hongwei Song
Wenjie Song
Guibin Zheng
Tieran Zheng
Yongjun He
25
0
0
11 Jan 2024
Full-frequency dynamic convolution: a physical frequency-dependent
  convolution for sound event detection
Full-frequency dynamic convolution: a physical frequency-dependent convolution for sound event detection
Haobo Yue
Zhicheng Zhang
Da Mu
Yonghao Dang
Jianqin Yin
Jin Tang
27
0
0
10 Jan 2024
Towards Weakly Supervised Text-to-Audio Grounding
Towards Weakly Supervised Text-to-Audio Grounding
Xuenan Xu
Ziyang Ma
Mengyue Wu
Kai Yu
AI4TS
30
9
0
05 Jan 2024
SECap: Speech Emotion Captioning with Large Language Model
SECap: Speech Emotion Captioning with Large Language Model
Yaoxun Xu
Hangting Chen
Jianwei Yu
Qiaochu Huang
Zhiyong Wu
Shixiong Zhang
Guangzhi Li
Yi Luo
Rongzhi Gu
25
22
0
16 Dec 2023
Performance and energy balance: a comprehensive study of
  state-of-the-art sound event detection systems
Performance and energy balance: a comprehensive study of state-of-the-art sound event detection systems
Francesca Ronchini
Romain Serizel
22
10
0
05 Oct 2023
Semi-supervised Sound Event Detection with Local and Global Consistency
  Regularization
Semi-supervised Sound Event Detection with Local and Global Consistency Regularization
Yiming Li
Xiangdong Wang
Hong Liu
Rui Tao
Long Yan
Kazushige Ouchi
24
3
0
15 Sep 2023
AGS: An Dataset and Taxonomy for Domestic Scene Sound Event Recognition
AGS: An Dataset and Taxonomy for Domestic Scene Sound Event Recognition
Nan Che
Chenrui Liu
Fei Yu
33
0
0
30 Aug 2023
Post-Processing Independent Evaluation of Sound Event Detection Systems
Post-Processing Independent Evaluation of Sound Event Detection Systems
Janek Ebbers
Reinhold Haeb-Umbach
Romain Serizel
21
7
0
27 Jun 2023
Frequency & Channel Attention for Computationally Efficient Sound Event
  Detection
Frequency & Channel Attention for Computationally Efficient Sound Event Detection
Hyeonuk Nam
Seong-Hu Kim
D. Min
Yong-Hwa Park
19
9
0
20 Jun 2023
Channel-Spatial-Based Few-Shot Bird Sound Event Detection
Channel-Spatial-Based Few-Shot Bird Sound Event Detection
Lingwen Liu
Yuxuan Feng
Haitao Fu
Yajie Yang
Xin Pan
Chenlei Jin
23
0
0
18 Jun 2023
Self-supervised Audio Teacher-Student Transformer for Both Clip-level
  and Frame-level Tasks
Self-supervised Audio Teacher-Student Transformer for Both Clip-level and Frame-level Tasks
Xian Li
Nian Shao
Xiaofei Li
ViT
CLIP
21
26
0
07 Jun 2023
AST-SED: An Effective Sound Event Detection Method Based on Audio
  Spectrogram Transformer
AST-SED: An Effective Sound Event Detection Method Based on Audio Spectrogram Transformer
Kang Li
Yan Song
Lirong Dai
Ian Mcloughlin
Xin Fang
Lin Liu
24
22
0
07 Mar 2023
Multi-dimensional frequency dynamic convolution with confident mean
  teacher for sound event detection
Multi-dimensional frequency dynamic convolution with confident mean teacher for sound event detection
Shengchang Xiao
Xueshuai Zhang
Pengyuan Zhang
13
10
0
18 Feb 2023
Tackling the Cocktail Fork Problem for Separation and Transcription of
  Real-World Soundtracks
Tackling the Cocktail Fork Problem for Separation and Transcription of Real-World Soundtracks
Darius Petermann
G. Wichern
Aswin Shanmugam Subramanian
Zhong-Qiu Wang
Jonathan Le Roux
27
10
0
14 Dec 2022
Optimizing Temporal Resolution Of Convolutional Recurrent Neural
  Networks For Sound Event Detection
Optimizing Temporal Resolution Of Convolutional Recurrent Neural Networks For Sound Event Detection
Wim Boes
Hugo Van hamme
6
1
0
18 Oct 2022
A Hybrid System of Sound Event Detection Transformer and Frame-wise
  Model for DCASE 2022 Task 4
A Hybrid System of Sound Event Detection Transformer and Frame-wise Model for DCASE 2022 Task 4
Yiming Li
Zhifang Guo
Zhi-qin Ye
Xiangdong Wang
Hong Liu
Yueliang Qian
Ruijie Tao
Long Yan
Kazushige Ouchi
22
0
0
18 Oct 2022
Description and analysis of novelties introduced in DCASE Task 4 2022 on
  the baseline system
Description and analysis of novelties introduced in DCASE Task 4 2022 on the baseline system
Francesca Ronchini
Samuele Cornell
Romain Serizel
Nicolas Turpault
Eduardo Fonseca
D. Ellis
22
14
0
14 Oct 2022
Impact of temporal resolution on convolutional recurrent networks for
  audio tagging and sound event detection
Impact of temporal resolution on convolutional recurrent networks for audio tagging and sound event detection
Wim Boes
Hugo Van hamme
12
0
0
26 Sep 2022
Multi-encoder attention-based architectures for sound recognition with
  partial visual assistance
Multi-encoder attention-based architectures for sound recognition with partial visual assistance
Wim Boes
Hugo Van hamme
14
1
0
26 Sep 2022
Deep Learning-Based Acoustic Mosquito Detection in Noisy Conditions
  Using Trainable Kernels and Augmentations
Deep Learning-Based Acoustic Mosquito Detection in Noisy Conditions Using Trainable Kernels and Augmentations
Devesh Khandelwal
Sean Campos
Shwetha Nagaraj
F. Nugen
Alberto Todeschini
4
0
0
28 Jul 2022
Segment-level Metric Learning for Few-shot Bioacoustic Event Detection
Segment-level Metric Learning for Few-shot Bioacoustic Event Detection
Haohe Liu
Xubo Liu
Xinhao Mei
Qiuqiang Kong
Wenwu Wang
Mark D. Plumbley
28
8
0
15 Jul 2022
A Multi-grained based Attention Network for Semi-supervised Sound Event
  Detection
A Multi-grained based Attention Network for Semi-supervised Sound Event Detection
Ying Hu
Xiujuan Zhu
Yun Li
Hao-Ming Huang
Liang He
21
9
0
21 Jun 2022
The ACM Multimedia 2022 Computational Paralinguistics Challenge:
  Vocalisations, Stuttering, Activity, & Mosquitoes
The ACM Multimedia 2022 Computational Paralinguistics Challenge: Vocalisations, Stuttering, Activity, & Mosquitoes
Björn W. Schuller
A. Batliner
Shahin Amiriparian
Christian Bergler
Maurice Gerczuk
...
M. Pateraki
H. Coppock
Ivan Kiskin
Marianne E. Sinka
Stephen J. Roberts
40
26
0
13 May 2022
Sound Event Triage: Detecting Sound Events Considering Priority of
  Classes
Sound Event Triage: Detecting Sound Events Considering Priority of Classes
Noriyuki Tonami
Keisuke Imoto
24
1
0
13 Apr 2022
12
Next