ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1807.00129
  4. Cited By
Sound Event Localization and Detection of Overlapping Sources Using
  Convolutional Recurrent Neural Networks

Sound Event Localization and Detection of Overlapping Sources Using Convolutional Recurrent Neural Networks

30 June 2018
Sharath Adavanne
Archontis Politis
Joonas Nikunen
Tuomas Virtanen
ArXivPDFHTML

Papers citing "Sound Event Localization and Detection of Overlapping Sources Using Convolutional Recurrent Neural Networks"

50 / 64 papers shown
Title
Spatial Audio Processing with Large Language Model on Wearable Devices
Spatial Audio Processing with Large Language Model on Wearable Devices
Ayushi Mishra
Yang Bai
Priyadarshan Narayanasamy
Nakul Garg
Nirupam Roy
30
0
0
11 Apr 2025
Location-Oriented Sound Event Localization and Detection with Spatial Mapping and Regression Localization
Location-Oriented Sound Event Localization and Detection with Spatial Mapping and Regression Localization
Xueping Zhang
Yaxiong Chen
Ruilin Yao
Yunfei Zi
Shengwu Xiong
38
0
0
11 Apr 2025
CNN-based Robust Sound Source Localization with SRP-PHAT for the Extreme Edge
Jun Yin
Marian Verhelst
65
1
0
03 Mar 2025
Exploring Text-Queried Sound Event Detection with Audio Source Separation
Exploring Text-Queried Sound Event Detection with Audio Source Separation
Han Yin
Jisheng Bai
Yang Xiao
Hui Wang
Siqi Zheng
Yafeng Chen
Rohan Kumar Das
Chong Deng
Jianfeng Chen
40
3
0
20 Sep 2024
SELD-Mamba: Selective State-Space Model for Sound Event Localization and
  Detection with Source Distance Estimation
SELD-Mamba: Selective State-Space Model for Sound Event Localization and Detection with Source Distance Estimation
Da Mu
Zhicheng Zhang
Haobo Yue
Zehao Wang
Jin Tang
Jianqin Yin
Mamba
51
2
0
09 Aug 2024
Can Large Language Models Understand Spatial Audio?
Can Large Language Models Understand Spatial Audio?
Changli Tang
Wenyi Yu
Guangzhi Sun
Xianzhao Chen
Tian Tan
...
Jun Zhang
Lu Lu
Zejun Ma
Yuxuan Wang
Chao Zhang
54
4
0
12 Jun 2024
Guided Masked Self-Distillation Modeling for Distributed Multimedia
  Sensor Event Analysis
Guided Masked Self-Distillation Modeling for Distributed Multimedia Sensor Event Analysis
Masahiro Yasuda
Noboru Harada
Yasunori Ohishi
Shoichiro Saito
Akira Nakayama
Nobutaka Ono
41
3
0
12 Apr 2024
BAT: Learning to Reason about Spatial Sounds with Large Language Models
BAT: Learning to Reason about Spatial Sounds with Large Language Models
Zhisheng Zheng
Puyuan Peng
Ziyang Ma
Xie Chen
Eunsol Choi
David Harwath
LRM
37
14
0
02 Feb 2024
Feature Aggregation in Joint Sound Classification and Localization
  Neural Networks
Feature Aggregation in Joint Sound Classification and Localization Neural Networks
Brendan Healy
Patrick McNamee
Z. Nili Ahmadabadi
41
0
0
29 Oct 2023
Audio-Visual Speaker Tracking: Progress, Challenges, and Future Directions
Audio-Visual Speaker Tracking: Progress, Challenges, and Future Directions
Jinzheng Zhao
Yong-mei Xu
Xinyuan Qian
Davide Berghi
Peipei Wu
Meng Cui
Jianyuan Sun
Philip J. B. Jackson
Wenwu Wang
BDL
50
7
0
23 Oct 2023
Exploring Self-Supervised Contrastive Learning of Spatial Sound Event
  Representation
Exploring Self-Supervised Contrastive Learning of Spatial Sound Event Representation
Xilin Jiang
Cong Han
Yinghao Aaron Li
N. Mesgarani
SSL
37
1
0
27 Sep 2023
Permutation Invariant Recurrent Neural Networks for Sound Source
  Tracking Applications
Permutation Invariant Recurrent Neural Networks for Sound Source Tracking Applications
David Diaz-Guerra
Archontis Politis
A. Miguel
J. R. Beltrán
Tuomas Virtanen
33
0
0
14 Jun 2023
A Hybrid Neural Coding Approach for Pattern Recognition with Spiking
  Neural Networks
A Hybrid Neural Coding Approach for Pattern Recognition with Spiking Neural Networks
Xinyi Chen
Qu Yang
Jibin Wu
Haizhou Li
Kay Chen Tan
30
16
0
26 May 2023
Contrastive Representation Learning for Acoustic Parameter Estimation
Contrastive Representation Learning for Acoustic Parameter Estimation
Philipp Götz
Cagdas Tuna
Andreas Walther
Emanuel Habets
SSL
30
7
0
22 Feb 2023
Improving trajectory localization accuracy via direction-of-arrival
  derivative estimation
Improving trajectory localization accuracy via direction-of-arrival derivative estimation
Ruchi Pandey
Shreya Jaiswal
Huy P Phan
S. Nannuru
30
0
0
07 Dec 2022
Panoramic Video Salient Object Detection with Ambisonic Audio Guidance
Panoramic Video Salient Object Detection with Ambisonic Audio Guidance
Xiang Li
H. Cao
Shijie Zhao
Junlin Li
Li Zhang
Bhiksha Raj
42
14
0
26 Nov 2022
Position tracking of a varying number of sound sources with sliding
  permutation invariant training
Position tracking of a varying number of sound sources with sliding permutation invariant training
David Diaz-Guerra
Archontis Politis
Tuomas Virtanen
30
5
0
26 Oct 2022
CoLoC: Conditioned Localizer and Classifier for Sound Event Localization
  and Detection
CoLoC: Conditioned Localizer and Classifier for Sound Event Localization and Detection
Slawomir Kapka
J. Tkaczuk
23
0
0
25 Oct 2022
Deep Learning Based Stage-wise Two-dimensional Speaker Localization with
  Large Ad-hoc Microphone Arrays
Deep Learning Based Stage-wise Two-dimensional Speaker Localization with Large Ad-hoc Microphone Arrays
Shupei Liu
Linfeng Feng
Yijun Gong
Chengdong Liang
Chen Zhang
Xiao-Lei Zhang
Xuelong Li
20
3
0
19 Oct 2022
Binaural Signal Representations for Joint Sound Event Detection and
  Acoustic Scene Classification
Binaural Signal Representations for Joint Sound Event Detection and Acoustic Scene Classification
D. Krause
A. Mesaros
29
3
0
13 Sep 2022
A Track-Wise Ensemble Event Independent Network for Polyphonic Sound
  Event Localization and Detection
A Track-Wise Ensemble Event Independent Network for Polyphonic Sound Event Localization and Detection
Jinbo Hu
Yin Cao
Ming Wu
Qiuqiang Kong
Feiran Yang
Mark D. Plumbley
J. Yang
24
23
0
19 Mar 2022
SmartBelt: A Wearable Microphone Array for Sound Source Localization
  with Haptic Feedback
SmartBelt: A Wearable Microphone Array for Sound Source Localization with Haptic Feedback
Simon Michaud
Benjamin Moffett
Ana Tapia Rousiouk
Victoria Duda
François Grondin
HAI
15
2
0
28 Feb 2022
L-SpEx: Localized Target Speaker Extraction
L-SpEx: Localized Target Speaker Extraction
Meng Ge
Chenglin Xu
Longbiao Wang
Eng Siong Chng
J. Dang
Haizhou Li
32
21
0
21 Feb 2022
Multi-view and Multi-modal Event Detection Utilizing Transformer-based
  Multi-sensor fusion
Multi-view and Multi-modal Event Detection Utilizing Transformer-based Multi-sensor fusion
Masahiro Yasuda
Yasunori Ohishi
Shoichiro Saito
Noboru Harada
43
13
0
18 Feb 2022
Visual Sound Localization in the Wild by Cross-Modal Interference
  Erasing
Visual Sound Localization in the Wild by Cross-Modal Interference Erasing
Xian Liu
Rui Qian
Hang Zhou
Di Hu
Weiyao Lin
Ziwei Liu
Bolei Zhou
Xiaowei Zhou
18
25
0
13 Feb 2022
Self-Supervised Moving Vehicle Detection from Audio-Visual Cues
Self-Supervised Moving Vehicle Detection from Audio-Visual Cues
Jannik Zürn
Wolfram Burgard
SSL
39
8
0
30 Jan 2022
End-to-end Alexa Device Arbitration
End-to-end Alexa Device Arbitration
Jarred Barber
Yifeng Fan
Tao Zhang
27
3
0
08 Dec 2021
SALSA-Lite: A Fast and Effective Feature for Polyphonic Sound Event
  Localization and Detection with Microphone Arrays
SALSA-Lite: A Fast and Effective Feature for Polyphonic Sound Event Localization and Detection with Microphone Arrays
Thi Ngoc Tho Nguyen
Douglas L. Jones
Karn N. Watcharasupat
Huy P Phan
W. Gan
33
36
0
16 Nov 2021
Differentiable Tracking-Based Training of Deep Learning Sound Source
  Localizers
Differentiable Tracking-Based Training of Deep Learning Sound Source Localizers
Sharath Adavanne
Archontis Politis
Tuomas Virtanen
31
16
0
29 Oct 2021
Multi-ACCDOA: Localizing and Detecting Overlapping Sounds from the Same
  Class with Auxiliary Duplicating Permutation Invariant Training
Multi-ACCDOA: Localizing and Detecting Overlapping Sounds from the Same Class with Auxiliary Duplicating Permutation Invariant Training
Kazuki Shimada
Yuichiro Koyama
Shusuke Takahashi
Naoya Takahashi
E. Tsunoo
Yuki Mitsufuji
13
63
0
14 Oct 2021
Visually Exploring Multi-Purpose Audio Data
Visually Exploring Multi-Purpose Audio Data
David Heise
Helen L. Bear
27
3
0
09 Oct 2021
PHNNs: Lightweight Neural Networks via Parameterized Hypercomplex
  Convolutions
PHNNs: Lightweight Neural Networks via Parameterized Hypercomplex Convolutions
Eleonora Grassucci
Aston Zhang
Danilo Comminiello
28
38
0
08 Oct 2021
A Few-Shot Learning Approach for Sound Source Distance Estimation Using
  Relation Networks
A Few-Shot Learning Approach for Sound Source Distance Estimation Using Relation Networks
Amirreza Sobhdel
R. Razavi-Far
47
4
0
22 Sep 2021
Joint Direction and Proximity Classification of Overlapping Sound Events
  from Binaural Audio
Joint Direction and Proximity Classification of Overlapping Sound Events from Binaural Audio
D. Krause
Archontis Politis
A. Mesaros
22
7
0
26 Jul 2021
Assessment of Self-Attention on Learned Features For Sound Event
  Localization and Detection
Assessment of Self-Attention on Learned Features For Sound Event Localization and Detection
Parthasaarathy Sudarsanam
Archontis Politis
Konstantinos Drossos
19
13
0
20 Jul 2021
DCASE 2021 Task 3: Spectrotemporally-aligned Features for Polyphonic
  Sound Event Localization and Detection
DCASE 2021 Task 3: Spectrotemporally-aligned Features for Polyphonic Sound Event Localization and Detection
Thi Ngoc Tho Nguyen
Karn N. Watcharasupat
Ngoc Khanh Nguyen
Douglas L. Jones
W. Gan
27
16
0
29 Jun 2021
A Dataset of Dynamic Reverberant Sound Scenes with Directional
  Interferers for Sound Event Localization and Detection
A Dataset of Dynamic Reverberant Sound Scenes with Directional Interferers for Sound Event Localization and Detection
Archontis Politis
Sharath Adavanne
D. Krause
Antoine Deleforge
Prerak Srivastava
Tuomas Virtanen
31
66
0
13 Jun 2021
SoundDet: Polyphonic Moving Sound Event Detection and Localization from
  Raw Waveform
SoundDet: Polyphonic Moving Sound Event Detection and Localization from Raw Waveform
Yuhang He
A. Trigoni
Andrew Markham
34
19
0
13 Jun 2021
PILOT: Introducing Transformers for Probabilistic Sound Event
  Localization
PILOT: Introducing Transformers for Probabilistic Sound Event Localization
C. Schymura
Benedikt T. Bönninghoff
Tsubasa Ochiai
Marc Delcroix
K. Kinoshita
Tomohiro Nakatani
S. Araki
D. Kolossa
27
24
0
07 Jun 2021
Refinement of Direction of Arrival Estimators by
  Majorization-Minimization Optimization on the Array Manifold
Refinement of Direction of Arrival Estimators by Majorization-Minimization Optimization on the Array Manifold
Robin Scheibler
M. Togami
13
3
0
02 Jun 2021
BeamLearning: an end-to-end Deep Learning approach for the angular
  localization of sound sources using raw multichannel acoustic pressure data
BeamLearning: an end-to-end Deep Learning approach for the angular localization of sound sources using raw multichannel acoustic pressure data
Hadrien Pujol
Éric Bavu
Alexandre Garcia
44
22
0
27 Apr 2021
L3DAS21 Challenge: Machine Learning for 3D Audio Signal Processing
L3DAS21 Challenge: Machine Learning for 3D Audio Signal Processing
E. Guizzo
R. F. Gramaccioni
Saeid Jamili
Christian Marinoni
Edoardo Massaro
...
Marco Pennese
Sveva Pepe
Enrico Rocchi
A. Uncini
Danilo Comminiello
21
27
0
12 Apr 2021
Audio scene monitoring using redundant ad-hoc microphone array networks
Audio scene monitoring using redundant ad-hoc microphone array networks
Peter Gerstoft
Yihan Hu
Michael J. Bianco
Chaitanya Patil
Ardel Alegre
Y. Freund
François Grondin
39
11
0
02 Mar 2021
Deep Learning based Multi-Source Localization with Source Splitting and
  its Effectiveness in Multi-Talker Speech Recognition
Deep Learning based Multi-Source Localization with Source Splitting and its Effectiveness in Multi-Talker Speech Recognition
Aswin Shanmugam Subramanian
Chao Weng
Shinji Watanabe
Meng Yu
Dong Yu
34
78
0
16 Feb 2021
A Four-Stage Data Augmentation Approach to ResNet-Conformer Based
  Acoustic Modeling for Sound Event Localization and Detection
A Four-Stage Data Augmentation Approach to ResNet-Conformer Based Acoustic Modeling for Sound Event Localization and Detection
Qing Wang
Jun Du
Hua-Xin Wu
Jia Pan
Feng Ma
Chin-Hui Lee
15
79
0
08 Jan 2021
ACCDOA: Activity-Coupled Cartesian Direction of Arrival Representation
  for Sound Event Localization and Detection
ACCDOA: Activity-Coupled Cartesian Direction of Arrival Representation for Sound Event Localization and Detection
Kazuki Shimada
Yuichiro Koyama
Naoya Takahashi
Shusuke Takahashi
Yuki Mitsufuji
23
86
0
29 Oct 2020
An Improved Event-Independent Network for Polyphonic Sound Event
  Localization and Detection
An Improved Event-Independent Network for Polyphonic Sound Event Localization and Detection
Yin Cao
Turab Iqbal
Qiuqiang Kong
Y. Zhong
Wenwu Wang
Mark D. Plumbley
16
75
0
25 Oct 2020
Joint Analysis of Sound Events and Acoustic Scenes Using Multitask
  Learning
Joint Analysis of Sound Events and Acoustic Scenes Using Multitask Learning
Noriyuki Tonami
Keisuke Imoto
Ryosuke Yamanishi
Y. Yamashita
23
13
0
16 Oct 2020
Sound event localization and detection based on crnn using rectangular
  filters and channel rotation data augmentation
Sound event localization and detection based on crnn using rectangular filters and channel rotation data augmentation
Francesca Ronchini
Daniel Arteaga
Andrés Pérez-López
26
9
0
13 Oct 2020
Multi-microphone Complex Spectral Mapping for Utterance-wise and
  Continuous Speech Separation
Multi-microphone Complex Spectral Mapping for Utterance-wise and Continuous Speech Separation
Zhong-Qiu Wang
Peidong Wang
DeLiang Wang
35
88
0
04 Oct 2020
12
Next