ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2101.02919
  4. Cited By
A Four-Stage Data Augmentation Approach to ResNet-Conformer Based
  Acoustic Modeling for Sound Event Localization and Detection

A Four-Stage Data Augmentation Approach to ResNet-Conformer Based Acoustic Modeling for Sound Event Localization and Detection

8 January 2021
Qing Wang
Jun Du
Hua-Xin Wu
Jia Pan
Feng Ma
Chin-Hui Lee
ArXivPDFHTML

Papers citing "A Four-Stage Data Augmentation Approach to ResNet-Conformer Based Acoustic Modeling for Sound Event Localization and Detection"

35 / 35 papers shown
Title
Location-Oriented Sound Event Localization and Detection with Spatial Mapping and Regression Localization
Location-Oriented Sound Event Localization and Detection with Spatial Mapping and Regression Localization
Xueping Zhang
Yaxiong Chen
Ruilin Yao
Yunfei Zi
Shengwu Xiong
45
0
0
11 Apr 2025
Reverberation-based Features for Sound Event Localization and Detection with Distance Estimation
Reverberation-based Features for Sound Event Localization and Detection with Distance Estimation
Davide Berghi
Philip J. B. Jackson
41
0
0
11 Apr 2025
CNN-based Robust Sound Source Localization with SRP-PHAT for the Extreme Edge
Jun Yin
Marian Verhelst
70
1
0
03 Mar 2025
PSELDNets: Pre-trained Neural Networks on Large-scale Synthetic Datasets
  for Sound Event Localization and Detection
PSELDNets: Pre-trained Neural Networks on Large-scale Synthetic Datasets for Sound Event Localization and Detection
Jinbo Hu
Yin Cao
Ming Wu
Fang Kang
Feiran Yang
Wenwu Wang
Mark D. Plumbley
J. Yang
41
0
0
10 Nov 2024
DOA-Aware Audio-Visual Self-Supervised Learning for Sound Event
  Localization and Detection
DOA-Aware Audio-Visual Self-Supervised Learning for Sound Event Localization and Detection
Yoto Fujita
Yoshiaki Bando
Keisuke Imoto
Masaki Onishi
Kazuyoshi Yoshii
48
2
0
30 Oct 2024
Leveraging Reverberation and Visual Depth Cues for Sound Event
  Localization and Detection with Distance Estimation
Leveraging Reverberation and Visual Depth Cues for Sound Event Localization and Detection with Distance Estimation
Davide Berghi
Philip J. B. Jackson
42
1
0
29 Oct 2024
Enhancing 1-Second 3D SELD Performance with Filter Bank Analysis and
  SCConv Integration in CST-Former
Enhancing 1-Second 3D SELD Performance with Filter Bank Analysis and SCConv Integration in CST-Former
Zhehui Zhang
29
0
0
17 Oct 2024
Learning Multi-Target TDOA Features for Sound Event Localization and
  Detection
Learning Multi-Target TDOA Features for Sound Event Localization and Detection
Axel Berg
Johanna Engman
Jens Gulin
Karl Åström
Magnus Oskarsson
44
1
0
30 Aug 2024
Audio-Visual Talker Localization in Video for Spatial Sound Reproduction
Audio-Visual Talker Localization in Video for Spatial Sound Reproduction
Davide Berghi
Philip J. B. Jackson
55
0
0
01 Jun 2024
Enhanced Sound Event Localization and Detection in Real 360-degree
  audio-visual soundscapes
Enhanced Sound Event Localization and Detection in Real 360-degree audio-visual soundscapes
Adrian S. Roman
Baladithya Balamurugan
Rithik Pothuganti
35
5
0
29 Jan 2024
Robust DOA estimation using deep acoustic imaging
Robust DOA estimation using deep acoustic imaging
Adrian S. Roman
Irán R. Román
J. P. Bello
27
2
0
16 Jan 2024
Selective-Memory Meta-Learning with Environment Representations for
  Sound Event Localization and Detection
Selective-Memory Meta-Learning with Environment Representations for Sound Event Localization and Detection
Jinbo Hu
Yin Cao
Ming Wu
Qiuqiang Kong
Feiran Yang
Mark D. Plumbley
Jun Yang
40
1
0
27 Dec 2023
CST-former: Transformer with Channel-Spectro-Temporal Attention for
  Sound Event Localization and Detection
CST-former: Transformer with Channel-Spectro-Temporal Attention for Sound Event Localization and Detection
Yusun Shul
Jung-Woo Choi
46
8
0
20 Dec 2023
Fusion of Audio and Visual Embeddings for Sound Event Localization and
  Detection
Fusion of Audio and Visual Embeddings for Sound Event Localization and Detection
Davide Berghi
Peipei Wu
Jinzheng Zhao
Wenwu Wang
Philip J. B. Jackson
54
10
0
14 Dec 2023
Feature Aggregation in Joint Sound Classification and Localization
  Neural Networks
Feature Aggregation in Joint Sound Classification and Localization Neural Networks
Brendan Healy
Patrick McNamee
Z. Nili Ahmadabadi
46
0
0
29 Oct 2023
Performance and energy balance: a comprehensive study of
  state-of-the-art sound event detection systems
Performance and energy balance: a comprehensive study of state-of-the-art sound event detection systems
Francesca Ronchini
Romain Serizel
35
11
0
05 Oct 2023
Exploring Self-Supervised Contrastive Learning of Spatial Sound Event
  Representation
Exploring Self-Supervised Contrastive Learning of Spatial Sound Event Representation
Xilin Jiang
Cong Han
Yinghao Aaron Li
N. Mesgarani
SSL
49
1
0
27 Sep 2023
Two vs. Four-Channel Sound Event Localization and Detection
Two vs. Four-Channel Sound Event Localization and Detection
J. Wilkins
Magdalena Fuentes
Luca Bondi
Shabnam Ghaffarzadegan
A. Abavisani
J. P. Bello
21
1
0
23 Sep 2023
Sound Source Distance Estimation in Diverse and Dynamic Acoustic
  Conditions
Sound Source Distance Estimation in Diverse and Dynamic Acoustic Conditions
Saksham Singh Kushwaha
Irán R. Román
Magdalena Fuentes
J. P. Bello
24
9
0
17 Sep 2023
A Real-Time Active Speaker Detection System Integrating an Audio-Visual
  Signal with a Spatial Querying Mechanism
A Real-Time Active Speaker Detection System Integrating an Audio-Visual Signal with a Spatial Querying Mechanism
I. Gurvich
Ido Leichter
Dharmendar Reddy Palle
Yossi Asher
Alon Vinnikov
Igor Abramovski
Vishak Gopal
Ross Cutler
Eyal Krupka
39
4
0
15 Sep 2023
STARSS23: An Audio-Visual Dataset of Spatial Recordings of Real Scenes
  with Spatiotemporal Annotations of Sound Events
STARSS23: An Audio-Visual Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events
Kazuki Shimada
Archontis Politis
Parthasaarathy Sudarsanam
D. Krause
Kengo Uchida
...
Yuichiro Koyama
Naoya Takahashi
Shusuke Takahashi
Tuomas Virtanen
Yuki Mitsufuji
68
37
0
15 Jun 2023
Inference and Denoise: Causal Inference-based Neural Speech Enhancement
Inference and Denoise: Causal Inference-based Neural Speech Enhancement
Tsun-An Hsieh
Chao-Han Huck Yang
Pin-Yu Chen
Sabato Marco Siniscalchi
Yu Tsao
CML
63
2
0
02 Nov 2022
Deep Learning Based Audio-Visual Multi-Speaker DOA Estimation Using
  Permutation-Free Loss Function
Deep Learning Based Audio-Visual Multi-Speaker DOA Estimation Using Permutation-Free Loss Function
Qing Wang
Hang Chen
Yannan Jiang
Zhe Wang
Yuyang Wang
Jun Du
Chin-Hui Lee
29
4
0
26 Oct 2022
A Track-Wise Ensemble Event Independent Network for Polyphonic Sound
  Event Localization and Detection
A Track-Wise Ensemble Event Independent Network for Polyphonic Sound Event Localization and Detection
Jinbo Hu
Yin Cao
Ming Wu
Qiuqiang Kong
Feiran Yang
Mark D. Plumbley
J. Yang
29
23
0
19 Mar 2022
Locate This, Not That: Class-Conditioned Sound Event DOA Estimation
Locate This, Not That: Class-Conditioned Sound Event DOA Estimation
Olga Slizovskaia
Gordon Wichern
Zhong-Qiu Wang
Jonathan Le Roux
27
4
0
08 Mar 2022
SALSA-Lite: A Fast and Effective Feature for Polyphonic Sound Event
  Localization and Detection with Microphone Arrays
SALSA-Lite: A Fast and Effective Feature for Polyphonic Sound Event Localization and Detection with Microphone Arrays
Thi Ngoc Tho Nguyen
Douglas L. Jones
Karn N. Watcharasupat
Huy P Phan
W. Gan
38
37
0
16 Nov 2021
Multi-ACCDOA: Localizing and Detecting Overlapping Sounds from the Same
  Class with Auxiliary Duplicating Permutation Invariant Training
Multi-ACCDOA: Localizing and Detecting Overlapping Sounds from the Same Class with Auxiliary Duplicating Permutation Invariant Training
Kazuki Shimada
Yuichiro Koyama
Shusuke Takahashi
Naoya Takahashi
E. Tsunoo
Yuki Mitsufuji
18
64
0
14 Oct 2021
Spatial Data Augmentation with Simulated Room Impulse Responses for
  Sound Event Localization and Detection
Spatial Data Augmentation with Simulated Room Impulse Responses for Sound Event Localization and Detection
Yuichiro Koyama
Kazuhide Shigemi
Masafumi Takahashi
Kazuki Shimada
Naoya Takahashi
E. Tsunoo
Shusuke Takahashi
Yuki Mitsufuji
41
12
0
13 Oct 2021
Spatial mixup: Directional loudness modification as data augmentation
  for sound event localization and detection
Spatial mixup: Directional loudness modification as data augmentation for sound event localization and detection
Ricardo Falcón Pérez
Kazuki Shimada
Yuichiro Koyama
Shusuke Takahashi
Yuki Mitsufuji
28
5
0
12 Oct 2021
SALSA: Spatial Cue-Augmented Log-Spectrogram Features for Polyphonic
  Sound Event Localization and Detection
SALSA: Spatial Cue-Augmented Log-Spectrogram Features for Polyphonic Sound Event Localization and Detection
Thi Ngoc Tho Nguyen
Karn N. Watcharasupat
Ngoc Khanh Nguyen
Douglas L. Jones
W. Gan
31
49
0
01 Oct 2021
A Survey of Sound Source Localization with Deep Learning Methods
A Survey of Sound Source Localization with Deep Learning Methods
Pierre-Amaury Grumiaux
Srdjan Kitić
Laurent Girin
Alexandre Guérin
43
247
0
08 Sep 2021
Assessment of Self-Attention on Learned Features For Sound Event
  Localization and Detection
Assessment of Self-Attention on Learned Features For Sound Event Localization and Detection
Parthasaarathy Sudarsanam
Archontis Politis
Konstantinos Drossos
27
13
0
20 Jul 2021
DCASE 2021 Task 3: Spectrotemporally-aligned Features for Polyphonic
  Sound Event Localization and Detection
DCASE 2021 Task 3: Spectrotemporally-aligned Features for Polyphonic Sound Event Localization and Detection
Thi Ngoc Tho Nguyen
Karn N. Watcharasupat
Ngoc Khanh Nguyen
Douglas L. Jones
W. Gan
32
16
0
29 Jun 2021
Ensemble of ACCDOA- and EINV2-based Systems with D3Nets and Impulse
  Response Simulation for Sound Event Localization and Detection
Ensemble of ACCDOA- and EINV2-based Systems with D3Nets and Impulse Response Simulation for Sound Event Localization and Detection
Kazuki Shimada
Naoya Takahashi
Yuichiro Koyama
Shusuke Takahashi
E. Tsunoo
Masafumi Takahashi
Yuki Mitsufuji
35
23
0
21 Jun 2021
A Dataset of Dynamic Reverberant Sound Scenes with Directional
  Interferers for Sound Event Localization and Detection
A Dataset of Dynamic Reverberant Sound Scenes with Directional Interferers for Sound Event Localization and Detection
Archontis Politis
Sharath Adavanne
D. Krause
Antoine Deleforge
Prerak Srivastava
Tuomas Virtanen
36
66
0
13 Jun 2021
1