ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1911.06878
  4. Cited By
Adaptive Multi-scale Detection of Acoustic Events

Adaptive Multi-scale Detection of Acoustic Events

15 November 2019
Wenhao Ding
Liang He
ArXivPDFHTML

Papers citing "Adaptive Multi-scale Detection of Acoustic Events"

27 / 27 papers shown
Title
MelNet: A Generative Model for Audio in the Frequency Domain
MelNet: A Generative Model for Audio in the Frequency Domain
Sean Vasquez
M. Lewis
DiffM
50
131
0
04 Jun 2019
Unifying Isolated and Overlapping Audio Event Detection with Multi-Label
  Multi-Task Convolutional Recurrent Neural Networks
Unifying Isolated and Overlapping Audio Event Detection with Multi-Label Multi-Task Convolutional Recurrent Neural Networks
Huy P Phan
Oliver Y. Chén
P. Koch
L. D. Pham
Ian Mcloughlin
Alfred Mertins
M. D. Vos
44
18
0
02 Nov 2018
Learning How to Listen: A Temporal-Frequential Attention Model for Sound
  Event Detection
Learning How to Listen: A Temporal-Frequential Attention Model for Sound Event Detection
Yuhan Shen
Ke-Xin He
Weiqiang Zhang
26
18
0
29 Oct 2018
Unsupervised Detection of Anomalous Sound based on Deep Learning and the
  Neyman-Pearson Lemma
Unsupervised Detection of Anomalous Sound based on Deep Learning and the Neyman-Pearson Lemma
Yuma Koizumi
Shoichiro Saito
Hisashi Uematsu
Noboru Harada
AAML
56
158
0
22 Oct 2018
A simple model for detection of rare sound events
A simple model for detection of rare sound events
Weiran Wang
Chieh-Chi Kao
Chao Wang
25
17
0
20 Aug 2018
R-CRNN: Region-based Convolutional Recurrent Neural Network for Audio
  Event Detection
R-CRNN: Region-based Convolutional Recurrent Neural Network for Audio Event Detection
Chieh-Chi Kao
Weiran Wang
Ming Sun
Chao Wang
39
57
0
20 Aug 2018
CornerNet: Detecting Objects as Paired Keypoints
CornerNet: Detecting Objects as Paired Keypoints
Hei Law
Jia Deng
ObjD
67
3,613
0
03 Aug 2018
Large-Scale Weakly Labeled Semi-Supervised Sound Event Detection in
  Domestic Environments
Large-Scale Weakly Labeled Semi-Supervised Sound Event Detection in Domestic Environments
Romain Serizel
Nicolas Turpault
Hamid Eghbalzadeh
Ankit Parag Shah
31
140
0
27 Jul 2018
Disentangling by Partitioning: A Representation Learning Framework for
  Multimodal Sensory Data
Disentangling by Partitioning: A Representation Learning Framework for Multimodal Sensory Data
Wei-Ning Hsu
James R. Glass
DRL
64
43
0
29 May 2018
Vehicle Pose and Shape Estimation through Multiple Monocular Vision
Vehicle Pose and Shape Estimation through Multiple Monocular Vision
Wenhao Ding
Shuaijun Li
Guilin Zhang
Xiangyu Lei
Huihuan Qian
62
31
0
10 Feb 2018
Cascaded Pyramid Network for Multi-Person Pose Estimation
Cascaded Pyramid Network for Multi-Person Pose Estimation
Yilun Chen
Zhicheng Wang
Yuxiang Peng
Zhiqiang Zhang
Gang Yu
Jian Sun
123
1,426
0
20 Nov 2017
A report on sound event detection with different binaural features
A report on sound event detection with different binaural features
Sharath Adavanne
Tuomas Virtanen
41
68
0
09 Oct 2017
Large-scale weakly supervised audio classification using gated
  convolutional neural network
Large-scale weakly supervised audio classification using gated convolutional neural network
Yong-mei Xu
Qiuqiang Kong
Wenwu Wang
Mark D. Plumbley
62
232
0
01 Oct 2017
DNN and CNN with Weighted and Multi-task Loss Functions for Audio Event
  Detection
DNN and CNN with Weighted and Multi-task Loss Functions for Audio Event Detection
Huy P Phan
Martin Krawczyk-Becker
Timo Gerkmann
Alfred Mertins
49
38
0
10 Aug 2017
Sound Event Detection in Multichannel Audio Using Spatial and Harmonic
  Features
Sound Event Detection in Multichannel Audio Using Spatial and Harmonic Features
Sharath Adavanne
Giambattista Parascandolo
Pasi Pertilä
Toni Heittola
Tuomas Virtanen
45
116
0
07 Jun 2017
Mask R-CNN
Mask R-CNN
Kaiming He
Georgia Gkioxari
Piotr Dollár
Ross B. Girshick
ObjD
344
27,129
0
20 Mar 2017
6-DoF Object Pose from Semantic Keypoints
6-DoF Object Pose from Semantic Keypoints
Georgios Pavlakos
Xiaowei Zhou
Aaron Chan
Konstantinos G. Derpanis
Kostas Daniilidis
119
394
0
14 Mar 2017
Convolutional Recurrent Neural Networks for Polyphonic Sound Event
  Detection
Convolutional Recurrent Neural Networks for Polyphonic Sound Event Detection
Emre Çakir
Giambattista Parascandolo
Toni Heittola
H. Huttunen
Tuomas Virtanen
ObjD
45
543
0
21 Feb 2017
What Makes Audio Event Detection Harder than Classification?
What Makes Audio Event Detection Harder than Classification?
Huy P Phan
P. Koch
Fabrice Katzberg
M. Maass
Radoslaw Mazur
Ian Mcloughlin
Alfred Mertins
28
10
0
29 Dec 2016
Language Modeling with Gated Convolutional Networks
Language Modeling with Gated Convolutional Networks
Yann N. Dauphin
Angela Fan
Michael Auli
David Grangier
212
2,391
0
23 Dec 2016
R-FCN: Object Detection via Region-based Fully Convolutional Networks
R-FCN: Object Detection via Region-based Fully Convolutional Networks
Jifeng Dai
Yi Li
Kaiming He
Jian Sun
ObjD
158
5,635
0
20 May 2016
Recurrent Neural Networks for Polyphonic Sound Event Detection in Real
  Life Recordings
Recurrent Neural Networks for Polyphonic Sound Event Detection in Real Life Recordings
Giambattista Parascandolo
H. Huttunen
Tuomas Virtanen
44
318
0
04 Apr 2016
Stacked Hourglass Networks for Human Pose Estimation
Stacked Hourglass Networks for Human Pose Estimation
Alejandro Newell
Kaiyu Yang
Jia Deng
3DH
115
5,024
0
22 Mar 2016
Deep Residual Learning for Image Recognition
Deep Residual Learning for Image Recognition
Kaiming He
Xinming Zhang
Shaoqing Ren
Jian Sun
MedIm
2.0K
193,426
0
10 Dec 2015
An End-to-End Trainable Neural Network for Image-based Sequence
  Recognition and Its Application to Scene Text Recognition
An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition
Baoguang Shi
X. Bai
Cong Yao
VLM
191
2,484
0
21 Jul 2015
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal
  Networks
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
Shaoqing Ren
Kaiming He
Ross B. Girshick
Jian Sun
AIMat
ObjD
473
62,122
0
04 Jun 2015
Deep Multimodal Learning for Audio-Visual Speech Recognition
Deep Multimodal Learning for Audio-Visual Speech Recognition
Youssef Mroueh
E. Marcheret
Vaibhava Goel
60
226
0
22 Jan 2015
1