Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1906.03588
Cited By
rVAD: An Unsupervised Segment-Based Robust Voice Activity Detection Method
9 June 2019
Zheng-Hua Tan
A. Sarkar
Najim Dehak
Re-assign community
ArXiv
PDF
HTML
Papers citing
"rVAD: An Unsupervised Segment-Based Robust Voice Activity Detection Method"
17 / 17 papers shown
Title
Noise-Robust Target-Speaker Voice Activity Detection Through Self-Supervised Pretraining
H. S. Bovbjerg
Jan Østergaard
Jesper Jensen
Zheng-Hua Tan
41
0
0
06 Jan 2025
InaGVAD : a Challenging French TV and Radio Corpus Annotated for Speech Activity Detection and Speaker Gender Segmentation
D. Doukhan
Christine Maertens
William Le Personnic
Ludovic Speroni
Reda Dehak
38
2
0
06 Jun 2024
Unsupervised Active Learning: Optimizing Labeling Cost-Effectiveness for Automatic Speech Recognition
Zhisheng Zheng
Ziyang Ma
Yu Wang
Xie Chen
36
2
0
28 Aug 2023
SVVAD: Personal Voice Activity Detection for Speaker Verification
Zuheng Kang
Jianzong Wang
Junqing Peng
Jing Xiao
11
2
0
31 May 2023
Integrating Voice-Based Machine Learning Technology into Complex Home Environments
Ye Gao
Jason J. Jabbour
Eun-Jung Ko
L. Wijayasingha
Sooyoung Kim
...
Meiyi Ma
Karen Rose
Kristin D. Gordon
Hongning Wang
John A. Stankovic
29
1
0
06 Nov 2022
Leveraging Domain Features for Detecting Adversarial Attacks Against Deep Speech Recognition in Noise
Christian Heider Nielsen
Zheng-Hua Tan
AAML
27
1
0
03 Nov 2022
Impact of annotation modality on label quality and model performance in the automatic assessment of laughter in-the-wild
Jose Vargas-Quiros
Laura Cabrera-Quiros
Catharine Oertel
Hayley Hung
27
5
0
02 Nov 2022
No-audio speaking status detection in crowded settings via visual pose-based filtering and wearable acceleration
Jose Vargas-Quiros
Laura Cabrera-Quiros
Hayley Hung
29
1
0
01 Nov 2022
SVLDL: Improved Speaker Age Estimation Using Selective Variance Label Distribution Learning
Zuheng Kang
Jianzong Wang
Junqing Peng
Jing Xiao
24
3
0
18 Oct 2022
Speech Sequence Embeddings using Nearest Neighbors Contrastive Learning
Algayres Robin
Adel Nabli
Benoît Sagot
Emmanuel Dupoux
SSL
23
8
0
11 Apr 2022
NAS-VAD: Neural Architecture Search for Voice Activity Detection
Daniel Rho
Jinhyeok Park
J. Ko
51
6
0
22 Jan 2022
On Training Targets and Activation Functions for Deep Representation Learning in Text-Dependent Speaker Verification
A. Sarkar
Zheng-Hua Tan
16
2
0
17 Jan 2022
Deep Spoken Keyword Spotting: An Overview
Iván López-Espejo
Zheng-Hua Tan
John H. L. Hansen
Jesper Jensen
21
102
0
20 Nov 2021
Voice activity detection in the wild: A data-driven approach using teacher-student training
Heinrich Dinkel
Shuai Wang
Xuenan Xu
Mengyue Wu
K. Yu
VLM
13
32
0
10 May 2021
Dataset artefacts in anti-spoofing systems: a case study on the ASVspoof 2017 benchmark
Bhusan Chettri
Emmanouil Benetos
Bob L. T. Sturm
34
27
0
15 Oct 2020
FEARLESS STEPS Challenge (FS-2): Supervised Learning with Massive Naturalistic Apollo Data
Aditya Sunil Joglekar
John H. L. Hansen
M. C. Shekhar
A. Sangwan
32
24
0
15 Aug 2020
Cotatron: Transcription-Guided Speech Encoder for Any-to-Many Voice Conversion without Parallel Data
Seung-won Park
Doo-young Kim
Myun-chul Joe
31
40
0
07 May 2020
1