rVAD: An Unsupervised Segment-Based Robust Voice Activity Detection
Method

rVAD: An Unsupervised Segment-Based Robust Voice Activity Detection Method

9 June 2019

Najim Dehak

Papers citing "rVAD: An Unsupervised Segment-Based Robust Voice Activity Detection Method"

17 / 17 papers shown

Title
Noise-Robust Target-Speaker Voice Activity Detection Through Self-Supervised Pretraining H. S. Bovbjerg Jan Østergaard Jesper Jensen Zheng-Hua Tan 41 0 0 06 Jan 2025
InaGVAD : a Challenging French TV and Radio Corpus Annotated for Speech Activity Detection and Speaker Gender Segmentation D. Doukhan Christine Maertens William Le Personnic Ludovic Speroni Reda Dehak 38 2 0 06 Jun 2024
Unsupervised Active Learning: Optimizing Labeling Cost-Effectiveness for Automatic Speech Recognition Zhisheng Zheng Ziyang Ma Yu Wang Xie Chen 36 2 0 28 Aug 2023
SVVAD: Personal Voice Activity Detection for Speaker Verification Zuheng Kang Jianzong Wang Junqing Peng Jing Xiao 11 2 0 31 May 2023
Integrating Voice-Based Machine Learning Technology into Complex Home Environments Ye Gao Jason J. Jabbour Eun-Jung Ko L. Wijayasingha Sooyoung Kim ... Meiyi Ma Karen Rose Kristin D. Gordon Hongning Wang John A. Stankovic 29 1 0 06 Nov 2022
Leveraging Domain Features for Detecting Adversarial Attacks Against Deep Speech Recognition in Noise Christian Heider Nielsen Zheng-Hua Tan AAML 27 1 0 03 Nov 2022
Impact of annotation modality on label quality and model performance in the automatic assessment of laughter in-the-wild Jose Vargas-Quiros Laura Cabrera-Quiros Catharine Oertel Hayley Hung 27 5 0 02 Nov 2022
No-audio speaking status detection in crowded settings via visual pose-based filtering and wearable acceleration Jose Vargas-Quiros Laura Cabrera-Quiros Hayley Hung 29 1 0 01 Nov 2022
SVLDL: Improved Speaker Age Estimation Using Selective Variance Label Distribution Learning Zuheng Kang Jianzong Wang Junqing Peng Jing Xiao 24 3 0 18 Oct 2022
Speech Sequence Embeddings using Nearest Neighbors Contrastive Learning Algayres Robin Adel Nabli Benoît Sagot Emmanuel Dupoux SSL 23 8 0 11 Apr 2022
NAS-VAD: Neural Architecture Search for Voice Activity Detection Daniel Rho Jinhyeok Park J. Ko 51 6 0 22 Jan 2022
On Training Targets and Activation Functions for Deep Representation Learning in Text-Dependent Speaker Verification A. Sarkar Zheng-Hua Tan 16 2 0 17 Jan 2022
Deep Spoken Keyword Spotting: An Overview Iván López-Espejo Zheng-Hua Tan John H. L. Hansen Jesper Jensen 21 102 0 20 Nov 2021
Voice activity detection in the wild: A data-driven approach using teacher-student training Heinrich Dinkel Shuai Wang Xuenan Xu Mengyue Wu K. Yu VLM 13 32 0 10 May 2021
Dataset artefacts in anti-spoofing systems: a case study on the ASVspoof 2017 benchmark Bhusan Chettri Emmanouil Benetos Bob L. T. Sturm 34 27 0 15 Oct 2020
FEARLESS STEPS Challenge (FS-2): Supervised Learning with Massive Naturalistic Apollo Data Aditya Sunil Joglekar John H. L. Hansen M. C. Shekhar A. Sangwan 32 24 0 15 Aug 2020
Cotatron: Transcription-Guided Speech Encoder for Any-to-Many Voice Conversion without Parallel Data Seung-won Park Doo-young Kim Myun-chul Joe 31 40 0 07 May 2020