ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1906.03588
  4. Cited By
rVAD: An Unsupervised Segment-Based Robust Voice Activity Detection
  Method

rVAD: An Unsupervised Segment-Based Robust Voice Activity Detection Method

9 June 2019
Zheng-Hua Tan
A. Sarkar
Najim Dehak
ArXivPDFHTML

Papers citing "rVAD: An Unsupervised Segment-Based Robust Voice Activity Detection Method"

17 / 17 papers shown
Title
Noise-Robust Target-Speaker Voice Activity Detection Through Self-Supervised Pretraining
Noise-Robust Target-Speaker Voice Activity Detection Through Self-Supervised Pretraining
H. S. Bovbjerg
Jan Østergaard
Jesper Jensen
Zheng-Hua Tan
41
0
0
06 Jan 2025
InaGVAD : a Challenging French TV and Radio Corpus Annotated for Speech
  Activity Detection and Speaker Gender Segmentation
InaGVAD : a Challenging French TV and Radio Corpus Annotated for Speech Activity Detection and Speaker Gender Segmentation
D. Doukhan
Christine Maertens
William Le Personnic
Ludovic Speroni
Reda Dehak
38
2
0
06 Jun 2024
Unsupervised Active Learning: Optimizing Labeling Cost-Effectiveness for
  Automatic Speech Recognition
Unsupervised Active Learning: Optimizing Labeling Cost-Effectiveness for Automatic Speech Recognition
Zhisheng Zheng
Ziyang Ma
Yu Wang
Xie Chen
36
2
0
28 Aug 2023
SVVAD: Personal Voice Activity Detection for Speaker Verification
SVVAD: Personal Voice Activity Detection for Speaker Verification
Zuheng Kang
Jianzong Wang
Junqing Peng
Jing Xiao
11
2
0
31 May 2023
Integrating Voice-Based Machine Learning Technology into Complex Home
  Environments
Integrating Voice-Based Machine Learning Technology into Complex Home Environments
Ye Gao
Jason J. Jabbour
Eun-Jung Ko
L. Wijayasingha
Sooyoung Kim
...
Meiyi Ma
Karen Rose
Kristin D. Gordon
Hongning Wang
John A. Stankovic
29
1
0
06 Nov 2022
Leveraging Domain Features for Detecting Adversarial Attacks Against
  Deep Speech Recognition in Noise
Leveraging Domain Features for Detecting Adversarial Attacks Against Deep Speech Recognition in Noise
Christian Heider Nielsen
Zheng-Hua Tan
AAML
27
1
0
03 Nov 2022
Impact of annotation modality on label quality and model performance in
  the automatic assessment of laughter in-the-wild
Impact of annotation modality on label quality and model performance in the automatic assessment of laughter in-the-wild
Jose Vargas-Quiros
Laura Cabrera-Quiros
Catharine Oertel
Hayley Hung
27
5
0
02 Nov 2022
No-audio speaking status detection in crowded settings via visual
  pose-based filtering and wearable acceleration
No-audio speaking status detection in crowded settings via visual pose-based filtering and wearable acceleration
Jose Vargas-Quiros
Laura Cabrera-Quiros
Hayley Hung
29
1
0
01 Nov 2022
SVLDL: Improved Speaker Age Estimation Using Selective Variance Label
  Distribution Learning
SVLDL: Improved Speaker Age Estimation Using Selective Variance Label Distribution Learning
Zuheng Kang
Jianzong Wang
Junqing Peng
Jing Xiao
24
3
0
18 Oct 2022
Speech Sequence Embeddings using Nearest Neighbors Contrastive Learning
Speech Sequence Embeddings using Nearest Neighbors Contrastive Learning
Algayres Robin
Adel Nabli
Benoît Sagot
Emmanuel Dupoux
SSL
23
8
0
11 Apr 2022
NAS-VAD: Neural Architecture Search for Voice Activity Detection
NAS-VAD: Neural Architecture Search for Voice Activity Detection
Daniel Rho
Jinhyeok Park
J. Ko
51
6
0
22 Jan 2022
On Training Targets and Activation Functions for Deep Representation
  Learning in Text-Dependent Speaker Verification
On Training Targets and Activation Functions for Deep Representation Learning in Text-Dependent Speaker Verification
A. Sarkar
Zheng-Hua Tan
16
2
0
17 Jan 2022
Deep Spoken Keyword Spotting: An Overview
Deep Spoken Keyword Spotting: An Overview
Iván López-Espejo
Zheng-Hua Tan
John H. L. Hansen
Jesper Jensen
21
102
0
20 Nov 2021
Voice activity detection in the wild: A data-driven approach using
  teacher-student training
Voice activity detection in the wild: A data-driven approach using teacher-student training
Heinrich Dinkel
Shuai Wang
Xuenan Xu
Mengyue Wu
K. Yu
VLM
13
32
0
10 May 2021
Dataset artefacts in anti-spoofing systems: a case study on the ASVspoof
  2017 benchmark
Dataset artefacts in anti-spoofing systems: a case study on the ASVspoof 2017 benchmark
Bhusan Chettri
Emmanouil Benetos
Bob L. T. Sturm
34
27
0
15 Oct 2020
FEARLESS STEPS Challenge (FS-2): Supervised Learning with Massive
  Naturalistic Apollo Data
FEARLESS STEPS Challenge (FS-2): Supervised Learning with Massive Naturalistic Apollo Data
Aditya Sunil Joglekar
John H. L. Hansen
M. C. Shekhar
A. Sangwan
32
24
0
15 Aug 2020
Cotatron: Transcription-Guided Speech Encoder for Any-to-Many Voice
  Conversion without Parallel Data
Cotatron: Transcription-Guided Speech Encoder for Any-to-Many Voice Conversion without Parallel Data
Seung-won Park
Doo-young Kim
Myun-chul Joe
31
40
0
07 May 2020
1