ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2012.07551
  4. Cited By
Towards unsupervised phone and word segmentation using self-supervised
  vector-quantized neural networks

Towards unsupervised phone and word segmentation using self-supervised vector-quantized neural networks

14 December 2020
Herman Kamper
Benjamin van Niekerk
    SSL
    MQ
ArXivPDFHTML

Papers citing "Towards unsupervised phone and word segmentation using self-supervised vector-quantized neural networks"

24 / 24 papers shown
Title
A Simple HMM with Self-Supervised Representations for Phone Segmentation
A Simple HMM with Self-Supervised Representations for Phone Segmentation
Gene-Ping Yang
Hao Tang
SSL
35
0
0
15 Sep 2024
Unified Segment-to-Segment Framework for Simultaneous Sequence
  Generation
Unified Segment-to-Segment Framework for Simultaneous Sequence Generation
Shaolei Zhang
Yang Feng
23
7
0
27 Oct 2023
Rhythm Modeling for Voice Conversion
Rhythm Modeling for Voice Conversion
Benjamin van Niekerk
M. Carbonneau
Herman Kamper
37
5
0
12 Jul 2023
Visually grounded few-shot word learning in low-resource settings
Visually grounded few-shot word learning in low-resource settings
Leanne Nortje
Dan Oneaţă
Herman Kamper
VLM
17
4
0
20 Jun 2023
End-to-End Simultaneous Speech Translation with Differentiable
  Segmentation
End-to-End Simultaneous Speech Translation with Differentiable Segmentation
Shaolei Zhang
Yang Feng
23
17
0
25 May 2023
Visually grounded few-shot word acquisition with fewer shots
Visually grounded few-shot word acquisition with fewer shots
Leanne Nortje
Benjamin van Niekerk
Herman Kamper
28
1
0
25 May 2023
Unsupervised Word Segmentation Using Temporal Gradient Pseudo-Labels
Unsupervised Word Segmentation Using Temporal Gradient Pseudo-Labels
T. Fuchs
Yedid Hoshen
33
5
0
30 Mar 2023
Towards trustworthy phoneme boundary detection with autoregressive model
  and improved evaluation metric
Towards trustworthy phoneme boundary detection with autoregressive model and improved evaluation metric
Hyeongju Kim
Hyeong-Seok Choi
6
2
0
13 Dec 2022
Learning Dependencies of Discrete Speech Representations with Neural
  Hidden Markov Models
Learning Dependencies of Discrete Speech Representations with Neural Hidden Markov Models
Sung-Lin Yeh
Hao Tang
SSL
BDL
35
1
0
29 Oct 2022
On Compressing Sequences for Self-Supervised Speech Models
On Compressing Sequences for Self-Supervised Speech Models
Yen Meng
Hsuan-Jui Chen
Jiatong Shi
Shinji Watanabe
Paola García
Hung-yi Lee
Hao Tang
SSL
21
15
0
13 Oct 2022
Learning Phone Recognition from Unpaired Audio and Phone Sequences Based
  on Generative Adversarial Network
Learning Phone Recognition from Unpaired Audio and Phone Sequences Based on Generative Adversarial Network
Da-Rong Liu
Po-Chun Hsu
Yi-Chen Chen
Sung-Feng Huang
Shun-Po Chuang
Da-Yi Wu
Hung-yi Lee
GAN
25
7
0
29 Jul 2022
A Temporal Extension of Latent Dirichlet Allocation for Unsupervised
  Acoustic Unit Discovery
A Temporal Extension of Latent Dirichlet Allocation for Unsupervised Acoustic Unit Discovery
W. V. D. Merwe
Herman Kamper
J. D. Preez
22
2
0
23 Jun 2022
DP-Parse: Finding Word Boundaries from Raw Speech with an Instance
  Lexicon
DP-Parse: Finding Word Boundaries from Raw Speech with an Instance Lexicon
Robin Algayres
Tristan Ricoul
Julien Karadayi
Hugo Laurenccon
Salah Zaiem
Abdel-rahman Mohamed
Benoît Sagot
Emmanuel Dupoux
14
13
0
22 Jun 2022
Unsupervised Word Segmentation using K Nearest Neighbors
Unsupervised Word Segmentation using K Nearest Neighbors
T. Fuchs
Yedid Hoshen
Joseph Keshet
SSL
24
6
0
27 Apr 2022
Word Discovery in Visually Grounded, Self-Supervised Speech Models
Word Discovery in Visually Grounded, Self-Supervised Speech Models
Puyuan Peng
David Harwath
SSL
20
39
0
28 Mar 2022
A Brief Overview of Unsupervised Neural Speech Representation Learning
A Brief Overview of Unsupervised Neural Speech Representation Learning
Lasse Borgholt
Jakob Drachmann Havtorn
Joakim Edin
Lars Maaløe
Christian Igel
BDL
AI4TS
SSL
19
11
0
01 Mar 2022
Word Segmentation on Discovered Phone Units with Dynamic Programming and
  Self-Supervised Scoring
Word Segmentation on Discovered Phone Units with Dynamic Programming and Self-Supervised Scoring
Herman Kamper
34
25
0
24 Feb 2022
Phone-to-audio alignment without text: A Semi-supervised Approach
Phone-to-audio alignment without text: A Semi-supervised Approach
Jian Zhu
Cong Zhang
David Jurgens
37
36
0
08 Oct 2021
Unsupervised Speech Segmentation and Variable Rate Representation
  Learning using Segmental Contrastive Predictive Coding
Unsupervised Speech Segmentation and Variable Rate Representation Learning using Segmental Contrastive Predictive Coding
Saurabhchand Bhati
Jesús Villalba
Piotr Żelasko
Laureano Moro Velázquez
Najim Dehak
SSL
53
22
0
05 Oct 2021
Unsupervised Automatic Speech Recognition: A Review
Unsupervised Automatic Speech Recognition: A Review
Hanan Aldarmaki
Asad Ullah
Nazar Zaki
VLM
SSL
39
57
0
09 Jun 2021
Unsupervised Word Segmentation from Discrete Speech Units in
  Low-Resource Settings
Unsupervised Word Segmentation from Discrete Speech Units in Low-Resource Settings
Marcely Zanon Boito
Bolaji Yusuf
Lucas Ondel
Aline Villavicencio
Laurent Besacier
23
3
0
08 Jun 2021
Segmental Contrastive Predictive Coding for Unsupervised Word
  Segmentation
Segmental Contrastive Predictive Coding for Unsupervised Word Segmentation
Saurabhchand Bhati
Jesús Villalba
Piotr Żelasko
Laureano Moro Velázquez
Najim Dehak
SSL
19
37
0
03 Jun 2021
Double Articulation Analyzer with Prosody for Unsupervised Word and
  Phoneme Discovery
Double Articulation Analyzer with Prosody for Unsupervised Word and Phoneme Discovery
Yasuaki Okuda
Ryo Ozaki
T. Taniguchi
28
5
0
15 Mar 2021
Evaluating the reliability of acoustic speech embeddings
Evaluating the reliability of acoustic speech embeddings
Robin Algayres
Mohamed Salah Zaiem
Benoît Sagot
Emmanuel Dupoux
38
29
0
27 Jul 2020
1