An embedded segmental K-means model for unsupervised segmentation and clustering of speech

23 March 2017

Papers citing "An embedded segmental K-means model for unsupervised segmentation and clustering of speech"

26 / 26 papers shown

Title
Unsupervised Word Discovery: Boundary Detection with Clustering vs. Dynamic Programming Simon Malan Benjamin van Niekerk Herman Kamper 30 0 0 22 Sep 2024
A Simple HMM with Self-Supervised Representations for Phone Segmentation Gene-Ping Yang Hao Tang SSL 35 0 0 15 Sep 2024
SD-HuBERT: Sentence-Level Self-Distillation Induces Syllabic Organization in HuBERT Cheol Jun Cho Abdelrahman Mohamed Shang-Wen Li Alan W. Black Gopala K. Anumanchipalli 39 8 0 16 Oct 2023
Leveraging multilingual transfer for unsupervised semantic acoustic word embeddings C. Jacobs Herman Kamper 32 1 0 05 Jul 2023
End-to-End Simultaneous Speech Translation with Differentiable Segmentation Shaolei Zhang Yang Feng 23 17 0 25 May 2023
Syllable Discovery and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Model Puyuan Peng Shang-Wen Li Okko Rasanen Abdel-rahman Mohamed David Harwath SSL VLM 36 7 0 19 May 2023
Analyzing Acoustic Word Embeddings from Pre-trained Self-supervised Speech Models Ramon Sanabria Hao Tang Sharon Goldwater SSL 40 18 0 28 Oct 2022
Self-supervised language learning from raw audio: Lessons from the Zero Resource Speech Challenge Ewan Dunbar Nicolas Hamilakis Emmanuel Dupoux SSL 34 30 0 27 Oct 2022
Self-Supervised Speech Representation Learning: A Review Abdel-rahman Mohamed Hung-yi Lee Lasse Borgholt Jakob Drachmann Havtorn Joakim Edin ... Shang-Wen Li Karen Livescu Lars Maaløe Tara N. Sainath Shinji Watanabe SSL AI4TS 137 352 0 21 May 2022
Unsupervised Word Segmentation using K Nearest Neighbors T. Fuchs Yedid Hoshen Joseph Keshet SSL 24 6 0 27 Apr 2022
Speech Sequence Embeddings using Nearest Neighbors Contrastive Learning Algayres Robin Adel Nabli Benoît Sagot Emmanuel Dupoux SSL 23 8 0 11 Apr 2022
Word Discovery in Visually Grounded, Self-Supervised Speech Models Puyuan Peng David Harwath SSL 20 39 0 28 Mar 2022
Word Segmentation on Discovered Phone Units with Dynamic Programming and Self-Supervised Scoring Herman Kamper 34 25 0 24 Feb 2022
Towards Tokenized Human Dynamics Representation Kenneth Li Xiao Sun Zhirong Wu Fangyun Wei Stephen Lin 29 2 0 22 Nov 2021
Unsupervised Speech Segmentation and Variable Rate Representation Learning using Segmental Contrastive Predictive Coding Saurabhchand Bhati Jesús Villalba Piotr Żelasko Laureano Moro Velázquez Najim Dehak SSL 53 22 0 05 Oct 2021
Multilingual transfer of acoustic word embeddings improves when training on languages related to the target zero-resource language C. Jacobs Herman Kamper 35 10 0 24 Jun 2021
Unsupervised Automatic Speech Recognition: A Review Hanan Aldarmaki Asad Ullah Nazar Zaki VLM SSL 39 57 0 09 Jun 2021
Towards unsupervised phone and word segmentation using self-supervised vector-quantized neural networks Herman Kamper Benjamin van Niekerk SSL MQ 20 35 0 14 Dec 2020
A Correspondence Variational Autoencoder for Unsupervised Acoustic Word Embeddings Puyuan Peng Herman Kamper Karen Livescu DRL SSL 14 14 0 03 Dec 2020
Unsupervised Discovery of Recurring Speech Patterns Using Probabilistic Adaptive Metrics Okko Rasanen María Andrea Cruz Blandón 24 25 0 03 Aug 2020
Multilingual acoustic word embedding models for processing zero-resource languages Herman Kamper Yevgen Matusevych Sharon Goldwater 31 24 0 06 Feb 2020
Unsupervised Phoneme and Word Discovery from Multiple Speakers using Double Articulation Analyzer and Neural Network with Parametric Bias Ryo Nakashima Ryo Ozaki T. Taniguchi 21 6 0 21 Jun 2019
From Semi-supervised to Almost-unsupervised Speech Recognition with Very-low Resource by Jointly Learning Phonetic Structures from Audio and Text Embeddings Yi-Chen Chen Sung-Feng Huang Hung-yi Lee Lin-Shan Lee SSL 19 0 0 10 Apr 2019
Unsupervised Speech Recognition via Segmental Empirical Output Distribution Matching Chih-Kuan Yeh Jianshu Chen Chengzhu Yu Dong Yu 13 40 0 23 Dec 2018
Phonetic-and-Semantic Embedding of Spoken Words with Applications in Spoken Content Retrieval Yi-Chen Chen Sung-Feng Huang Chia-Hao Shen Hung-yi Lee Lin-Shan Lee 46 37 0 21 Jul 2018
Sequence Prediction with Neural Segmental Models Hao Tang 29 2 0 05 Sep 2017