ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1703.08135
  4. Cited By
An embedded segmental K-means model for unsupervised segmentation and
  clustering of speech

An embedded segmental K-means model for unsupervised segmentation and clustering of speech

23 March 2017
Herman Kamper
Karen Livescu
Sharon Goldwater
ArXivPDFHTML

Papers citing "An embedded segmental K-means model for unsupervised segmentation and clustering of speech"

26 / 26 papers shown
Title
Unsupervised Word Discovery: Boundary Detection with Clustering vs. Dynamic Programming
Unsupervised Word Discovery: Boundary Detection with Clustering vs. Dynamic Programming
Simon Malan
Benjamin van Niekerk
Herman Kamper
30
0
0
22 Sep 2024
A Simple HMM with Self-Supervised Representations for Phone Segmentation
A Simple HMM with Self-Supervised Representations for Phone Segmentation
Gene-Ping Yang
Hao Tang
SSL
35
0
0
15 Sep 2024
SD-HuBERT: Sentence-Level Self-Distillation Induces Syllabic Organization in HuBERT
SD-HuBERT: Sentence-Level Self-Distillation Induces Syllabic Organization in HuBERT
Cheol Jun Cho
Abdelrahman Mohamed
Shang-Wen Li
Alan W. Black
Gopala K. Anumanchipalli
39
8
0
16 Oct 2023
Leveraging multilingual transfer for unsupervised semantic acoustic word
  embeddings
Leveraging multilingual transfer for unsupervised semantic acoustic word embeddings
C. Jacobs
Herman Kamper
32
1
0
05 Jul 2023
End-to-End Simultaneous Speech Translation with Differentiable
  Segmentation
End-to-End Simultaneous Speech Translation with Differentiable Segmentation
Shaolei Zhang
Yang Feng
23
17
0
25 May 2023
Syllable Discovery and Cross-Lingual Generalization in a Visually
  Grounded, Self-Supervised Speech Model
Syllable Discovery and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Model
Puyuan Peng
Shang-Wen Li
Okko Rasanen
Abdel-rahman Mohamed
David Harwath
SSL
VLM
36
7
0
19 May 2023
Analyzing Acoustic Word Embeddings from Pre-trained Self-supervised
  Speech Models
Analyzing Acoustic Word Embeddings from Pre-trained Self-supervised Speech Models
Ramon Sanabria
Hao Tang
Sharon Goldwater
SSL
40
18
0
28 Oct 2022
Self-supervised language learning from raw audio: Lessons from the Zero
  Resource Speech Challenge
Self-supervised language learning from raw audio: Lessons from the Zero Resource Speech Challenge
Ewan Dunbar
Nicolas Hamilakis
Emmanuel Dupoux
SSL
34
30
0
27 Oct 2022
Self-Supervised Speech Representation Learning: A Review
Self-Supervised Speech Representation Learning: A Review
Abdel-rahman Mohamed
Hung-yi Lee
Lasse Borgholt
Jakob Drachmann Havtorn
Joakim Edin
...
Shang-Wen Li
Karen Livescu
Lars Maaløe
Tara N. Sainath
Shinji Watanabe
SSL
AI4TS
137
352
0
21 May 2022
Unsupervised Word Segmentation using K Nearest Neighbors
Unsupervised Word Segmentation using K Nearest Neighbors
T. Fuchs
Yedid Hoshen
Joseph Keshet
SSL
24
6
0
27 Apr 2022
Speech Sequence Embeddings using Nearest Neighbors Contrastive Learning
Speech Sequence Embeddings using Nearest Neighbors Contrastive Learning
Algayres Robin
Adel Nabli
Benoît Sagot
Emmanuel Dupoux
SSL
23
8
0
11 Apr 2022
Word Discovery in Visually Grounded, Self-Supervised Speech Models
Word Discovery in Visually Grounded, Self-Supervised Speech Models
Puyuan Peng
David Harwath
SSL
20
39
0
28 Mar 2022
Word Segmentation on Discovered Phone Units with Dynamic Programming and
  Self-Supervised Scoring
Word Segmentation on Discovered Phone Units with Dynamic Programming and Self-Supervised Scoring
Herman Kamper
34
25
0
24 Feb 2022
Towards Tokenized Human Dynamics Representation
Towards Tokenized Human Dynamics Representation
Kenneth Li
Xiao Sun
Zhirong Wu
Fangyun Wei
Stephen Lin
29
2
0
22 Nov 2021
Unsupervised Speech Segmentation and Variable Rate Representation
  Learning using Segmental Contrastive Predictive Coding
Unsupervised Speech Segmentation and Variable Rate Representation Learning using Segmental Contrastive Predictive Coding
Saurabhchand Bhati
Jesús Villalba
Piotr Żelasko
Laureano Moro Velázquez
Najim Dehak
SSL
53
22
0
05 Oct 2021
Multilingual transfer of acoustic word embeddings improves when training
  on languages related to the target zero-resource language
Multilingual transfer of acoustic word embeddings improves when training on languages related to the target zero-resource language
C. Jacobs
Herman Kamper
35
10
0
24 Jun 2021
Unsupervised Automatic Speech Recognition: A Review
Unsupervised Automatic Speech Recognition: A Review
Hanan Aldarmaki
Asad Ullah
Nazar Zaki
VLM
SSL
39
57
0
09 Jun 2021
Towards unsupervised phone and word segmentation using self-supervised
  vector-quantized neural networks
Towards unsupervised phone and word segmentation using self-supervised vector-quantized neural networks
Herman Kamper
Benjamin van Niekerk
SSL
MQ
20
35
0
14 Dec 2020
A Correspondence Variational Autoencoder for Unsupervised Acoustic Word
  Embeddings
A Correspondence Variational Autoencoder for Unsupervised Acoustic Word Embeddings
Puyuan Peng
Herman Kamper
Karen Livescu
DRL
SSL
14
14
0
03 Dec 2020
Unsupervised Discovery of Recurring Speech Patterns Using Probabilistic
  Adaptive Metrics
Unsupervised Discovery of Recurring Speech Patterns Using Probabilistic Adaptive Metrics
Okko Rasanen
María Andrea Cruz Blandón
24
25
0
03 Aug 2020
Multilingual acoustic word embedding models for processing zero-resource
  languages
Multilingual acoustic word embedding models for processing zero-resource languages
Herman Kamper
Yevgen Matusevych
Sharon Goldwater
31
24
0
06 Feb 2020
Unsupervised Phoneme and Word Discovery from Multiple Speakers using
  Double Articulation Analyzer and Neural Network with Parametric Bias
Unsupervised Phoneme and Word Discovery from Multiple Speakers using Double Articulation Analyzer and Neural Network with Parametric Bias
Ryo Nakashima
Ryo Ozaki
T. Taniguchi
21
6
0
21 Jun 2019
From Semi-supervised to Almost-unsupervised Speech Recognition with
  Very-low Resource by Jointly Learning Phonetic Structures from Audio and Text
  Embeddings
From Semi-supervised to Almost-unsupervised Speech Recognition with Very-low Resource by Jointly Learning Phonetic Structures from Audio and Text Embeddings
Yi-Chen Chen
Sung-Feng Huang
Hung-yi Lee
Lin-Shan Lee
SSL
19
0
0
10 Apr 2019
Unsupervised Speech Recognition via Segmental Empirical Output
  Distribution Matching
Unsupervised Speech Recognition via Segmental Empirical Output Distribution Matching
Chih-Kuan Yeh
Jianshu Chen
Chengzhu Yu
Dong Yu
13
40
0
23 Dec 2018
Phonetic-and-Semantic Embedding of Spoken Words with Applications in
  Spoken Content Retrieval
Phonetic-and-Semantic Embedding of Spoken Words with Applications in Spoken Content Retrieval
Yi-Chen Chen
Sung-Feng Huang
Chia-Hao Shen
Hung-yi Lee
Lin-Shan Lee
46
37
0
21 Jul 2018
Sequence Prediction with Neural Segmental Models
Sequence Prediction with Neural Segmental Models
Hao Tang
29
2
0
05 Sep 2017
1