ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2206.11332
  4. Cited By
DP-Parse: Finding Word Boundaries from Raw Speech with an Instance
  Lexicon

DP-Parse: Finding Word Boundaries from Raw Speech with an Instance Lexicon

22 June 2022
Robin Algayres
Tristan Ricoul
Julien Karadayi
Hugo Laurenccon
Salah Zaiem
Abdel-rahman Mohamed
Benoît Sagot
Emmanuel Dupoux
ArXiv (abs)PDFHTML

Papers citing "DP-Parse: Finding Word Boundaries from Raw Speech with an Instance Lexicon"

28 / 28 papers shown
Title
Towards Unsupervised Speech Recognition Without Pronunciation Models
Towards Unsupervised Speech Recognition Without Pronunciation Models
Junrui Ni
Liming Wang
Yang Zhang
Kaizhi Qian
Heting Gao
Mark Hasegawa-Johnson
Chang D. Yoo
SSLOffRL
129
0
0
10 Jan 2025
Unsupervised Word Discovery: Boundary Detection with Clustering vs. Dynamic Programming
Unsupervised Word Discovery: Boundary Detection with Clustering vs. Dynamic Programming
Simon Malan
Benjamin van Niekerk
Herman Kamper
71
0
0
22 Sep 2024
Speech Sequence Embeddings using Nearest Neighbors Contrastive Learning
Speech Sequence Embeddings using Nearest Neighbors Contrastive Learning
Algayres Robin
Adel Nabli
Benoît Sagot
Emmanuel Dupoux
SSL
50
8
0
11 Apr 2022
Word Segmentation on Discovered Phone Units with Dynamic Programming and
  Self-Supervised Scoring
Word Segmentation on Discovered Phone Units with Dynamic Programming and Self-Supervised Scoring
Herman Kamper
70
26
0
24 Feb 2022
HuBERT: Self-Supervised Speech Representation Learning by Masked
  Prediction of Hidden Units
HuBERT: Self-Supervised Speech Representation Learning by Masked Prediction of Hidden Units
Wei-Ning Hsu
Benjamin Bolte
Yao-Hung Hubert Tsai
Kushal Lakhotia
Ruslan Salakhutdinov
Abdel-rahman Mohamed
SSL
180
2,966
0
14 Jun 2021
Segmental Contrastive Predictive Coding for Unsupervised Word
  Segmentation
Segmental Contrastive Predictive Coding for Unsupervised Word Segmentation
Saurabhchand Bhati
Jesús Villalba
Piotr Żelasko
Laureano Moro-Velazquez
Najim Dehak
SSL
70
37
0
03 Jun 2021
Acoustic word embeddings for zero-resource languages using
  self-supervised contrastive learning and multilingual adaptation
Acoustic word embeddings for zero-resource languages using self-supervised contrastive learning and multilingual adaptation
C. Jacobs
Yevgen Matusevych
Herman Kamper
50
21
0
19 Mar 2021
Generative Spoken Language Modeling from Raw Audio
Generative Spoken Language Modeling from Raw Audio
Kushal Lakhotia
Evgeny Kharitonov
Wei-Ning Hsu
Yossi Adi
Adam Polyak
...
Tu Nguyen
Jade Copet
Alexei Baevski
A. Mohamed
Emmanuel Dupoux
AuLLM
251
360
0
01 Feb 2021
Towards unsupervised phone and word segmentation using self-supervised
  vector-quantized neural networks
Towards unsupervised phone and word segmentation using self-supervised vector-quantized neural networks
Herman Kamper
Benjamin van Niekerk
SSLMQ
71
36
0
14 Dec 2020
Acoustic span embeddings for multilingual query-by-example search
Acoustic span embeddings for multilingual query-by-example search
Yushi Hu
Shane Settle
Karen Livescu
RALM
64
8
0
24 Nov 2020
The Zero Resource Speech Benchmark 2021: Metrics and baselines for
  unsupervised spoken language modeling
The Zero Resource Speech Benchmark 2021: Metrics and baselines for unsupervised spoken language modeling
Tu Nguyen
Maureen de Seyssel
Patricia Roze
M. Rivière
Evgeny Kharitonov
Alexei Baevski
Ewan Dunbar
Emmanuel Dupoux
SSL
126
107
0
23 Nov 2020
The Zero Resource Speech Challenge 2020: Discovering discrete subword
  and word units
The Zero Resource Speech Challenge 2020: Discovering discrete subword and word units
Ewan Dunbar
Julien Karadayi
Mathieu Bernard
Xuan-Nga Cao
Robin Algayres
Lucas Ondel
Laurent Besacier
S. Sakti
Emmanuel Dupoux
SSL
106
61
0
12 Oct 2020
Evaluating the reliability of acoustic speech embeddings
Evaluating the reliability of acoustic speech embeddings
Robin Algayres
Mohamed Salah Zaiem
Benoît Sagot
Emmanuel Dupoux
67
29
0
27 Jul 2020
Self-Expressing Autoencoders for Unsupervised Spoken Term Discovery
Self-Expressing Autoencoders for Unsupervised Spoken Term Discovery
Saurabhchand Bhati
Jesús Villalba
Piotr Żelasko
Najim Dehak
SSL
58
16
0
26 Jul 2020
Multilingual Jointly Trained Acoustic and Written Word Embeddings
Multilingual Jointly Trained Acoustic and Written Word Embeddings
Yushi Hu
Shane Settle
Karen Livescu
44
23
0
24 Jun 2020
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech
  Representations
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations
Alexei Baevski
Henry Zhou
Abdel-rahman Mohamed
Michael Auli
SSL
285
5,801
0
20 Jun 2020
pyannote.audio: neural building blocks for speaker diarization
pyannote.audio: neural building blocks for speaker diarization
H. Bredin
Ruiqing Yin
Juan Manuel Coria
G. Gelly
Pavel Korshunov
Marvin Lavechin
D. Fustes
Hadrien Titeux
Wassim Bouaziz
Marie-Philippe Gill
229
325
0
04 Nov 2019
Truly unsupervised acoustic word embeddings using weak top-down
  constraints in encoder-decoder models
Truly unsupervised acoustic word embeddings using weak top-down constraints in encoder-decoder models
Herman Kamper
SSL
71
68
0
01 Nov 2018
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLMSSLSSeg
1.8K
94,891
0
11 Oct 2018
Representation Learning with Contrastive Predictive Coding
Representation Learning with Contrastive Predictive Coding
Aaron van den Oord
Yazhe Li
Oriol Vinyals
DRLSSL
327
10,302
0
10 Jul 2018
Sampling strategies in Siamese Networks for unsupervised speech
  representation learning
Sampling strategies in Siamese Networks for unsupervised speech representation learning
Rachid Riad
Corentin Dancette
Julien Karadayi
Neil Zeghidour
Thomas Schatz
Emmanuel Dupoux
SSL
49
28
0
30 Apr 2018
Speech2Vec: A Sequence-to-Sequence Framework for Learning Word
  Embeddings from Speech
Speech2Vec: A Sequence-to-Sequence Framework for Learning Word Embeddings from Speech
Yu-An Chung
James R. Glass
3DV
71
184
0
23 Mar 2018
The Zero Resource Speech Challenge 2017
The Zero Resource Speech Challenge 2017
Maarten Versteegh
Xuan-Nga Cao
Roland Thiollière
Thomas Schatz
Mathieu Bernard
A. Jansen
Xavier Anguera Miró
Emmanuel Dupoux
70
204
0
12 Dec 2017
An embedded segmental K-means model for unsupervised segmentation and
  clustering of speech
An embedded segmental K-means model for unsupervised segmentation and clustering of speech
Herman Kamper
Karen Livescu
Sharon Goldwater
54
96
0
23 Mar 2017
Discriminative Acoustic Word Embeddings: Recurrent Neural Network-Based
  Approaches
Discriminative Acoustic Word Embeddings: Recurrent Neural Network-Based Approaches
Shane Settle
Karen Livescu
61
87
0
08 Nov 2016
WaveNet: A Generative Model for Raw Audio
WaveNet: A Generative Model for Raw Audio
Aaron van den Oord
Sander Dieleman
Heiga Zen
Karen Simonyan
Oriol Vinyals
Alex Graves
Nal Kalchbrenner
A. Senior
Koray Kavukcuoglu
DiffM
406
7,399
0
12 Sep 2016
A segmental framework for fully-unsupervised large-vocabulary speech
  recognition
A segmental framework for fully-unsupervised large-vocabulary speech recognition
Herman Kamper
A. Jansen
Sharon Goldwater
68
104
0
22 Jun 2016
Efficient Estimation of Word Representations in Vector Space
Efficient Estimation of Word Representations in Vector Space
Tomas Mikolov
Kai Chen
G. Corrado
J. Dean
3DV
680
31,512
0
16 Jan 2013
1