ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1811.00403
  4. Cited By
Truly unsupervised acoustic word embeddings using weak top-down
  constraints in encoder-decoder models

Truly unsupervised acoustic word embeddings using weak top-down constraints in encoder-decoder models

1 November 2018
Herman Kamper
    SSL
ArXivPDFHTML

Papers citing "Truly unsupervised acoustic word embeddings using weak top-down constraints in encoder-decoder models"

50 / 50 papers shown
Title
LASER: Learning by Aligning Self-supervised Representations of Speech
  for Improving Content-related Tasks
LASER: Learning by Aligning Self-supervised Representations of Speech for Improving Content-related Tasks
Amit Meghanani
Thomas Hain
44
1
0
13 Jun 2024
Visually Grounded Speech Models have a Mutual Exclusivity Bias
Visually Grounded Speech Models have a Mutual Exclusivity Bias
Leanne Nortje
Dan Oneaţă
Yevgen Matusevych
Herman Kamper
SSL
47
0
0
20 Mar 2024
Improving Acoustic Word Embeddings through Correspondence Training of
  Self-supervised Speech Representations
Improving Acoustic Word Embeddings through Correspondence Training of Self-supervised Speech Representations
Amit Meghanani
Thomas Hain
SSL
40
1
0
13 Mar 2024
SCORE: Self-supervised Correspondence Fine-tuning for Improved Content
  Representations
SCORE: Self-supervised Correspondence Fine-tuning for Improved Content Representations
Amit Meghanani
Thomas Hain
41
3
0
10 Mar 2024
SD-HuBERT: Sentence-Level Self-Distillation Induces Syllabic Organization in HuBERT
SD-HuBERT: Sentence-Level Self-Distillation Induces Syllabic Organization in HuBERT
Cheol Jun Cho
Abdelrahman Mohamed
Shang-Wen Li
Alan W. Black
Gopala K. Anumanchipalli
39
8
0
16 Oct 2023
XLS-R fine-tuning on noisy word boundaries for unsupervised speech
  segmentation into words
XLS-R fine-tuning on noisy word boundaries for unsupervised speech segmentation into words
Robin Algayres
Pablo Diego-Simon
Benoît Sagot
Emmanuel Dupoux
44
1
0
08 Oct 2023
Generative Spoken Language Model based on continuous word-sized audio
  tokens
Generative Spoken Language Model based on continuous word-sized audio tokens
Robin Algayres
Yossi Adi
Tu Nguyen
Jade Copet
Gabriel Synnaeve
Benoît Sagot
Emmanuel Dupoux
AuLLM
46
13
0
08 Oct 2023
Leveraging multilingual transfer for unsupervised semantic acoustic word
  embeddings
Leveraging multilingual transfer for unsupervised semantic acoustic word embeddings
C. Jacobs
Herman Kamper
32
1
0
05 Jul 2023
Visually grounded few-shot word learning in low-resource settings
Visually grounded few-shot word learning in low-resource settings
Leanne Nortje
Dan Oneaţă
Herman Kamper
VLM
23
4
0
20 Jun 2023
Acoustic Word Embeddings for Untranscribed Target Languages with
  Continued Pretraining and Learned Pooling
Acoustic Word Embeddings for Untranscribed Target Languages with Continued Pretraining and Learned Pooling
Ramon Sanabria
Ondˇrej Klejch
Hao Tang
Sharon Goldwater
30
1
0
03 Jun 2023
Towards hate speech detection in low-resource languages: Comparing ASR
  to acoustic word embeddings on Wolof and Swahili
Towards hate speech detection in low-resource languages: Comparing ASR to acoustic word embeddings on Wolof and Swahili
C. Jacobs
Nathanaël Carraz Rakotonirina
E. Chimoto
Bruce A. Bassett
Herman Kamper
27
5
0
01 Jun 2023
Visually grounded few-shot word acquisition with fewer shots
Visually grounded few-shot word acquisition with fewer shots
Leanne Nortje
Benjamin van Niekerk
Herman Kamper
30
1
0
25 May 2023
Analyzing the Representational Geometry of Acoustic Word Embeddings
Analyzing the Representational Geometry of Acoustic Word Embeddings
Badr M. Abdullah
Dietrich Klakow
21
3
0
08 Jan 2023
Analyzing Acoustic Word Embeddings from Pre-trained Self-supervised
  Speech Models
Analyzing Acoustic Word Embeddings from Pre-trained Self-supervised Speech Models
Ramon Sanabria
Hao Tang
Sharon Goldwater
SSL
40
19
0
28 Oct 2022
Bootstrapping meaning through listening: Unsupervised learning of spoken
  sentence embeddings
Bootstrapping meaning through listening: Unsupervised learning of spoken sentence embeddings
Jian Zhu
Zuoyu Tian
Yadong Liu
Cong Zhang
Chia-wen Lo
SSL
34
2
0
23 Oct 2022
Integrating Form and Meaning: A Multi-Task Learning Model for Acoustic
  Word Embeddings
Integrating Form and Meaning: A Multi-Task Learning Model for Acoustic Word Embeddings
Badr M. Abdullah
Bernd Möbius
Dietrich Klakow
13
3
0
14 Sep 2022
Learning Phone Recognition from Unpaired Audio and Phone Sequences Based
  on Generative Adversarial Network
Learning Phone Recognition from Unpaired Audio and Phone Sequences Based on Generative Adversarial Network
Da-Rong Liu
Po-Chun Hsu
Yi-Chen Chen
Sung-Feng Huang
Shun-Po Chuang
Da-Yi Wu
Hung-yi Lee
GAN
31
7
0
29 Jul 2022
DP-Parse: Finding Word Boundaries from Raw Speech with an Instance
  Lexicon
DP-Parse: Finding Word Boundaries from Raw Speech with an Instance Lexicon
Robin Algayres
Tristan Ricoul
Julien Karadayi
Hugo Laurenccon
Salah Zaiem
Abdel-rahman Mohamed
Benoît Sagot
Emmanuel Dupoux
14
13
0
22 Jun 2022
Self-Supervised Speech Representation Learning: A Review
Self-Supervised Speech Representation Learning: A Review
Abdel-rahman Mohamed
Hung-yi Lee
Lasse Borgholt
Jakob Drachmann Havtorn
Joakim Edin
...
Shang-Wen Li
Karen Livescu
Lars Maaløe
Tara N. Sainath
Shinji Watanabe
SSL
AI4TS
137
354
0
21 May 2022
A Brief Overview of Unsupervised Neural Speech Representation Learning
A Brief Overview of Unsupervised Neural Speech Representation Learning
Lasse Borgholt
Jakob Drachmann Havtorn
Joakim Edin
Lars Maaløe
Christian Igel
BDL
AI4TS
SSL
19
11
0
01 Mar 2022
How Familiar Does That Sound? Cross-Lingual Representational Similarity
  Analysis of Acoustic Word Embeddings
How Familiar Does That Sound? Cross-Lingual Representational Similarity Analysis of Acoustic Word Embeddings
Badr M. Abdullah
Iuliia Zaitova
T. Avgustinova
Bernd Möbius
Dietrich Klakow
37
10
0
21 Sep 2021
Multilingual transfer of acoustic word embeddings improves when training
  on languages related to the target zero-resource language
Multilingual transfer of acoustic word embeddings improves when training on languages related to the target zero-resource language
C. Jacobs
Herman Kamper
35
10
0
24 Jun 2021
Do Acoustic Word Embeddings Capture Phonological Similarity? An
  Empirical Study
Do Acoustic Word Embeddings Capture Phonological Similarity? An Empirical Study
Badr M. Abdullah
Marius Mosbach
Iuliia Zaitova
Bernd Möbius
Dietrich Klakow
33
14
0
16 Jun 2021
Unsupervised Automatic Speech Recognition: A Review
Unsupervised Automatic Speech Recognition: A Review
Hanan Aldarmaki
Asad Ullah
Nazar Zaki
VLM
SSL
39
57
0
09 Jun 2021
Interpreting intermediate convolutional layers of generative CNNs
  trained on waveforms
Interpreting intermediate convolutional layers of generative CNNs trained on waveforms
Gašper Beguš
Alan Zhou
30
7
0
19 Apr 2021
Acoustic word embeddings for zero-resource languages using
  self-supervised contrastive learning and multilingual adaptation
Acoustic word embeddings for zero-resource languages using self-supervised contrastive learning and multilingual adaptation
C. Jacobs
Yevgen Matusevych
Herman Kamper
17
21
0
19 Mar 2021
Double Articulation Analyzer with Prosody for Unsupervised Word and
  Phoneme Discovery
Double Articulation Analyzer with Prosody for Unsupervised Word and Phoneme Discovery
Yasuaki Okuda
Ryo Ozaki
T. Taniguchi
28
5
0
15 Mar 2021
A phonetic model of non-native spoken word processing
A phonetic model of non-native spoken word processing
Yevgen Matusevych
Herman Kamper
Thomas Schatz
Naomi H Feldman
Sharon Goldwater
19
7
0
27 Jan 2021
AudioViewer: Learning to Visualize Sounds
AudioViewer: Learning to Visualize Sounds
Chunjin Song
Yuchi Zhang
Willis Peng
Parmis Mohaghegh
Bastian Wandt
Helge Rhodin
30
1
0
22 Dec 2020
A comparison of self-supervised speech representations as input features
  for unsupervised acoustic word embeddings
A comparison of self-supervised speech representations as input features for unsupervised acoustic word embeddings
Lisa van Staden
Herman Kamper
SSL
31
16
0
14 Dec 2020
Direct multimodal few-shot learning of speech and images
Direct multimodal few-shot learning of speech and images
Leanne Nortje
Herman Kamper
SSL
27
10
0
10 Dec 2020
A Correspondence Variational Autoencoder for Unsupervised Acoustic Word
  Embeddings
A Correspondence Variational Autoencoder for Unsupervised Acoustic Word Embeddings
Puyuan Peng
Herman Kamper
Karen Livescu
DRL
SSL
14
14
0
03 Dec 2020
Acoustic span embeddings for multilingual query-by-example search
Acoustic span embeddings for multilingual query-by-example search
Yushi Hu
Shane Settle
Karen Livescu
RALM
33
8
0
24 Nov 2020
STEPs-RL: Speech-Text Entanglement for Phonetically Sound Representation
  Learning
STEPs-RL: Speech-Text Entanglement for Phonetically Sound Representation Learning
Prakamya Mishra
18
0
0
23 Nov 2020
Towards Semi-Supervised Semantics Understanding from Speech
Towards Semi-Supervised Semantics Understanding from Speech
Cheng-I Jeff Lai
Jin Cao
S. Bodapati
Shang-Wen Li
SSL
22
7
0
11 Nov 2020
Unsupervised vs. transfer learning for multimodal one-shot matching of
  speech and images
Unsupervised vs. transfer learning for multimodal one-shot matching of speech and images
Leanne Nortje
Herman Kamper
SSL
6
9
0
14 Aug 2020
Automatic Detection of Phonological Errors in Child Speech Using Siamese
  Recurrent Autoencoder
Automatic Detection of Phonological Errors in Child Speech Using Siamese Recurrent Autoencoder
Si-Ioi Ng
Tan Lee
9
7
0
07 Aug 2020
Evaluating computational models of infant phonetic learning across
  languages
Evaluating computational models of infant phonetic learning across languages
Yevgen Matusevych
Thomas Schatz
Herman Kamper
Naomi H Feldman
Sharon Goldwater
24
14
0
06 Aug 2020
Evaluating the reliability of acoustic speech embeddings
Evaluating the reliability of acoustic speech embeddings
Robin Algayres
Mohamed Salah Zaiem
Benoît Sagot
Emmanuel Dupoux
38
29
0
27 Jul 2020
Multilingual Jointly Trained Acoustic and Written Word Embeddings
Multilingual Jointly Trained Acoustic and Written Word Embeddings
Yushi Hu
Shane Settle
Karen Livescu
21
22
0
24 Jun 2020
CiwGAN and fiwGAN: Encoding information in acoustic data to model
  lexical learning with Generative Adversarial Networks
CiwGAN and fiwGAN: Encoding information in acoustic data to model lexical learning with Generative Adversarial Networks
Gašper Beguš
GAN
6
34
0
04 Jun 2020
Improved acoustic word embeddings for zero-resource languages using
  multilingual transfer
Improved acoustic word embeddings for zero-resource languages using multilingual transfer
Herman Kamper
Yevgen Matusevych
Sharon Goldwater
20
18
0
02 Jun 2020
Bayesian Subspace HMM for the Zerospeech 2020 Challenge
Bayesian Subspace HMM for the Zerospeech 2020 Challenge
Bolaji Yusuf
Lucas Ondel
BDL
21
0
0
19 May 2020
Improved Speech Representations with Multi-Target Autoregressive
  Predictive Coding
Improved Speech Representations with Multi-Target Autoregressive Predictive Coding
Yu-An Chung
James R. Glass
SSL
20
56
0
11 Apr 2020
Analyzing autoencoder-based acoustic word embeddings
Analyzing autoencoder-based acoustic word embeddings
Yevgen Matusevych
Herman Kamper
Sharon Goldwater
30
12
0
03 Apr 2020
Unsupervised feature learning for speech using correspondence and
  Siamese networks
Unsupervised feature learning for speech using correspondence and Siamese networks
Petri-Johan Last
H. Engelbrecht
Herman Kamper
SSL
15
18
0
28 Mar 2020
Multilingual acoustic word embedding models for processing zero-resource
  languages
Multilingual acoustic word embedding models for processing zero-resource languages
Herman Kamper
Yevgen Matusevych
Sharon Goldwater
31
24
0
06 Feb 2020
Generative Pre-Training for Speech with Autoregressive Predictive Coding
Generative Pre-Training for Speech with Autoregressive Predictive Coding
Yu-An Chung
James R. Glass
SSL
29
173
0
23 Oct 2019
Additional Shared Decoder on Siamese Multi-view Encoders for Learning
  Acoustic Word Embeddings
Additional Shared Decoder on Siamese Multi-view Encoders for Learning Acoustic Word Embeddings
Myunghun Jung
Hyungjun Lim
Jahyun Goo
Youngmoon Jung
Hoirin Kim
22
14
0
01 Oct 2019
Phonetic-and-Semantic Embedding of Spoken Words with Applications in
  Spoken Content Retrieval
Phonetic-and-Semantic Embedding of Spoken Words with Applications in Spoken Content Retrieval
Yi-Chen Chen
Sung-Feng Huang
Chia-Hao Shen
Hung-yi Lee
Lin-Shan Lee
46
37
0
21 Jul 2018
1