Truly unsupervised acoustic word embeddings using weak top-down constraints in encoder-decoder models

1 November 2018

Papers citing "Truly unsupervised acoustic word embeddings using weak top-down constraints in encoder-decoder models"

50 / 50 papers shown

Title
LASER: Learning by Aligning Self-supervised Representations of Speech for Improving Content-related Tasks Amit Meghanani Thomas Hain 44 1 0 13 Jun 2024
Visually Grounded Speech Models have a Mutual Exclusivity Bias Leanne Nortje Dan Oneaţă Yevgen Matusevych Herman Kamper SSL 47 0 0 20 Mar 2024
Improving Acoustic Word Embeddings through Correspondence Training of Self-supervised Speech Representations Amit Meghanani Thomas Hain SSL 40 1 0 13 Mar 2024
SCORE: Self-supervised Correspondence Fine-tuning for Improved Content Representations Amit Meghanani Thomas Hain 41 3 0 10 Mar 2024
SD-HuBERT: Sentence-Level Self-Distillation Induces Syllabic Organization in HuBERT Cheol Jun Cho Abdelrahman Mohamed Shang-Wen Li Alan W. Black Gopala K. Anumanchipalli 39 8 0 16 Oct 2023
XLS-R fine-tuning on noisy word boundaries for unsupervised speech segmentation into words Robin Algayres Pablo Diego-Simon Benoît Sagot Emmanuel Dupoux 44 1 0 08 Oct 2023
Generative Spoken Language Model based on continuous word-sized audio tokens Robin Algayres Yossi Adi Tu Nguyen Jade Copet Gabriel Synnaeve Benoît Sagot Emmanuel Dupoux AuLLM 46 13 0 08 Oct 2023
Leveraging multilingual transfer for unsupervised semantic acoustic word embeddings C. Jacobs Herman Kamper 32 1 0 05 Jul 2023
Visually grounded few-shot word learning in low-resource settings Leanne Nortje Dan Oneaţă Herman Kamper VLM 23 4 0 20 Jun 2023
Acoustic Word Embeddings for Untranscribed Target Languages with Continued Pretraining and Learned Pooling Ramon Sanabria Ondˇrej Klejch Hao Tang Sharon Goldwater 30 1 0 03 Jun 2023
Towards hate speech detection in low-resource languages: Comparing ASR to acoustic word embeddings on Wolof and Swahili C. Jacobs Nathanaël Carraz Rakotonirina E. Chimoto Bruce A. Bassett Herman Kamper 27 5 0 01 Jun 2023
Visually grounded few-shot word acquisition with fewer shots Leanne Nortje Benjamin van Niekerk Herman Kamper 30 1 0 25 May 2023
Analyzing the Representational Geometry of Acoustic Word Embeddings Badr M. Abdullah Dietrich Klakow 21 3 0 08 Jan 2023
Analyzing Acoustic Word Embeddings from Pre-trained Self-supervised Speech Models Ramon Sanabria Hao Tang Sharon Goldwater SSL 40 19 0 28 Oct 2022
Bootstrapping meaning through listening: Unsupervised learning of spoken sentence embeddings Jian Zhu Zuoyu Tian Yadong Liu Cong Zhang Chia-wen Lo SSL 34 2 0 23 Oct 2022
Integrating Form and Meaning: A Multi-Task Learning Model for Acoustic Word Embeddings Badr M. Abdullah Bernd Möbius Dietrich Klakow 13 3 0 14 Sep 2022
Learning Phone Recognition from Unpaired Audio and Phone Sequences Based on Generative Adversarial Network Da-Rong Liu Po-Chun Hsu Yi-Chen Chen Sung-Feng Huang Shun-Po Chuang Da-Yi Wu Hung-yi Lee GAN 31 7 0 29 Jul 2022
DP-Parse: Finding Word Boundaries from Raw Speech with an Instance Lexicon Robin Algayres Tristan Ricoul Julien Karadayi Hugo Laurenccon Salah Zaiem Abdel-rahman Mohamed Benoît Sagot Emmanuel Dupoux 14 13 0 22 Jun 2022
Self-Supervised Speech Representation Learning: A Review Abdel-rahman Mohamed Hung-yi Lee Lasse Borgholt Jakob Drachmann Havtorn Joakim Edin ... Shang-Wen Li Karen Livescu Lars Maaløe Tara N. Sainath Shinji Watanabe SSL AI4TS 137 354 0 21 May 2022
A Brief Overview of Unsupervised Neural Speech Representation Learning Lasse Borgholt Jakob Drachmann Havtorn Joakim Edin Lars Maaløe Christian Igel BDL AI4TS SSL 19 11 0 01 Mar 2022
How Familiar Does That Sound? Cross-Lingual Representational Similarity Analysis of Acoustic Word Embeddings Badr M. Abdullah Iuliia Zaitova T. Avgustinova Bernd Möbius Dietrich Klakow 37 10 0 21 Sep 2021
Multilingual transfer of acoustic word embeddings improves when training on languages related to the target zero-resource language C. Jacobs Herman Kamper 35 10 0 24 Jun 2021
Do Acoustic Word Embeddings Capture Phonological Similarity? An Empirical Study Badr M. Abdullah Marius Mosbach Iuliia Zaitova Bernd Möbius Dietrich Klakow 33 14 0 16 Jun 2021
Unsupervised Automatic Speech Recognition: A Review Hanan Aldarmaki Asad Ullah Nazar Zaki VLM SSL 39 57 0 09 Jun 2021
Interpreting intermediate convolutional layers of generative CNNs trained on waveforms Gašper Beguš Alan Zhou 30 7 0 19 Apr 2021
Acoustic word embeddings for zero-resource languages using self-supervised contrastive learning and multilingual adaptation C. Jacobs Yevgen Matusevych Herman Kamper 17 21 0 19 Mar 2021
Double Articulation Analyzer with Prosody for Unsupervised Word and Phoneme Discovery Yasuaki Okuda Ryo Ozaki T. Taniguchi 28 5 0 15 Mar 2021
A phonetic model of non-native spoken word processing Yevgen Matusevych Herman Kamper Thomas Schatz Naomi H Feldman Sharon Goldwater 19 7 0 27 Jan 2021
AudioViewer: Learning to Visualize Sounds Chunjin Song Yuchi Zhang Willis Peng Parmis Mohaghegh Bastian Wandt Helge Rhodin 30 1 0 22 Dec 2020
A comparison of self-supervised speech representations as input features for unsupervised acoustic word embeddings Lisa van Staden Herman Kamper SSL 31 16 0 14 Dec 2020
Direct multimodal few-shot learning of speech and images Leanne Nortje Herman Kamper SSL 27 10 0 10 Dec 2020
A Correspondence Variational Autoencoder for Unsupervised Acoustic Word Embeddings Puyuan Peng Herman Kamper Karen Livescu DRL SSL 14 14 0 03 Dec 2020
Acoustic span embeddings for multilingual query-by-example search Yushi Hu Shane Settle Karen Livescu RALM 33 8 0 24 Nov 2020
STEPs-RL: Speech-Text Entanglement for Phonetically Sound Representation Learning Prakamya Mishra 18 0 0 23 Nov 2020
Towards Semi-Supervised Semantics Understanding from Speech Cheng-I Jeff Lai Jin Cao S. Bodapati Shang-Wen Li SSL 22 7 0 11 Nov 2020
Unsupervised vs. transfer learning for multimodal one-shot matching of speech and images Leanne Nortje Herman Kamper SSL 6 9 0 14 Aug 2020
Automatic Detection of Phonological Errors in Child Speech Using Siamese Recurrent Autoencoder Si-Ioi Ng Tan Lee 9 7 0 07 Aug 2020
Evaluating computational models of infant phonetic learning across languages Yevgen Matusevych Thomas Schatz Herman Kamper Naomi H Feldman Sharon Goldwater 24 14 0 06 Aug 2020
Evaluating the reliability of acoustic speech embeddings Robin Algayres Mohamed Salah Zaiem Benoît Sagot Emmanuel Dupoux 38 29 0 27 Jul 2020
Multilingual Jointly Trained Acoustic and Written Word Embeddings Yushi Hu Shane Settle Karen Livescu 21 22 0 24 Jun 2020
CiwGAN and fiwGAN: Encoding information in acoustic data to model lexical learning with Generative Adversarial Networks Gašper Beguš GAN 6 34 0 04 Jun 2020
Improved acoustic word embeddings for zero-resource languages using multilingual transfer Herman Kamper Yevgen Matusevych Sharon Goldwater 20 18 0 02 Jun 2020
Bayesian Subspace HMM for the Zerospeech 2020 Challenge Bolaji Yusuf Lucas Ondel BDL 21 0 0 19 May 2020
Improved Speech Representations with Multi-Target Autoregressive Predictive Coding Yu-An Chung James R. Glass SSL 20 56 0 11 Apr 2020
Analyzing autoencoder-based acoustic word embeddings Yevgen Matusevych Herman Kamper Sharon Goldwater 30 12 0 03 Apr 2020
Unsupervised feature learning for speech using correspondence and Siamese networks Petri-Johan Last H. Engelbrecht Herman Kamper SSL 15 18 0 28 Mar 2020
Multilingual acoustic word embedding models for processing zero-resource languages Herman Kamper Yevgen Matusevych Sharon Goldwater 31 24 0 06 Feb 2020
Generative Pre-Training for Speech with Autoregressive Predictive Coding Yu-An Chung James R. Glass SSL 29 173 0 23 Oct 2019
Additional Shared Decoder on Siamese Multi-view Encoders for Learning Acoustic Word Embeddings Myunghun Jung Hyungjun Lim Jahyun Goo Youngmoon Jung Hoirin Kim 22 14 0 01 Oct 2019
Phonetic-and-Semantic Embedding of Spoken Words with Applications in Spoken Content Retrieval Yi-Chen Chen Sung-Feng Huang Chia-Hao Shen Hung-yi Lee Lin-Shan Lee 46 37 0 21 Jul 2018