Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1611.04496
Cited By
Multi-view Recurrent Neural Acoustic Word Embeddings
14 November 2016
Wanjia He
Weiran Wang
Karen Livescu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Multi-view Recurrent Neural Acoustic Word Embeddings"
50 / 50 papers shown
Title
MaLa-ASR: Multimedia-Assisted LLM-Based ASR
Guanrou Yang
Ziyang Ma
Fan Yu
Zhifu Gao
Shiliang Zhang
Xie Chen
AuLLM
44
2
0
09 Jun 2024
Relational Proxy Loss for Audio-Text based Keyword Spotting
Youngmoon Jung
Seungjin Lee
Joon-Young Yang
Jaeyoung Roh
C. Han
Hoon-Young Cho
40
0
0
08 Jun 2024
Improving Acoustic Word Embeddings through Correspondence Training of Self-supervised Speech Representations
Amit Meghanani
Thomas Hain
SSL
40
1
0
13 Mar 2024
What Do Self-Supervised Speech Models Know About Words?
Ankita Pasad
C. Chien
Shane Settle
Karen Livescu
SSL
40
26
0
30 Jun 2023
Timestamped Embedding-Matching Acoustic-to-Word CTC ASR
Woojay Jeon
21
0
0
20 Jun 2023
Matching Latent Encoding for Audio-Text based Keyword Spotting
K. Nishu
Minsik Cho
Devang Naik
9
14
0
08 Jun 2023
Improvements to Embedding-Matching Acoustic-to-Word ASR Using Multiple-Hypothesis Pronunciation-Based Embeddings
Hao Yen
Woojay Jeon
26
2
0
30 Oct 2022
AdaMS: Deep Metric Learning with Adaptive Margin and Adaptive Scale for Acoustic Word Discrimination
Myunghun Jung
Hoi-Rim Kim
13
1
0
26 Oct 2022
Spoken Term Detection and Relevance Score Estimation using Dot-Product of Pronunciation Embeddings
J. Svec
L. Smídl
J. Psutka
A. Pražák
11
6
0
21 Oct 2022
Integrating Form and Meaning: A Multi-Task Learning Model for Acoustic Word Embeddings
Badr M. Abdullah
Bernd Möbius
Dietrich Klakow
13
3
0
14 Sep 2022
Learning Phone Recognition from Unpaired Audio and Phone Sequences Based on Generative Adversarial Network
Da-Rong Liu
Po-Chun Hsu
Yi-Chen Chen
Sung-Feng Huang
Shun-Po Chuang
Da-Yi Wu
Hung-yi Lee
GAN
15
7
0
29 Jul 2022
Asymmetric Proxy Loss for Multi-View Acoustic Word Embeddings
Myunghun Jung
Hoirin Kim
19
3
0
30 Mar 2022
Multilingual transfer of acoustic word embeddings improves when training on languages related to the target zero-resource language
C. Jacobs
Herman Kamper
35
10
0
24 Jun 2021
Do Acoustic Word Embeddings Capture Phonological Similarity? An Empirical Study
Badr M. Abdullah
Marius Mosbach
Iuliia Zaitova
Bernd Möbius
Dietrich Klakow
22
14
0
16 Jun 2021
Acoustic word embeddings for zero-resource languages using self-supervised contrastive learning and multilingual adaptation
C. Jacobs
Yevgen Matusevych
Herman Kamper
17
21
0
19 Mar 2021
CNN-based Spoken Term Detection and Localization without Dynamic Programming
T. Fuchs
Yael Segal
Joseph Keshet
9
5
0
07 Mar 2021
A Correspondence Variational Autoencoder for Unsupervised Acoustic Word Embeddings
Puyuan Peng
Herman Kamper
Karen Livescu
DRL
SSL
14
14
0
03 Dec 2020
Acoustic span embeddings for multilingual query-by-example search
Yushi Hu
Shane Settle
Karen Livescu
RALM
17
8
0
24 Nov 2020
Probing Acoustic Representations for Phonetic Properties
Danni Ma
Neville Ryant
M. Liberman
25
45
0
25 Oct 2020
A multi-view approach for Mandarin non-native mispronunciation verification
Zhenyu Wang
John H. L. Hansen
Yanlu Xie
14
3
0
05 Sep 2020
Acoustic Neighbor Embeddings
Woojay Jeon
12
7
0
20 Jul 2020
Whole-Word Segmental Speech Recognition with Acoustic Word Embeddings
Bowen Shi
Shane Settle
Karen Livescu
22
4
0
01 Jul 2020
Multilingual Jointly Trained Acoustic and Written Word Embeddings
Yushi Hu
Shane Settle
Karen Livescu
11
22
0
24 Jun 2020
Improved acoustic word embeddings for zero-resource languages using multilingual transfer
Herman Kamper
Yevgen Matusevych
Sharon Goldwater
15
18
0
02 Jun 2020
Effectiveness of self-supervised pre-training for speech recognition
Alexei Baevski
Michael Auli
Abdel-rahman Mohamed
SSL
27
147
0
10 Nov 2019
Additional Shared Decoder on Siamese Multi-view Encoders for Learning Acoustic Word Embeddings
Myunghun Jung
Hyungjun Lim
Jahyun Goo
Youngmoon Jung
Hoirin Kim
14
14
0
01 Oct 2019
Learning Joint Acoustic-Phonetic Word Embeddings
Mohamed El-Geish
DRL
SSL
8
2
0
01 Aug 2019
To Tune or Not To Tune? How About the Best of Both Worlds?
Ran A. Wang
Haibo Su
Chunye Wang
Kailin Ji
J. Ding
VLM
31
17
0
09 Jul 2019
Multimodal and Multi-view Models for Emotion Recognition
Gustavo Aguilar
Viktor Rozgic
Weiran Wang
Chao Wang
16
29
0
24 Jun 2019
Unsupervised End-to-End Learning of Discrete Linguistic Units for Voice Conversion
Andy T. Liu
Po-Chun Hsu
Hung-yi Lee
SSL
12
28
0
28 May 2019
On the Contributions of Visual and Textual Supervision in Low-Resource Semantic Speech Retrieval
Ankita Pasad
Bowen Shi
Herman Kamper
Karen Livescu
11
12
0
24 Apr 2019
Acoustically Grounded Word Embeddings for Improved Acoustics-to-Word Speech Recognition
Shane Settle
Kartik Audhkhasi
Karen Livescu
M. Picheny
19
34
0
29 Mar 2019
Modeling Acoustic-Prosodic Cues for Word Importance Prediction in Spoken Dialogues
Sushant Kafle
Cecilia Ovesdotter Alm
Matt Huenerfauth
11
3
0
28 Mar 2019
Learning Embodied Semantics via Music and Dance Semiotic Correlations
F. Raposo
David Martins de Matos
Ricardo Ribeiro
16
7
0
25 Mar 2019
Learned In Speech Recognition: Contextual Acoustic Word Embeddings
Shruti Palaskar
Vikas Raunak
Florian Metze
14
17
0
18 Feb 2019
Learning from Multiview Correlations in Open-Domain Videos
Nils Holzenberger
Shruti Palaskar
Pranava Madhyastha
Florian Metze
R. Arora
SSL
6
11
0
21 Nov 2018
Confusion2Vec: Towards Enriching Vector Space Word Representations with Representational Ambiguities
K. K. Thekumparampil
Zinan Lin
12
23
0
08 Nov 2018
Improved Audio Embeddings by Adjacency-Based Clustering with Applications in Spoken Term Detection
Sung-Feng Huang
Yi-Chen Chen
Hung-yi Lee
Lin-Shan Lee
AI4TS
19
5
0
07 Nov 2018
Phonetic-and-Semantic Embedding of Spoken Words with Applications in Spoken Content Retrieval
Yi-Chen Chen
Sung-Feng Huang
Chia-Hao Shen
Hung-yi Lee
Lin-Shan Lee
46
37
0
21 Jul 2018
Unsupervised Cross-Modal Alignment of Speech and Text Embedding Spaces
Yu-An Chung
W. Weng
S. Tong
James R. Glass
17
99
0
18 May 2018
Speech2Vec: A Sequence-to-Sequence Framework for Learning Word Embeddings from Speech
Yu-An Chung
James R. Glass
3DV
32
184
0
23 Mar 2018
Acoustic feature learning using cross-domain articulatory measurements
Qingming Tang
Weiran Wang
Karen Livescu
20
2
0
19 Mar 2018
Deep Cross-Modal Correlation Learning for Audio and Lyrics in Music Retrieval
Yi Yu
Suhua Tang
F. Raposo
Lei Chen
19
110
0
24 Nov 2017
Learning Word Embeddings from Speech
Yu-An Chung
James R. Glass
SSL
26
19
0
05 Nov 2017
Query-by-example Spoken Term Detection using Attention-based Multi-hop Networks
Chia-Wei Ao
Hung-yi Lee
9
21
0
01 Sep 2017
Language Transfer of Audio Word2Vec: Learning Audio Segment Representations without Target Language Data
Chia-Hao Shen
Janet Y. Sung
Hung-yi Lee
19
5
0
19 Jul 2017
Query-by-Example Search with Discriminative Neural Acoustic Word Embeddings
Shane Settle
Keith D. Levin
Herman Kamper
Karen Livescu
12
85
0
12 Jun 2017
An embedded segmental K-means model for unsupervised segmentation and clustering of speech
Herman Kamper
Karen Livescu
Sharon Goldwater
11
95
0
23 Mar 2017
Sound-Word2Vec: Learning Word Representations Grounded in Sounds
Ashwin K. Vijayakumar
Ramakrishna Vedantam
Devi Parikh
22
22
0
06 Mar 2017
End-to-End ASR-free Keyword Search from Speech
Kartik Audhkhasi
Andrew Rosenberg
A. Sethy
Bhuvana Ramabhadran
Brian Kingsbury
16
111
0
13 Jan 2017
1