ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1611.04496
  4. Cited By
Multi-view Recurrent Neural Acoustic Word Embeddings

Multi-view Recurrent Neural Acoustic Word Embeddings

14 November 2016
Wanjia He
Weiran Wang
Karen Livescu
ArXivPDFHTML

Papers citing "Multi-view Recurrent Neural Acoustic Word Embeddings"

50 / 50 papers shown
Title
MaLa-ASR: Multimedia-Assisted LLM-Based ASR
MaLa-ASR: Multimedia-Assisted LLM-Based ASR
Guanrou Yang
Ziyang Ma
Fan Yu
Zhifu Gao
Shiliang Zhang
Xie Chen
AuLLM
44
2
0
09 Jun 2024
Relational Proxy Loss for Audio-Text based Keyword Spotting
Relational Proxy Loss for Audio-Text based Keyword Spotting
Youngmoon Jung
Seungjin Lee
Joon-Young Yang
Jaeyoung Roh
C. Han
Hoon-Young Cho
40
0
0
08 Jun 2024
Improving Acoustic Word Embeddings through Correspondence Training of
  Self-supervised Speech Representations
Improving Acoustic Word Embeddings through Correspondence Training of Self-supervised Speech Representations
Amit Meghanani
Thomas Hain
SSL
40
1
0
13 Mar 2024
What Do Self-Supervised Speech Models Know About Words?
What Do Self-Supervised Speech Models Know About Words?
Ankita Pasad
C. Chien
Shane Settle
Karen Livescu
SSL
40
26
0
30 Jun 2023
Timestamped Embedding-Matching Acoustic-to-Word CTC ASR
Timestamped Embedding-Matching Acoustic-to-Word CTC ASR
Woojay Jeon
21
0
0
20 Jun 2023
Matching Latent Encoding for Audio-Text based Keyword Spotting
Matching Latent Encoding for Audio-Text based Keyword Spotting
K. Nishu
Minsik Cho
Devang Naik
9
14
0
08 Jun 2023
Improvements to Embedding-Matching Acoustic-to-Word ASR Using
  Multiple-Hypothesis Pronunciation-Based Embeddings
Improvements to Embedding-Matching Acoustic-to-Word ASR Using Multiple-Hypothesis Pronunciation-Based Embeddings
Hao Yen
Woojay Jeon
26
2
0
30 Oct 2022
AdaMS: Deep Metric Learning with Adaptive Margin and Adaptive Scale for
  Acoustic Word Discrimination
AdaMS: Deep Metric Learning with Adaptive Margin and Adaptive Scale for Acoustic Word Discrimination
Myunghun Jung
Hoi-Rim Kim
15
1
0
26 Oct 2022
Spoken Term Detection and Relevance Score Estimation using Dot-Product
  of Pronunciation Embeddings
Spoken Term Detection and Relevance Score Estimation using Dot-Product of Pronunciation Embeddings
J. Svec
L. Smídl
J. Psutka
A. Pražák
11
6
0
21 Oct 2022
Integrating Form and Meaning: A Multi-Task Learning Model for Acoustic
  Word Embeddings
Integrating Form and Meaning: A Multi-Task Learning Model for Acoustic Word Embeddings
Badr M. Abdullah
Bernd Möbius
Dietrich Klakow
13
3
0
14 Sep 2022
Learning Phone Recognition from Unpaired Audio and Phone Sequences Based
  on Generative Adversarial Network
Learning Phone Recognition from Unpaired Audio and Phone Sequences Based on Generative Adversarial Network
Da-Rong Liu
Po-Chun Hsu
Yi-Chen Chen
Sung-Feng Huang
Shun-Po Chuang
Da-Yi Wu
Hung-yi Lee
GAN
17
7
0
29 Jul 2022
Asymmetric Proxy Loss for Multi-View Acoustic Word Embeddings
Asymmetric Proxy Loss for Multi-View Acoustic Word Embeddings
Myunghun Jung
Hoirin Kim
19
3
0
30 Mar 2022
Multilingual transfer of acoustic word embeddings improves when training
  on languages related to the target zero-resource language
Multilingual transfer of acoustic word embeddings improves when training on languages related to the target zero-resource language
C. Jacobs
Herman Kamper
35
10
0
24 Jun 2021
Do Acoustic Word Embeddings Capture Phonological Similarity? An
  Empirical Study
Do Acoustic Word Embeddings Capture Phonological Similarity? An Empirical Study
Badr M. Abdullah
Marius Mosbach
Iuliia Zaitova
Bernd Möbius
Dietrich Klakow
25
14
0
16 Jun 2021
Acoustic word embeddings for zero-resource languages using
  self-supervised contrastive learning and multilingual adaptation
Acoustic word embeddings for zero-resource languages using self-supervised contrastive learning and multilingual adaptation
C. Jacobs
Yevgen Matusevych
Herman Kamper
17
21
0
19 Mar 2021
CNN-based Spoken Term Detection and Localization without Dynamic
  Programming
CNN-based Spoken Term Detection and Localization without Dynamic Programming
T. Fuchs
Yael Segal
Joseph Keshet
11
5
0
07 Mar 2021
A Correspondence Variational Autoencoder for Unsupervised Acoustic Word
  Embeddings
A Correspondence Variational Autoencoder for Unsupervised Acoustic Word Embeddings
Puyuan Peng
Herman Kamper
Karen Livescu
DRL
SSL
14
14
0
03 Dec 2020
Acoustic span embeddings for multilingual query-by-example search
Acoustic span embeddings for multilingual query-by-example search
Yushi Hu
Shane Settle
Karen Livescu
RALM
19
8
0
24 Nov 2020
Probing Acoustic Representations for Phonetic Properties
Probing Acoustic Representations for Phonetic Properties
Danni Ma
Neville Ryant
M. Liberman
25
45
0
25 Oct 2020
A multi-view approach for Mandarin non-native mispronunciation
  verification
A multi-view approach for Mandarin non-native mispronunciation verification
Zhenyu Wang
John H. L. Hansen
Yanlu Xie
14
3
0
05 Sep 2020
Acoustic Neighbor Embeddings
Acoustic Neighbor Embeddings
Woojay Jeon
14
7
0
20 Jul 2020
Whole-Word Segmental Speech Recognition with Acoustic Word Embeddings
Whole-Word Segmental Speech Recognition with Acoustic Word Embeddings
Bowen Shi
Shane Settle
Karen Livescu
22
4
0
01 Jul 2020
Multilingual Jointly Trained Acoustic and Written Word Embeddings
Multilingual Jointly Trained Acoustic and Written Word Embeddings
Yushi Hu
Shane Settle
Karen Livescu
13
22
0
24 Jun 2020
Improved acoustic word embeddings for zero-resource languages using
  multilingual transfer
Improved acoustic word embeddings for zero-resource languages using multilingual transfer
Herman Kamper
Yevgen Matusevych
Sharon Goldwater
15
18
0
02 Jun 2020
Effectiveness of self-supervised pre-training for speech recognition
Effectiveness of self-supervised pre-training for speech recognition
Alexei Baevski
Michael Auli
Abdel-rahman Mohamed
SSL
27
147
0
10 Nov 2019
Additional Shared Decoder on Siamese Multi-view Encoders for Learning
  Acoustic Word Embeddings
Additional Shared Decoder on Siamese Multi-view Encoders for Learning Acoustic Word Embeddings
Myunghun Jung
Hyungjun Lim
Jahyun Goo
Youngmoon Jung
Hoirin Kim
14
14
0
01 Oct 2019
Learning Joint Acoustic-Phonetic Word Embeddings
Learning Joint Acoustic-Phonetic Word Embeddings
Mohamed El-Geish
DRL
SSL
8
2
0
01 Aug 2019
To Tune or Not To Tune? How About the Best of Both Worlds?
To Tune or Not To Tune? How About the Best of Both Worlds?
Ran A. Wang
Haibo Su
Chunye Wang
Kailin Ji
J. Ding
VLM
31
17
0
09 Jul 2019
Multimodal and Multi-view Models for Emotion Recognition
Multimodal and Multi-view Models for Emotion Recognition
Gustavo Aguilar
Viktor Rozgic
Weiran Wang
Chao Wang
16
29
0
24 Jun 2019
Unsupervised End-to-End Learning of Discrete Linguistic Units for Voice
  Conversion
Unsupervised End-to-End Learning of Discrete Linguistic Units for Voice Conversion
Andy T. Liu
Po-Chun Hsu
Hung-yi Lee
SSL
12
28
0
28 May 2019
On the Contributions of Visual and Textual Supervision in Low-Resource
  Semantic Speech Retrieval
On the Contributions of Visual and Textual Supervision in Low-Resource Semantic Speech Retrieval
Ankita Pasad
Bowen Shi
Herman Kamper
Karen Livescu
11
12
0
24 Apr 2019
Acoustically Grounded Word Embeddings for Improved Acoustics-to-Word
  Speech Recognition
Acoustically Grounded Word Embeddings for Improved Acoustics-to-Word Speech Recognition
Shane Settle
Kartik Audhkhasi
Karen Livescu
M. Picheny
19
34
0
29 Mar 2019
Modeling Acoustic-Prosodic Cues for Word Importance Prediction in Spoken
  Dialogues
Modeling Acoustic-Prosodic Cues for Word Importance Prediction in Spoken Dialogues
Sushant Kafle
Cecilia Ovesdotter Alm
Matt Huenerfauth
13
3
0
28 Mar 2019
Learning Embodied Semantics via Music and Dance Semiotic Correlations
Learning Embodied Semantics via Music and Dance Semiotic Correlations
F. Raposo
David Martins de Matos
Ricardo Ribeiro
16
7
0
25 Mar 2019
Learned In Speech Recognition: Contextual Acoustic Word Embeddings
Learned In Speech Recognition: Contextual Acoustic Word Embeddings
Shruti Palaskar
Vikas Raunak
Florian Metze
16
17
0
18 Feb 2019
Learning from Multiview Correlations in Open-Domain Videos
Learning from Multiview Correlations in Open-Domain Videos
Nils Holzenberger
Shruti Palaskar
Pranava Madhyastha
Florian Metze
R. Arora
SSL
6
11
0
21 Nov 2018
Confusion2Vec: Towards Enriching Vector Space Word Representations with
  Representational Ambiguities
Confusion2Vec: Towards Enriching Vector Space Word Representations with Representational Ambiguities
K. K. Thekumparampil
Zinan Lin
14
23
0
08 Nov 2018
Improved Audio Embeddings by Adjacency-Based Clustering with
  Applications in Spoken Term Detection
Improved Audio Embeddings by Adjacency-Based Clustering with Applications in Spoken Term Detection
Sung-Feng Huang
Yi-Chen Chen
Hung-yi Lee
Lin-Shan Lee
AI4TS
19
5
0
07 Nov 2018
Phonetic-and-Semantic Embedding of Spoken Words with Applications in
  Spoken Content Retrieval
Phonetic-and-Semantic Embedding of Spoken Words with Applications in Spoken Content Retrieval
Yi-Chen Chen
Sung-Feng Huang
Chia-Hao Shen
Hung-yi Lee
Lin-Shan Lee
46
37
0
21 Jul 2018
Unsupervised Cross-Modal Alignment of Speech and Text Embedding Spaces
Unsupervised Cross-Modal Alignment of Speech and Text Embedding Spaces
Yu-An Chung
W. Weng
S. Tong
James R. Glass
17
99
0
18 May 2018
Speech2Vec: A Sequence-to-Sequence Framework for Learning Word
  Embeddings from Speech
Speech2Vec: A Sequence-to-Sequence Framework for Learning Word Embeddings from Speech
Yu-An Chung
James R. Glass
3DV
32
184
0
23 Mar 2018
Acoustic feature learning using cross-domain articulatory measurements
Acoustic feature learning using cross-domain articulatory measurements
Qingming Tang
Weiran Wang
Karen Livescu
22
2
0
19 Mar 2018
Deep Cross-Modal Correlation Learning for Audio and Lyrics in Music
  Retrieval
Deep Cross-Modal Correlation Learning for Audio and Lyrics in Music Retrieval
Yi Yu
Suhua Tang
F. Raposo
Lei Chen
19
110
0
24 Nov 2017
Learning Word Embeddings from Speech
Learning Word Embeddings from Speech
Yu-An Chung
James R. Glass
SSL
28
19
0
05 Nov 2017
Query-by-example Spoken Term Detection using Attention-based Multi-hop
  Networks
Query-by-example Spoken Term Detection using Attention-based Multi-hop Networks
Chia-Wei Ao
Hung-yi Lee
9
21
0
01 Sep 2017
Language Transfer of Audio Word2Vec: Learning Audio Segment
  Representations without Target Language Data
Language Transfer of Audio Word2Vec: Learning Audio Segment Representations without Target Language Data
Chia-Hao Shen
Janet Y. Sung
Hung-yi Lee
21
5
0
19 Jul 2017
Query-by-Example Search with Discriminative Neural Acoustic Word
  Embeddings
Query-by-Example Search with Discriminative Neural Acoustic Word Embeddings
Shane Settle
Keith D. Levin
Herman Kamper
Karen Livescu
14
85
0
12 Jun 2017
An embedded segmental K-means model for unsupervised segmentation and
  clustering of speech
An embedded segmental K-means model for unsupervised segmentation and clustering of speech
Herman Kamper
Karen Livescu
Sharon Goldwater
11
95
0
23 Mar 2017
Sound-Word2Vec: Learning Word Representations Grounded in Sounds
Sound-Word2Vec: Learning Word Representations Grounded in Sounds
Ashwin K. Vijayakumar
Ramakrishna Vedantam
Devi Parikh
24
22
0
06 Mar 2017
End-to-End ASR-free Keyword Search from Speech
End-to-End ASR-free Keyword Search from Speech
Kartik Audhkhasi
Andrew Rosenberg
A. Sethy
Bhuvana Ramabhadran
Brian Kingsbury
16
111
0
13 Jan 2017
1