ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1805.07467
  4. Cited By
Unsupervised Cross-Modal Alignment of Speech and Text Embedding Spaces

Unsupervised Cross-Modal Alignment of Speech and Text Embedding Spaces

18 May 2018
Yu-An Chung
W. Weng
S. Tong
James R. Glass
ArXivPDFHTML

Papers citing "Unsupervised Cross-Modal Alignment of Speech and Text Embedding Spaces"

27 / 27 papers shown
Title
Speechless: Speech Instruction Training Without Speech for Low Resource Languages
Alan Dao
Dinh Bach Vu
Huy Hoang Ha
Tuan Le Duc Anh
Shreyas Gopal
Yue Heng Yeo
Warren Keng Hoong Low
Eng Siong Chng
J. Yip
SyDa
29
0
0
23 May 2025
Towards Unsupervised Automatic Speech Recognition Trained by Unaligned
  Speech and Text only
Towards Unsupervised Automatic Speech Recognition Trained by Unaligned Speech and Text only
Yi-Chen Chen
Chia-Hao Shen
Sung-Feng Huang
Hung-yi Lee
34
19
0
29 Mar 2018
Speech2Vec: A Sequence-to-Sequence Framework for Learning Word
  Embeddings from Speech
Speech2Vec: A Sequence-to-Sequence Framework for Learning Word Embeddings from Speech
Yu-An Chung
James R. Glass
3DV
55
184
0
23 Mar 2018
Augmenting Librispeech with French Translations: A Multimodal Corpus for
  Direct Speech Translation Evaluation
Augmenting Librispeech with French Translations: A Multimodal Corpus for Direct Speech Translation Evaluation
A. Kocabiyikoglu
Laurent Besacier
Olivier Kraif
46
104
0
09 Feb 2018
Learning Word Embeddings from Speech
Learning Word Embeddings from Speech
Yu-An Chung
James R. Glass
SSL
64
19
0
05 Nov 2017
Unsupervised Machine Translation Using Monolingual Corpora Only
Unsupervised Machine Translation Using Monolingual Corpora Only
Guillaume Lample
Alexis Conneau
Ludovic Denoyer
MarcÁurelio Ranzato
SSL
91
1,091
0
31 Oct 2017
Unsupervised Neural Machine Translation
Unsupervised Neural Machine Translation
Mikel Artetxe
Gorka Labaka
Eneko Agirre
Kyunghyun Cho
74
772
0
30 Oct 2017
Word Translation Without Parallel Data
Word Translation Without Parallel Data
Alexis Conneau
Guillaume Lample
MarcÁurelio Ranzato
Ludovic Denoyer
Hervé Jégou
265
1,646
0
11 Oct 2017
An embedded segmental K-means model for unsupervised segmentation and
  clustering of speech
An embedded segmental K-means model for unsupervised segmentation and clustering of speech
Herman Kamper
Karen Livescu
Sharon Goldwater
39
96
0
23 Mar 2017
Offline bilingual word vectors, orthogonal transformations and the
  inverted softmax
Offline bilingual word vectors, orthogonal transformations and the inverted softmax
Samuel L. Smith
David H. P. Turban
Steven Hamblin
Nils Y. Hammerla
OffRL
46
536
0
13 Feb 2017
Multi-view Recurrent Neural Acoustic Word Embeddings
Multi-view Recurrent Neural Acoustic Word Embeddings
Wanjia He
Weiran Wang
Karen Livescu
40
85
0
14 Nov 2016
Discriminative Acoustic Word Embeddings: Recurrent Neural Network-Based
  Approaches
Discriminative Acoustic Word Embeddings: Recurrent Neural Network-Based Approaches
Shane Settle
Karen Livescu
34
87
0
08 Nov 2016
Enriching Word Vectors with Subword Information
Enriching Word Vectors with Subword Information
Piotr Bojanowski
Edouard Grave
Armand Joulin
Tomas Mikolov
NAI
SSL
VLM
192
9,944
0
15 Jul 2016
Learning Crosslingual Word Embeddings without Bilingual Corpora
Learning Crosslingual Word Embeddings without Bilingual Corpora
Long Duong
H. Kanayama
Tengfei Ma
Steven Bird
Trevor Cohn
45
115
0
30 Jun 2016
A segmental framework for fully-unsupervised large-vocabulary speech
  recognition
A segmental framework for fully-unsupervised large-vocabulary speech recognition
Herman Kamper
A. Jansen
Sharon Goldwater
53
103
0
22 Jun 2016
Unsupervised word segmentation and lexicon discovery using acoustic word
  embeddings
Unsupervised word segmentation and lexicon discovery using acoustic word embeddings
Herman Kamper
A. Jansen
Sharon Goldwater
SSL
28
74
0
09 Mar 2016
Audio Word2Vec: Unsupervised Learning of Audio Segment Representations
  using Sequence-to-sequence Autoencoder
Audio Word2Vec: Unsupervised Learning of Audio Segment Representations using Sequence-to-sequence Autoencoder
Yu-An Chung
Chao-Chung Wu
Chia-Hao Shen
Hung-yi Lee
Lin-Shan Lee
AI4TS
55
182
0
03 Mar 2016
Deep Speech 2: End-to-End Speech Recognition in English and Mandarin
Deep Speech 2: End-to-End Speech Recognition in English and Mandarin
Dario Amodei
Rishita Anubhai
Eric Battenberg
Carl Case
Jared Casper
...
Chong-Jun Wang
Bo Xiao
Dani Yogatama
J. Zhan
Zhenyao Zhu
103
2,965
0
08 Dec 2015
Deep convolutional acoustic word embeddings using word-pair side
  information
Deep convolutional acoustic word embeddings using word-pair side information
Herman Kamper
Weiran Wang
Karen Livescu
SSL
33
171
0
05 Oct 2015
Listen, Attend and Spell
Listen, Attend and Spell
William Chan
Navdeep Jaitly
Quoc V. Le
Oriol Vinyals
RALM
131
2,261
0
05 Aug 2015
Attention-Based Models for Speech Recognition
Attention-Based Models for Speech Recognition
J. Chorowski
Dzmitry Bahdanau
Dmitriy Serdyuk
Kyunghyun Cho
Yoshua Bengio
101
2,605
0
24 Jun 2015
Improving zero-shot learning by mitigating the hubness problem
Improving zero-shot learning by mitigating the hubness problem
Georgiana Dinu
Angeliki Lazaridou
Marco Baroni
VLM
48
379
0
20 Dec 2014
Sequence to Sequence Learning with Neural Networks
Sequence to Sequence Learning with Neural Networks
Ilya Sutskever
Oriol Vinyals
Quoc V. Le
AIMat
280
20,491
0
10 Sep 2014
Learning Phrase Representations using RNN Encoder-Decoder for
  Statistical Machine Translation
Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation
Kyunghyun Cho
B. V. Merrienboer
Çağlar Gülçehre
Dzmitry Bahdanau
Fethi Bougares
Holger Schwenk
Yoshua Bengio
AIMat
629
23,235
0
03 Jun 2014
Distributed Representations of Words and Phrases and their
  Compositionality
Distributed Representations of Words and Phrases and their Compositionality
Tomas Mikolov
Ilya Sutskever
Kai Chen
G. Corrado
J. Dean
NAI
OCL
296
33,445
0
16 Oct 2013
Exploiting Similarities among Languages for Machine Translation
Exploiting Similarities among Languages for Machine Translation
Tomas Mikolov
Quoc V. Le
Ilya Sutskever
53
1,594
0
17 Sep 2013
Speech Recognition with Deep Recurrent Neural Networks
Speech Recognition with Deep Recurrent Neural Networks
Alex Graves
Abdel-rahman Mohamed
Geoffrey E. Hinton
154
8,503
0
22 Mar 2013
1