Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1702.01991
Cited By
Representations of language in a model of visually grounded speech signal
7 February 2017
Grzegorz Chrupała
Lieke Gelderloos
A. Alishahi
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Representations of language in a model of visually grounded speech signal"
34 / 84 papers shown
Title
Word Recognition, Competition, and Activation in a Model of Visually Grounded Speech
William N. Havard
Jean-Pierre Chevrot
Laurent Besacier
17
21
0
18 Sep 2019
Language learning using Speech to Image retrieval
Danny Merkx
S. Frank
M. Ernestus
14
43
0
09 Sep 2019
Do Cross Modal Systems Leverage Semantic Relationships?
Shah Nawaz
Muhammad Kamran Janjua
I. Gallo
Arif Mahmood
Alessandro Calefati
Faisal Shafait
19
8
0
03 Sep 2019
Higher-order Comparisons of Sentence Encoder Representations
Mostafa Abdou
Artur Kulmizev
Felix Hill
D. Low
Anders Søgaard
20
16
0
01 Sep 2019
MaSS: A Large and Clean Multilingual Corpus of Sentence-aligned Spoken Utterances Extracted from the Bible
Marcely Zanon Boito
William N. Havard
Mahault Garnerin
Éric Le Ferrand
Laurent Besacier
22
47
0
30 Jul 2019
Trends in Integration of Vision and Language Research: A Survey of Tasks, Datasets, and Methods
Aditya Mogadala
M. Kalimuthu
Dietrich Klakow
VLM
20
132
0
22 Jul 2019
Transfer Learning from Audio-Visual Grounding to Speech Recognition
Wei-Ning Hsu
David F. Harwath
James R. Glass
SSL
18
32
0
09 Jul 2019
Analyzing Phonetic and Graphemic Representations in End-to-End Automatic Speech Recognition
Yonatan Belinkov
Ahmed M. Ali
James R. Glass
17
32
0
09 Jul 2019
On the Contributions of Visual and Textual Supervision in Low-Resource Semantic Speech Retrieval
Ankita Pasad
Bowen Shi
Herman Kamper
Karen Livescu
11
12
0
24 Apr 2019
Self-Supervised Audio-Visual Co-Segmentation
Andrew Rouditchenko
Hang Zhao
Chuang Gan
Josh H. McDermott
Antonio Torralba
VLM
SSL
9
104
0
18 Apr 2019
Semantic query-by-example speech search using visual grounding
Herman Kamper
Aristotelis Anastassiou
Karen Livescu
11
29
0
15 Apr 2019
Learning semantic sentence representations from visually grounded language without lexical knowledge
Danny Merkx
S. Frank
SSL
11
13
0
27 Mar 2019
Towards Visually Grounded Sub-Word Speech Unit Discovery
David F. Harwath
James R. Glass
16
35
0
21 Feb 2019
Models of Visually Grounded Speech Signal Pay Attention To Nouns: a Bilingual Experiment on English and Japanese
William N. Havard
Jean-Pierre Chevrot
Laurent Besacier
15
24
0
08 Feb 2019
Symbolic inductive bias for visually grounded learning of spoken language
Grzegorz Chrupała
14
28
0
21 Dec 2018
Analysis Methods in Neural Language Processing: A Survey
Yonatan Belinkov
James R. Glass
30
547
0
21 Dec 2018
Interpretable Textual Neuron Representations for NLP
Nina Poerner
Benjamin Roth
Hinrich Schütze
FAtt
AI4CE
MILM
8
26
0
19 Sep 2018
Explaining Character-Aware Neural Networks for Word-Level Prediction: Do They Discover Linguistic Rules?
Fréderic Godin
Kris Demuynck
J. Dambre
W. D. Neve
T. Demeester
AI4CE
16
17
0
28 Aug 2018
Revisiting Cross Modal Retrieval
Shah Nawaz
Muhammad Kamran Janjua
Alessandro Calefati
I. Gallo
6
6
0
19 Jul 2018
Revisiting the Hierarchical Multiscale LSTM
Ákos Kádár
Marc-Alexandre Côté
Grzegorz Chrupała
A. Alishahi
11
13
0
10 Jul 2018
Visually grounded cross-lingual keyword spotting in speech
Herman Kamper
Michael Roth
11
34
0
13 Jun 2018
Vision as an Interlingua: Learning Multilingual Semantic Embeddings of Untranscribed Speech
David F. Harwath
Galen Chuang
James R. Glass
12
58
0
09 Apr 2018
Jointly Discovering Visual Objects and Spoken Words from Raw Sensory Input
David F. Harwath
Adrià Recasens
Dídac Surís
Galen Chuang
Antonio Torralba
James R. Glass
24
201
0
04 Apr 2018
On the difficulty of a distributional semantics of spoken language
Grzegorz Chrupała
Lieke Gelderloos
Ákos Kádár
A. Alishahi
17
6
0
23 Mar 2018
Linguistic unit discovery from multi-modal inputs in unwritten languages: Summary of the "Speaking Rosetta" JSALT 2017 Workshop
O. Scharenborg
Laurent Besacier
A. Black
M. Hasegawa-Johnson
Florian Metze
...
Elin Larsen
Danny Merkx
Rachid Riad
Liming Wang
Emmanuel Dupoux
20
33
0
14 Feb 2018
Object Referring in Visual Scene with Spoken Language
A. Vasudevan
Dengxin Dai
Luc Van Gool
29
18
0
10 Nov 2017
Semantic speech retrieval with a visually grounded model of untranscribed speech
Herman Kamper
Gregory Shakhnarovich
Karen Livescu
21
53
0
05 Oct 2017
Analyzing Hidden Representations in End-to-End Automatic Speech Recognition Systems
Yonatan Belinkov
James R. Glass
16
83
0
13 Sep 2017
Acoustic Feature Learning via Deep Variational Canonical Correlation Analysis
Qingming Tang
Weiran Wang
Karen Livescu
DRL
13
20
0
11 Aug 2017
SPEECH-COCO: 600k Visually Grounded Spoken Captions Aligned to MSCOCO Data Set
William N. Havard
Laurent Besacier
O. Rosec
15
28
0
26 Jul 2017
Encoding of phonology in a recurrent neural model of grounded speech
A. Alishahi
Marie Barking
Grzegorz Chrupała
10
58
0
12 Jun 2017
Imagination improves Multimodal Translation
Desmond Elliott
Ákos Kádár
23
136
0
11 May 2017
Visually grounded learning of keyword prediction from untranscribed speech
Herman Kamper
Shane Settle
Gregory Shakhnarovich
Karen Livescu
11
63
0
23 Mar 2017
Pixel Recurrent Neural Networks
Aaron van den Oord
Nal Kalchbrenner
Koray Kavukcuoglu
SSeg
GAN
233
2,547
0
25 Jan 2016
Previous
1
2