Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2202.01107
Cited By
Keyword localisation in untranscribed speech using visually grounded speech models
2 February 2022
Kayode Olaleye
Dan Oneaţă
Herman Kamper
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Keyword localisation in untranscribed speech using visually grounded speech models"
9 / 9 papers shown
Title
Visually Grounded Speech Models have a Mutual Exclusivity Bias
Leanne Nortje
Dan Oneaţă
Yevgen Matusevych
Herman Kamper
SSL
47
0
0
20 Mar 2024
Towards visually prompted keyword localisation for zero-resource spoken languages
Leanne Nortje
Herman Kamper
29
6
0
12 Oct 2022
YFACC: A Yorùbá speech-image dataset for cross-lingual keyword localisation through visual grounding
Kayode Olaleye
Dan Oneaţă
Herman Kamper
ObjD
34
6
0
10 Oct 2022
SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model
Yi-Jen Shih
Hsuan-Fu Wang
Heng-Jui Chang
Layne Berry
Hung-yi Lee
David Harwath
VLM
CLIP
50
32
0
03 Oct 2022
Word Segmentation on Discovered Phone Units with Dynamic Programming and Self-Supervised Scoring
Herman Kamper
31
25
0
24 Feb 2022
On the Difficulty of Segmenting Words with Attention
Ramon Sanabria
Hao Tang
Sharon Goldwater
35
5
0
21 Sep 2021
MLP-Mixer: An all-MLP Architecture for Vision
Ilya O. Tolstikhin
N. Houlsby
Alexander Kolesnikov
Lucas Beyer
Xiaohua Zhai
...
Andreas Steiner
Daniel Keysers
Jakob Uszkoreit
Mario Lucic
Alexey Dosovitskiy
274
2,603
0
04 May 2021
Neural Baby Talk
Jiasen Lu
Jianwei Yang
Dhruv Batra
Devi Parikh
VLM
200
434
0
27 Mar 2018
Effective Approaches to Attention-based Neural Machine Translation
Thang Luong
Hieu H. Pham
Christopher D. Manning
218
7,926
0
17 Aug 2015
1