ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1703.08136
  4. Cited By
Visually grounded learning of keyword prediction from untranscribed
  speech

Visually grounded learning of keyword prediction from untranscribed speech

23 March 2017
Herman Kamper
Shane Settle
Gregory Shakhnarovich
Karen Livescu
ArXivPDFHTML

Papers citing "Visually grounded learning of keyword prediction from untranscribed speech"

11 / 11 papers shown
Title
Leveraging multilingual transfer for unsupervised semantic acoustic word
  embeddings
Leveraging multilingual transfer for unsupervised semantic acoustic word embeddings
C. Jacobs
Herman Kamper
32
1
0
05 Jul 2023
Hindi as a Second Language: Improving Visually Grounded Speech with
  Semantically Similar Samples
Hindi as a Second Language: Improving Visually Grounded Speech with Semantically Similar Samples
H. Ryu
Arda Senocak
In So Kweon
Joon Son Chung
VLM
26
8
0
30 Mar 2023
Towards visually prompted keyword localisation for zero-resource spoken
  languages
Towards visually prompted keyword localisation for zero-resource spoken languages
Leanne Nortje
Herman Kamper
26
6
0
12 Oct 2022
Seeing the advantage: visually grounding word embeddings to better
  capture human semantic knowledge
Seeing the advantage: visually grounding word embeddings to better capture human semantic knowledge
Danny Merkx
S. Frank
M. Ernestus
19
4
0
21 Feb 2022
Keyword localisation in untranscribed speech using visually grounded
  speech models
Keyword localisation in untranscribed speech using visually grounded speech models
Kayode Olaleye
Dan Oneaţă
Herman Kamper
26
7
0
02 Feb 2022
Voice-assisted Image Labelling for Endoscopic Ultrasound Classification
  using Neural Networks
Voice-assisted Image Labelling for Endoscopic Ultrasound Classification using Neural Networks
E. Bonmati
Yipeng Hu
A. Grimwood
G. Johnson
G. Goodchild
...
K. Gurusamy
Brian P. Davidson
Matthew J. Clarkson
Stephen P. Pereira
D. Barratt
21
15
0
12 Oct 2021
Word Order Does Not Matter For Speech Recognition
Word Order Does Not Matter For Speech Recognition
Vineel Pratap
Qiantong Xu
Tatiana Likhomanenko
Gabriel Synnaeve
R. Collobert
35
4
0
12 Oct 2021
Large scale weakly and semi-supervised learning for low-resource video
  ASR
Large scale weakly and semi-supervised learning for low-resource video ASR
Kritika Singh
Vimal Manohar
Alex Xiao
Sergey Edunov
Ross B. Girshick
Vitaliy Liptchinsky
Christian Fuegen
Yatharth Saraf
Geoffrey Zweig
Abdel-rahman Mohamed
28
9
0
16 May 2020
Effectiveness of self-supervised pre-training for speech recognition
Effectiveness of self-supervised pre-training for speech recognition
Alexei Baevski
Michael Auli
Abdel-rahman Mohamed
SSL
27
147
0
10 Nov 2019
Multimodal Language Analysis with Recurrent Multistage Fusion
Multimodal Language Analysis with Recurrent Multistage Fusion
Paul Pu Liang
Liu Ziyin
Amir Zadeh
Louis-Philippe Morency
30
198
0
12 Aug 2018
Semantic speech retrieval with a visually grounded model of
  untranscribed speech
Semantic speech retrieval with a visually grounded model of untranscribed speech
Herman Kamper
Gregory Shakhnarovich
Karen Livescu
23
53
0
05 Oct 2017
1