ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2112.13758
  4. Cited By
Bridging the Gap: Using Deep Acoustic Representations to Learn Grounded
  Language from Percepts and Raw Speech

Bridging the Gap: Using Deep Acoustic Representations to Learn Grounded Language from Percepts and Raw Speech

27 December 2021
Gaoussou Youssouf Kebe
Luke E. Richards
Edward Raff
Francis Ferraro
Cynthia Matuszek
    SSL
ArXivPDFHTML

Papers citing "Bridging the Gap: Using Deep Acoustic Representations to Learn Grounded Language from Percepts and Raw Speech"

26 / 26 papers shown
Title
Towards Measuring Fairness in Speech Recognition: Casual Conversations
  Dataset Transcriptions
Towards Measuring Fairness in Speech Recognition: Casual Conversations Dataset Transcriptions
Chunxi Liu
M. Picheny
Leda Sari
Pooja Chitkara
Alex Xiao
Xiaohui Zhang
Mark Chou
Andres Alvarado
C. Hazirbas
Yatharth Saraf
62
42
0
18 Nov 2021
Spoken Language Interaction with Robots: Research Issues and
  Recommendations, Report from the NSF Future Directions Workshop
Spoken Language Interaction with Robots: Research Issues and Recommendations, Report from the NSF Future Directions Workshop
M. Marge
C. Espy-Wilson
Roger K. Moore
51
79
0
11 Nov 2020
Robot Object Retrieval with Contextual Natural Language Queries
Robot Object Retrieval with Contextual Natural Language Queries
Thao Nguyen
N. Gopalan
Roma Patel
Matt Corsaro
Ellie Pavlick
Stefanie Tellex
LM&Ro
45
53
0
23 Jun 2020
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech
  Representations
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations
Alexei Baevski
Henry Zhou
Abdel-rahman Mohamed
Michael Auli
SSL
241
5,774
0
20 Jun 2020
It's Morphin' Time! Combating Linguistic Discrimination with
  Inflectional Perturbations
It's Morphin' Time! Combating Linguistic Discrimination with Inflectional Perturbations
Samson Tan
Shafiq Joty
Min-Yen Kan
R. Socher
213
105
0
09 May 2020
Deep Contextualized Acoustic Representations For Semi-Supervised Speech
  Recognition
Deep Contextualized Acoustic Representations For Semi-Supervised Speech Recognition
Shaoshi Ling
Yuzong Liu
Julian Salazar
Katrin Kirchhoff
SSL
56
139
0
03 Dec 2019
vq-wav2vec: Self-Supervised Learning of Discrete Speech Representations
vq-wav2vec: Self-Supervised Learning of Discrete Speech Representations
Alexei Baevski
Steffen Schneider
Michael Auli
SSL
144
666
0
12 Oct 2019
ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations for
  Vision-and-Language Tasks
ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations for Vision-and-Language Tasks
Jiasen Lu
Dhruv Batra
Devi Parikh
Stefan Lee
SSL
VLM
217
3,667
0
06 Aug 2019
Symbolic inductive bias for visually grounded learning of spoken
  language
Symbolic inductive bias for visually grounded learning of spoken language
Grzegorz Chrupała
44
28
0
21 Dec 2018
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
1.6K
94,511
0
11 Oct 2018
Translating Navigation Instructions in Natural Language to a High-Level
  Plan for Behavioral Robot Navigation
Translating Navigation Instructions in Natural Language to a High-Level Plan for Behavioral Robot Navigation
Xiaoxue Zang
Ashwini Pokle
Nathan Tsoi
Kevin Chen
Juan Carlos Niebles
Á. Soto
Silvio Savarese
LM&Ro
36
30
0
24 Sep 2018
Unsupervised Stylish Image Description Generation via Domain Layer Norm
Unsupervised Stylish Image Description Generation via Domain Layer Norm
Cheng Kuan Chen
Zhufeng Pan
Min Sun
Ming-Yuan Liu
46
29
0
11 Sep 2018
FollowNet: Robot Navigation by Following Natural Language Directions
  with Deep Reinforcement Learning
FollowNet: Robot Navigation by Following Natural Language Directions with Deep Reinforcement Learning
Pararth Shah
Marek Fiser
Aleksandra Faust
J. Kew
Dilek Z. Hakkani-Tür
58
52
0
16 May 2018
Jointly Discovering Visual Objects and Spoken Words from Raw Sensory
  Input
Jointly Discovering Visual Objects and Spoken Words from Raw Sensory Input
David Harwath
Adrià Recasens
Dídac Surís
Galen Chuang
Antonio Torralba
James R. Glass
68
201
0
04 Apr 2018
Reconstruction Network for Video Captioning
Reconstruction Network for Video Captioning
Bairui Wang
Lin Ma
Wei Zhang
Wen Liu
116
318
0
30 Mar 2018
Deep contextualized word representations
Deep contextualized word representations
Matthew E. Peters
Mark Neumann
Mohit Iyyer
Matt Gardner
Christopher Clark
Kenton Lee
Luke Zettlemoyer
NAI
198
11,542
0
15 Feb 2018
Racial Disparity in Natural Language Processing: A Case Study of Social
  Media African-American English
Racial Disparity in Natural Language Processing: A Case Study of Social Media African-American English
Su Lin Blodgett
Brendan O'Connor
58
148
0
30 Jun 2017
In Defense of the Triplet Loss for Person Re-Identification
In Defense of the Triplet Loss for Person Re-Identification
Alexander Hermans
Lucas Beyer
Bastian Leibe
DML
76
3,200
0
22 Mar 2017
Representations of language in a model of visually grounded speech
  signal
Representations of language in a model of visually grounded speech signal
Grzegorz Chrupała
Lieke Gelderloos
Afra Alishahi
73
131
0
07 Feb 2017
Deep Residual Learning for Image Recognition
Deep Residual Learning for Image Recognition
Kaiming He
Xinming Zhang
Shaoqing Ren
Jian Sun
MedIm
2.1K
193,426
0
10 Dec 2015
Natural Language Object Retrieval
Natural Language Object Retrieval
Ronghang Hu
Huazhe Xu
Marcus Rohrbach
Jiashi Feng
Kate Saenko
Trevor Darrell
ObjD
90
552
0
13 Nov 2015
Resolving References to Objects in Photographs using the
  Words-As-Classifiers Model
Resolving References to Objects in Photographs using the Words-As-Classifiers Model
David Schlangen
Sina Zarrieß
C. Kennington
40
48
0
07 Oct 2015
Multimodal Deep Learning for Robust RGB-D Object Recognition
Multimodal Deep Learning for Robust RGB-D Object Recognition
Andreas Eitel
Jost Tobias Springenberg
Luciano Spinello
Martin Riedmiller
Wolfram Burgard
63
646
0
24 Jul 2015
FaceNet: A Unified Embedding for Face Recognition and Clustering
FaceNet: A Unified Embedding for Face Recognition and Clustering
Florian Schroff
Dmitry Kalenichenko
James Philbin
3DH
349
13,134
0
12 Mar 2015
Adam: A Method for Stochastic Optimization
Adam: A Method for Stochastic Optimization
Diederik P. Kingma
Jimmy Ba
ODL
1.6K
149,842
0
22 Dec 2014
Distributed Representations of Words and Phrases and their
  Compositionality
Distributed Representations of Words and Phrases and their Compositionality
Tomas Mikolov
Ilya Sutskever
Kai Chen
G. Corrado
J. Dean
NAI
OCL
367
33,520
0
16 Oct 2013
1