Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2112.13758
Cited By
Bridging the Gap: Using Deep Acoustic Representations to Learn Grounded Language from Percepts and Raw Speech
27 December 2021
Gaoussou Youssouf Kebe
Luke E. Richards
Edward Raff
Francis Ferraro
Cynthia Matuszek
SSL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Bridging the Gap: Using Deep Acoustic Representations to Learn Grounded Language from Percepts and Raw Speech"
26 / 26 papers shown
Title
Towards Measuring Fairness in Speech Recognition: Casual Conversations Dataset Transcriptions
Chunxi Liu
M. Picheny
Leda Sari
Pooja Chitkara
Alex Xiao
Xiaohui Zhang
Mark Chou
Andres Alvarado
C. Hazirbas
Yatharth Saraf
62
42
0
18 Nov 2021
Spoken Language Interaction with Robots: Research Issues and Recommendations, Report from the NSF Future Directions Workshop
M. Marge
C. Espy-Wilson
Roger K. Moore
51
79
0
11 Nov 2020
Robot Object Retrieval with Contextual Natural Language Queries
Thao Nguyen
N. Gopalan
Roma Patel
Matt Corsaro
Ellie Pavlick
Stefanie Tellex
LM&Ro
45
53
0
23 Jun 2020
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations
Alexei Baevski
Henry Zhou
Abdel-rahman Mohamed
Michael Auli
SSL
241
5,774
0
20 Jun 2020
It's Morphin' Time! Combating Linguistic Discrimination with Inflectional Perturbations
Samson Tan
Shafiq Joty
Min-Yen Kan
R. Socher
213
105
0
09 May 2020
Deep Contextualized Acoustic Representations For Semi-Supervised Speech Recognition
Shaoshi Ling
Yuzong Liu
Julian Salazar
Katrin Kirchhoff
SSL
56
139
0
03 Dec 2019
vq-wav2vec: Self-Supervised Learning of Discrete Speech Representations
Alexei Baevski
Steffen Schneider
Michael Auli
SSL
144
666
0
12 Oct 2019
ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations for Vision-and-Language Tasks
Jiasen Lu
Dhruv Batra
Devi Parikh
Stefan Lee
SSL
VLM
217
3,667
0
06 Aug 2019
Symbolic inductive bias for visually grounded learning of spoken language
Grzegorz Chrupała
44
28
0
21 Dec 2018
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
1.6K
94,511
0
11 Oct 2018
Translating Navigation Instructions in Natural Language to a High-Level Plan for Behavioral Robot Navigation
Xiaoxue Zang
Ashwini Pokle
Nathan Tsoi
Kevin Chen
Juan Carlos Niebles
Á. Soto
Silvio Savarese
LM&Ro
36
30
0
24 Sep 2018
Unsupervised Stylish Image Description Generation via Domain Layer Norm
Cheng Kuan Chen
Zhufeng Pan
Min Sun
Ming-Yuan Liu
46
29
0
11 Sep 2018
FollowNet: Robot Navigation by Following Natural Language Directions with Deep Reinforcement Learning
Pararth Shah
Marek Fiser
Aleksandra Faust
J. Kew
Dilek Z. Hakkani-Tür
58
52
0
16 May 2018
Jointly Discovering Visual Objects and Spoken Words from Raw Sensory Input
David Harwath
Adrià Recasens
Dídac Surís
Galen Chuang
Antonio Torralba
James R. Glass
68
201
0
04 Apr 2018
Reconstruction Network for Video Captioning
Bairui Wang
Lin Ma
Wei Zhang
Wen Liu
116
318
0
30 Mar 2018
Deep contextualized word representations
Matthew E. Peters
Mark Neumann
Mohit Iyyer
Matt Gardner
Christopher Clark
Kenton Lee
Luke Zettlemoyer
NAI
198
11,542
0
15 Feb 2018
Racial Disparity in Natural Language Processing: A Case Study of Social Media African-American English
Su Lin Blodgett
Brendan O'Connor
58
148
0
30 Jun 2017
In Defense of the Triplet Loss for Person Re-Identification
Alexander Hermans
Lucas Beyer
Bastian Leibe
DML
76
3,200
0
22 Mar 2017
Representations of language in a model of visually grounded speech signal
Grzegorz Chrupała
Lieke Gelderloos
Afra Alishahi
73
131
0
07 Feb 2017
Deep Residual Learning for Image Recognition
Kaiming He
Xinming Zhang
Shaoqing Ren
Jian Sun
MedIm
2.1K
193,426
0
10 Dec 2015
Natural Language Object Retrieval
Ronghang Hu
Huazhe Xu
Marcus Rohrbach
Jiashi Feng
Kate Saenko
Trevor Darrell
ObjD
90
552
0
13 Nov 2015
Resolving References to Objects in Photographs using the Words-As-Classifiers Model
David Schlangen
Sina Zarrieß
C. Kennington
40
48
0
07 Oct 2015
Multimodal Deep Learning for Robust RGB-D Object Recognition
Andreas Eitel
Jost Tobias Springenberg
Luciano Spinello
Martin Riedmiller
Wolfram Burgard
63
646
0
24 Jul 2015
FaceNet: A Unified Embedding for Face Recognition and Clustering
Florian Schroff
Dmitry Kalenichenko
James Philbin
3DH
349
13,134
0
12 Mar 2015
Adam: A Method for Stochastic Optimization
Diederik P. Kingma
Jimmy Ba
ODL
1.6K
149,842
0
22 Dec 2014
Distributed Representations of Words and Phrases and their Compositionality
Tomas Mikolov
Ilya Sutskever
Kai Chen
G. Corrado
J. Dean
NAI
OCL
367
33,520
0
16 Oct 2013
1