Bridging the Gap: Using Deep Acoustic Representations to Learn Grounded Language from Percepts and Raw Speech

27 December 2021

Gaoussou Youssouf Kebe

Papers citing "Bridging the Gap: Using Deep Acoustic Representations to Learn Grounded Language from Percepts and Raw Speech"

26 / 26 papers shown

Title
Towards Measuring Fairness in Speech Recognition: Casual Conversations Dataset Transcriptions Chunxi Liu M. Picheny Leda Sari Pooja Chitkara Alex Xiao Xiaohui Zhang Mark Chou Andres Alvarado C. Hazirbas Yatharth Saraf 62 42 0 18 Nov 2021
Spoken Language Interaction with Robots: Research Issues and Recommendations, Report from the NSF Future Directions Workshop M. Marge C. Espy-Wilson Roger K. Moore 51 79 0 11 Nov 2020
Robot Object Retrieval with Contextual Natural Language Queries Thao Nguyen N. Gopalan Roma Patel Matt Corsaro Ellie Pavlick Stefanie Tellex LM&Ro 45 53 0 23 Jun 2020
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations Alexei Baevski Henry Zhou Abdel-rahman Mohamed Michael Auli SSL 241 5,774 0 20 Jun 2020
It's Morphin' Time! Combating Linguistic Discrimination with Inflectional Perturbations Samson Tan Shafiq Joty Min-Yen Kan R. Socher 213 105 0 09 May 2020
Deep Contextualized Acoustic Representations For Semi-Supervised Speech Recognition Shaoshi Ling Yuzong Liu Julian Salazar Katrin Kirchhoff SSL 56 139 0 03 Dec 2019
vq-wav2vec: Self-Supervised Learning of Discrete Speech Representations Alexei Baevski Steffen Schneider Michael Auli SSL 144 666 0 12 Oct 2019
ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations for Vision-and-Language Tasks Jiasen Lu Dhruv Batra Devi Parikh Stefan Lee SSL VLM 217 3,667 0 06 Aug 2019
Symbolic inductive bias for visually grounded learning of spoken language Grzegorz Chrupała 44 28 0 21 Dec 2018
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding Jacob Devlin Ming-Wei Chang Kenton Lee Kristina Toutanova VLM SSL SSeg 1.6K 94,511 0 11 Oct 2018
Translating Navigation Instructions in Natural Language to a High-Level Plan for Behavioral Robot Navigation Xiaoxue Zang Ashwini Pokle Nathan Tsoi Kevin Chen Juan Carlos Niebles Á. Soto Silvio Savarese LM&Ro 36 30 0 24 Sep 2018
Unsupervised Stylish Image Description Generation via Domain Layer Norm Cheng Kuan Chen Zhufeng Pan Min Sun Ming-Yuan Liu 46 29 0 11 Sep 2018
FollowNet: Robot Navigation by Following Natural Language Directions with Deep Reinforcement Learning Pararth Shah Marek Fiser Aleksandra Faust J. Kew Dilek Z. Hakkani-Tür 58 52 0 16 May 2018
Jointly Discovering Visual Objects and Spoken Words from Raw Sensory Input David Harwath Adrià Recasens Dídac Surís Galen Chuang Antonio Torralba James R. Glass 68 201 0 04 Apr 2018
Reconstruction Network for Video Captioning Bairui Wang Lin Ma Wei Zhang Wen Liu 116 318 0 30 Mar 2018
Deep contextualized word representations Matthew E. Peters Mark Neumann Mohit Iyyer Matt Gardner Christopher Clark Kenton Lee Luke Zettlemoyer NAI 198 11,542 0 15 Feb 2018
Racial Disparity in Natural Language Processing: A Case Study of Social Media African-American English Su Lin Blodgett Brendan O'Connor 58 148 0 30 Jun 2017
In Defense of the Triplet Loss for Person Re-Identification Alexander Hermans Lucas Beyer Bastian Leibe DML 76 3,200 0 22 Mar 2017
Representations of language in a model of visually grounded speech signal Grzegorz Chrupała Lieke Gelderloos Afra Alishahi 73 131 0 07 Feb 2017
Deep Residual Learning for Image Recognition Kaiming He Xinming Zhang Shaoqing Ren Jian Sun MedIm 2.1K 193,426 0 10 Dec 2015
Natural Language Object Retrieval Ronghang Hu Huazhe Xu Marcus Rohrbach Jiashi Feng Kate Saenko Trevor Darrell ObjD 90 552 0 13 Nov 2015
Resolving References to Objects in Photographs using the Words-As-Classifiers Model David Schlangen Sina Zarrieß C. Kennington 40 48 0 07 Oct 2015
Multimodal Deep Learning for Robust RGB-D Object Recognition Andreas Eitel Jost Tobias Springenberg Luciano Spinello Martin Riedmiller Wolfram Burgard 63 646 0 24 Jul 2015
FaceNet: A Unified Embedding for Face Recognition and Clustering Florian Schroff Dmitry Kalenichenko James Philbin 3DH 349 13,134 0 12 Mar 2015
Adam: A Method for Stochastic Optimization Diederik P. Kingma Jimmy Ba ODL 1.6K 149,842 0 22 Dec 2014
Distributed Representations of Words and Phrases and their Compositionality Tomas Mikolov Ilya Sutskever Kai Chen G. Corrado J. Dean NAI OCL 367 33,520 0 16 Oct 2013