General-Purpose Speech Representation Learning through a Self-Supervised Multi-Granularity Framework

3 February 2021

Papers citing "General-Purpose Speech Representation Learning through a Self-Supervised Multi-Granularity Framework"

22 / 22 papers shown

Title
TERA: Self-Supervised Learning of Transformer Encoder Representation for Speech Andy T. Liu Shang-Wen Li Hung-yi Lee SSL 132 358 0 12 Jul 2020
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations Alexei Baevski Henry Zhou Abdel-rahman Mohamed Michael Auli SSL 295 5,837 0 20 Jun 2020
The INTERSPEECH 2020 Deep Noise Suppression Challenge: Datasets, Subjective Testing Framework, and Challenge Results Chandan K. A. Reddy Vishak Gopal Ross Cutler Ebrahim Beyrami R. Cheng ... A. Aazami Sebastian Braun Puneet Rana Sriram Srinivasan J. Gehrke 94 318 0 16 May 2020
Improved Speech Representations with Multi-Target Autoregressive Predictive Coding Yu-An Chung James R. Glass SSL 67 56 0 11 Apr 2020
A Simple Framework for Contrastive Learning of Visual Representations Ting-Li Chen Simon Kornblith Mohammad Norouzi Geoffrey E. Hinton SSL 375 18,866 0 13 Feb 2020
Multi-task self-supervised learning for Robust Speech Recognition Mirco Ravanelli Jianyuan Zhong Santiago Pascual P. Swietojanski João Monteiro J. Trmal Yoshua Bengio SSL 281 290 0 25 Jan 2020
Deep Contextualized Acoustic Representations For Semi-Supervised Speech Recognition Shaoshi Ling Yuzong Liu Julian Salazar Katrin Kirchhoff SSL 66 139 0 03 Dec 2019
Momentum Contrast for Unsupervised Visual Representation Learning Kaiming He Haoqi Fan Yuxin Wu Saining Xie Ross B. Girshick SSL 210 12,124 0 13 Nov 2019
M3ER: Multiplicative Multimodal Emotion Recognition Using Facial, Textual, and Speech Cues Trisha Mittal Uttaran Bhattacharya Rohan Chandra Aniket Bera Tianyi Zhou 73 240 0 09 Nov 2019
Mockingjay: Unsupervised Speech Representation Learning with Deep Bidirectional Transformer Encoders Andy T. Liu Shu-Wen Yang Po-Han Chi Po-Chun Hsu Hung-yi Lee SSL 150 374 0 25 Oct 2019
vq-wav2vec: Self-Supervised Learning of Discrete Speech Representations Alexei Baevski Steffen Schneider Michael Auli SSL 163 667 0 12 Oct 2019
wav2vec: Unsupervised Pre-training for Speech Recognition Steffen Schneider Alexei Baevski R. Collobert Michael Auli SSL 76 418 0 11 Apr 2019
Who Needs Words? Lexicon-Free Speech Recognition Tatiana Likhomanenko Gabriel Synnaeve R. Collobert 35 27 0 09 Apr 2019
Learning Problem-agnostic Speech Representations from Multiple Self-supervised Tasks Santiago Pascual Mirco Ravanelli Joan Serrà Antonio Bonafonte Yoshua Bengio SSL 126 251 0 06 Apr 2019
An Unsupervised Autoregressive Model for Speech Representation Learning Yu-An Chung Wei-Ning Hsu Hao Tang James R. Glass SSL 84 409 0 05 Apr 2019
wav2letter++: The Fastest Open-source Speech Recognition System Vineel Pratap Awni Y. Hannun Qiantong Xu Jeff Cai Jacob Kahn Gabriel Synnaeve Vitaliy Liptchinsky R. Collobert VLM 54 156 0 18 Dec 2018
SDR - half-baked or well done? F. Sánchez-Martínez M. Esplà-Gomis Hakan Erdogan J. Hershey 153 1,204 0 06 Nov 2018
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding Jacob Devlin Ming-Wei Chang Kenton Lee Kristina Toutanova VLM SSL SSeg 1.8K 95,175 0 11 Oct 2018
Representation Learning with Contrastive Predictive Coding Aaron van den Oord Yazhe Li Oriol Vinyals DRL SSL 351 10,356 0 10 Jul 2018
Deep contextualized word representations Matthew E. Peters Mark Neumann Mohit Iyyer Matt Gardner Christopher Clark Kenton Lee Luke Zettlemoyer NAI 230 11,566 0 15 Feb 2018
Generalized End-to-End Loss for Speaker Verification Li Wan Quan Wang Alan Papir Ignacio López Moreno VLM 68 933 0 28 Oct 2017
Attention Is All You Need Ashish Vaswani Noam M. Shazeer Niki Parmar Jakob Uszkoreit Llion Jones Aidan Gomez Lukasz Kaiser Illia Polosukhin 3DV 732 132,363 0 12 Jun 2017