Conditional independence for pretext task selection in Self-supervised
speech representation learning

Conditional independence for pretext task selection in Self-supervised speech representation learning

15 April 2021

Titouan Parcollet

Papers citing "Conditional independence for pretext task selection in Self-supervised speech representation learning"

18 / 18 papers shown

Title
SpeechBrain: A General-Purpose Speech Toolkit Mirco Ravanelli Titouan Parcollet Peter William VanHarn Plantinga Aku Rouhe Samuele Cornell ... William Aris Hwidong Na Yan Gao R. Mori Yoshua Bengio 69 763 0 08 Jun 2021
Speech SIMCLR: Combining Contrastive and Reconstruction Objective for Self-supervised Speech Representation Learning Dongwei Jiang Wubo Li Miao Cao Wei Zou Xiangang Li SSL 48 65 0 27 Oct 2020
Contrastive Learning of General-Purpose Audio Representations Aaqib Saeed David Grangier Neil Zeghidour VLM SSL 62 269 0 21 Oct 2020
Pushing the Limits of Semi-Supervised Learning for Automatic Speech Recognition Yu Zhang James Qin Daniel S. Park Wei Han Chung-Cheng Chiu Ruoming Pang Quoc V. Le Yonghui Wu VLM SSL 176 309 0 20 Oct 2020
Evaluating the reliability of acoustic speech embeddings Robin Algayres Mohamed Salah Zaiem Benoît Sagot Emmanuel Dupoux 64 29 0 27 Jul 2020
Learning Speech Representations from Raw Audio by Joint Audiovisual Self-Supervision Abhinav Shukla Stavros Petridis Maja Pantic SSL 39 16 0 08 Jul 2020
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations Alexei Baevski Henry Zhou Abdel-rahman Mohamed Michael Auli SSL 228 5,774 0 20 Jun 2020
Self-supervised Learning for Speech Enhancement Yuchun Wang Shrikant Venkataramani Paris Smaragdis SSL 65 31 0 18 Jun 2020
A Convolutional Deep Markov Model for Unsupervised Speech Representation Learning Sameer Khurana Antoine Laurent Wei-Ning Hsu J. Chorowski A. Lancucki R. Marxer James R. Glass SSL BDL 46 29 0 03 Jun 2020
Common Voice: A Massively-Multilingual Speech Corpus Rosana Ardila Megan Branson Kelly Davis Michael Henretty M. Kohler Josh Meyer Reuben Morais Lindsay Saunders Francis M. Tyers Gregor Weber VLM 87 1,592 0 13 Dec 2019
Mockingjay: Unsupervised Speech Representation Learning with Deep Bidirectional Transformer Encoders Andy T. Liu Shu-Wen Yang Po-Han Chi Po-Chun Hsu Hung-yi Lee SSL 132 373 0 25 Oct 2019
Speech-XLNet: Unsupervised Acoustic Model Pretraining For Self-Attention Networks Xingcheng Song Guangsen Wang Zhiyong Wu Yiheng Huang Dan Su Dong Yu Helen Meng SSL 62 49 0 23 Oct 2019
Multitask learning for frame-level instrument recognition Yun-Ning Hung Yian Chen Yi-Hsuan Yang 106 33 0 03 Nov 2018
Objects that Sound Relja Arandjelović Andrew Zisserman ObjD VOS 92 529 0 18 Dec 2017
VoxCeleb: a large-scale speaker identification dataset Arsha Nagrani Joon Son Chung Andrew Zisserman 117 2,273 0 26 Jun 2017
Unsupervised Learning of Visual Representations by Solving Jigsaw Puzzles M. Noroozi Paolo Favaro SSL 154 2 0 30 Mar 2016
Unsupervised Visual Representation Learning by Context Prediction Carl Doersch Abhinav Gupta Alexei A. Efros DRL SSL 164 2,782 0 19 May 2015
Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification Kaiming He Xinming Zhang Shaoqing Ren Jian Sun VLM 280 18,587 0 06 Feb 2015