Generative Pre-Training for Speech with Autoregressive Predictive Coding

23 October 2019

Papers citing "Generative Pre-Training for Speech with Autoregressive Predictive Coding"

50 / 115 papers shown

Title
DeLoRes: Decorrelating Latent Spaces for Low-Resource Audio Representation Learning Sreyan Ghosh Ashish Seth and Deepak Mittal Maneesh Singh S. Umesh SSL 27 6 0 25 Mar 2022
Federated Self-Supervised Learning for Acoustic Event Classification Meng Feng Chieh-Chi Kao Qingming Tang Ming Sun Viktor Rozgic Spyros Matsoukas Chao Wang 41 11 0 22 Mar 2022
Audio Self-supervised Learning: A Survey Shuo Liu Adria Mallol-Ragolta Emilia Parada-Cabeleiro Kun Qian Xingshuo Jing Alexander Kathan Bin Hu Bjoern W. Schuller SSL 35 106 0 02 Mar 2022
A Brief Overview of Unsupervised Neural Speech Representation Learning Lasse Borgholt Jakob Drachmann Havtorn Joakim Edin Lars Maaløe Christian Igel BDL AI4TS SSL 19 11 0 01 Mar 2022
Assessing the State of Self-Supervised Human Activity Recognition using Wearables H. Haresamudram Irfan Essa Thomas Plötz SSL 42 86 0 22 Feb 2022
Self-supervised Speaker Recognition Training Using Human-Machine Dialogues Metehan Cekic Ruirui Li Zeya Chen Yuguang Yang A. Stolcke Upamanyu Madhow SSL 27 2 0 07 Feb 2022
Sentiment-Aware Automatic Speech Recognition pre-training for enhanced Speech Emotion Recognition Ayoub Ghriss Bo Yang Viktor Rozgic Elizabeth Shriberg Chao Wang 27 21 0 27 Jan 2022
SPIRAL: Self-supervised Perturbation-Invariant Representation Learning for Speech Pre-Training Wenyong Huang Zhenhe Zhang Y. Yeung Xin Jiang Qun Liu 35 23 0 25 Jan 2022
On Training Targets and Activation Functions for Deep Representation Learning in Text-Dependent Speaker Verification A. Sarkar Zheng-Hua Tan 16 2 0 17 Jan 2022
Self-Supervised Learning for speech recognition with Intermediate layer supervision Chengyi Wang Yu-Huan Wu Sanyuan Chen Shujie Liu Jinyu Li Yao Qian Zhenglu Yang SSL 24 28 0 16 Dec 2021
Recent Advances in End-to-End Automatic Speech Recognition Jinyu Li VLM 35 363 0 02 Nov 2021
WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing Sanyuan Chen Chengyi Wang Zhengyang Chen Yu-Huan Wu Shujie Liu ... Yao Qian Jian Wu Micheal Zeng Xiangzhan Yu Furu Wei SSL 118 1,715 0 26 Oct 2021
Contrastively Disentangled Sequential Variational Autoencoder M. Kiener Weiran Wang Michael Gerndt CoGe DRL 27 40 0 22 Oct 2021
DECAR: Deep Clustering for learning general-purpose Audio Representations Sreyan Ghosh Sandesh V Katta Ashish Seth S. Umesh SSL 36 12 0 17 Oct 2021
Don't speak too fast: The impact of data bias on self-supervised speech models Yen Meng Yi-Hui Chou Andy T. Liu Hung-yi Lee 34 26 0 15 Oct 2021
SpeechT5: Unified-Modal Encoder-Decoder Pre-Training for Spoken Language Processing Junyi Ao Rui Wang Long Zhou Chengyi Wang Shuo Ren ... Yu Zhang Zhihua Wei Yao Qian Jinyu Li Furu Wei 118 193 0 14 Oct 2021
UniSpeech-SAT: Universal Speech Representation Learning with Speaker Aware Pre-Training Sanyuan Chen Yu Wu Chengyi Wang Zhengyang Chen Zhuo Chen ... Jian Wu Yao Qian Furu Wei Jinyu Li Xiangzhan Yu SSL 30 85 0 12 Oct 2021
Wav2vec-Switch: Contrastive Learning from Original-noisy Speech Pairs for Robust Speech Recognition Yiming Wang Jinyu Li Heming Wang Yao Qian Chengyi Wang Yu Wu 38 48 0 11 Oct 2021
Large-scale ASR Domain Adaptation using Self- and Semi-supervised Learning DongSeon Hwang Ananya Misra Zhouyuan Huo Nikhil Siddhartha Shefali Garg David Qiu K. Sim Trevor Strohman F. Beaufays Yanzhang He 65 34 0 01 Oct 2021
Incremental Layer-wise Self-Supervised Learning for Efficient Speech Domain Adaptation On Device Zhouyuan Huo Dong-Gyo Hwang K. Sim Shefali Garg Ananya Misra Nikhil Siddhartha Trevor Strohman Franccoise Beaufays 48 7 0 01 Oct 2021
Comparison of Self-Supervised Speech Pre-Training Methods on Flemish Dutch Jakob Poncelet Hugo Van hamme SSL 28 1 0 29 Sep 2021
Live Speech Portraits: Real-Time Photorealistic Talking-Head Animation Yuanxun Lu Jinxiang Chai Xun Cao 29 82 0 22 Sep 2021
Wav-BERT: Cooperative Acoustic and Linguistic Representation Learning for Low-Resource Speech Recognition Guolin Zheng Yubei Xiao Ke Gong Pan Zhou Xiaodan Liang Liang Lin 32 26 0 19 Sep 2021
Improving Accent Identification and Accented Speech Recognition Under a Framework of Self-supervised Learning Keqi Deng Songjun Cao Long Ma 14 29 0 15 Sep 2021
Injecting Text in Self-Supervised Speech Pretraining Zhehuai Chen Yu Zhang Andrew Rosenberg Bhuvana Ramabhadran Gary Wang Pedro J. Moreno SSL 25 36 0 27 Aug 2021
W2v-BERT: Combining Contrastive Learning and Masked Language Modeling for Self-Supervised Speech Pre-Training Yu-An Chung Yu Zhang Wei Han Chung-Cheng Chiu James Qin Ruoming Pang Yonghui Wu SSL VLM 12 412 0 07 Aug 2021
Speech Representation Learning Combining Conformer CPC with Deep Cluster for the ZeroSpeech Challenge 2021 Takashi Maekaku Xuankai Chang Yuya Fujita Li-Wei Chen Shinji Watanabe Alexander I. Rudnicky 115 13 0 13 Jul 2021
Layer-wise Analysis of a Self-supervised Speech Representation Model Ankita Pasad Ju-Chieh Chou Karen Livescu SSL 26 288 0 10 Jul 2021
As easy as APC: overcoming missing data and class imbalance in time series with self-supervised learning Fiorella Wever Thomas Anderson Keller L. Symul Victor Garcia SSL AI4TS 28 1 0 29 Jun 2021
Momentum Pseudo-Labeling for Semi-Supervised Speech Recognition Yosuke Higuchi Niko Moritz Jonathan Le Roux Takaaki Hori VLM 35 51 0 16 Jun 2021
HuBERT: Self-Supervised Speech Representation Learning by Masked Prediction of Hidden Units Wei-Ning Hsu Benjamin Bolte Yao-Hung Hubert Tsai Kushal Lakhotia Ruslan Salakhutdinov Abdel-rahman Mohamed SSL 55 2,770 0 14 Jun 2021
Scaling Laws for Acoustic Models J. Droppo Oguz H. Elibol 15 22 0 11 Jun 2021
Improving the Adversarial Robustness for Speaker Verification by Self-Supervised Learning Haibin Wu Xu Li Andy T. Liu Zhiyong Wu Helen Meng Hung-yi Lee AAML SSL 44 29 0 01 Jun 2021
LeBenchmark: A Reproducible Framework for Assessing Self-Supervised Representation Learning from Speech Solène Evain H. Nguyen Hang Le Marcely Zanon Boito Salima Mdhaffar ... François Portet Solange Rossato F. Ringeval D. Schwab Laurent Besacier SSL 33 70 0 23 Apr 2021
S2VC: A Framework for Any-to-Any Voice Conversion with Self-Supervised Pretrained Representations Jheng-hao Lin Yist Y. Lin C. Chien Hung-yi Lee 30 56 0 07 Apr 2021
General Robot Dynamics Learning and Gen2Real Dengpeng Xing Jiale Li Yiming Yang Bo Xu DRL AI4CE 21 3 0 06 Apr 2021
Unsupervised Speech Representation Learning for Behavior Modeling using Triplet Enhanced Contextualized Networks Haoqi Li Brian R. Baucom Shrikanth Narayanan P. Georgiou 30 1 0 01 Apr 2021
Fast Development of ASR in African Languages using Self Supervised Speech Representation Learning Jama Hussein Mohamud Lloyd Thompson A. Ndoye Laurent Besacier 29 4 0 16 Mar 2021
Wav2vec-C: A Self-supervised Model for Speech Representation Learning Samik Sadhu Di He Che-Wei Huang Sri Harish Reddy Mallidi Minhua Wu Ariya Rastrow A. Stolcke J. Droppo Roland Maas SSL 20 48 0 09 Mar 2021
Contrastive Semi-supervised Learning for ASR Alex Xiao Christian Fuegen Abdel-rahman Mohamed 26 20 0 09 Mar 2021
Data Generation Using Pass-phrase-dependent Deep Auto-encoders for Text-Dependent Speaker Verification A. K. Sarkar Md. Sahidullah Zheng-Hua Tan 7 0 0 03 Feb 2021
On Scaling Contrastive Representations for Low-Resource Speech Recognition Lasse Borgholt T. M. S. Tax Jakob Drachmann Havtorn Lars Maaløe Christian Igel SSL 13 5 0 01 Feb 2021
Adversarial Meta Sampling for Multilingual Low-Resource Speech Recognition Yubei Xiao Ke Gong Pan Zhou Guolin Zheng Xiaodan Liang Liang Lin 30 34 0 22 Dec 2020
Contrastive Predictive Coding for Human Activity Recognition H. Haresamudram Irfan Essa Thomas Ploetz 32 118 0 09 Dec 2020
Vocal Tract Length Perturbation for Text-Dependent Speaker Verification with Autoregressive Prediction Coding A. Sarkar Zheng-Hua Tan 9 13 0 25 Nov 2020
The Zero Resource Speech Benchmark 2021: Metrics and baselines for unsupervised spoken language modeling Tu Nguyen Maureen de Seyssel Patricia Roze M. Rivière Evgeny Kharitonov Alexei Baevski Ewan Dunbar Emmanuel Dupoux SSL 16 101 0 23 Nov 2020
Towards Semi-Supervised Semantics Understanding from Speech Cheng-I Jeff Lai Jin Cao S. Bodapati Shang-Wen Li SSL 22 7 0 11 Nov 2020
Speech SIMCLR: Combining Contrastive and Reconstruction Objective for Self-supervised Speech Representation Learning Dongwei Jiang Wubo Li Miao Cao Wei Zou Xiangang Li SSL 21 65 0 27 Oct 2020
Any-to-One Sequence-to-Sequence Voice Conversion using Self-Supervised Discrete Speech Representations Wen-Chin Huang Yi-Chiao Wu Tomoki Hayashi T. Toda BDL 49 37 0 23 Oct 2020
Similarity Analysis of Self-Supervised Speech Representations Yu-An Chung Yonatan Belinkov James R. Glass SSL 36 36 0 22 Oct 2020