High-Fidelity Audio Generation and Representation Learning with Guided Adversarial Autoencoder

1 June 2020

Björn W Schuller

Papers citing "High-Fidelity Audio Generation and Representation Learning with Guided Adversarial Autoencoder"

50 / 73 papers shown

Title
Generative Adversarial Networks Gilad Cohen Raja Giryes GAN 223 30,089 0 01 Mar 2022
Guided Generative Adversarial Neural Network for Representation Learning and High Fidelity Audio Generation using Fewer Labelled Audio Data Kazi Nazmul Haque R. Rana John H. L. Hansen Björn Schuller GAN 38 3 0 05 Mar 2020
Disentangled Speech Embeddings using Cross-modal Self-supervision Arsha Nagrani Joon Son Chung Samuel Albanie Andrew Zisserman SSL 57 88 0 20 Feb 2020
Unsupervised pretraining transfers well across languages M. Rivière Armand Joulin Pierre-Emmanuel Mazaré Emmanuel Dupoux SSL VLM 42 208 0 07 Feb 2020
Learning Robust and Multilingual Speech Representations Kazuya Kawakami Luyu Wang Chris Dyer Phil Blunsom Aaron van den Oord SSL 68 100 0 29 Jan 2020
Deep Representation Learning in Speech Processing: Challenges, Recent Advances, and Future Trends S. Latif R. Rana Sara Khalifa Raja Jurdak Junaid Qadir Björn W. Schuller AI4TS 84 82 0 02 Jan 2020
PyTorch: An Imperative Style, High-Performance Deep Learning Library Adam Paszke Sam Gross Francisco Massa Adam Lerer James Bradbury ... Sasank Chilamkurthy Benoit Steiner Lu Fang Junjie Bai Soumith Chintala ODL 354 42,299 0 03 Dec 2019
Analyzing and Improving the Image Quality of StyleGAN Tero Karras S. Laine M. Aittala Janne Hellsten J. Lehtinen Timo Aila GAN 260 5,797 0 03 Dec 2019
Effectiveness of self-supervised pre-training for speech recognition Alexei Baevski Michael Auli Abdel-rahman Mohamed SSL 69 147 0 10 Nov 2019
Learning audio representations via phase prediction Félix de Chaumont Quitry Marco Tagliasacchi Dominik Roblek SSL AI4TS 33 10 0 25 Oct 2019
vq-wav2vec: Self-Supervised Learning of Discrete Speech Representations Alexei Baevski Steffen Schneider Michael Auli SSL 128 666 0 12 Oct 2019
VL-BERT: Pre-training of Generic Visual-Linguistic Representations Weijie Su Xizhou Zhu Yue Cao Bin Li Lewei Lu Furu Wei Jifeng Dai VLM MLLM SSL 142 1,661 0 22 Aug 2019
Multi-Task Semi-Supervised Adversarial Autoencoding for Speech Emotion Recognition S. Latif R. Rana Sara Khalifa Raja Jurdak J. Epps Björn W. Schuller 75 99 0 13 Jul 2019
Large Scale Adversarial Representation Learning Jeff Donahue Karen Simonyan SSL 122 543 0 04 Jul 2019
Self-Supervised Dialogue Learning Jiawei Wu Xin Eric Wang William Yang Wang SSL 33 58 0 30 Jun 2019
Self-Supervised Learning for Contextualized Extractive Summarization Hong Wang Xin Eric Wang Wenhan Xiong Mo Yu Xiaoxiao Guo Shiyu Chang William Yang Wang SSL 88 56 0 11 Jun 2019
SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition Daniel S. Park William Chan Yu Zhang Chung-Cheng Chiu Barret Zoph E. D. Cubuk Quoc V. Le VLM 159 3,451 0 18 Apr 2019
Direct Modelling of Speech Emotion from Raw Speech S. Latif R. Rana Sara Khalifa Raja Jurdak J. Epps 65 103 0 08 Apr 2019
Self-Supervised Learning via Conditional Motion Propagation Xiaohang Zhan Xingang Pan Ziwei Liu Dahua Lin Chen Change Loy SSL 61 47 0 27 Mar 2019
High-Fidelity Image Generation With Fewer Labels Mario Lucic Michael Tschannen Marvin Ritter Xiaohua Zhai Olivier Bachem Sylvain Gelly GAN OOD 77 158 0 06 Mar 2019
GANSynth: Adversarial Neural Audio Synthesis Jesse Engel Kumar Krishna Agrawal Shuo Chen Ishaan Gulrajani Chris Donahue Adam Roberts 79 392 0 23 Feb 2019
Adversarial Generation of Time-Frequency Features with application in audio synthesis Andrés Marafioti Nicki Holighaus Nathanael Perraudin P. Majdak 33 68 0 11 Feb 2019
Transfer Learning From Sound Representations For Anger Detection in Speech M. Elshaer Scott Wisdom Taniya Mishra 75 17 0 06 Feb 2019
Self-Supervised Generalisation with Meta Auxiliary Learning Shikun Liu Andrew J. Davison Edward Johns SSL OOD 50 163 0 25 Jan 2019
Unsupervised speech representation learning using WaveNet autoencoders J. Chorowski Ron J. Weiss Samy Bengio Aaron van den Oord SSL 72 318 0 25 Jan 2019
A Style-Based Generator Architecture for Generative Adversarial Networks Tero Karras S. Laine Timo Aila 529 10,527 0 12 Dec 2018
Challenging Common Assumptions in the Unsupervised Learning of Disentangled Representations Francesco Locatello Stefan Bauer Mario Lucic Gunnar Rätsch Sylvain Gelly Bernhard Schölkopf Olivier Bachem OOD 111 1,466 0 29 Nov 2018
Training neural audio classifiers with few data Jordi Pons Joan Serrà Xavier Serra 62 57 0 24 Oct 2018
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding Jacob Devlin Ming-Wei Chang Kenton Lee Kristina Toutanova VLM SSL SSeg 1.4K 94,511 0 11 Oct 2018
Large Scale GAN Training for High Fidelity Natural Image Synthesis Andrew Brock Jeff Donahue Karen Simonyan 237 5,381 0 28 Sep 2018
How good is my GAN? K. Shmelkov Cordelia Schmid Alahari Karteek GAN EGVM 51 348 0 25 Jul 2018
Representation Learning with Contrastive Predictive Coding Aaron van den Oord Yazhe Li Oriol Vinyals DRL SSL 280 10,253 0 10 Jul 2018
Adversarial Auto-encoders for Speech Based Emotion Recognition Saurabh Sahu Rahul Gupta Ganesh Sivaraman Wael AbdAlmageed C. Espy-Wilson GAN 50 66 0 06 Jun 2018
Speech Commands: A Dataset for Limited-Vocabulary Speech Recognition Pete Warden 74 1,615 0 09 Apr 2018
Speech2Vec: A Sequence-to-Sequence Framework for Learning Word Embeddings from Speech Yu-An Chung James R. Glass 3DV 64 184 0 23 Mar 2018
Unsupervised Representation Learning by Predicting Image Rotations Spyros Gidaris Praveer Singh N. Komodakis OOD SSL DRL 227 3,283 0 21 Mar 2018
Adversarial Audio Synthesis Chris Donahue Julian McAuley M. Puckette GAN 133 610 0 12 Feb 2018
Image denoising and restoration with CNN-LSTM Encoder Decoder with Direct Attention Kazi Nazmul Haque M. Yousuf R. Rana 3DV 36 21 0 16 Jan 2018
A Note on the Inception Score Shane T. Barratt Rishi Sharma EGVM 86 691 0 06 Jan 2018
Knowledge Transfer from Weakly Labeled Audio using Convolutional Neural Network for Sound Events and Scenes Anurag Kumar Maksim Khadkevich C. Fügen 46 140 0 04 Nov 2017
Progressive Growing of GANs for Improved Quality, Stability, and Variation Tero Karras Timo Aila S. Laine J. Lehtinen GAN 118 7,339 0 27 Oct 2017
Mixed Precision Training Paulius Micikevicius Sharan Narang Jonah Alben G. Diamos Erich Elsen ... Boris Ginsburg Michael Houston Oleksii Kuchaiev Ganesh Venkatesh Hao Wu 149 1,792 0 10 Oct 2017
Semi-supervised Conditional GANs K. Sricharan R. Bala Matthew Shreve Hui Ding K. Saketh J. Sun GAN 41 54 0 19 Aug 2017
Unsupervised Domain Adaptation for Robust Speech Recognition via Variational Autoencoder-Based Data Augmentation Wei-Ning Hsu Yu Zhang James R. Glass 44 127 0 19 Jul 2017
Guiding InfoGAN with Semi-Supervision Adrian Spurr Emre Aksan Otmar Hilliges GAN 57 47 0 14 Jul 2017
Adversarial Network Bottleneck Features for Noise Robust Speaker Verification Hong Yu Zheng-Hua Tan Zhanyu Ma Jun Guo AAML 42 33 0 11 Jun 2017
InfoVAE: Information Maximizing Variational Autoencoders Shengjia Zhao Jiaming Song Stefano Ermon DRL 79 445 0 07 Jun 2017
Learning Representations of Emotional Speech with Deep Convolutional Generative Adversarial Networks Jonathan D. Chang Stefan Scherer SSL GAN 43 104 0 22 Apr 2017
Neural Audio Synthesis of Musical Notes with WaveNet Autoencoders Jesse Engel Cinjon Resnick Adam Roberts Sander Dieleman Douglas Eck Karen Simonyan Mohammad Norouzi 106 623 0 05 Apr 2017
Tacotron: Towards End-to-End Speech Synthesis Yuxuan Wang RJ Skerry-Ryan Daisy Stanton Yonghui Wu Ron J. Weiss ... Samy Bengio Quoc V. Le Yannis Agiomyrgiannakis R. Clark Rif A. Saurous 153 1,819 0 29 Mar 2017