v1v2 (latest)

Neural Discrete Representation Learning

2 November 2017

Papers citing "Neural Discrete Representation Learning"

50 / 3,267 papers shown

Title
End-to-End Text-to-Speech using Latent Duration based on VQ-VAE Yusuke Yasuda Xin Wang Junichi Yamagishi 75 17 0 19 Oct 2020
CLAR: Contrastive Learning of Auditory Representations Haider Al-Tahan Y. Mohsenzadeh SSL 198 56 0 19 Oct 2020
Evidential Sparsification of Multimodal Latent Spaces in Conditional Variational Autoencoders Masha Itkina Boris Ivanovic Ransalu Senanayake Mykel J. Kochenderfer Marco Pavone 128 18 0 19 Oct 2020
CQ-VAE: Coordinate Quantized VAE for Uncertainty Estimation with Application to Disk Shape Analysis from Lumbar Spine MRI Images Linchen Qian Jiasong Chen Timur Urakov Weiyong Gu Liang Liang 25 3 0 17 Oct 2020
Latent Vector Recovery of Audio GANs Andrew Keyes N. Bayat Vahid Reza Khazaie Y. Mohsenzadeh 13 3 0 16 Oct 2020
The NeteaseGames System for Voice Conversion Challenge 2020 with Vector-quantization Variational Autoencoder and WaveNet Haitong Zhang DRL 38 4 0 15 Oct 2020
Smaller World Models for Reinforcement Learning Jan Robine Tobias Uelwer Stefan Harmeling DRL 59 3 0 12 Oct 2020
Deep Sequence Learning for Video Anticipation: From Discrete and Deterministic to Continuous and Stochastic S. Aliakbarian AI4TS 40 0 0 09 Oct 2020
Event Representation with Sequential, Semi-Supervised Discrete Variables Mehdi Rezaee Francis Ferraro BDL DRL 59 14 0 09 Oct 2020
FastVC: Fast Voice Conversion with non-parallel data Oriol Barbany Milos Cernak 51 7 0 08 Oct 2020
VoiceGrad: Non-Parallel Any-to-Many Voice Conversion with Annealed Langevin Dynamics Hirokazu Kameoka Takuhiro Kaneko Kou Tanaka Nobukatsu Hojo Shogo Seki DiffM 124 21 0 06 Oct 2020
A Contrastive Learning Approach for Training Variational Autoencoder Priors J. Aneja Alex Schwing Jan Kautz Arash Vahdat DRL 130 83 0 06 Oct 2020
The Academia Sinica Systems of Voice Conversion for VCC2020 Yu-Huai Peng Cheng-Hung Hu A. Kang Hung-Shin Lee Pin-Yuan Chen Yu Tsao Hsin-Min Wang 68 2 0 06 Oct 2020
Implicit Rank-Minimizing Autoencoder Li Jing Jure Zbontar Yann LeCun SSL DRL 83 49 0 01 Oct 2020
The Utility of Decorrelating Colour Spaces in Vector Quantised Variational Autoencoders A. Akbarinia Raquel Gil-Rodríguez Alban Flachot Matteo Toscani DRL 20 0 0 30 Sep 2020
Controllable Text Generation with Focused Variation Lei Shu Alexandros Papangelis Yi-Chia Wang Gokhan Tur Hu Xu Zhaleh Feizollahi Bing-Quan Liu Piero Molino 94 11 0 25 Sep 2020
A Unifying Review of Deep and Shallow Anomaly Detection Lukas Ruff Jacob R. Kauffmann Robert A. Vandermeulen G. Montavon Wojciech Samek Marius Kloft Thomas G. Dietterich Klaus-Robert Muller UQCV 152 806 0 24 Sep 2020
X-LXMERT: Paint, Caption and Answer Questions with Multi-Modal Transformers Jaemin Cho Jiasen Lu Dustin Schwenk Hannaneh Hajishirzi Aniruddha Kembhavi VLM MLLM 95 102 0 23 Sep 2020
Generative Model without Prior Distribution Matching Cong Geng Jia Wang Lixing Chen Zhiyong Gao GAN 392 1 0 23 Sep 2020
DiffWave: A Versatile Diffusion Model for Audio Synthesis Zhifeng Kong Ming-Yu Liu Jiaji Huang Kexin Zhao Bryan Catanzaro DiffM BDL 280 1,472 0 21 Sep 2020
Target Conditioning for One-to-Many Generation Marie-Anne Lachaux Armand Joulin Guillaume Lample 56 13 0 21 Sep 2020
DVG-Face: Dual Variational Generation for Heterogeneous Face Recognition Chaoyou Fu Xiang Wu Yibo Hu Huaibo Huang Ran He CVBM 78 86 0 20 Sep 2020
Discond-VAE: Disentangling Continuous Factors from the Discrete Jaewoong Choi Geonho Hwang Myung-joo Kang CoGe CML 56 4 0 17 Sep 2020
Recurrent autoencoder with sequence-aware encoding Robert Susik AI4TS 31 6 0 15 Sep 2020
Brain2Word: Decoding Brain Activity for Language Generation Nicolas Affolter Béni Egressy Damian Pascual Roger Wattenhofer 107 24 0 10 Sep 2020
not-so-BigGAN: Generating High-Fidelity Images on Small Compute with Wavelet-based Super-Resolution Seung-Jun Han Akash Srivastava C. Hurwitz P. Sattigeri David D. Cox 63 8 0 09 Sep 2020
Multilinear Latent Conditioning for Generating Unseen Attribute Combinations Markos Georgopoulos Grigorios G. Chrysos Maja Pantic Yannis Panagakis GAN DRL 68 17 0 09 Sep 2020
Deep data compression for approximate ultrasonic image formation G. Pilikos L. Horchens K. Batenburg Tristan van Leeuwen F. Lucka 41 8 0 04 Sep 2020
End-to-End Learning of Neuromorphic Wireless Systems for Low-Power Edge Artificial Intelligence N. Skatchkovsky Hyeryung Jang Osvaldo Simeone 62 24 0 03 Sep 2020
Stochastic Graph Recurrent Neural Network Tijin Yan Hongwei Zhang Zirui Li Yuanqing Xia GNN BDL 33 5 0 01 Sep 2020
GIF: Generative Interpretable Faces Partha Ghosh Pravir Singh Gupta Roy Uziel Anurag Ranjan Michael J. Black Timo Bolkart CVBM AI4CE 114 77 0 31 Aug 2020
Hierarchical Timbre-Painting and Articulation Generation Michael Michelashvili Lior Wolf 90 12 0 30 Aug 2020
Dynamical Variational Autoencoders: A Comprehensive Review Laurent Girin Simon Leglaive Xiaoyu Bie Julien Diard Thomas Hueber Xavier Alameda-Pineda BDL 154 223 0 28 Aug 2020
Voice Conversion Challenge 2020: Intra-lingual semi-parallel and cross-lingual voice conversion Yi Zhao Wen-Chin Huang Xiaohai Tian Junichi Yamagishi Rohan Kumar Das Tomi Kinnunen Zhenhua Ling Tomoki Toda 96 211 0 28 Aug 2020
Nonparallel Voice Conversion with Augmented Classifier Star Generative Adversarial Networks Hirokazu Kameoka Takuhiro Kaneko Kou Tanaka Nobukatsu Hojo 107 20 0 27 Aug 2020
asya: Mindful verbal communication using deep learning Ē. Urtāns Ariel Tabaks VLM 121 1 0 20 Aug 2020
Unsupervised Acoustic Unit Representation Learning for Voice Conversion using WaveNet Auto-encoders Mingjie Chen Thomas Hain SSL DRL 54 15 0 16 Aug 2020
Jointly Fine-Tuning "BERT-like" Self Supervised Models to Improve Multimodal Speech Emotion Recognition Shamane Siriwardhana Andrew Reis Rivindu Weerasekera Suranga Nanayakkara 92 112 0 15 Aug 2020
Audio- and Gaze-driven Facial Animation of Codec Avatars Alexander Richard Colin S. Lea Shugao Ma Juergen Gall Fernando de la Torre Yaser Sheikh CVBM 79 81 0 11 Aug 2020
Unsupervised Learning For Sequence-to-sequence Text-to-speech For Low-resource Languages Haitong Zhang Yue Lin 60 30 0 11 Aug 2020
An Overview of Voice Conversion and its Challenges: From Statistical Modeling to Deep Learning Berrak Sisman Junichi Yamagishi Simon King Haizhou Li BDL 152 329 0 09 Aug 2020
Learning Sampling in Financial Statement Audits using Vector Quantised Autoencoder Neural Networks Marco Schreyer Timur Sattarov Anita Gierbl Bernd Reimer Damian Borth DRL 37 3 0 06 Aug 2020
Optimal Variance Control of the Score Function Gradient Estimator for Importance Weighted Bounds Valentin Liévin Andrea Dittadi Anders Christensen Ole Winther DRL 61 6 0 05 Aug 2020
Timbre latent space: exploration and creative aspects Antoine Caillon Adrien Bitton Brice Gatinet P. Esling 67 1 0 04 Aug 2020
Learning from Few Samples: A Survey Nihar Bendre Hugo Terashima-Marín Peyman Najafirad VLM BDL 87 54 0 30 Jul 2020
Foveation for Segmentation of Ultra-High Resolution Images Chen Jin Ryutaro Tanno Moucheng Xu T. Mertzanidou Daniel C. Alexander AI4TS 53 4 0 29 Jul 2020
dMelodies: A Music Dataset for Disentanglement Learning Ashis Pati Siddharth Gururani Alexander Lerch CoGe DRL 66 10 0 29 Jul 2020
Privacy-preserving Voice Analysis via Disentangled Representations Ranya Aloufi Hamed Haddadi David E. Boyle DRL 132 58 0 29 Jul 2020
$Generative networks as inverse problems with fractional wavelet scattering networks$ Generative networks as inverse problems with fractional wavelet scattering networks Jiasong Wu Jing Zhang Fuzhi Wu Youyong Kong Guanyu Yang L. Senhadji H. Shu GAN 54 1 0 28 Jul 2020
Unsupervised Subword Modeling Using Autoregressive Pretraining and Cross-Lingual Phone-Aware Modeling Siyuan Feng O. Scharenborg SSL 72 4 0 25 Jul 2020