ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1711.00937
  4. Cited By
Neural Discrete Representation Learning
v1v2 (latest)

Neural Discrete Representation Learning

2 November 2017
Aaron van den Oord
Oriol Vinyals
Koray Kavukcuoglu
    BDLSSLOCL
ArXiv (abs)PDFHTML

Papers citing "Neural Discrete Representation Learning"

50 / 3,267 papers shown
Title
End-to-End Text-to-Speech using Latent Duration based on VQ-VAE
End-to-End Text-to-Speech using Latent Duration based on VQ-VAE
Yusuke Yasuda
Xin Wang
Junichi Yamagishi
75
17
0
19 Oct 2020
CLAR: Contrastive Learning of Auditory Representations
CLAR: Contrastive Learning of Auditory Representations
Haider Al-Tahan
Y. Mohsenzadeh
SSL
198
56
0
19 Oct 2020
Evidential Sparsification of Multimodal Latent Spaces in Conditional
  Variational Autoencoders
Evidential Sparsification of Multimodal Latent Spaces in Conditional Variational Autoencoders
Masha Itkina
Boris Ivanovic
Ransalu Senanayake
Mykel J. Kochenderfer
Marco Pavone
128
18
0
19 Oct 2020
CQ-VAE: Coordinate Quantized VAE for Uncertainty Estimation with
  Application to Disk Shape Analysis from Lumbar Spine MRI Images
CQ-VAE: Coordinate Quantized VAE for Uncertainty Estimation with Application to Disk Shape Analysis from Lumbar Spine MRI Images
Linchen Qian
Jiasong Chen
Timur Urakov
Weiyong Gu
Liang Liang
25
3
0
17 Oct 2020
Latent Vector Recovery of Audio GANs
Latent Vector Recovery of Audio GANs
Andrew Keyes
N. Bayat
Vahid Reza Khazaie
Y. Mohsenzadeh
13
3
0
16 Oct 2020
The NeteaseGames System for Voice Conversion Challenge 2020 with
  Vector-quantization Variational Autoencoder and WaveNet
The NeteaseGames System for Voice Conversion Challenge 2020 with Vector-quantization Variational Autoencoder and WaveNet
Haitong Zhang
DRL
38
4
0
15 Oct 2020
Smaller World Models for Reinforcement Learning
Smaller World Models for Reinforcement Learning
Jan Robine
Tobias Uelwer
Stefan Harmeling
DRL
59
3
0
12 Oct 2020
Deep Sequence Learning for Video Anticipation: From Discrete and
  Deterministic to Continuous and Stochastic
Deep Sequence Learning for Video Anticipation: From Discrete and Deterministic to Continuous and Stochastic
S. Aliakbarian
AI4TS
40
0
0
09 Oct 2020
Event Representation with Sequential, Semi-Supervised Discrete Variables
Event Representation with Sequential, Semi-Supervised Discrete Variables
Mehdi Rezaee
Francis Ferraro
BDLDRL
59
14
0
09 Oct 2020
FastVC: Fast Voice Conversion with non-parallel data
FastVC: Fast Voice Conversion with non-parallel data
Oriol Barbany
Milos Cernak
51
7
0
08 Oct 2020
VoiceGrad: Non-Parallel Any-to-Many Voice Conversion with Annealed
  Langevin Dynamics
VoiceGrad: Non-Parallel Any-to-Many Voice Conversion with Annealed Langevin Dynamics
Hirokazu Kameoka
Takuhiro Kaneko
Kou Tanaka
Nobukatsu Hojo
Shogo Seki
DiffM
124
21
0
06 Oct 2020
A Contrastive Learning Approach for Training Variational Autoencoder
  Priors
A Contrastive Learning Approach for Training Variational Autoencoder Priors
J. Aneja
Alex Schwing
Jan Kautz
Arash Vahdat
DRL
130
83
0
06 Oct 2020
The Academia Sinica Systems of Voice Conversion for VCC2020
The Academia Sinica Systems of Voice Conversion for VCC2020
Yu-Huai Peng
Cheng-Hung Hu
A. Kang
Hung-Shin Lee
Pin-Yuan Chen
Yu Tsao
Hsin-Min Wang
68
2
0
06 Oct 2020
Implicit Rank-Minimizing Autoencoder
Implicit Rank-Minimizing Autoencoder
Li Jing
Jure Zbontar
Yann LeCun
SSLDRL
83
49
0
01 Oct 2020
The Utility of Decorrelating Colour Spaces in Vector Quantised
  Variational Autoencoders
The Utility of Decorrelating Colour Spaces in Vector Quantised Variational Autoencoders
A. Akbarinia
Raquel Gil-Rodríguez
Alban Flachot
Matteo Toscani
DRL
20
0
0
30 Sep 2020
Controllable Text Generation with Focused Variation
Controllable Text Generation with Focused Variation
Lei Shu
Alexandros Papangelis
Yi-Chia Wang
Gokhan Tur
Hu Xu
Zhaleh Feizollahi
Bing-Quan Liu
Piero Molino
94
11
0
25 Sep 2020
A Unifying Review of Deep and Shallow Anomaly Detection
A Unifying Review of Deep and Shallow Anomaly Detection
Lukas Ruff
Jacob R. Kauffmann
Robert A. Vandermeulen
G. Montavon
Wojciech Samek
Marius Kloft
Thomas G. Dietterich
Klaus-Robert Muller
UQCV
152
806
0
24 Sep 2020
X-LXMERT: Paint, Caption and Answer Questions with Multi-Modal
  Transformers
X-LXMERT: Paint, Caption and Answer Questions with Multi-Modal Transformers
Jaemin Cho
Jiasen Lu
Dustin Schwenk
Hannaneh Hajishirzi
Aniruddha Kembhavi
VLMMLLM
95
102
0
23 Sep 2020
Generative Model without Prior Distribution Matching
Generative Model without Prior Distribution Matching
Cong Geng
Jia Wang
Lixing Chen
Zhiyong Gao
GAN
392
1
0
23 Sep 2020
DiffWave: A Versatile Diffusion Model for Audio Synthesis
DiffWave: A Versatile Diffusion Model for Audio Synthesis
Zhifeng Kong
Ming-Yu Liu
Jiaji Huang
Kexin Zhao
Bryan Catanzaro
DiffMBDL
280
1,472
0
21 Sep 2020
Target Conditioning for One-to-Many Generation
Target Conditioning for One-to-Many Generation
Marie-Anne Lachaux
Armand Joulin
Guillaume Lample
56
13
0
21 Sep 2020
DVG-Face: Dual Variational Generation for Heterogeneous Face Recognition
DVG-Face: Dual Variational Generation for Heterogeneous Face Recognition
Chaoyou Fu
Xiang Wu
Yibo Hu
Huaibo Huang
Ran He
CVBM
78
86
0
20 Sep 2020
Discond-VAE: Disentangling Continuous Factors from the Discrete
Discond-VAE: Disentangling Continuous Factors from the Discrete
Jaewoong Choi
Geonho Hwang
Myung-joo Kang
CoGeCML
56
4
0
17 Sep 2020
Recurrent autoencoder with sequence-aware encoding
Recurrent autoencoder with sequence-aware encoding
Robert Susik
AI4TS
31
6
0
15 Sep 2020
Brain2Word: Decoding Brain Activity for Language Generation
Brain2Word: Decoding Brain Activity for Language Generation
Nicolas Affolter
Béni Egressy
Damian Pascual
Roger Wattenhofer
107
24
0
10 Sep 2020
not-so-BigGAN: Generating High-Fidelity Images on Small Compute with
  Wavelet-based Super-Resolution
not-so-BigGAN: Generating High-Fidelity Images on Small Compute with Wavelet-based Super-Resolution
Seung-Jun Han
Akash Srivastava
C. Hurwitz
P. Sattigeri
David D. Cox
63
8
0
09 Sep 2020
Multilinear Latent Conditioning for Generating Unseen Attribute
  Combinations
Multilinear Latent Conditioning for Generating Unseen Attribute Combinations
Markos Georgopoulos
Grigorios G. Chrysos
Maja Pantic
Yannis Panagakis
GANDRL
68
17
0
09 Sep 2020
Deep data compression for approximate ultrasonic image formation
Deep data compression for approximate ultrasonic image formation
G. Pilikos
L. Horchens
K. Batenburg
Tristan van Leeuwen
F. Lucka
41
8
0
04 Sep 2020
End-to-End Learning of Neuromorphic Wireless Systems for Low-Power Edge
  Artificial Intelligence
End-to-End Learning of Neuromorphic Wireless Systems for Low-Power Edge Artificial Intelligence
N. Skatchkovsky
Hyeryung Jang
Osvaldo Simeone
62
24
0
03 Sep 2020
Stochastic Graph Recurrent Neural Network
Stochastic Graph Recurrent Neural Network
Tijin Yan
Hongwei Zhang
Zirui Li
Yuanqing Xia
GNNBDL
33
5
0
01 Sep 2020
GIF: Generative Interpretable Faces
GIF: Generative Interpretable Faces
Partha Ghosh
Pravir Singh Gupta
Roy Uziel
Anurag Ranjan
Michael J. Black
Timo Bolkart
CVBMAI4CE
114
77
0
31 Aug 2020
Hierarchical Timbre-Painting and Articulation Generation
Hierarchical Timbre-Painting and Articulation Generation
Michael Michelashvili
Lior Wolf
90
12
0
30 Aug 2020
Dynamical Variational Autoencoders: A Comprehensive Review
Dynamical Variational Autoencoders: A Comprehensive Review
Laurent Girin
Simon Leglaive
Xiaoyu Bie
Julien Diard
Thomas Hueber
Xavier Alameda-Pineda
BDL
154
223
0
28 Aug 2020
Voice Conversion Challenge 2020: Intra-lingual semi-parallel and
  cross-lingual voice conversion
Voice Conversion Challenge 2020: Intra-lingual semi-parallel and cross-lingual voice conversion
Yi Zhao
Wen-Chin Huang
Xiaohai Tian
Junichi Yamagishi
Rohan Kumar Das
Tomi Kinnunen
Zhenhua Ling
Tomoki Toda
96
211
0
28 Aug 2020
Nonparallel Voice Conversion with Augmented Classifier Star Generative
  Adversarial Networks
Nonparallel Voice Conversion with Augmented Classifier Star Generative Adversarial Networks
Hirokazu Kameoka
Takuhiro Kaneko
Kou Tanaka
Nobukatsu Hojo
107
20
0
27 Aug 2020
asya: Mindful verbal communication using deep learning
asya: Mindful verbal communication using deep learning
Ē. Urtāns
Ariel Tabaks
VLM
121
1
0
20 Aug 2020
Unsupervised Acoustic Unit Representation Learning for Voice Conversion
  using WaveNet Auto-encoders
Unsupervised Acoustic Unit Representation Learning for Voice Conversion using WaveNet Auto-encoders
Mingjie Chen
Thomas Hain
SSLDRL
54
15
0
16 Aug 2020
Jointly Fine-Tuning "BERT-like" Self Supervised Models to Improve
  Multimodal Speech Emotion Recognition
Jointly Fine-Tuning "BERT-like" Self Supervised Models to Improve Multimodal Speech Emotion Recognition
Shamane Siriwardhana
Andrew Reis
Rivindu Weerasekera
Suranga Nanayakkara
92
112
0
15 Aug 2020
Audio- and Gaze-driven Facial Animation of Codec Avatars
Audio- and Gaze-driven Facial Animation of Codec Avatars
Alexander Richard
Colin S. Lea
Shugao Ma
Juergen Gall
Fernando de la Torre
Yaser Sheikh
CVBM
79
81
0
11 Aug 2020
Unsupervised Learning For Sequence-to-sequence Text-to-speech For
  Low-resource Languages
Unsupervised Learning For Sequence-to-sequence Text-to-speech For Low-resource Languages
Haitong Zhang
Yue Lin
60
30
0
11 Aug 2020
An Overview of Voice Conversion and its Challenges: From Statistical
  Modeling to Deep Learning
An Overview of Voice Conversion and its Challenges: From Statistical Modeling to Deep Learning
Berrak Sisman
Junichi Yamagishi
Simon King
Haizhou Li
BDL
152
329
0
09 Aug 2020
Learning Sampling in Financial Statement Audits using Vector Quantised
  Autoencoder Neural Networks
Learning Sampling in Financial Statement Audits using Vector Quantised Autoencoder Neural Networks
Marco Schreyer
Timur Sattarov
Anita Gierbl
Bernd Reimer
Damian Borth
DRL
37
3
0
06 Aug 2020
Optimal Variance Control of the Score Function Gradient Estimator for
  Importance Weighted Bounds
Optimal Variance Control of the Score Function Gradient Estimator for Importance Weighted Bounds
Valentin Liévin
Andrea Dittadi
Anders Christensen
Ole Winther
DRL
61
6
0
05 Aug 2020
Timbre latent space: exploration and creative aspects
Timbre latent space: exploration and creative aspects
Antoine Caillon
Adrien Bitton
Brice Gatinet
P. Esling
67
1
0
04 Aug 2020
Learning from Few Samples: A Survey
Learning from Few Samples: A Survey
Nihar Bendre
Hugo Terashima-Marín
Peyman Najafirad
VLMBDL
87
54
0
30 Jul 2020
Foveation for Segmentation of Ultra-High Resolution Images
Foveation for Segmentation of Ultra-High Resolution Images
Chen Jin
Ryutaro Tanno
Moucheng Xu
T. Mertzanidou
Daniel C. Alexander
AI4TS
53
4
0
29 Jul 2020
dMelodies: A Music Dataset for Disentanglement Learning
dMelodies: A Music Dataset for Disentanglement Learning
Ashis Pati
Siddharth Gururani
Alexander Lerch
CoGeDRL
66
10
0
29 Jul 2020
Privacy-preserving Voice Analysis via Disentangled Representations
Privacy-preserving Voice Analysis via Disentangled Representations
Ranya Aloufi
Hamed Haddadi
David E. Boyle
DRL
132
58
0
29 Jul 2020
Generative networks as inverse problems with fractional wavelet
  scattering networks
Generative networks as inverse problems with fractional wavelet scattering networks
Jiasong Wu
Jing Zhang
Fuzhi Wu
Youyong Kong
Guanyu Yang
L. Senhadji
H. Shu
GAN
54
1
0
28 Jul 2020
Unsupervised Subword Modeling Using Autoregressive Pretraining and
  Cross-Lingual Phone-Aware Modeling
Unsupervised Subword Modeling Using Autoregressive Pretraining and Cross-Lingual Phone-Aware Modeling
Siyuan Feng
O. Scharenborg
SSL
72
4
0
25 Jul 2020
Previous
123...596061...646566
Next