Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1711.00937
Cited By
Neural Discrete Representation Learning
2 November 2017
Aaron van den Oord
Oriol Vinyals
Koray Kavukcuoglu
BDL
SSL
OCL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Neural Discrete Representation Learning"
50 / 2,748 papers shown
Title
X-LXMERT: Paint, Caption and Answer Questions with Multi-Modal Transformers
Jaemin Cho
Jiasen Lu
Dustin Schwenk
Hannaneh Hajishirzi
Aniruddha Kembhavi
VLM
MLLM
30
102
0
23 Sep 2020
Generative Model without Prior Distribution Matching
Cong Geng
Jia Wang
L. Chen
Zhiyong Gao
GAN
162
1
0
23 Sep 2020
DiffWave: A Versatile Diffusion Model for Audio Synthesis
Zhifeng Kong
Ming-Yu Liu
Jiaji Huang
Kexin Zhao
Bryan Catanzaro
DiffM
BDL
34
1,392
0
21 Sep 2020
Brain2Word: Decoding Brain Activity for Language Generation
Nicolas Affolter
Béni Egressy
Damian Pascual
Roger Wattenhofer
11
21
0
10 Sep 2020
Multilinear Latent Conditioning for Generating Unseen Attribute Combinations
Markos Georgopoulos
Grigorios G. Chrysos
M. Pantic
Yannis Panagakis
GAN
DRL
19
19
0
09 Sep 2020
GIF: Generative Interpretable Faces
Partha Ghosh
Pravir Singh Gupta
Roy Uziel
Anurag Ranjan
Michael J. Black
Timo Bolkart
CVBM
AI4CE
8
76
0
31 Aug 2020
asya: Mindful verbal communication using deep learning
Ē. Urtāns
Ariel Tabaks
VLM
33
1
0
20 Aug 2020
Audio- and Gaze-driven Facial Animation of Codec Avatars
Alexander Richard
Colin S. Lea
Shugao Ma
Juergen Gall
Fernando de la Torre
Yaser Sheikh
CVBM
21
81
0
11 Aug 2020
An Overview of Voice Conversion and its Challenges: From Statistical Modeling to Deep Learning
Berrak Sisman
Junichi Yamagishi
Simon King
Haizhou Li
BDL
41
317
0
09 Aug 2020
Timbre latent space: exploration and creative aspects
Antoine Caillon
Adrien Bitton
Brice Gatinet
P. Esling
18
1
0
04 Aug 2020
Learning from Few Samples: A Survey
Nihar Bendre
Hugo Terashima-Marín
Peyman Najafirad
VLM
BDL
26
54
0
30 Jul 2020
Privacy-preserving Voice Analysis via Disentangled Representations
Ranya Aloufi
Hamed Haddadi
David E. Boyle
DRL
19
58
0
29 Jul 2020
Incorporating Reinforced Adversarial Learning in Autoregressive Image Generation
Kenan E. Ak
N. Xu
Zhe-nan Lin
Yilin Wang
19
12
0
20 Jul 2020
Failure Modes of Variational Autoencoders and Their Effects on Downstream Tasks
Yaniv Yacoby
Weiwei Pan
Finale Doshi-Velez
CML
DRL
27
25
0
14 Jul 2020
Vector-Quantized Timbre Representation
Adrien Bitton
P. Esling
Tatsuya Harada
20
12
0
13 Jul 2020
Prosodic Prominence and Boundaries in Sequence-to-Sequence Speech Synthesis
Antti Suni
Sofoklis Kakouros
M. Vainio
J. Šimko
19
17
0
29 Jun 2020
Locally Masked Convolution for Autoregressive Models
Ajay Jain
Pieter Abbeel
Deepak Pathak
DiffM
OffRL
39
31
0
22 Jun 2020
Deep Residual Mixture Models
Perttu Hämäläinen
Martin Trapp
Tuure Saloheimo
Arno Solin
36
8
0
22 Jun 2020
Set Distribution Networks: a Generative Model for Sets of Images
Shuangfei Zhai
Walter A. Talbott
Miguel Angel Bautista
Carlos Guestrin
J. Susskind
GAN
29
2
0
18 Jun 2020
Latent Video Transformer
Ruslan Rakhimov
Denis Volkhonskiy
Alexey Artemov
Denis Zorin
Evgeny Burnaev
VGen
33
118
0
18 Jun 2020
Temporal Phenotyping using Deep Predictive Clustering of Disease Progression
Changhee Lee
M. Schaar
OOD
24
53
0
15 Jun 2020
Self-supervised Learning: Generative or Contrastive
Xiao Liu
Fanjin Zhang
Zhenyu Hou
Zhaoyu Wang
Li Mian
Jing Zhang
Jie Tang
SSL
50
1,586
0
15 Jun 2020
Deep generative models for musical audio synthesis
M. Huzaifah
L. Wyse
27
20
0
10 Jun 2020
Probabilistic Autoencoder
Vanessa Böhm
U. Seljak
UQCV
BDL
DRL
24
32
0
09 Jun 2020
Unsupervised Paraphrase Generation using Pre-trained Language Models
Chaitra V. Hegde
Shrikumar Patil
LRM
21
38
0
09 Jun 2020
A Survey on Generative Adversarial Networks: Variants, Applications, and Training
Abdul Jabbar
Xi Li
Bourahla Omar
25
266
0
09 Jun 2020
Variational Variance: Simple, Reliable, Calibrated Heteroscedastic Noise Variance Parameterization
Andrew Stirn
David A. Knowles
DRL
18
10
0
08 Jun 2020
Speech-to-Singing Conversion based on Boundary Equilibrium GAN
Da-Yi Wu
Yi-Hsuan Yang
GAN
8
8
0
28 May 2020
Contrastive Predictive Coding Supported Factorized Variational Autoencoder for Unsupervised Learning of Disentangled Speech Representations
Janek Ebbers
Michael Kuhlmann
Tobias Cord-Landwehr
Reinhold Haeb-Umbach
DRL
CoGe
SSL
31
4
0
26 May 2020
Glow-TTS: A Generative Flow for Text-to-Speech via Monotonic Alignment Search
Jaehyeon Kim
Sungwon Kim
Jungil Kong
Sungroh Yoon
54
475
0
22 May 2020
Vector-quantized neural networks for acoustic unit discovery in the ZeroSpeech 2020 challenge
Benjamin van Niekerk
Leanne Nortje
Herman Kamper
13
115
0
19 May 2020
Robust Training of Vector Quantized Bottleneck Models
A. Lancucki
J. Chorowski
Guillaume Sanchez
R. Marxer
Nanxin Chen
Hans J. G. A. Dolfing
Sameer Khurana
Tanel Alumäe
Antoine Laurent
29
58
0
18 May 2020
DiscreTalk: Text-to-Speech as a Machine Translation Problem
Tomoki Hayashi
Shinji Watanabe
27
32
0
12 May 2020
Lossy Compression with Distortion Constrained Optimization
T. V. Rozendaal
Guillaume Sautière
Taco S. Cohen
31
13
0
08 May 2020
A Batch Normalized Inference Network Keeps the KL Vanishing Away
Qile Zhu
Jianlin Su
Wei Bi
Xiaojiang Liu
Xiyao Ma
Xiaolin Li
D. Wu
BDL
DRL
34
61
0
27 Apr 2020
Vector Quantized Contrastive Predictive Coding for Template-based Music Generation
Gaëtan Hadjeres
Léopold Crestel
34
18
0
21 Apr 2020
Hybrid Classification and Reasoning for Image-based Constraint Solving
Maxime Mulamba
Jayanta Mandi
Rocsildes Canoy
Tias Guns
33
11
0
24 Mar 2020
Characterizing and Avoiding Problematic Global Optima of Variational Autoencoders
Yaniv Yacoby
Weiwei Pan
Finale Doshi-Velez
DRL
21
4
0
17 Mar 2020
Uncertainty Estimation Using a Single Deep Deterministic Neural Network
Joost R. van Amersfoort
Lewis Smith
Yee Whye Teh
Y. Gal
UQCV
BDL
14
55
0
04 Mar 2020
Learning Representations by Predicting Bags of Visual Words
Spyros Gidaris
Andrei Bursuc
N. Komodakis
P. Pérez
Matthieu Cord
SSL
28
117
0
27 Feb 2020
Predictive Sampling with Forecasting Autoregressive Models
Auke Wiggers
Emiel Hoogeboom
BDL
27
16
0
23 Feb 2020
Disentangled Speech Embeddings using Cross-modal Self-supervision
Arsha Nagrani
Joon Son Chung
Samuel Albanie
Andrew Zisserman
SSL
21
88
0
20 Feb 2020
Latent Normalizing Flows for Many-to-Many Cross-Domain Mappings
Shweta Mahajan
Iryna Gurevych
Stefan Roth
DRL
21
36
0
16 Feb 2020
Many-to-Many Voice Conversion using Conditional Cycle-Consistent Adversarial Networks
Shindong Lee
Bonggu Ko
Keonnyeong Lee
In-Chul Yoo
Dongsuk Yook
GAN
30
33
0
15 Feb 2020
Efficient And Scalable Neural Residual Waveform Coding With Collaborative Quantization
Kai Zhen
Mi Suk Lee
Jongmo Sung
Seungkwon Beack
Minje Kim
35
20
0
13 Feb 2020
Variational Autoencoders with Riemannian Brownian Motion Priors
Dimitris Kalatzis
David Eklund
Georgios Arvanitidis
Søren Hauberg
BDL
DRL
60
48
0
12 Feb 2020
Content Based Singing Voice Extraction From a Musical Mixture
Pritish Chandna
Merlijn Blaauw
J. Bonada
E. Gómez
28
14
0
12 Feb 2020
Generating diverse and natural text-to-speech samples using a quantized fine-grained VAE and auto-regressive prosody prior
Guangzhi Sun
Yu Zhang
Ron J. Weiss
Yuan Cao
Heiga Zen
Andrew Rosenberg
Bhuvana Ramabhadran
Yonghui Wu
DiffM
25
92
0
06 Feb 2020
Semi-supervised Grasp Detection by Representation Learning in a Vector Quantized Latent Space
Mridul Mahajan
Tryambak Bhattacharjee
Arya Krishnan
Priya Shukla
G. C. Nandi
DRL
SSL
16
3
0
23 Jan 2020
Deep Representation Learning in Speech Processing: Challenges, Recent Advances, and Future Trends
S. Latif
R. Rana
Sara Khalifa
Raja Jurdak
Junaid Qadir
Björn W. Schuller
AI4TS
32
81
0
02 Jan 2020
Previous
1
2
3
...
53
54
55
Next