ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1711.00937
  4. Cited By
Neural Discrete Representation Learning
v1v2 (latest)

Neural Discrete Representation Learning

2 November 2017
Aaron van den Oord
Oriol Vinyals
Koray Kavukcuoglu
    BDLSSLOCL
ArXiv (abs)PDFHTML

Papers citing "Neural Discrete Representation Learning"

50 / 3,267 papers shown
Title
Geometry-Free View Synthesis: Transformers and no 3D Priors
Geometry-Free View Synthesis: Transformers and no 3D Priors
Robin Rombach
Patrick Esser
Bjorn Ommer
ViT
114
95
0
15 Apr 2021
Spectrogram Inpainting for Interactive Generation of Instrument Sounds
Spectrogram Inpainting for Interactive Generation of Instrument Sounds
Théis Bazin
Gaëtan Hadjeres
P. Esling
M. Malt
61
11
0
15 Apr 2021
FastS2S-VC: Streaming Non-Autoregressive Sequence-to-Sequence Voice
  Conversion
FastS2S-VC: Streaming Non-Autoregressive Sequence-to-Sequence Voice Conversion
Hirokazu Kameoka
Kou Tanaka
Takuhiro Kaneko
85
21
0
14 Apr 2021
NoiseVC: Towards High Quality Zero-Shot Voice Conversion
NoiseVC: Towards High Quality Zero-Shot Voice Conversion
Shijun Wang
Damian Borth
DRL
80
6
0
13 Apr 2021
Visiting the Invisible: Layer-by-Layer Completed Scene Decomposition
Visiting the Invisible: Layer-by-Layer Completed Scene Decomposition
Chuanxia Zheng
Duy-Son Dao
Guoxian Song
Tat-Jen Cham
Jianfei Cai
47
20
0
12 Apr 2021
Boltzmann Tuning of Generative Models
Boltzmann Tuning of Generative Models
Victor Berger
Michele Sebag
62
0
0
12 Apr 2021
Learned transform compression with optimized entropy encoding
Learned transform compression with optimized entropy encoding
Magda Gregorova
Marc Desaules
Alexandros Kalousis
36
2
0
07 Apr 2021
Seeing Out of tHe bOx: End-to-End Pre-training for Vision-Language
  Representation Learning
Seeing Out of tHe bOx: End-to-End Pre-training for Vision-Language Representation Learning
Zhicheng Huang
Zhaoyang Zeng
Yupan Huang
Bei Liu
Dongmei Fu
Jianlong Fu
VLMViT
167
274
0
07 Apr 2021
Creativity and Machine Learning: A Survey
Creativity and Machine Learning: A Survey
Giorgio Franceschelli
Mirco Musolesi
VLMAI4CE
129
43
0
06 Apr 2021
Defending Against Image Corruptions Through Adversarial Augmentations
Defending Against Image Corruptions Through Adversarial Augmentations
D. A. Calian
Florian Stimberg
Olivia Wiles
Sylvestre-Alvise Rebuffi
András Gyorgy
Timothy A. Mann
Sven Gowal
AAML
93
41
0
02 Apr 2021
Unsupervised Acoustic Unit Discovery by Leveraging a
  Language-Independent Subword Discriminative Feature Representation
Unsupervised Acoustic Unit Discovery by Leveraging a Language-Independent Subword Discriminative Feature Representation
Siyuan Feng
Piotr Żelasko
Laureano Moro-Velazquez
O. Scharenborg
71
4
0
02 Apr 2021
Speech Resynthesis from Discrete Disentangled Self-Supervised
  Representations
Speech Resynthesis from Discrete Disentangled Self-Supervised Representations
Adam Polyak
Yossi Adi
Jade Copet
Eugene Kharitonov
Kushal Lakhotia
Wei-Ning Hsu
Abdel-rahman Mohamed
Emmanuel Dupoux
149
318
0
01 Apr 2021
A Closer Look at Fourier Spectrum Discrepancies for CNN-generated Images
  Detection
A Closer Look at Fourier Spectrum Discrepancies for CNN-generated Images Detection
Keshigeyan Chandrasegaran
Ngoc-Trung Tran
Ngai-Man Cheung
94
83
0
31 Mar 2021
Unsupervised Disentanglement of Linear-Encoded Facial Semantics
Unsupervised Disentanglement of Linear-Encoded Facial Semantics
Yutong Zheng
Yu-Kai Huang
R. Tao
Zhiqiang Shen
Marios Savvides
CVBMDRL
66
12
0
30 Mar 2021
PixelTransformer: Sample Conditioned Signal Generation
PixelTransformer: Sample Conditioned Signal Generation
Shubham Tulsiani
Abhinav Gupta
76
17
0
29 Mar 2021
Scalable and Efficient Neural Speech Coding: A Hybrid Design
Scalable and Efficient Neural Speech Coding: A Hybrid Design
Kai Zhen
Jongmo Sung
Mi Suk Lee
Seung-Wha Beack
Minje Kim
95
14
0
27 Mar 2021
Decomposing Normal and Abnormal Features of Medical Images into Discrete
  Latent Codes for Content-Based Image Retrieval
Decomposing Normal and Abnormal Features of Medical Images into Discrete Latent Codes for Content-Based Image Retrieval
Kazuma Kobayashi
Ryuichiro Hataya
Y. Kurose
M. Miyake
Masamichi Takahashi
Akiko Nakagawa
Tatsuya Harada
Ryuji Hamamoto
MedIm
96
19
0
23 Mar 2021
Tiny Transformers for Environmental Sound Classification at the Edge
Tiny Transformers for Environmental Sound Classification at the Edge
David Elliott
Carlos E. Otero
Steven Wyatt
Evan Martino
83
16
0
22 Mar 2021
Generating Diverse Structure for Image Inpainting With Hierarchical
  VQ-VAE
Generating Diverse Structure for Image Inpainting With Hierarchical VQ-VAE
Jialun Peng
Dong Liu
Songcen Xu
Houqiang Li
DiffM
61
196
0
18 Mar 2021
Variable-rate discrete representation learning
Variable-rate discrete representation learning
Sander Dieleman
C. Nash
Jesse Engel
Karen Simonyan
BDLDRL
82
24
0
10 Mar 2021
Wav2vec-C: A Self-supervised Model for Speech Representation Learning
Wav2vec-C: A Self-supervised Model for Speech Representation Learning
Samik Sadhu
Di He
Che-Wei Huang
Sri Harish Reddy Mallidi
Minhua Wu
Ariya Rastrow
A. Stolcke
J. Droppo
Roland Maas
SSL
70
49
0
09 Mar 2021
Deep Generative Modelling: A Comparative Review of VAEs, GANs,
  Normalizing Flows, Energy-Based and Autoregressive Models
Deep Generative Modelling: A Comparative Review of VAEs, GANs, Normalizing Flows, Energy-Based and Autoregressive Models
Sam Bond-Taylor
Adam Leach
Yang Long
Chris G. Willcocks
VLMTPM
203
511
0
08 Mar 2021
Learning to Generate 3D Shapes with Generative Cellular Automata
Learning to Generate 3D Shapes with Generative Cellular Automata
Dongsu Zhang
Changwoon Choi
Jeonghwan Kim
Y. Kim
84
24
0
06 Mar 2021
Generating Images with Sparse Representations
Generating Images with Sparse Representations
C. Nash
Jacob Menick
Sander Dieleman
Peter W. Battaglia
102
211
0
05 Mar 2021
crank: An Open-Source Software for Nonparallel Voice Conversion Based on
  Vector-Quantized Variational Autoencoder
crank: An Open-Source Software for Nonparallel Voice Conversion Based on Vector-Quantized Variational Autoencoder
Kazuhiro Kobayashi
Wen-Chin Huang
Yi-Chiao Wu
Patrick Lumban Tobing
Tomoki Hayashi
Tomoki Toda
BDLDRL
79
19
0
04 Mar 2021
Enabling Visual Action Planning for Object Manipulation through Latent
  Space Roadmap
Enabling Visual Action Planning for Object Manipulation through Latent Space Roadmap
M. Lippi
Petra Poklukar
Michael C. Welle
Anastasia Varava
Hang Yin
Alessandro Marino
Danica Kragic
62
14
0
03 Mar 2021
Predicting Video with VQVAE
Predicting Video with VQVAE
Jacob Walker
Ali Razavi
Aaron van den Oord
DRL
152
69
0
02 Mar 2021
A survey on Variational Autoencoders from a GreenAI perspective
A survey on Variational Autoencoders from a GreenAI perspective
Andrea Asperti
David Evangelista
E. Loli Piccolomini
DRL
91
53
0
01 Mar 2021
M6: A Chinese Multimodal Pretrainer
M6: A Chinese Multimodal Pretrainer
Junyang Lin
Rui Men
An Yang
Chan Zhou
Ming Ding
...
Yong Li
Wei Lin
Jingren Zhou
J. Tang
Hongxia Yang
VLMMoE
163
134
0
01 Mar 2021
Zero-Shot Text-to-Image Generation
Zero-Shot Text-to-Image Generation
Aditya A. Ramesh
Mikhail Pavlov
Gabriel Goh
Scott Gray
Chelsea Voss
Alec Radford
Mark Chen
Ilya Sutskever
VLM
444
5,020
0
24 Feb 2021
Unsupervised Brain Anomaly Detection and Segmentation with Transformers
Unsupervised Brain Anomaly Detection and Segmentation with Transformers
W. H. Pinaya
Petru-Daniel Tudosiu
Robert J. Gray
G. Rees
P. Nachev
Sebastien Ourselin
M. Jorge Cardoso
ViTMedIm
74
61
0
23 Feb 2021
Anytime Sampling for Autoregressive Models via Ordered Autoencoding
Anytime Sampling for Autoregressive Models via Ordered Autoencoding
Yilun Xu
Yang Song
Sahaj Garg
Linyuan Gong
Rui Shu
Aditya Grover
Stefano Ermon
DiffM
93
11
0
23 Feb 2021
Uncertainty Estimation Using Riemannian Model Dynamics for Offline
  Reinforcement Learning
Uncertainty Estimation Using Riemannian Model Dynamics for Offline Reinforcement Learning
Guy Tennenholtz
Shie Mannor
OffRL
61
12
0
22 Feb 2021
Improving Lossless Compression Rates via Monte Carlo Bits-Back Coding
Improving Lossless Compression Rates via Monte Carlo Bits-Back Coding
Yangjun Ruan
Karen Ullrich
Daniel de Souza Severo
James Townsend
Ashish Khisti
Arnaud Doucet
Alireza Makhzani
Chris J. Maddison
118
25
0
22 Feb 2021
Measuring the Stability of Learned Features
Measuring the Stability of Learned Features
Kris Sankaran
OOD
32
0
0
20 Feb 2021
Preventing Oversmoothing in VAE via Generalized Variance
  Parameterization
Preventing Oversmoothing in VAE via Generalized Variance Parameterization
Yuhta Takida
Wei-Hsiang Liao
Chieh-Hsin Lai
Toshimitsu Uesaka
Shusuke Takahashi
Yuki Mitsufuji
DRL
94
15
0
17 Feb 2021
Enhancing into the codec: Noise Robust Speech Coding with
  Vector-Quantized Autoencoders
Enhancing into the codec: Noise Robust Speech Coding with Vector-Quantized Autoencoders
Jonah Casebeer
Vinjai Vale
Umut Isik
J. Valin
Ritwik Giri
A. Krishnaswamy
100
20
0
12 Feb 2021
VARA-TTS: Non-Autoregressive Text-to-Speech Synthesis based on Very Deep
  VAE with Residual Attention
VARA-TTS: Non-Autoregressive Text-to-Speech Synthesis based on Very Deep VAE with Residual Attention
Peng Liu
Yuewen Cao
Songxiang Liu
Na Hu
Guangzhi Li
Chao Weng
Jane Polak Scowcroft
95
22
0
12 Feb 2021
Self-Supervised VQ-VAE for One-Shot Music Style Transfer
Self-Supervised VQ-VAE for One-Shot Music Style Transfer
Ondřej Cífka
A. Ozerov
Umut Simsekli
G. Richard
79
28
0
10 Feb 2021
Inducing Meaningful Units from Character Sequences with Dynamic Capacity
  Slot Attention
Inducing Meaningful Units from Character Sequences with Dynamic Capacity Slot Attention
Melika Behjati
James Henderson
OCL
76
1
0
01 Feb 2021
Generative Spoken Language Modeling from Raw Audio
Generative Spoken Language Modeling from Raw Audio
Kushal Lakhotia
Evgeny Kharitonov
Wei-Ning Hsu
Yossi Adi
Adam Polyak
...
Tu Nguyen
Jade Copet
Alexei Baevski
A. Mohamed
Emmanuel Dupoux
AuLLM
313
366
0
01 Feb 2021
CNN with large memory layers
CNN with large memory layers
R. Karimov
Yury Malkov
Karim Iskakov
Victor Lempitsky
61
0
0
27 Jan 2021
Disentangled Sequence Clustering for Human Intention Inference
Disentangled Sequence Clustering for Human Intention Inference
Mark Zolotas
Y. Demiris
DRL
104
5
0
23 Jan 2021
Hierarchical disentangled representation learning for singing voice
  conversion
Hierarchical disentangled representation learning for singing voice conversion
Naoya Takahashi
M. Singh
Yuki Mitsufuji
DRL
60
14
0
18 Jan 2021
Cauchy-Schwarz Regularized Autoencoder
Cauchy-Schwarz Regularized Autoencoder
Linh-Tam Tran
Maja Pantic
M. Deisenroth
DRLBDL
77
18
0
06 Jan 2021
HAVANA: Hierarchical and Variation-Normalized Autoencoder for Person
  Re-identification
HAVANA: Hierarchical and Variation-Normalized Autoencoder for Person Re-identification
Jiawei Ren
Xiao Ma
Chen Xu
Haiyu Zhao
Shuai Yi
BDL
70
4
0
06 Jan 2021
Psychoacoustic Calibration of Loss Functions for Efficient End-to-End
  Neural Audio Coding
Psychoacoustic Calibration of Loss Functions for Efficient End-to-End Neural Audio Coding
Kai Zhen
Mi Suk Lee
Jongmo Sung
Seung-Wha Beack
Minje Kim
69
21
0
31 Dec 2020
Discovering Dialog Structure Graph for Open-Domain Dialog Generation
Discovering Dialog Structure Graph for Open-Domain Dialog Generation
Jun Xu
Zeyang Lei
Haifeng Wang
Zheng-Yu Niu
Hua Wu
Wanxiang Che
Ting Liu
45
6
0
31 Dec 2020
Text-Free Image-to-Speech Synthesis Using Learned Segmental Units
Text-Free Image-to-Speech Synthesis Using Learned Segmental Units
Wei-Ning Hsu
David Harwath
Christopher Song
James R. Glass
CLIP
92
67
0
31 Dec 2020
Interpretable NLG for Task-oriented Dialogue Systems with Heterogeneous
  Rendering Machines
Interpretable NLG for Task-oriented Dialogue Systems with Heterogeneous Rendering Machines
Yangming Li
Kaisheng Yao
54
5
0
29 Dec 2020
Previous
123...575859...646566
Next