Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1711.00937
Cited By
v1
v2 (latest)
Neural Discrete Representation Learning
2 November 2017
Aaron van den Oord
Oriol Vinyals
Koray Kavukcuoglu
BDL
SSL
OCL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Neural Discrete Representation Learning"
50 / 3,267 papers shown
Title
ShapeCrafter: A Recursive Text-Conditioned 3D Shape Generation Model
Rao Fu
Xiaoyu Zhan
Yiwen Chen
Daniel E. Ritchie
Srinath Sridhar
128
79
0
19 Jul 2022
FewGAN: Generating from the Joint Distribution of a Few Images
Lior Ben-Moshe
Sagie Benaim
Lior Wolf
GAN
112
2
0
18 Jul 2022
Latent-Domain Predictive Neural Speech Coding
Xue Jiang
Xiulian Peng
Huaying Xue
Yuan Zhang
Yan Lu
86
18
0
18 Jul 2022
Guaranteed Discovery of Control-Endogenous Latent States with Multi-Step Inverse Models
Alex Lamb
Riashat Islam
Yonathan Efroni
Aniket Didolkar
Dipendra Kumar Misra
Dylan J. Foster
Lekan Molu
Rajan Chari
A. Krishnamurthy
John Langford
99
24
0
17 Jul 2022
FashionViL: Fashion-Focused Vision-and-Language Representation Learning
Xiaoping Han
Licheng Yu
Xiatian Zhu
Li Zhang
Yi-Zhe Song
Tao Xiang
AI4TS
54
49
0
17 Jul 2022
Position Prediction as an Effective Pretraining Strategy
Shuangfei Zhai
Navdeep Jaitly
Jason Ramapuram
Dan Busbridge
Tatiana Likhomanenko
Joseph Y. Cheng
Walter A. Talbott
Chen Huang
Hanlin Goh
J. Susskind
ViT
91
25
0
15 Jul 2022
Comparing the latent space of generative models
Andrea Asperti
Valerio Tonelli
DRL
84
13
0
14 Jul 2022
N-Grammer: Augmenting Transformers with latent n-grams
Aurko Roy
Rohan Anil
Guangda Lai
Benjamin Lee
Jeffrey Zhao
...
Yu
Phuong Dao
Christopher Fifty
Zhiwen Chen
Yonghui Wu
77
8
0
13 Jul 2022
Collaborative Quantization Embeddings for Intra-Subject Prostate MR Image Registration
Ziyi Shen
Qianye Yang
Yuming Shen
F. Giganti
V. Stavrinides
...
M. Rusu
G. Sonn
Philip Torr
D. Barratt
Yipeng Hu
69
3
0
13 Jul 2022
Fuse It More Deeply! A Variational Transformer with Layer-Wise Latent Variable Inference for Text Generation
Jinyi Hu
Xiaoyuan Yi
Wenhao Li
Maosong Sun
Xing Xie
122
21
0
13 Jul 2022
Eliminating Gradient Conflict in Reference-based Line-Art Colorization
Zekun Li
Zhengyang Geng
Zhao Kang
Wenyu Chen
Yibo Yang
109
37
0
13 Jul 2022
Learning Representations for CSI Adaptive Quantization and Feedback
Valentina Rizzello
Matteo Nerini
M. Joham
B. Clerckx
Wolfgang Utschick
MQ
32
6
0
13 Jul 2022
Earthformer: Exploring Space-Time Transformers for Earth System Forecasting
Zhihan Gao
Xingjian Shi
Hao Wang
Yi Zhu
Yuyang Wang
Mu Li
Dit-Yan Yeung
AI4TS
100
159
0
12 Jul 2022
SkexGen: Autoregressive Generation of CAD Construction Sequences with Disentangled Codebooks
Xiang Xu
Karl D. D. Willis
Joseph G. Lambourne
Chin-Yi Cheng
P. Jayaraman
Yasutaka Furukawa
97
78
0
11 Jul 2022
A Comparative Study of Self-supervised Speech Representation Based Voice Conversion
Wen-Chin Huang
Shu-Wen Yang
Tomoki Hayashi
Tomoki Toda
68
17
0
10 Jul 2022
Towards Highly Expressive Machine Learning Models of Non-Melanoma Skin Cancer
S. Thomas
J. Lefevre
Glenn W. Baxter
N. Hamilton
MedIm
87
2
0
09 Jul 2022
Generative Adversarial Networks and Other Generative Models
Markus T. Wenzel
GAN
113
12
0
08 Jul 2022
Hidden Schema Networks
Ramses J. Sanchez
L. Conrads
Pascal Welke
K. Cvejoski
C. Ojeda
NAI
MILM
65
3
0
08 Jul 2022
End-to-End Binaural Speech Synthesis
Wen-Chin Huang
Dejan Marković
Alexander Richard
I. D. Gebru
Anjali Menon
65
9
0
08 Jul 2022
NESC: Robust Neural End-2-End Speech Coding with GANs
N. Pia
Kishan Gupta
Srikanth Korse
M. Multrus
Guillaume Fuchs
105
16
0
07 Jul 2022
Cross-Scale Vector Quantization for Scalable Neural Speech Coding
Xue Jiang
Xiulian Peng
Huaying Xue
Yuan Zhang
Yan Lu
MQ
117
10
0
07 Jul 2022
Ultra-Low-Bitrate Speech Coding with Pretrained Transformers
Ali Siahkoohi
Michael Chinen
Tom Denton
W. Kleijn
Jan Skoglund
58
9
0
05 Jul 2022
Transformer based Models for Unsupervised Anomaly Segmentation in Brain MR Images
Ahmed Ghorbel
Ahmed Aldahdooh
Shadi Albarqouni
Neuherberg
ViT
MedIm
77
6
0
05 Jul 2022
Vector Quantisation for Robust Segmentation
Ainkaran Santhirasekaram
Avinash Kori
Mathias Winkler
A. Rockall
Ben Glocker
OOD
77
9
0
05 Jul 2022
Hierarchical Symbolic Reasoning in Hyperbolic Space for Deep Discriminative Models
Ainkaran Santhirasekaram
Avinash Kori
A. Rockall
Mathias Winkler
Francesca Toni
Ben Glocker
FAtt
71
4
0
05 Jul 2022
TM2T: Stochastic and Tokenized Modeling for the Reciprocal Generation of 3D Human Motions and Texts
Chuan Guo
Xinxin Xuo
Sen Wang
Li Cheng
VGen
197
244
0
04 Jul 2022
GlowVC: Mel-spectrogram space disentangling model for language-independent text-free voice conversion
Magdalena Proszewska
Grzegorz Beringer
Daniel Sáez-Trigueros
Thomas Merritt
Abdelhamid Ezzerg
Roberto Barra-Chicote
70
6
0
04 Jul 2022
TMGAN-PLC: Audio Packet Loss Concealment using Temporal Memory Generative Adversarial Network
Yuansheng Guan
Guochen Yu
Andong Li
C. Zheng
Jie Wang
125
9
0
04 Jul 2022
Generating gender-ambiguous voices for privacy-preserving speech recognition
Dimitrios Stoidis
Andrea Cavallaro
52
14
0
03 Jul 2022
Dissecting Self-Supervised Learning Methods for Surgical Computer Vision
Sanat Ramesh
V. Srivastav
Deepak Alapatt
Tong Yu
Aditya Murali
...
Saurav Sharma
A. Fleurentin
Georgios Exarchakis
Alexandros Karargyris
N. Padoy
134
46
0
01 Jul 2022
Towards Human-Agent Communication via the Information Bottleneck Principle
Mycal Tucker
J. Shah
R. Levy
Noga Zaslavsky
84
14
0
30 Jun 2022
iEmoTTS: Toward Robust Cross-Speaker Emotion Transfer and Control for Speech Synthesis based on Disentanglement between Prosody and Timbre
Guangyan Zhang
Ying Qin
Weinan Zhang
Jialun Wu
Mei Li
Yu Gai
Feijun Jiang
Tan Lee
108
27
0
29 Jun 2022
Masked World Models for Visual Control
Younggyo Seo
Danijar Hafner
Hao Liu
Fangchen Liu
Stephen James
Kimin Lee
Pieter Abbeel
OffRL
188
149
0
28 Jun 2022
Megapixel Image Generation with Step-Unrolled Denoising Autoencoders
Alex F. McKinney
Chris G. Willcocks
DiffM
47
0
0
24 Jun 2022
MaskViT: Masked Visual Pre-Training for Video Prediction
Agrim Gupta
Stephen Tian
Yunzhi Zhang
Jiajun Wu
Roberto Martín-Martín
Li Fei-Fei
199
121
0
23 Jun 2022
A Temporal Extension of Latent Dirichlet Allocation for Unsupervised Acoustic Unit Discovery
W. V. D. Merwe
Herman Kamper
J. D. Preez
57
2
0
23 Jun 2022
Entropy-driven Sampling and Training Scheme for Conditional Diffusion Generation
Sheng-liang Li
Guangcong Zheng
Haibo Wang
Taiping Yao
Yang Chen
Shoudong Ding
Xi Li
DiffM
86
22
0
23 Jun 2022
Towards Robust Blind Face Restoration with Codebook Lookup Transformer
Shangchen Zhou
Kelvin C. K. Chan
Chongyi Li
Chen Change Loy
CVBM
100
239
0
22 Jun 2022
Automated Cancer Subtyping via Vector Quantization Mutual Information Maximization
Zheng Chen
Lingwei Zhu
Ziwei Yang
Takashi Matsubara
116
7
0
22 Jun 2022
Scaling Autoregressive Models for Content-Rich Text-to-Image Generation
Jiahui Yu
Yuanzhong Xu
Jing Yu Koh
Thang Luong
Gunjan Baid
...
Zarana Parekh
Xin Li
Han Zhang
Jason Baldridge
Yonghui Wu
EGVM
302
1,134
0
22 Jun 2022
SemMAE: Semantic-Guided Masking for Learning Masked Autoencoders
Gang Li
Heliang Zheng
Daqing Liu
Chaoyue Wang
Fuchun Sun
Changwen Zheng
135
130
0
21 Jun 2022
Identifiability of deep generative models without auxiliary information
Bohdan Kivva
Goutham Rajendran
Pradeep Ravikumar
Bryon Aragam
DRL
117
53
0
20 Jun 2022
Latent Variable Modelling Using Variational Autoencoders: A survey
Vasanth Kalingeri
CML
DRL
72
2
0
20 Jun 2022
Self-supervised speech unit discovery from articulatory and acoustic features using VQ-VAE
Marc-Antoine Georges
J. Schwartz
Thomas Hueber
SSL
116
5
0
17 Jun 2022
TUSK: Task-Agnostic Unsupervised Keypoints
Yuhe Jin
Weiwei Sun
J. Hosang
Eduard Trulls
K. M. Yi
76
5
0
16 Jun 2022
iBoot: Image-bootstrapped Self-Supervised Video Representation Learning
F. Saleh
Fuwen Tan
Adrian Bulat
Georgios Tzimiropoulos
Brais Martínez
SSL
104
1
0
16 Jun 2022
Pythae: Unifying Generative Autoencoders in Python -- A Benchmarking Use Case
Clément Chadebec
Louis J. Vincent
S. Allassonnière
DRL
98
30
0
16 Jun 2022
Discrete Contrastive Diffusion for Cross-Modal Music and Image Generation
Ye Zhu
Yuehua Wu
Kyle Olszewski
Jian Ren
Sergey Tulyakov
Yan Yan
DiffM
109
49
0
15 Jun 2022
Write and Paint: Generative Vision-Language Models are Unified Modal Learners
Shizhe Diao
Wangchunshu Zhou
Xinsong Zhang
Jiawei Wang
MLLM
AI4CE
99
17
0
15 Jun 2022
A Survey of Automated Data Augmentation Algorithms for Deep Learning-based Image Classification Tasks
Z. Yang
Richard Sinnott
James Bailey
Qiuhong Ke
86
45
0
14 Jun 2022
Previous
1
2
3
...
48
49
50
...
64
65
66
Next