Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
1711.00937
Cited By
v1
v2 (latest)
Neural Discrete Representation Learning
2 November 2017
Aaron van den Oord
Oriol Vinyals
Koray Kavukcuoglu
BDL
SSL
OCL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Neural Discrete Representation Learning"
50 / 3,267 papers shown
Title
BEiT: BERT Pre-Training of Image Transformers
Hangbo Bao
Li Dong
Songhao Piao
Furu Wei
ViT
423
2,858
0
15 Jun 2021
Improved Transformer for High-Resolution GANs
Long Zhao
Zizhao Zhang
Ting Chen
Dimitris N. Metaxas
Han Zhang
ViT
133
96
0
14 Jun 2021
HuBERT: Self-Supervised Speech Representation Learning by Masked Prediction of Hidden Units
Wei-Ning Hsu
Benjamin Bolte
Yao-Hung Hubert Tsai
Kushal Lakhotia
Ruslan Salakhutdinov
Abdel-rahman Mohamed
SSL
214
3,017
0
14 Jun 2021
Learning the Imaging Landmarks: Unsupervised Key point Detection in Lung Ultrasound Videos
Arpan Tripathi
Mahesh Raveendranatha Panicker
A. Hareendranathan
Yale Tung Chen
Jacob L. Jaremko
K. Narayan
C. Kesavadas
24
0
0
13 Jun 2021
Inverting Adversarially Robust Networks for Image Synthesis
Renan A. Rojas-Gomez
Raymond A. Yeh
Minh Do
A. Nguyen
68
5
0
13 Jun 2021
D2C: Diffusion-Denoising Models for Few-shot Conditional Generation
Abhishek Sinha
Jiaming Song
Chenlin Meng
Stefano Ermon
VLM
DiffM
140
121
0
12 Jun 2021
Robust Representation Learning via Perceptual Similarity Metrics
Saeid Asgari Taghanaki
Kristy Choi
Amir Khasahmadi
Anirudh Goyal
SSL
64
29
0
11 Jun 2021
Decoupled Greedy Learning of CNNs for Synchronous and Asynchronous Distributed Learning
Eugene Belilovsky
Louis Leconte
Lucas Caccia
Michael Eickenberg
Edouard Oyallon
45
7
0
11 Jun 2021
Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
Jaehyeon Kim
Jungil Kong
Juhee Son
DRL
170
903
0
11 Jun 2021
Score-based Generative Modeling in Latent Space
Arash Vahdat
Karsten Kreis
Jan Kautz
DiffM
142
688
0
10 Jun 2021
Cross-Modal Discrete Representation Learning
Alexander H. Liu
SouYoung Jin
Cheng-I Jeff Lai
Andrew Rouditchenko
A. Oliva
James R. Glass
SSL
84
41
0
10 Jun 2021
Domain Specific Transporter Framework to Detect Fractures in Ultrasound
Arpan Tripathi
A. Hareendranathan
Mahesh Raveendranatha Panicker
Jack Zhang
Naveenjyote Boora
Jacob L. Jaremko
12
0
0
09 Jun 2021
Vector Quantized Models for Planning
Sherjil Ozair
Yazhe Li
Ali Razavi
Ioannis Antonoglou
Aaron van den Oord
Oriol Vinyals
OffRL
96
51
0
08 Jun 2021
Unsupervised Word Segmentation from Discrete Speech Units in Low-Resource Settings
Marcely Zanon Boito
Bolaji Yusuf
Lucas Ondel
Aline Villavicencio
Laurent Besacier
69
3
0
08 Jun 2021
NWT: Towards natural audio-to-video generation with representation learning
Rayhane Mama
Marc S. Tyndel
Hashiam Kadhim
Cole Clifford
Ragavan Thurairatnam
VGen
112
12
0
08 Jun 2021
Interpretable agent communication from scratch (with a generic visual processor emerging on the side)
Roberto Dessì
Eugene Kharitonov
Marco Baroni
95
28
0
08 Jun 2021
Weakly-supervised word-level pronunciation error detection in non-native English speech
Daniel Korzekwa
Jaime Lorenzo-Trueba
Thomas Drugman
Shira Calamaro
B. Kostek
37
13
0
07 Jun 2021
Neural Distributed Source Coding
Jay Whang
Alliot Nagle
Anish Acharya
Hyeji Kim
A. Dimakis
93
21
0
05 Jun 2021
On Perceptual Lossy Compression: The Cost of Perceptual Reconstruction and An Optimal Training Framework
Zeyu Yan
Fei Wen
R. Ying
Chao Ma
Peilin Liu
86
38
0
05 Jun 2021
The Image Local Autoregressive Transformer
Chenjie Cao
Yue Hong
Xiang Li
Chengrong Wang
C. Xu
Xiangyang Xue
Yanwei Fu
82
13
0
04 Jun 2021
Segmental Contrastive Predictive Coding for Unsupervised Word Segmentation
Saurabhchand Bhati
Jesús Villalba
Piotr Żelasko
Laureano Moro-Velazquez
Najim Dehak
SSL
86
37
0
03 Jun 2021
A Preliminary Study of a Two-Stage Paradigm for Preserving Speaker Identity in Dysarthric Voice Conversion
Wen-Chin Huang
Kazuhiro Kobayashi
Yu-Huai Peng
Ching-Feng Liu
Yu Tsao
Hsin-Min Wang
Tomoki Toda
86
11
0
02 Jun 2021
Container: Context Aggregation Network
Peng Gao
Jiasen Lu
Hongsheng Li
Roozbeh Mottaghi
Aniruddha Kembhavi
ViT
108
72
0
02 Jun 2021
What Can I Do Here? Learning New Skills by Imagining Visual Affordances
Alexander Khazatsky
Ashvin Nair
Dan Jing
Sergey Levine
LM&Ro
84
33
0
01 Jun 2021
DISSECT: Disentangled Simultaneous Explanations via Concept Traversals
Asma Ghandeharioun
Been Kim
Chun-Liang Li
Brendan Jou
B. Eoff
Rosalind W. Picard
AAML
102
54
0
31 May 2021
Factorising Meaning and Form for Intent-Preserving Paraphrasing
Tom Hosking
Mirella Lapata
OOD
86
41
0
31 May 2021
Cascaded Diffusion Models for High Fidelity Image Generation
Jonathan Ho
Chitwan Saharia
William Chan
David J. Fleet
Mohammad Norouzi
Tim Salimans
306
1,246
0
30 May 2021
Diffusion-Based Representation Learning
K. Abstreiter
Sarthak Mittal
Stefan Bauer
Bernhard Schölkopf
Arash Mehrjou
DiffM
96
58
0
29 May 2021
M6-UFC: Unifying Multi-Modal Controls for Conditional Image Synthesis via Non-Autoregressive Generative Transformers
Zhu Zhang
Jianxin Ma
Chang Zhou
Rui Men
Zhikang Li
Ming Ding
Jie Tang
Jingren Zhou
Hongxia Yang
110
47
0
29 May 2021
CogView: Mastering Text-to-Image Generation via Transformers
Ming Ding
Zhuoyi Yang
Wenyi Hong
Wendi Zheng
Chang Zhou
...
Junyang Lin
Xu Zou
Zhou Shao
Hongxia Yang
Jie Tang
ViT
VLM
164
784
0
26 May 2021
Deep Neural Networks and End-to-End Learning for Audio Compression
Daniela N. Rim
I. Jang
Heeyoul Choi
64
9
0
25 May 2021
EXoN: EXplainable encoder Network
SeungHwan An
Hosik Choi
Jong-June Jeon
BDL
DRL
76
5
0
23 May 2021
Compositional Fine-Grained Low-Shot Learning
Dat T. Huynh
Ehsan Elhamifar
83
4
0
21 May 2021
Combining Transformer Generators with Convolutional Discriminators
Ricard Durall
Stanislav Frolov
Jörn Hees
Federico Raue
Franz-Josef Pfreundt
Andreas Dengel
J. Keuper
ViT
76
16
0
21 May 2021
Parallel and Flexible Sampling from Autoregressive Models via Langevin Dynamics
V. Jayaram
John Thickstun
DiffM
107
25
0
17 May 2021
Priors in Bayesian Deep Learning: A Review
Vincent Fortuin
UQCV
BDL
141
134
0
14 May 2021
High-Resolution Complex Scene Synthesis with Transformers
Manuel Jahn
Robin Rombach
Bjorn Ommer
ViT
85
37
0
13 May 2021
Autoencoding Under Normalization Constraints
Sangwoong Yoon
Yung-Kyun Noh
Frank C. Park
OODD
UQCV
86
39
0
12 May 2021
Discrete representations in neural models of spoken language
Bertrand Higy
Lieke Gelderloos
Afra Alishahi
Grzegorz Chrupała
144
6
0
12 May 2021
Diffusion Models Beat GANs on Image Synthesis
Prafulla Dhariwal
Alex Nichol
562
8,017
0
11 May 2021
MuseMorphose: Full-Song and Fine-Grained Piano Music Style Transfer with One Transformer VAE
Shih-Lun Wu
Yi-Hsuan Yang
ViT
109
55
0
10 May 2021
Voice Conversion Based Speaker Normalization for Acoustic Unit Discovery
Thomas Glarner
Janek Ebbers
Reinhold Häb-Umbach
DRL
30
1
0
04 May 2021
VQCPC-GAN: Variable-Length Adversarial Audio Synthesis Using Vector-Quantized Contrastive Predictive Coding
J. Nistal
Cyran Aouameur
Stefan Lattner
G. Richard
105
7
0
04 May 2021
OCTOPUS: Overcoming Performance andPrivatization Bottlenecks in Distributed Learning
Shuo Wang
Surya Nepal
Kristen Moore
M. Grobler
Carsten Rudolph
A. Abuadbba
FedML
76
8
0
03 May 2021
GODIVA: Generating Open-DomaIn Videos from nAtural Descriptions
Chenfei Wu
Lun Huang
Qianxi Zhang
Binyang Li
Lei Ji
Fan Yang
Guillermo Sapiro
Nan Duan
DiffM
VGen
120
245
0
30 Apr 2021
Eccentric Regularization: Minimizing Hyperspherical Energy without explicit projection
Xuefeng Li
Alan Blair
62
0
0
23 Apr 2021
Protecting gender and identity with disentangled speech representations
Dimitrios Stoidis
Andrea Cavallaro
74
10
0
22 Apr 2021
IB-DRR: Incremental Learning with Information-Back Discrete Representation Replay
Jian Jiang
Edoardo Cetin
Oya Celiktutan
61
9
0
21 Apr 2021
VideoGPT: Video Generation using VQ-VAE and Transformers
Wilson Yan
Yunzhi Zhang
Pieter Abbeel
A. Srinivas
ViT
VGen
345
514
0
20 Apr 2021
Conditional Variational Capsule Network for Open Set Recognition
Yunrui Guo
Guglielmo Camporese
Wenjing Yang
A. Sperduti
Lamberto Ballan
BDL
90
45
0
19 Apr 2021
Previous
1
2
3
...
56
57
58
...
64
65
66
Next