Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1711.00937
Cited By
v1
v2 (latest)
Neural Discrete Representation Learning
2 November 2017
Aaron van den Oord
Oriol Vinyals
Koray Kavukcuoglu
BDL
SSL
OCL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Neural Discrete Representation Learning"
50 / 3,267 papers shown
Title
Multiresolution Signal Processing of Financial Market Objects
Ioana Boier
35
2
0
28 Oct 2022
One-Shot Acoustic Matching Of Audio Signals -- Learning to Hear Music In Any Room/ Concert Hall
Prateek Verma
C. Chafe
J. Berger
67
1
0
27 Oct 2022
Broken Neural Scaling Laws
Ethan Caballero
Kshitij Gupta
Irina Rish
David M. Krueger
194
76
0
26 Oct 2022
Leveraging Demonstrations with Latent Space Priors
Jonas Gehring
Deepak Gopinath
Jungdam Won
Andreas Krause
Gabriel Synnaeve
Nicolas Usunier
81
6
0
26 Oct 2022
Planning with Occluded Traffic Agents using Bi-Level Variational Occlusion Models
Filippos Christianos
Peter Karkus
Boris Ivanovic
Stefano V. Albrecht
Marco Pavone
97
10
0
26 Oct 2022
Discovering Design Concepts for CAD Sketches
Yuezhi Yang
Hao Pan
73
12
0
26 Oct 2022
A Survey on Artificial Intelligence for Music Generation: Agents, Domains and Perspectives
Carlos Hernandez-Olivan
Javier Hernandez-Olivan
J. R. Beltrán
MGen
103
7
0
25 Oct 2022
High Fidelity Neural Audio Compression
Alexandre Défossez
Jade Copet
Gabriel Synnaeve
Yossi Adi
136
674
0
24 Oct 2022
Language Model Pre-Training with Sparse Latent Typing
Liliang Ren
Zixuan Zhang
H. Wang
Clare R. Voss
Chengxiang Zhai
Heng Ji
101
3
0
23 Oct 2022
VIOLA: Imitation Learning for Vision-Based Manipulation with Object Proposal Priors
Yifeng Zhu
Abhishek Joshi
Peter Stone
Yuke Zhu
LM&Ro
97
134
0
20 Oct 2022
Coordinates Are NOT Lonely -- Codebook Prior Helps Implicit Neural 3D Representations
Fukun Yin
Wen Liu
Zilong Huang
Pei Cheng
Tao Chen
Gang Yu
63
19
0
20 Oct 2022
DisC-VC: Disentangled and F0-Controllable Neural Voice Conversion
Chihiro Watanabe
Hirokazu Kameoka
DRL
116
0
0
20 Oct 2022
Representation Learning with Diffusion Models
Jeremias Traub
DiffM
97
8
0
20 Oct 2022
OCR-VQGAN: Taming Text-within-Image Generation
Juan A. Rodriguez
David Vazquez
I. Laradji
M. Pedersoli
Pau Rodríguez López
154
20
0
19 Oct 2022
PoseGPT: Quantization-based 3D Human Motion Generation and Forecasting
Thomas Lucas
Fabien Baradel
Philippe Weinzaepfel
Grégory Rogez
109
75
0
19 Oct 2022
FedForgery: Generalized Face Forgery Detection with Residual Federated Learning
Decheng Liu
Zhan Dang
Chunlei Peng
Yu Zheng
Shuang Li
N. Wang
Xinbo Gao
FedML
88
35
0
18 Oct 2022
Explaining Image Classification with Visual Debates
Avinash Kori
Ben Glocker
Francesca Toni
69
1
0
17 Oct 2022
Improving Object-centric Learning with Query Optimization
Baoxiong Jia
Yu Liu
Siyuan Huang
OCL
99
52
0
17 Oct 2022
DiffGAR: Model-Agnostic Restoration from Generative Artifacts Using Image-to-Image Diffusion Models
Yueqin Yin
Lianghua Huang
Yu Liu
Kaiqiang Huang
DiffM
76
12
0
16 Oct 2022
Character-Centric Story Visualization via Visual Planning and Token Alignment
Hong Chen
Rujun Han
Te-Lin Wu
Hideki Nakayama
Nanyun Peng
DiffM
VGen
96
32
0
16 Oct 2022
Generalization with Lossy Affordances: Leveraging Broad Offline Data for Learning Visuomotor Tasks
Kuan Fang
Patrick Yin
Ashvin Nair
Homer Walke
Ge Yan
Sergey Levine
OffRL
96
25
0
12 Oct 2022
Anomaly Detection using Generative Models and Sum-Product Networks in Mammography Scans
Marc Dietrichstein
David Major
Martin Trapp
M. Wimmer
Dimitrios Lenis
Philip Winter
Astrid Berg
Theresa Neubauer
Katja Bühler
MedIm
49
4
0
12 Oct 2022
JukeDrummer: Conditional Beat-aware Audio-domain Drum Accompaniment Generation via Transformer VQ-VAE
Yueh-Kao Wu
Ching-Yu Chiu
Yi-Hsuan Yang
ViT
79
15
0
12 Oct 2022
3D Brain and Heart Volume Generative Models: A Survey
Yanbin Liu
Girish Dwivedi
F. Boussaïd
Bennamoun
MedIm
AI4CE
107
6
0
12 Oct 2022
Underspecification in Scene Description-to-Depiction Tasks
Ben Hutchinson
Jason Baldridge
Vinodkumar Prabhakaran
DiffM
128
34
0
11 Oct 2022
Automatic Speech Recognition of Low-Resource Languages Based on Chukchi
Anastasia N. Safonova
Tatiana Yudina
Emil Nadimanov
Cydnie Davenport
54
3
0
11 Oct 2022
Style-Guided Inference of Transformer for High-resolution Image Synthesis
Jonghwa Yim
Minjae Kim
ViT
103
0
0
11 Oct 2022
Hybrid Inverted Index Is a Robust Accelerator for Dense Retrieval
Peitian Zhang
Zheng Liu
Shitao Xiao
Zhicheng Dou
Jing Yao
98
6
0
11 Oct 2022
LECO: Learnable Episodic Count for Task-Specific Intrinsic Reward
DaeJin Jo
Sungwoong Kim
D. W. Nam
Taehwan Kwon
Seungeun Rho
Jongmin Kim
Donghoon Lee
OffRL
76
10
0
11 Oct 2022
Continual Learning by Modeling Intra-Class Variation
L. Yu
Tianyang Hu
Lanqing Hong
Zhen Liu
Adrian Weller
Weiyang Liu
CLL
83
13
0
11 Oct 2022
Race Bias Analysis of Bona Fide Errors in face anti-spoofing
Latifah Abduh
I. Ivrissimtzis
CVBM
98
2
0
11 Oct 2022
It Takes Two: Masked Appearance-Motion Modeling for Self-supervised Video Transformer Pre-training
Yuxin Song
Min Yang
Wenhao Wu
Dongliang He
Fu Li
Jingdong Wang
ViT
150
9
0
11 Oct 2022
Markup-to-Image Diffusion Models with Scheduled Sampling
Yuntian Deng
Noriyuki Kojima
Alexander M. Rush
DiffM
86
4
0
11 Oct 2022
ConchShell: A Generative Adversarial Networks that Turns Pictures into Piano Music
Wanshu Fan
Yu-Chuan Su
Yuxin Huang
GAN
36
2
0
11 Oct 2022
f-DM: A Multi-stage Diffusion Model via Progressive Signal Transformation
Jiatao Gu
Shuangfei Zhai
Yizhe Zhang
Miguel Angel Bautista
J. Susskind
DiffM
103
27
0
10 Oct 2022
LMQFormer: A Laplace-Prior-Guided Mask Query Transformer for Lightweight Snow Removal
Junhong Lin
Nanfeng Jiang
Zhentao Zhang
Weiling Chen
Tiesong Zhao
88
21
0
10 Oct 2022
HORIZON: High-Resolution Semantically Controlled Panorama Synthesis
Kun Yan
Lei Ji
Chenfei Wu
Jian Liang
Ming Zhou
Nan Duan
Shuai Ma
80
0
0
10 Oct 2022
Scaling Up Probabilistic Circuits by Latent Variable Distillation
Hoang Trung-Dung
Honghua Zhang
Guy Van den Broeck
TPM
73
27
0
10 Oct 2022
Dual-distribution discrepancy with self-supervised refinement for anomaly detection in medical images
Yu Cai
Hao Chen
Xin Yang
Yu Zhou
Kwang-Ting Cheng
133
51
0
09 Oct 2022
Visualize Before You Write: Imagination-Guided Open-Ended Text Generation
Wanrong Zhu
An Yan
Yujie Lu
Wenda Xu
Xinze Wang
Miguel P. Eckstein
William Yang Wang
137
36
0
07 Oct 2022
SpeechUT: Bridging Speech and Text with Hidden-Unit for Encoder-Decoder Based Speech-Text Pre-training
Zi-Hua Zhang
Long Zhou
Junyi Ao
Shujie Liu
Lirong Dai
Jinyu Li
Furu Wei
134
58
0
07 Oct 2022
Dynamic Latent Separation for Deep Learning
Yi-Lin Tuan
Zih-Yun Chiu
William Yang Wang
89
0
0
07 Oct 2022
A deep learning approach for detection and localization of leaf anomalies
David M. Calabro
Massimiliano Lupo Pasini
Nicola Ferro
S. Perotto
44
1
0
07 Oct 2022
Compressed Vision for Efficient Video Understanding
Olivia Wiles
João Carreira
Iain Barr
Andrew Zisserman
Mateusz Malinowski
56
7
0
06 Oct 2022
Phenaki: Variable Length Video Generation From Open Domain Textual Description
Ruben Villegas
Mohammad Babaeizadeh
Pieter-Jan Kindermans
Hernan Moraldo
Han Zhang
M. Saffar
Santiago Castro
Julius Kunze
D. Erhan
DiffM
VGen
173
396
0
05 Oct 2022
Temporally Consistent Transformers for Video Generation
Wilson Yan
Danijar Hafner
Stephen James
Pieter Abbeel
DiffM
94
31
0
05 Oct 2022
Progressive Text-to-Image Generation
Zhengcong Fei
Mingyuan Fan
Li Zhu
Junshi Huang
185
4
0
05 Oct 2022
Vision+X: A Survey on Multimodal Learning in the Light of Data
Ye Zhu
Yuehua Wu
N. Sebe
Yan Yan
119
19
0
05 Oct 2022
Stateful active facilitator: Coordination and Environmental Heterogeneity in Cooperative Multi-Agent Reinforcement Learning
Dianbo Liu
Vedant Shah
Oussama Boussif
Cristian Meo
Anirudh Goyal
Tianmin Shu
Michael C. Mozer
N. Heess
Yoshua Bengio
134
8
0
04 Oct 2022
Enhancing Spatiotemporal Prediction Model using Modular Design and Beyond
Haoyu Pan
Hao Wu
Tan Yang
AI4TS
80
0
0
04 Oct 2022
Previous
1
2
3
...
45
46
47
...
64
65
66
Next