Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
1711.00937
Cited By
v1
v2 (latest)
Neural Discrete Representation Learning
2 November 2017
Aaron van den Oord
Oriol Vinyals
Koray Kavukcuoglu
BDL
SSL
OCL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Neural Discrete Representation Learning"
50 / 3,267 papers shown
Title
Physics Driven Domain Specific Transporter Framework with Attention Mechanism for Ultrasound Imaging
Arpan Tripathi
A. Rakkunedeth
Mahesh Raveendranatha Panicker
Jack Zhang
Naveenjyote Boora
Jessica Knight
Jacob L. Jaremko
Yale Tung Chen
K. Narayan
C. Kesavadas
OOD
MedIm
97
0
0
13 Sep 2021
Incorporating Real-world Noisy Speech in Neural-network-based Speech Enhancement Systems
Yangyang Xia
Buye Xu
Anurag Kumar
40
7
0
11 Sep 2021
StreamHover: Livestream Transcript Summarization and Annotation
Sangwoo Cho
Franck Dernoncourt
Timothy Jeewun Ganter
Trung Bui
Nedim Lipka
Walter Chang
Hailin Jin
Jonathan Brandt
H. Foroosh
Fei Liu
3DGS
AI4TS
79
29
0
11 Sep 2021
Text-Free Prosody-Aware Generative Spoken Language Modeling
Eugene Kharitonov
Ann Lee
Adam Polyak
Yossi Adi
Jade Copet
...
Tu Nguyen
M. Rivière
Abdel-rahman Mohamed
Emmanuel Dupoux
Wei-Ning Hsu
118
122
0
07 Sep 2021
Self-supervised Tumor Segmentation through Layer Decomposition
Xiaoman Zhang
Weidi Xie
Chaoqin Huang
Yanfeng Wang
Ya Zhang
Xin Chen
Qi Tian
83
6
0
07 Sep 2021
Aspect-Controllable Opinion Summarization
Reinald Kim Amplayo
Stefanos Angelidis
Mirella Lapata
76
75
0
07 Sep 2021
Multi-Agent Variational Occlusion Inference Using People as Sensors
Masha Itkina
Ye-Ji Mun
Katherine Driggs-Campbell
Mykel J. Kochenderfer
96
25
0
05 Sep 2021
What Users Want? WARHOL: A Generative Model for Recommendation
Jules Samaran
Ugo Tanielian
Romain Beaumont
Flavian Vasile
HAI
42
0
0
02 Sep 2021
Learning Disentangled Representations in the Imaging Domain
Xiao Liu
Pedro Sanchez
Spyridon Thermos
Alison Q. OÑeil
Sotirios A. Tsaftaris
OOD
DRL
202
72
0
26 Aug 2021
Adversarially Robust One-class Novelty Detection
Shao-Yuan Lo
Poojan Oza
Vishal M. Patel
AAML
78
32
0
25 Aug 2021
ImageBART: Bidirectional Context with Multinomial Diffusion for Autoregressive Image Synthesis
Patrick Esser
Robin Rombach
A. Blattmann
Bjorn Ommer
DiffM
119
162
0
19 Aug 2021
Transformers predicting the future. Applying attention in next-frame and time series forecasting
Radostin Cholakov
T. Kolev
AI4TS
55
17
0
18 Aug 2021
Cross-modal Spectrum Transformation Network For Acoustic Scene classification
Yang Liu
A. Neophytou
Sunando Sengupta
Eric Sommerlade
97
9
0
13 Aug 2021
PixelSynth: Generating a 3D-Consistent Experience from a Single Image
C. Rockwell
David Fouhey
Justin Johnson
VGen
146
86
0
12 Aug 2021
Analysis of ODE2VAE with Examples
Batuhan Koyuncu
DRL
38
0
0
10 Aug 2021
StarGAN-VC+ASR: StarGAN-based Non-Parallel Voice Conversion Regularized by Automatic Speech Recognition
Shoki Sakamoto
Akira Taniguchi
T. Taniguchi
Hirokazu Kameoka
BDL
63
5
0
10 Aug 2021
Information Bottleneck Approach to Spatial Attention Learning
Qiuxia Lai
Yu Li
Ailing Zeng
Minhao Liu
Hanqiu Sun
Qiang Xu
98
9
0
07 Aug 2021
Applying the Information Bottleneck Principle to Prosodic Representation Learning
Guangyan Zhang
Ying Qin
Daxin Tan
Tan Lee
79
4
0
05 Aug 2021
RockGPT: Reconstructing three-dimensional digital rocks from single two-dimensional slice from the perspective of video generation
Qi Zheng
Dongxiao Zhang
33
22
0
05 Aug 2021
Deep Quantized Representation for Enhanced Reconstruction
Akash Gupta
Abhishek Aich
Kevin Rodriguez
G. Reddy
Amit K. Roy-Chowdhury
15
5
0
29 Jul 2021
Fast and Scalable Image Search For Histology
Chengkuan Chen
Ming Y. Lu
Drew F. K. Williamson
Tiffany Y. Chen
A. J. Schaumberg
Faisal Mahmood
34
2
0
28 Jul 2021
Unsupervised Learning of Neurosymbolic Encoders
Eric Zhan
Jennifer J. Sun
Ann Kennedy
Yisong Yue
Swarat Chaudhuri
102
14
0
28 Jul 2021
Improving Robot Localisation by Ignoring Visual Distraction
Oscar Alejandro Mendez Maldonado
M. Vowels
Richard Bowden
35
1
0
25 Jul 2021
Introducing: DeepHead, Wide-band Electromagnetic Imaging Paradigm
Ahmed Al-Saffar
L. Guo
A. Abbosh
MedIm
21
0
0
23 Jul 2021
Abstract Reasoning via Logic-guided Generation
Sihyun Yu
Sangwoo Mo
SungSoo Ahn
Jinwoo Shin
72
6
0
22 Jul 2021
Data synthesis and adversarial networks: A review and meta-analysis in cancer imaging
Richard Osuala
Kaisar Kushibar
Lidia Garrucho
Akis Linardos
Zuzanna Szafranowska
Stefan Klein
Ben Glocker
Oliver Díaz
Karim Lekadir
MedIm
106
45
0
20 Jul 2021
Generative Video Transformer: Can Objects be the Words?
Yi-Fu Wu
Jaesik Yoon
Sungjin Ahn
ViT
114
34
0
20 Jul 2021
GenRadar: Self-supervised Probabilistic Camera Synthesis based on Radar Frequencies
Carsten Ditzel
Klaus C. J. Dietmayer
56
3
0
19 Jul 2021
Translatotron 2: High-quality direct speech-to-speech translation with voice preservation
Ye Jia
Michelle Tadmor Ramanovich
Tal Remez
Roi Pomerantz
107
73
0
19 Jul 2021
Unsupervised Skill-Discovery and Skill-Learning in Minecraft
J. J. Nieto
Roger Creus
Xavier Giró-i-Nieto
SSL
DRL
71
4
0
18 Jul 2021
Learning De-identified Representations of Prosody from Raw Audio
J. Weston
R. Lenain
U. Meepegama
E. Fristed
SSL
68
17
0
17 Jul 2021
CCVS: Context-aware Controllable Video Synthesis
G. L. Moing
Jean Ponce
Cordelia Schmid
105
81
0
16 Jul 2021
Codified audio language modeling learns useful representations for music information retrieval
Rodrigo Castellon
Chris Donahue
Percy Liang
148
91
0
12 Jul 2021
PocketVAE: A Two-step Model for Groove Generation and Control
Kyungyun Lee
Wonil Kim
Juhan Nam
46
1
0
11 Jul 2021
SoundStream: An End-to-End Neural Audio Codec
Neil Zeghidour
Alejandro Luebs
Ahmed Omran
Jan Skoglund
Marco Tagliasacchi
AI4TS
120
806
0
07 Jul 2021
Discrete-Valued Neural Communication
Dianbo Liu DianboLiu
Alex Lamb
Kenji Kawaguchi
Anirudh Goyal
Chen Sun
Michael C. Mozer
Yoshua Bengio
95
52
0
06 Jul 2021
Classical Planning in Deep Latent Space
Masataro Asai
Hiroshi Kajino
A. Fukunaga
Christian Muise
VLM
98
19
0
30 Jun 2021
A Generative Model for Raw Audio Using Transformer Architectures
Prateek Verma
C. Chafe
89
29
0
30 Jun 2021
CLIPDraw: Exploring Text-to-Drawing Synthesis through Language-Image Encoders
Kevin Frans
Lisa Soros
Olaf Witkowski
CLIP
105
212
0
28 Jun 2021
Transflower: probabilistic autoregressive dance generation with multimodal attention
Guillermo Valle Pérez
G. Henter
Jonas Beskow
A. Holzapfel
Pierre-Yves Oudeyer
Simon Alexanderson
133
43
0
25 Jun 2021
On Incorporating Inductive Biases into VAEs
Ning Miao
Emile Mathieu
N. Siddharth
Yee Whye Teh
Tom Rainforth
CML
DRL
99
11
0
25 Jun 2021
Preliminary study on using vector quantization latent spaces for TTS/VC systems with consistent performance
Hieu-Thi Luong
Junichi Yamagishi
85
0
0
25 Jun 2021
NP-DRAW: A Non-Parametric Structured Latent Variable Model for Image Generation
Xiaohui Zeng
R. Urtasun
R. Zemel
Sanja Fidler
Renjie Liao
DiffM
45
2
0
25 Jun 2021
Generative Modeling for Multi-task Visual Learning
Zhipeng Bao
M. Hebert
Yu-Xiong Wang
64
17
0
25 Jun 2021
Handling Data Heterogeneity with Generative Replay in Collaborative Learning for Medical Imaging
Liangqiong Qu
N. Balachandar
Miao Zhang
D. Rubin
MedIm
89
23
0
24 Jun 2021
VIMPAC: Video Pre-Training via Masked Token Prediction and Contrastive Learning
Hao Tan
Jie Lei
Thomas Wolf
Joey Tianyi Zhou
120
67
0
21 Jun 2021
Trainable Class Prototypes for Few-Shot Learning
Jianyi Li
Guizhong Liu
VLM
49
2
0
21 Jun 2021
Discrete Auto-regressive Variational Attention Models for Text Modeling
Xianghong Fang
Haoli Bai
Jian Li
Zenglin Xu
Michael Lyu
Irwin King
73
3
0
16 Jun 2021
Test Sample Accuracy Scales with Training Sample Density in Neural Networks
Xu Ji
Razvan Pascanu
Devon Hjelm
Balaji Lakshminarayanan
Andrea Vedaldi
70
8
0
15 Jun 2021
Self-Supervised Learning with Kernel Dependence Maximization
Yazhe Li
Roman Pogodin
Danica J. Sutherland
Arthur Gretton
SSL
100
85
0
15 Jun 2021
Previous
1
2
3
...
55
56
57
...
64
65
66
Next