Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1711.00937
Cited By
v1
v2 (latest)
Neural Discrete Representation Learning
2 November 2017
Aaron van den Oord
Oriol Vinyals
Koray Kavukcuoglu
BDL
SSL
OCL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Neural Discrete Representation Learning"
50 / 3,267 papers shown
Title
An Empirical Study of Training End-to-End Vision-and-Language Transformers
Zi-Yi Dou
Yichong Xu
Zhe Gan
Jianfeng Wang
Shuohang Wang
...
Pengchuan Zhang
Lu Yuan
Nanyun Peng
Zicheng Liu
Michael Zeng
VLM
106
381
0
03 Nov 2021
Recent Advancements in Self-Supervised Paradigms for Visual Feature Representation
Mrinal Anand
Aditya Garg
SSL
46
2
0
03 Nov 2021
PatchGame: Learning to Signal Mid-level Patches in Referential Games
Kamal Gupta
Gowthami Somepalli
Anubhav Gupta
Vinoj Jayasundara
Matthias Zwicker
Abhinav Shrivastava
79
4
0
02 Nov 2021
Adjacency constraint for efficient hierarchical reinforcement learning
Tianren Zhang
Shangqi Guo
Tian Tan
Xiao M Hu
Feng Chen
106
17
0
30 Oct 2021
Neural Analysis and Synthesis: Reconstructing Speech from Self-Supervised Representations
Hyeong-Seok Choi
Juheon Lee
W. Kim
Jie Hwan Lee
Hoon Heo
Kyogu Lee
116
158
0
27 Oct 2021
Zero-shot Voice Conversion via Self-supervised Prosody Representation Learning
Shijun Wang
Dimche Kostadinov
Damian Borth
93
11
0
27 Oct 2021
Towards artificial general intelligence via a multimodal foundation model
Nanyi Fei
Zhiwu Lu
Yizhao Gao
Guoxing Yang
Yuqi Huo
...
Ruihua Song
Xin Gao
Tao Xiang
Haoran Sun
Jiling Wen
AI4CE
LRM
99
230
0
27 Oct 2021
VQ-GNN: A Universal Framework to Scale up Graph Neural Networks using Vector Quantization
Mucong Ding
Kezhi Kong
Jingling Li
Chen Zhu
John P. Dickerson
Furong Huang
Tom Goldstein
GNN
MQ
107
49
0
27 Oct 2021
TopicNet: Semantic Graph-Guided Topic Discovery
Zhibin Duan
Yishi Xu
Bo Chen
Dongsheng Wang
Chaojie Wang
Mingyuan Zhou
BDL
92
15
0
27 Oct 2021
Evidential Softmax for Sparse Multimodal Distributions in Deep Generative Models
Phil Chen
Masha Itkina
Ransalu Senanayake
Mykel J. Kochenderfer
69
6
0
27 Oct 2021
Object-Aware Regularization for Addressing Causal Confusion in Imitation Learning
Jongjin Park
Younggyo Seo
Chang-Shu Liu
Li Zhao
Tao Qin
Jinwoo Shin
Tie-Yan Liu
CML
OffRL
55
16
0
27 Oct 2021
Fragment-based Sequential Translation for Molecular Optimization
Benson Chen
Xiang Fu
Regina Barzilay
Tommi Jaakkola
58
7
0
26 Oct 2021
Discrete Acoustic Space for an Efficient Sampling in Neural Text-To-Speech
Mu Li
Jonas Rohnke
Antonio Bonafonte
Mateusz Lajszczak
Trevor Wood
DRL
108
2
0
24 Oct 2021
SLAM: A Unified Encoder for Speech and Language Modeling via Speech-Text Joint Pre-Training
Ankur Bapna
Yu-An Chung
Na Wu
Anmol Gulati
Ye Jia
J. Clark
Melvin Johnson
Jason Riesa
Alexis Conneau
Yu Zhang
VLM
139
96
0
20 Oct 2021
Momentum Contrastive Autoencoder: Using Contrastive Learning for Latent Space Distribution Matching in WAE
Devansh Arpit
Aadyot Bhatnagar
Huan Wang
Caiming Xiong
49
0
0
19 Oct 2021
Wideband and Entropy-Aware Deep Soft Bit Quantization
Marius Arvinte
Jonathan I. Tamir
MQ
28
0
0
18 Oct 2021
CycleFlow: Purify Information Factors by Cycle Loss
Haoran Sun
Chen Chen
Lantian Li
Dong Wang
72
1
0
18 Oct 2021
Illiterate DALL-E Learns to Compose
Gautam Singh
Fei Deng
Sungjin Ahn
CoGe
OCL
133
139
0
17 Oct 2021
3D-RETR: End-to-End Single and Multi-View 3D Reconstruction with Transformers
Z. Shi
Zhao Meng
Yiran Xing
Yunpu Ma
Roger Wattenhofer
ViT
82
35
0
17 Oct 2021
Taming Visually Guided Sound Generation
Vladimir E. Iashin
Esa Rahtu
VLM
131
128
0
17 Oct 2021
Guiding Visual Question Generation
Nihir Vedd
Zixu Wang
Marek Rei
Yishu Miao
Lucia Specia
140
22
0
15 Oct 2021
Towards Identity Preserving Normal to Dysarthric Voice Conversion
Wen-Chin Huang
B. Halpern
Lester Phillip Violeta
O. Scharenborg
Tomoki Toda
108
23
0
15 Oct 2021
MaGNET: Uniform Sampling from Deep Generative Network Manifolds Without Retraining
Ahmed Imtiaz Humayun
Randall Balestriero
Richard Baraniuk
OOD
141
31
0
15 Oct 2021
SpeechT5: Unified-Modal Encoder-Decoder Pre-Training for Spoken Language Processing
Junyi Ao
Rui Wang
Long Zhou
Chengyi Wang
Shuo Ren
...
Yu Zhang
Zhihua Wei
Yao Qian
Jinyu Li
Furu Wei
177
203
0
14 Oct 2021
The Deep Generative Decoder: MAP estimation of representations improves modeling of single-cell RNA data
Viktoria Schuster
A. Krogh
83
4
0
13 Oct 2021
DiscoDVT: Generating Long Text with Discourse-Aware Discrete Variational Transformer
Haozhe Ji
Minlie Huang
103
23
0
12 Oct 2021
Unsupervised Source Separation via Bayesian Inference in the Latent Domain
Michele Mancusi
Emilian Postolache
Giorgio Mariani
Marco Fumero
Andrea Santilli
Luca Cosmo
Emanuele Rodolà
BDL
62
2
0
11 Oct 2021
Injecting Text and Cross-lingual Supervision in Few-shot Learning from Self-Supervised Models
Sanjeev Khudanpur
Desh Raj
Sanjeev Khudanpur
103
6
0
10 Oct 2021
Vector-quantized Image Modeling with Improved VQGAN
Jiahui Yu
Xin Li
Jing Yu Koh
Han Zhang
Ruoming Pang
James Qin
Alexander Ku
Yuanzhong Xu
Jason Baldridge
Yonghui Wu
ViT
VLM
DRL
207
527
0
09 Oct 2021
Cognitive Coding of Speech
Reza Lotfidereshgi
P. Gournay
60
5
0
08 Oct 2021
KaraSinger: Score-Free Singing Voice Synthesis with VQ-VAE using Mel-spectrograms
Chien-Feng Liao
Jen-Yu Liu
Yi-Hsuan Yang
69
5
0
08 Oct 2021
Attention is All You Need? Good Embeddings with Statistics are enough:Large Scale Audio Understanding without Transformers/ Convolutions/ BERTs/ Mixers/ Attention/ RNNs or ....
Prateek Verma
AI4TS
102
2
0
07 Oct 2021
DiffusionCLIP: Text-Guided Diffusion Models for Robust Image Manipulation
Gwanghyun Kim
Taesung Kwon
Jong Chul Ye
DiffM
262
657
0
06 Oct 2021
CLIP-Forge: Towards Zero-Shot Text-to-Shape Generation
Aditya Sanghi
Hang Chu
Joseph G. Lambourne
Ye Wang
Chin-Yi Cheng
Marco Fumero
Kamal Rahimi Malekshan
CLIP
136
296
0
06 Oct 2021
Unsupervised Speech Segmentation and Variable Rate Representation Learning using Segmental Contrastive Predictive Coding
Saurabhchand Bhati
Jesús Villalba
Piotr Żelasko
Laureano Moro-Velazquez
Najim Dehak
SSL
134
23
0
05 Oct 2021
Truth-Conditional Captioning of Time Series Data
Harsh Jhamtani
Taylor Berg-Kirkpatrick
AI4TS
75
8
0
05 Oct 2021
Proxy-bridged Image Reconstruction Network for Anomaly Detection in Medical Images
Kang Zhou
Jing Li
Weixin Luo
Zhengxin Li
Jianlong Yang
Huazhu Fu
Jun Cheng
Jiang-Dong Liu
Shenghua Gao
193
34
0
05 Oct 2021
Causal Representation Learning for Context-Aware Face Transfer
Gege Gao
Huaibo Huang
Chaoyou Fu
Ran He
CVBM
72
0
0
04 Oct 2021
A Review of the Gumbel-max Trick and its Extensions for Discrete Stochasticity in Machine Learning
Iris A. M. Huijben
W. Kool
Max B. Paulus
Ruud J. G. van Sloun
115
99
0
04 Oct 2021
An Unsupervised Video Game Playstyle Metric via State Discretization
Chiu-Chou Lin
W. Chiu
I-Chen Wu
32
3
0
03 Oct 2021
Calibrated Multiple-Output Quantile Regression with Representation Learning
Shai Feldman
Stephen Bates
Yaniv Romano
210
36
0
02 Oct 2021
Audio-to-Image Cross-Modal Generation
Maciej Żelaszczyk
Jacek Mańdziuk
DiffM
118
17
0
27 Sep 2021
Fully Spiking Variational Autoencoder
Hiromichi Kamata
Yusuke Mukuta
Tatsuya Harada
BDL
DRL
106
43
0
26 Sep 2021
Learnable Triangulation for Deep Learning-based 3D Reconstruction of Objects of Arbitrary Topology from Single RGB Images
Tarek Ben Charrada
Hedi Tabia
A. Chetouani
Hamid Laga
3DV
74
0
0
24 Sep 2021
Multi-view Contrastive Self-Supervised Learning of Accounting Data Representations for Downstream Audit Tasks
Marco Schreyer
Timur Sattarov
Damian Borth
MLAU
76
15
0
23 Sep 2021
Noisy-to-Noisy Voice Conversion Framework with Denoising Model
Chao Xie
Yi-Chiao Wu
Patrick Lumban Tobing
Wen-Chin Huang
Tomoki Toda
67
8
0
22 Sep 2021
Intuitive and Efficient Roof Modeling for Reconstruction and Synthesis
Jing Ren
Biao Zhang
Bojian Wu
Jianqiang Huang
Lubin Fan
M. Ovsjanikov
Peter Wonka
70
19
0
16 Sep 2021
Disentangling Generative Factors in Natural Language with Discrete Variational Autoencoders
Giangiacomo Mercatali
André Freitas
CoGe
DRL
59
23
0
15 Sep 2021
fairseq S^2: A Scalable and Integrable Speech Synthesis Toolkit
Changhan Wang
Wei-Ning Hsu
Yossi Adi
Adam Polyak
Ann Lee
Peng-Jen Chen
Jiatao Gu
J. Pino
VLM
106
32
0
14 Sep 2021
A Temporal Variational Model for Story Generation
David Wilmot
Frank Keller
DRL
113
9
0
14 Sep 2021
Previous
1
2
3
...
54
55
56
...
64
65
66
Next