Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1711.00937
Cited By
v1
v2 (latest)
Neural Discrete Representation Learning
2 November 2017
Aaron van den Oord
Oriol Vinyals
Koray Kavukcuoglu
BDL
SSL
OCL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Neural Discrete Representation Learning"
50 / 3,267 papers shown
Title
StyleT2I: Toward Compositional and High-Fidelity Text-to-Image Synthesis
Zhiheng Li
Martin Renqiang Min
Keqin Li
Chenliang Xu
EGVM
79
40
0
29 Mar 2022
Diverse Plausible 360-Degree Image Outpainting for Efficient 3DCG Background Creation
Naofumi Akimoto
Yuhi Matsuo
Y. Aoki
106
36
0
28 Mar 2022
Single-Stream Multi-Level Alignment for Vision-Language Pretraining
Zaid Khan
B. Vijaykumar
Xiang Yu
S. Schulter
Manmohan Chandraker
Y. Fu
CLIP
VLM
125
17
0
27 Mar 2022
Self-supervised Semantic Segmentation Grounded in Visual Concepts
Wenbin He
William C. Surmeier
A. Shekar
Liangke Gou
Liu Ren
SSL
76
7
0
25 Mar 2022
Efficient-VDVAE: Less is more
Louay Hazami
Rayhane Mama
Ragavan Thurairatnam
BDL
104
28
0
25 Mar 2022
MISC: A MIxed Strategy-Aware Model Integrating COMET for Emotional Support Conversation
Quan Tu
Yanran Li
Jianwei Cui
Bin Wang
Jiaxin Wen
Rui Yan
101
101
0
25 Mar 2022
Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors
Oran Gafni
Adam Polyak
Oron Ashual
Shelly Sheynin
Devi Parikh
Yaniv Taigman
DiffM
97
526
0
24 Mar 2022
Bailando: 3D Dance Generation by Actor-Critic GPT with Choreographic Memory
Lian Siyao
Weijiang Yu
Tianpei Gu
Chunze Lin
Quan Wang
Chao Qian
Chen Change Loy
Ziwei Liu
SLR
137
196
0
24 Mar 2022
Competency Assessment for Autonomous Agents using Deep Generative Models
Aastha Acharya
Rebecca L. Russell
Nisar R. Ahmed
81
11
0
23 Mar 2022
Pixel VQ-VAEs for Improved Pixel Art Representation
Akash Saravanan
Matthew J. Guzdial
58
8
0
23 Mar 2022
QS-Craft: Learning to Quantize, Scrabble and Craft for Conditional Human Motion Animation
Yuxin Hong
Xuelin Qian
Simian Luo
Xiangyang Xue
Yanwei Fu
50
2
0
22 Mar 2022
VQ-Flows: Vector Quantized Local Normalizing Flows
Sahil Sidheekh
Chris B. Dock
Tushar Jain
R. Balan
M. Singh
73
8
0
22 Mar 2022
Interpreting Class Conditional GANs with Channel Awareness
Yin-Yin He
Zhiyi Zhang
Jiapeng Zhu
Yujun Shen
Qifeng Chen
GAN
66
1
0
21 Mar 2022
The Conceptual VAE
R. A. Shaikh
Sara Sabrina Zemljič
Sean Tull
S. Clark
76
4
0
21 Mar 2022
PublicCheck: Public Integrity Verification for Services of Run-time Deep Models
Shuo Wang
Sharif Abuadbba
Sidharth Agarwal
Kristen Moore
Ruoxi Sun
Minhui Xue
Surya Nepal
S. Çamtepe
S. Kanhere
HILM
68
7
0
21 Mar 2022
Unified Multivariate Gaussian Mixture for Efficient Neural Image Compression
Xiaosu Zhu
Jingkuan Song
Lianli Gao
Fengcai Zheng
Hengtao Shen
57
64
0
21 Mar 2022
SimAN: Exploring Self-Supervised Representation Learning of Scene Text via Similarity-Aware Normalization
Canjie Luo
Lianwen Jin
Jingdong Chen
SSL
AI4TS
92
30
0
20 Mar 2022
ViewFormer: NeRF-free Neural Rendering from Few Images Using Transformers
Jonávs Kulhánek
Erik Derner
Torsten Sattler
Robert Babuvska
ViT
112
75
0
18 Mar 2022
Alleviating Adversarial Attacks on Variational Autoencoders with MCMC
Anna Kuzina
Max Welling
Jakub M. Tomczak
AAML
DRL
102
12
0
18 Mar 2022
Improve few-shot voice cloning using multi-modal learning
Haitong Zhang
Yue Lin
51
8
0
18 Mar 2022
AutoSDF: Shape Priors for 3D Completion, Reconstruction and Generation
Paritosh Mittal
Y. Cheng
Maneesh Singh
Shubham Tulsiani
133
230
0
17 Mar 2022
Transframer: Arbitrary Frame Prediction with Generative Models
C. Nash
João Carreira
Jacob Walker
Iain Barr
Andrew Jaegle
Mateusz Malinowski
Peter W. Battaglia
ViT
123
38
0
17 Mar 2022
UNIMO-2: End-to-End Unified Vision-Language Grounded Learning
Wei Li
Can Gao
Guocheng Niu
Xinyan Xiao
Hao Liu
Jiachen Liu
Hua Wu
Haifeng Wang
MLLM
51
22
0
17 Mar 2022
DU-VLG: Unifying Vision-and-Language Generation via Dual Sequence-to-Sequence Pre-training
Luyang Huang
Guocheng Niu
Jiachen Liu
Xinyan Xiao
Hua Wu
VLM
CoGe
58
8
0
17 Mar 2022
Implicit Feature Decoupling with Depthwise Quantization
Iordanis Fostiropoulos
Barry W. Boehm
52
2
0
15 Mar 2022
Text-free non-parallel many-to-many voice conversion using normalising flows
Thomas Merritt
Abdelhamid Ezzerg
Piotr Bilinski
Magdalena Proszewska
Kamil Pokora
Roberto Barra-Chicote
Daniel Korzekwa
116
15
0
15 Mar 2022
Style Transformer for Image Inversion and Editing
Xueqi Hu
Qiusheng Huang
Zhengyi Shi
Siyuan Li
Changxin Gao
Li Sun
Qingli Li
89
56
0
15 Mar 2022
Unsupervised Extractive Opinion Summarization Using Sparse Coding
Somnath Basu Roy Chowdhury
Chao Zhao
Snigdha Chaturvedi
83
25
0
15 Mar 2022
Privacy-Preserving Speech Representation Learning using Vector Quantization
Pierre Champion
D. Jouvet
Anthony Larcher
SSL
18
0
0
15 Mar 2022
Modelling word learning and recognition using visually grounded speech
Danny Merkx
Sebastiaan Scholten
S. Frank
M. Ernestus
O. Scharenborg
SSL
130
0
0
14 Mar 2022
Semi-Discrete Normalizing Flows through Differentiable Tessellation
Ricky T. Q. Chen
Brandon Amos
Maximilian Nickel
88
10
0
14 Mar 2022
MVP: Multimodality-guided Visual Pre-training
Longhui Wei
Lingxi Xie
Wen-gang Zhou
Houqiang Li
Qi Tian
91
108
0
10 Mar 2022
KPE: Keypoint Pose Encoding for Transformer-based Image Generation
Soon Yau Cheong
A. Mustafa
Andrew Gilbert
ViT
85
10
0
09 Mar 2022
FlexIT: Towards Flexible Semantic Image Translation
Guillaume Couairon
Asya Grechka
Jakob Verbeek
Holger Schwenk
Matthieu Cord
DiffM
114
38
0
09 Mar 2022
Practical cognitive speech compression
Reza Lotfidereshgi
P. Gournay
59
2
0
08 Mar 2022
Hierarchical Sketch Induction for Paraphrase Generation
Tom Hosking
Hao Tang
Mirella Lapata
BDL
116
32
0
07 Mar 2022
Sparsity-Inducing Categorical Prior Improves Robustness of the Information Bottleneck
Anirban Samaddar
Sandeep Madireddy
Prasanna Balaprakash
Tapabrata Maiti
Gustavo de los Campos
Ian Fischer
57
1
0
04 Mar 2022
Show Me What and Tell Me How: Video Synthesis via Multimodal Conditioning
Ligong Han
Jian Ren
Hsin-Ying Lee
Francesco Barbieri
Kyle Olszewski
Shervin Minaee
Dimitris N. Metaxas
Sergey Tulyakov
DiffM
VGen
142
41
0
04 Mar 2022
Differentiable Causal Discovery Under Latent Interventions
Gonccalo R. A. Faria
André F. T. Martins
Mário A. T. Figueiredo
BDL
CML
OOD
95
23
0
04 Mar 2022
Autoregressive Image Generation using Residual Quantization
Doyup Lee
Chiheon Kim
Saehoon Kim
Minsu Cho
Wook-Shin Han
VGen
295
378
0
03 Mar 2022
Audio Self-supervised Learning: A Survey
Shuo Liu
Adria Mallol-Ragolta
Emilia Parada-Cabeleiro
Kun Qian
Xingshuo Jing
Alexander Kathan
Bin Hu
Bjoern W. Schuller
SSL
104
109
0
02 Mar 2022
Variational Autoencoders Without the Variation
Gregory A. Daly
J. Fieldsend
G. Tabor
68
2
0
01 Mar 2022
CLIP-GEN: Language-Free Training of a Text-to-Image Generator with CLIP
Zihao Wang
Wei Liu
Qian He
Xin-ru Wu
Zili Yi
CLIP
VLM
268
75
0
01 Mar 2022
A Brief Overview of Unsupervised Neural Speech Representation Learning
Lasse Borgholt
Jakob Drachmann Havtorn
Joakim Edin
Lars Maaløe
Christian Igel
BDL
AI4TS
SSL
101
11
0
01 Mar 2022
LISA: Learning Interpretable Skill Abstractions from Language
Divyansh Garg
Skanda Vaidyanath
Kuno Kim
Jiaming Song
Stefano Ermon
LM&Ro
OffRL
253
30
0
28 Feb 2022
Weakly Supervised Disentangled Representation for Goal-conditioned Reinforcement Learning
Zhifeng Qian
Mingyu You
Hongjun Zhou
Bin He
DRL
OffRL
70
7
0
28 Feb 2022
Variational Autoencoder with Disentanglement Priors for Low-Resource Task-Specific Natural Language Generation
Zhuang Li
Zhuang Li
Xingliang Yuan
Tongtong Wu
Tianyang Zhan
Gholamreza Haffari
CoGe
UD
DRL
122
4
0
27 Feb 2022
Controllable Natural Language Generation with Contrastive Prefixes
Jing Qian
Li Dong
Yelong Shen
Furu Wei
Weizhu Chen
101
100
0
27 Feb 2022
Real-World Blind Super-Resolution via Feature Matching with Implicit High-Resolution Priors
Chaofeng Chen
Xinyu Shi
Yipeng Qin
Xiaoming Li
Xiaoguang Han
Taojiannan Yang
Shihui Guo
106
118
0
26 Feb 2022
Retriever: Learning Content-Style Representation as a Token-Level Bipartite Graph
Dacheng Yin
Xuanchi Ren
Chong Luo
Yuwang Wang
Zhiwei Xiong
Wenjun Zeng
114
13
0
24 Feb 2022
Previous
1
2
3
...
51
52
53
...
64
65
66
Next