ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1711.00937
  4. Cited By
Neural Discrete Representation Learning
v1v2 (latest)

Neural Discrete Representation Learning

2 November 2017
Aaron van den Oord
Oriol Vinyals
Koray Kavukcuoglu
    BDLSSLOCL
ArXiv (abs)PDFHTML

Papers citing "Neural Discrete Representation Learning"

50 / 3,267 papers shown
Title
StyleT2I: Toward Compositional and High-Fidelity Text-to-Image Synthesis
StyleT2I: Toward Compositional and High-Fidelity Text-to-Image Synthesis
Zhiheng Li
Martin Renqiang Min
Keqin Li
Chenliang Xu
EGVM
79
40
0
29 Mar 2022
Diverse Plausible 360-Degree Image Outpainting for Efficient 3DCG
  Background Creation
Diverse Plausible 360-Degree Image Outpainting for Efficient 3DCG Background Creation
Naofumi Akimoto
Yuhi Matsuo
Y. Aoki
106
36
0
28 Mar 2022
Single-Stream Multi-Level Alignment for Vision-Language Pretraining
Single-Stream Multi-Level Alignment for Vision-Language Pretraining
Zaid Khan
B. Vijaykumar
Xiang Yu
S. Schulter
Manmohan Chandraker
Y. Fu
CLIPVLM
125
17
0
27 Mar 2022
Self-supervised Semantic Segmentation Grounded in Visual Concepts
Self-supervised Semantic Segmentation Grounded in Visual Concepts
Wenbin He
William C. Surmeier
A. Shekar
Liangke Gou
Liu Ren
SSL
76
7
0
25 Mar 2022
Efficient-VDVAE: Less is more
Efficient-VDVAE: Less is more
Louay Hazami
Rayhane Mama
Ragavan Thurairatnam
BDL
104
28
0
25 Mar 2022
MISC: A MIxed Strategy-Aware Model Integrating COMET for Emotional
  Support Conversation
MISC: A MIxed Strategy-Aware Model Integrating COMET for Emotional Support Conversation
Quan Tu
Yanran Li
Jianwei Cui
Bin Wang
Jiaxin Wen
Rui Yan
101
101
0
25 Mar 2022
Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors
Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors
Oran Gafni
Adam Polyak
Oron Ashual
Shelly Sheynin
Devi Parikh
Yaniv Taigman
DiffM
97
526
0
24 Mar 2022
Bailando: 3D Dance Generation by Actor-Critic GPT with Choreographic
  Memory
Bailando: 3D Dance Generation by Actor-Critic GPT with Choreographic Memory
Lian Siyao
Weijiang Yu
Tianpei Gu
Chunze Lin
Quan Wang
Chao Qian
Chen Change Loy
Ziwei Liu
SLR
137
196
0
24 Mar 2022
Competency Assessment for Autonomous Agents using Deep Generative Models
Competency Assessment for Autonomous Agents using Deep Generative Models
Aastha Acharya
Rebecca L. Russell
Nisar R. Ahmed
81
11
0
23 Mar 2022
Pixel VQ-VAEs for Improved Pixel Art Representation
Pixel VQ-VAEs for Improved Pixel Art Representation
Akash Saravanan
Matthew J. Guzdial
58
8
0
23 Mar 2022
QS-Craft: Learning to Quantize, Scrabble and Craft for Conditional Human
  Motion Animation
QS-Craft: Learning to Quantize, Scrabble and Craft for Conditional Human Motion Animation
Yuxin Hong
Xuelin Qian
Simian Luo
Xiangyang Xue
Yanwei Fu
50
2
0
22 Mar 2022
VQ-Flows: Vector Quantized Local Normalizing Flows
VQ-Flows: Vector Quantized Local Normalizing Flows
Sahil Sidheekh
Chris B. Dock
Tushar Jain
R. Balan
M. Singh
73
8
0
22 Mar 2022
Interpreting Class Conditional GANs with Channel Awareness
Interpreting Class Conditional GANs with Channel Awareness
Yin-Yin He
Zhiyi Zhang
Jiapeng Zhu
Yujun Shen
Qifeng Chen
GAN
66
1
0
21 Mar 2022
The Conceptual VAE
The Conceptual VAE
R. A. Shaikh
Sara Sabrina Zemljič
Sean Tull
S. Clark
76
4
0
21 Mar 2022
PublicCheck: Public Integrity Verification for Services of Run-time Deep
  Models
PublicCheck: Public Integrity Verification for Services of Run-time Deep Models
Shuo Wang
Sharif Abuadbba
Sidharth Agarwal
Kristen Moore
Ruoxi Sun
Minhui Xue
Surya Nepal
S. Çamtepe
S. Kanhere
HILM
68
7
0
21 Mar 2022
Unified Multivariate Gaussian Mixture for Efficient Neural Image
  Compression
Unified Multivariate Gaussian Mixture for Efficient Neural Image Compression
Xiaosu Zhu
Jingkuan Song
Lianli Gao
Fengcai Zheng
Hengtao Shen
57
64
0
21 Mar 2022
SimAN: Exploring Self-Supervised Representation Learning of Scene Text
  via Similarity-Aware Normalization
SimAN: Exploring Self-Supervised Representation Learning of Scene Text via Similarity-Aware Normalization
Canjie Luo
Lianwen Jin
Jingdong Chen
SSLAI4TS
92
30
0
20 Mar 2022
ViewFormer: NeRF-free Neural Rendering from Few Images Using
  Transformers
ViewFormer: NeRF-free Neural Rendering from Few Images Using Transformers
Jonávs Kulhánek
Erik Derner
Torsten Sattler
Robert Babuvska
ViT
112
75
0
18 Mar 2022
Alleviating Adversarial Attacks on Variational Autoencoders with MCMC
Alleviating Adversarial Attacks on Variational Autoencoders with MCMC
Anna Kuzina
Max Welling
Jakub M. Tomczak
AAMLDRL
102
12
0
18 Mar 2022
Improve few-shot voice cloning using multi-modal learning
Improve few-shot voice cloning using multi-modal learning
Haitong Zhang
Yue Lin
51
8
0
18 Mar 2022
AutoSDF: Shape Priors for 3D Completion, Reconstruction and Generation
AutoSDF: Shape Priors for 3D Completion, Reconstruction and Generation
Paritosh Mittal
Y. Cheng
Maneesh Singh
Shubham Tulsiani
133
230
0
17 Mar 2022
Transframer: Arbitrary Frame Prediction with Generative Models
Transframer: Arbitrary Frame Prediction with Generative Models
C. Nash
João Carreira
Jacob Walker
Iain Barr
Andrew Jaegle
Mateusz Malinowski
Peter W. Battaglia
ViT
123
38
0
17 Mar 2022
UNIMO-2: End-to-End Unified Vision-Language Grounded Learning
UNIMO-2: End-to-End Unified Vision-Language Grounded Learning
Wei Li
Can Gao
Guocheng Niu
Xinyan Xiao
Hao Liu
Jiachen Liu
Hua Wu
Haifeng Wang
MLLM
51
22
0
17 Mar 2022
DU-VLG: Unifying Vision-and-Language Generation via Dual
  Sequence-to-Sequence Pre-training
DU-VLG: Unifying Vision-and-Language Generation via Dual Sequence-to-Sequence Pre-training
Luyang Huang
Guocheng Niu
Jiachen Liu
Xinyan Xiao
Hua Wu
VLMCoGe
58
8
0
17 Mar 2022
Implicit Feature Decoupling with Depthwise Quantization
Implicit Feature Decoupling with Depthwise Quantization
Iordanis Fostiropoulos
Barry W. Boehm
52
2
0
15 Mar 2022
Text-free non-parallel many-to-many voice conversion using normalising
  flows
Text-free non-parallel many-to-many voice conversion using normalising flows
Thomas Merritt
Abdelhamid Ezzerg
Piotr Bilinski
Magdalena Proszewska
Kamil Pokora
Roberto Barra-Chicote
Daniel Korzekwa
116
15
0
15 Mar 2022
Style Transformer for Image Inversion and Editing
Style Transformer for Image Inversion and Editing
Xueqi Hu
Qiusheng Huang
Zhengyi Shi
Siyuan Li
Changxin Gao
Li Sun
Qingli Li
89
56
0
15 Mar 2022
Unsupervised Extractive Opinion Summarization Using Sparse Coding
Unsupervised Extractive Opinion Summarization Using Sparse Coding
Somnath Basu Roy Chowdhury
Chao Zhao
Snigdha Chaturvedi
83
25
0
15 Mar 2022
Privacy-Preserving Speech Representation Learning using Vector
  Quantization
Privacy-Preserving Speech Representation Learning using Vector Quantization
Pierre Champion
D. Jouvet
Anthony Larcher
SSL
18
0
0
15 Mar 2022
Modelling word learning and recognition using visually grounded speech
Modelling word learning and recognition using visually grounded speech
Danny Merkx
Sebastiaan Scholten
S. Frank
M. Ernestus
O. Scharenborg
SSL
130
0
0
14 Mar 2022
Semi-Discrete Normalizing Flows through Differentiable Tessellation
Semi-Discrete Normalizing Flows through Differentiable Tessellation
Ricky T. Q. Chen
Brandon Amos
Maximilian Nickel
88
10
0
14 Mar 2022
MVP: Multimodality-guided Visual Pre-training
MVP: Multimodality-guided Visual Pre-training
Longhui Wei
Lingxi Xie
Wen-gang Zhou
Houqiang Li
Qi Tian
91
108
0
10 Mar 2022
KPE: Keypoint Pose Encoding for Transformer-based Image Generation
KPE: Keypoint Pose Encoding for Transformer-based Image Generation
Soon Yau Cheong
A. Mustafa
Andrew Gilbert
ViT
85
10
0
09 Mar 2022
FlexIT: Towards Flexible Semantic Image Translation
FlexIT: Towards Flexible Semantic Image Translation
Guillaume Couairon
Asya Grechka
Jakob Verbeek
Holger Schwenk
Matthieu Cord
DiffM
114
38
0
09 Mar 2022
Practical cognitive speech compression
Practical cognitive speech compression
Reza Lotfidereshgi
P. Gournay
59
2
0
08 Mar 2022
Hierarchical Sketch Induction for Paraphrase Generation
Hierarchical Sketch Induction for Paraphrase Generation
Tom Hosking
Hao Tang
Mirella Lapata
BDL
116
32
0
07 Mar 2022
Sparsity-Inducing Categorical Prior Improves Robustness of the
  Information Bottleneck
Sparsity-Inducing Categorical Prior Improves Robustness of the Information Bottleneck
Anirban Samaddar
Sandeep Madireddy
Prasanna Balaprakash
Tapabrata Maiti
Gustavo de los Campos
Ian Fischer
57
1
0
04 Mar 2022
Show Me What and Tell Me How: Video Synthesis via Multimodal
  Conditioning
Show Me What and Tell Me How: Video Synthesis via Multimodal Conditioning
Ligong Han
Jian Ren
Hsin-Ying Lee
Francesco Barbieri
Kyle Olszewski
Shervin Minaee
Dimitris N. Metaxas
Sergey Tulyakov
DiffMVGen
142
41
0
04 Mar 2022
Differentiable Causal Discovery Under Latent Interventions
Differentiable Causal Discovery Under Latent Interventions
Gonccalo R. A. Faria
André F. T. Martins
Mário A. T. Figueiredo
BDLCMLOOD
95
23
0
04 Mar 2022
Autoregressive Image Generation using Residual Quantization
Autoregressive Image Generation using Residual Quantization
Doyup Lee
Chiheon Kim
Saehoon Kim
Minsu Cho
Wook-Shin Han
VGen
295
378
0
03 Mar 2022
Audio Self-supervised Learning: A Survey
Audio Self-supervised Learning: A Survey
Shuo Liu
Adria Mallol-Ragolta
Emilia Parada-Cabeleiro
Kun Qian
Xingshuo Jing
Alexander Kathan
Bin Hu
Bjoern W. Schuller
SSL
104
109
0
02 Mar 2022
Variational Autoencoders Without the Variation
Variational Autoencoders Without the Variation
Gregory A. Daly
J. Fieldsend
G. Tabor
68
2
0
01 Mar 2022
CLIP-GEN: Language-Free Training of a Text-to-Image Generator with CLIP
CLIP-GEN: Language-Free Training of a Text-to-Image Generator with CLIP
Zihao Wang
Wei Liu
Qian He
Xin-ru Wu
Zili Yi
CLIPVLM
268
75
0
01 Mar 2022
A Brief Overview of Unsupervised Neural Speech Representation Learning
A Brief Overview of Unsupervised Neural Speech Representation Learning
Lasse Borgholt
Jakob Drachmann Havtorn
Joakim Edin
Lars Maaløe
Christian Igel
BDLAI4TSSSL
101
11
0
01 Mar 2022
LISA: Learning Interpretable Skill Abstractions from Language
LISA: Learning Interpretable Skill Abstractions from Language
Divyansh Garg
Skanda Vaidyanath
Kuno Kim
Jiaming Song
Stefano Ermon
LM&RoOffRL
253
30
0
28 Feb 2022
Weakly Supervised Disentangled Representation for Goal-conditioned
  Reinforcement Learning
Weakly Supervised Disentangled Representation for Goal-conditioned Reinforcement Learning
Zhifeng Qian
Mingyu You
Hongjun Zhou
Bin He
DRLOffRL
70
7
0
28 Feb 2022
Variational Autoencoder with Disentanglement Priors for Low-Resource
  Task-Specific Natural Language Generation
Variational Autoencoder with Disentanglement Priors for Low-Resource Task-Specific Natural Language Generation
Zhuang Li
Zhuang Li
Xingliang Yuan
Tongtong Wu
Tianyang Zhan
Gholamreza Haffari
CoGeUDDRL
122
4
0
27 Feb 2022
Controllable Natural Language Generation with Contrastive Prefixes
Controllable Natural Language Generation with Contrastive Prefixes
Jing Qian
Li Dong
Yelong Shen
Furu Wei
Weizhu Chen
101
100
0
27 Feb 2022
Real-World Blind Super-Resolution via Feature Matching with Implicit
  High-Resolution Priors
Real-World Blind Super-Resolution via Feature Matching with Implicit High-Resolution Priors
Chaofeng Chen
Xinyu Shi
Yipeng Qin
Xiaoming Li
Xiaoguang Han
Taojiannan Yang
Shihui Guo
106
118
0
26 Feb 2022
Retriever: Learning Content-Style Representation as a Token-Level
  Bipartite Graph
Retriever: Learning Content-Style Representation as a Token-Level Bipartite Graph
Dacheng Yin
Xuanchi Ren
Chong Luo
Yuwang Wang
Zhiwei Xiong
Wenjun Zeng
114
13
0
24 Feb 2022
Previous
123...515253...646566
Next