ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1711.00937
  4. Cited By
Neural Discrete Representation Learning
v1v2 (latest)

Neural Discrete Representation Learning

2 November 2017
Aaron van den Oord
Oriol Vinyals
Koray Kavukcuoglu
    BDLSSLOCL
ArXiv (abs)PDFHTML

Papers citing "Neural Discrete Representation Learning"

50 / 3,267 papers shown
Title
Rhythmic Gesticulator: Rhythm-Aware Co-Speech Gesture Synthesis with
  Hierarchical Neural Embeddings
Rhythmic Gesticulator: Rhythm-Aware Co-Speech Gesture Synthesis with Hierarchical Neural Embeddings
Tenglong Ao
Qingzhe Gao
Yuke Lou
Baoquan Chen
Libin Liu
SLR
94
64
0
04 Oct 2022
Visual Prompt Tuning for Generative Transfer Learning
Visual Prompt Tuning for Generative Transfer Learning
Kihyuk Sohn
Yuan Hao
José Lezama
Luisa F. Polanía
Huiwen Chang
Han Zhang
Irfan Essa
Lu Jiang
VPVLMVLM
166
89
0
03 Oct 2022
Unsupervised Multi-View Object Segmentation Using Radiance Field
  Propagation
Unsupervised Multi-View Object Segmentation Using Radiance Field Propagation
Xinhang Liu
Jiaben Chen
Huai Yu
Yu-Wing Tai
Chi-Keung Tang
154
28
0
02 Oct 2022
Compositional Generalization in Unsupervised Compositional
  Representation Learning: A Study on Disentanglement and Emergent Language
Compositional Generalization in Unsupervised Compositional Representation Learning: A Study on Disentanglement and Emergent Language
Zhenlin Xu
Marc Niethammer
Colin Raffel
CoGeOODOCLDRL
111
44
0
02 Oct 2022
Contrastive Corpus Attribution for Explaining Representations
Contrastive Corpus Attribution for Explaining Representations
Christy Lin
Hugh Chen
Chanwoo Kim
Su-In Lee
SSL
61
8
0
30 Sep 2022
Rethinking the Learning Paradigm for Facial Expression Recognition
Rethinking the Learning Paradigm for Facial Expression Recognition
Weijie Wang
N. Sebe
Bruno Lepri
91
3
0
30 Sep 2022
AudioGen: Textually Guided Audio Generation
AudioGen: Textually Guided Audio Generation
Felix Kreuk
Gabriel Synnaeve
Adam Polyak
Uriel Singer
Alexandre Défossez
Jade Copet
Devi Parikh
Yaniv Taigman
Yossi Adi
DiffM
154
309
0
30 Sep 2022
Mind Reader: Reconstructing complex images from brain activities
Mind Reader: Reconstructing complex images from brain activities
Sikun Lin
Thomas C. Sprague
Ambuj K. Singh
DiffM
192
91
0
30 Sep 2022
Bridging the Gap to Real-World Object-Centric Learning
Bridging the Gap to Real-World Object-Centric Learning
Maximilian Seitzer
Max Horn
Andrii Zadaianchuk
Dominik Zietlow
Tianjun Xiao
...
Tong He
Zheng Zhang
Bernhard Schölkopf
Thomas Brox
Francesco Locatello
OCL
143
153
0
29 Sep 2022
Training β-VAE by Aggregating a Learned Gaussian Posterior with a
  Decoupled Decoder
Training β-VAE by Aggregating a Learned Gaussian Posterior with a Decoupled Decoder
Jianning Li
Jana Fragemann
Seyed-Ahmad Ahmadi
Jens Kleesiek
Jan Egger
DRL
70
5
0
29 Sep 2022
Learning Parsimonious Dynamics for Generalization in Reinforcement
  Learning
Learning Parsimonious Dynamics for Generalization in Reinforcement Learning
Tankred Saanum
Eric Schulz
60
1
0
29 Sep 2022
Re-Imagen: Retrieval-Augmented Text-to-Image Generator
Re-Imagen: Retrieval-Augmented Text-to-Image Generator
Wenhu Chen
Hexiang Hu
Chitwan Saharia
William W. Cohen
VLM
218
178
0
29 Sep 2022
Multi-Sample Training for Neural Image Compression
Multi-Sample Training for Neural Image Compression
Tongda Xu
Yan Wang
Dailan He
Chenjian Gao
Han-yi Gao
Kun Liu
Hongwei Qin
71
5
0
28 Sep 2022
Deep Generative Multimedia Children's Literature
Deep Generative Multimedia Children's Literature
Matthew Lyle Olson
52
0
0
27 Sep 2022
Learning to Drop Out: An Adversarial Approach to Training Sequence VAEs
Learning to Drop Out: An Adversarial Approach to Training Sequence VAEs
Ðorðe Miladinovic
Kumar Shridhar
Kushal Kumar Jain
Max B. Paulus
J. M. Buhmann
Mrinmaya Sachan
Carl Allen
DRL
116
5
0
26 Sep 2022
Vector Quantized Semantic Communication System
Vector Quantized Semantic Communication System
Qifan Fu
Huiqiang Xie
Zhijin Qin
Greg Slabaugh
Xiaoming Tao
90
45
0
23 Sep 2022
Variational Open-Domain Question Answering
Variational Open-Domain Question Answering
Valentin Liévin
Andreas Geert Motzfeldt
Ida Riis Jensen
Ole Winther
OODBDL
80
9
0
23 Sep 2022
Implementing and Experimenting with Diffusion Models for Text-to-Image
  Generation
Implementing and Experimenting with Diffusion Models for Text-to-Image Generation
Robin Zbinden
46
3
0
22 Sep 2022
A Multi-Stage Multi-Codebook VQ-VAE Approach to High-Performance Neural
  TTS
A Multi-Stage Multi-Codebook VQ-VAE Approach to High-Performance Neural TTS
Haohan Guo
Fenglong Xie
Frank Soong
Xixin Wu
Helen M. Meng
87
12
0
22 Sep 2022
Attention Beats Concatenation for Conditioning Neural Fields
Attention Beats Concatenation for Conditioning Neural Fields
Daniel Rebain
Mark J. Matthews
K. M. Yi
Gopal Sharma
Dmitry Lagun
Andrea Tagliasacchi
AI4CE
90
23
0
21 Sep 2022
Continuous Mixtures of Tractable Probabilistic Models
Continuous Mixtures of Tractable Probabilistic Models
Alvaro H. C. Correia
G. Gala
Erik Quaeghebeur
Cassio de Campos
Robert Peharz
TPM
96
18
0
21 Sep 2022
Robust Information Bottleneck for Task-Oriented Communication with
  Digital Modulation
Robust Information Bottleneck for Task-Oriented Communication with Digital Modulation
Songjie Xie
Shuaijie Ma
Ming Ding
Yuanming Shi
Ming-Fu Tang
Youlong Wu
120
74
0
21 Sep 2022
Deep Learning for Multi-User MIMO Systems: Joint Design of Pilot,
  Limited Feedback, and Precoding
Deep Learning for Multi-User MIMO Systems: Joint Design of Pilot, Limited Feedback, and Precoding
Jeonghyeon Jang
Hoon Lee
Il-Min Kim
Inkyu Lee
34
25
0
21 Sep 2022
Text2Light: Zero-Shot Text-Driven HDR Panorama Generation
Text2Light: Zero-Shot Text-Driven HDR Panorama Generation
Zhaoxi Chen
Guangcong Wang
Ziwei Liu
178
30
0
20 Sep 2022
MoVQ: Modulating Quantized Vectors for High-Fidelity Image Generation
MoVQ: Modulating Quantized Vectors for High-Fidelity Image Generation
Chuanxia Zheng
L. Vuong
Jianfei Cai
Dinh Q. Phung
MQ
149
80
0
19 Sep 2022
Adaptive Multi-stage Density Ratio Estimation for Learning Latent Space
  Energy-based Model
Adaptive Multi-stage Density Ratio Estimation for Learning Latent Space Energy-based Model
Zhisheng Xiao
Tian Han
108
15
0
19 Sep 2022
Can segmentation models be trained with fully synthetically generated
  data?
Can segmentation models be trained with fully synthetically generated data?
Virginia Fernandez
W. H. Pinaya
Pedro Borges
Petru-Daniel Tudosiu
M. Graham
Tom Vercauteren
M. Jorge Cardoso
DiffMMedIm
112
47
0
17 Sep 2022
Learning Distinct and Representative Styles for Image Captioning
Learning Distinct and Representative Styles for Image Captioning
Qi Chen
Chaorui Deng
Qi Wu
VLM
86
24
0
17 Sep 2022
Enhance the Visual Representation via Discrete Adversarial Training
Enhance the Visual Representation via Discrete Adversarial Training
Xiaofeng Mao
YueFeng Chen
Ranjie Duan
Yao Zhu
Gege Qi
Shaokai Ye
Xiaodan Li
Rong Zhang
Hui Xue
116
33
0
16 Sep 2022
One-Shot Synthesis of Images and Segmentation Masks
One-Shot Synthesis of Images and Segmentation Masks
V. Sushko
Dan Zhang
Juergen Gall
Anna Khoreva
99
6
0
15 Sep 2022
HARP: Autoregressive Latent Video Prediction with High-Fidelity Image
  Generator
HARP: Autoregressive Latent Video Prediction with High-Fidelity Image Generator
Younggyo Seo
Kimin Lee
Fangchen Liu
Stephen James
Pieter Abbeel
VGen
70
29
0
15 Sep 2022
Fair Inference for Discrete Latent Variable Models
Fair Inference for Discrete Latent Variable Models
Rashidul Islam
Shimei Pan
James R. Foulds
FaML
93
1
0
15 Sep 2022
Non-Parallel Voice Conversion for ASR Augmentation
Non-Parallel Voice Conversion for ASR Augmentation
Gary Wang
Andrew Rosenberg
Bhuvana Ramabhadran
Fadi Biadsy
Yinghui Huang
Jesse Emond
P. M. Mengibar
106
2
0
15 Sep 2022
StoryDALL-E: Adapting Pretrained Text-to-Image Transformers for Story
  Continuation
StoryDALL-E: Adapting Pretrained Text-to-Image Transformers for Story Continuation
A. Maharana
Darryl Hannan
Joey Tianyi Zhou
DiffM
112
83
0
13 Sep 2022
SeRP: Self-Supervised Representation Learning Using Perturbed Point
  Clouds
SeRP: Self-Supervised Representation Learning Using Perturbed Point Clouds
Siddhant Garg
Mudit Chaudhary
3DPC
81
2
0
13 Sep 2022
Unsupervised representation learning with recognition-parametrised
  probabilistic models
Unsupervised representation learning with recognition-parametrised probabilistic models
William I. Walker
Hugo Soulat
Changmin Yu
M. Sahani
BDL
65
5
0
13 Sep 2022
Residual Correction in Real-Time Traffic Forecasting
Residual Correction in Real-Time Traffic Forecasting
Daejin Kim
Young Cho
Dongmin Kim
Cheonbok Park
Jaegul Choo
108
7
0
12 Sep 2022
Diffusion Models in Vision: A Survey
Diffusion Models in Vision: A Survey
Florinel-Alin Croitoru
Vlad Hondru
Radu Tudor Ionescu
M. Shah
DiffMVLMMedIm
376
1,260
0
10 Sep 2022
Improved Masked Image Generation with Token-Critic
Improved Masked Image Generation with Token-Critic
José Lezama
Huiwen Chang
Lu Jiang
Irfan Essa
DiffM
255
48
0
09 Sep 2022
Dr. Neurosymbolic, or: How I Learned to Stop Worrying and Accept
  Statistics
Dr. Neurosymbolic, or: How I Learned to Stop Worrying and Accept Statistics
Masataro Asai
144
0
0
08 Sep 2022
Text-Free Learning of a Natural Language Interface for Pretrained Face
  Generators
Text-Free Learning of a Natural Language Interface for Pretrained Face Generators
Xiaodan Du
Raymond A. Yeh
Nicholas I. Kolkin
Eli Shechtman
Gregory Shakhnarovich
CLIP
68
1
0
08 Sep 2022
Foundations and Trends in Multimodal Machine Learning: Principles,
  Challenges, and Open Questions
Foundations and Trends in Multimodal Machine Learning: Principles, Challenges, and Open Questions
Paul Pu Liang
Amir Zadeh
Louis-Philippe Morency
114
90
0
07 Sep 2022
Morphology-preserving Autoregressive 3D Generative Modelling of the
  Brain
Morphology-preserving Autoregressive 3D Generative Modelling of the Brain
Petru-Daniel Tudosiu
W. H. Pinaya
M. Graham
Pedro Borges
Virginia Fernandez
...
Disha Mehra
M. Vella
P. Nachev
Sebastien Ourselin
M. Jorge Cardoso
3DHDiffMMedIm
63
21
0
07 Sep 2022
AudioLM: a Language Modeling Approach to Audio Generation
AudioLM: a Language Modeling Approach to Audio Generation
Zalan Borsos
Raphaël Marinier
Damien Vincent
Eugene Kharitonov
Olivier Pietquin
...
Dominik Roblek
O. Teboul
David Grangier
Marco Tagliasacchi
Neil Zeghidour
AuLLM
200
617
0
07 Sep 2022
A Survey on Generative Diffusion Model
A Survey on Generative Diffusion Model
Hanqun Cao
Cheng Tan
Zhangyang Gao
Yilun Xu
Guangyong Chen
Pheng-Ann Heng
Stan Z. Li
MedIm
332
239
0
06 Sep 2022
Semantic Image Synthesis with Semantically Coupled VQ-Model
Semantic Image Synthesis with Semantically Coupled VQ-Model
Stephan Alaniz
Thomas Hummel
Zeynep Akata
60
6
0
06 Sep 2022
Forensicability Assessment of Questioned Images in Recapturing Detection
Forensicability Assessment of Questioned Images in Recapturing Detection
Changsheng Chen
Lin Zhao
Rizhao Cai
Zitong Yu
Jiwu Huang
Alex C. Kot
AAMLCVBM
53
0
0
05 Sep 2022
An Empirical Study of End-to-End Video-Language Transformers with Masked
  Visual Modeling
An Empirical Study of End-to-End Video-Language Transformers with Masked Visual Modeling
Tsu-Jui Fu
Linjie Li
Zhe Gan
Kevin Qinghong Lin
William Yang Wang
Lijuan Wang
Zicheng Liu
VLM
144
65
0
04 Sep 2022
DSE-GAN: Dynamic Semantic Evolution Generative Adversarial Network for
  Text-to-Image Generation
DSE-GAN: Dynamic Semantic Evolution Generative Adversarial Network for Text-to-Image Generation
Mengqi Huang
Zhendong Mao
Penghui Wang
Quang Wang
Yongdong Zhang
70
21
0
03 Sep 2022
Visual Prompting via Image Inpainting
Visual Prompting via Image Inpainting
Amir Bar
Yossi Gandelsman
Trevor Darrell
Amir Globerson
Alexei A. Efros
VLMVPVLM
91
212
0
01 Sep 2022
Previous
123...464748...646566
Next