Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1711.00937
Cited By
Neural Discrete Representation Learning
2 November 2017
Aaron van den Oord
Oriol Vinyals
Koray Kavukcuoglu
BDL
SSL
OCL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Neural Discrete Representation Learning"
50 / 2,788 papers shown
Title
MonoWAD: Weather-Adaptive Diffusion Model for Robust Monocular 3D Object Detection
Youngmin Oh
Hyung-Il Kim
Seong Tae Kim
Jung Uk Kim
DiffM
44
2
0
23 Jul 2024
On Differentially Private 3D Medical Image Synthesis with Controllable Latent Diffusion Models
Deniz Daum
Richard Osuala
Anneliese Riess
Georgios Kaissis
Julia A. Schnabel
Maxime Di Folco
MedIm
58
0
0
23 Jul 2024
CarFormer: Self-Driving with Learned Object-Centric Representations
Shadi S. Hamdan
Fatma Guney
3DPC
OCL
51
3
0
22 Jul 2024
QueST: Self-Supervised Skill Abstractions for Learning Continuous Control
Atharva Mete
Haotian Xue
Albert Wilcox
Yongxin Chen
Animesh Garg
SSL
40
17
0
22 Jul 2024
Decomposition of Neural Discrete Representations for Large-Scale 3D Mapping
Minseong Park
Suhan Woo
Euntai Kim
3DV
38
0
0
22 Jul 2024
WebRPG: Automatic Web Rendering Parameters Generation for Visual Presentation
Zirui Shao
Feiyu Gao
Hangdi Xing
Zepeng Zhu
Zhi Yu
Jiajun Bu
Qi Zheng
Cong Yao
36
2
0
22 Jul 2024
DiffX: Guide Your Layout to Cross-Modal Generative Modeling
Zeyu Wang
Jingyu Lin
Yifei Qian
Yi Huang
Shicen Tian
...
Qu Yang
Lan Du
Cunjian Chen
Yufei Guo
Kejie Huang
DiffM
VLM
30
2
0
22 Jul 2024
Diverse Image Harmonization
Xinhao Tao
Tianyuan Qiu
Junyan Cao
Li Niu
40
0
0
22 Jul 2024
Exploring the Effectiveness of Object-Centric Representations in Visual Question Answering: Comparative Insights with Foundation Models
Amir Mohammad Karimi Mamaghan
Samuele Papa
Karl Henrik Johansson
Stefan Bauer
Andrea Dittadi
OCL
53
5
0
22 Jul 2024
Improving EEG Classification Through Randomly Reassembling Original and Generated Data with Transformer-based Diffusion Models
Mingzhi Chen
Yiyu Gui
Yuqi Su
Yuesheng Zhu
Guibo Luo
Yuchao Yang
DiffM
MedIm
26
0
0
20 Jul 2024
Intelligent Artistic Typography: A Comprehensive Review of Artistic Text Design and Generation
Yuhang Bai
Zichuan Huang
Wenshuo Gao
Shuai Yang
Jiaying Liu
49
5
0
20 Jul 2024
M2D2M: Multi-Motion Generation from Text with Discrete Diffusion Models
Seung-geun Chi
Hyung-Gun Chi
Hengbo Ma
Nakul Agarwal
Faizan Siddiqui
Karthik Ramani
Kwonjoon Lee
DiffM
51
11
0
19 Jul 2024
Decomposed Vector-Quantized Variational Autoencoder for Human Grasp Generation
Zhe Zhao
Mengshi Qi
Huadong Ma
DRL
49
2
0
19 Jul 2024
LIMT: Language-Informed Multi-Task Visual World Models
Elie Aljalbout
Nikolaos Sotirakis
Patrick van der Smagt
Maximilian Karl
Nutan Chen
55
5
0
18 Jul 2024
LLM-Empowered State Representation for Reinforcement Learning
Boyuan Wang
Yun Qu
Yuhang Jiang
Jianzhun Shao
Chang-rui Liu
Wenming Yang
Xiangyang Ji
45
7
0
18 Jul 2024
High-Quality Tabular Data Generation using Post-Selected VAE
Volodymyr Shulakov
31
1
0
17 Jul 2024
GLARE: Low Light Image Enhancement via Generative Latent Feature based Codebook Retrieval
Han Zhou
Wei Dong
Xiaohong Liu
Shuaicheng Liu
Xiongkuo Min
Guangtao Zhai
Jun Chen
69
13
0
17 Jul 2024
HIMO: A New Benchmark for Full-Body Human Interacting with Multiple Objects
Xintao Lv
Liang Xu
Yichao Yan
Xin Jin
Congsheng Xu
...
Yifan Liu
Lincheng Li
Mengxiao Bi
Wenjun Zeng
Xiaokang Yang
49
7
0
17 Jul 2024
Exploiting Inter-Image Similarity Prior for Low-Bitrate Remote Sensing Image Compression
Junhui Li
Xingsong Hou
34
1
0
17 Jul 2024
Quantised Global Autoencoder: A Holistic Approach to Representing Visual Data
Tim Elsner
Paula Usinger
Victor Czech
Gregor Kobsik
Yanjiang He
I. Lim
Leif Kobbelt
49
1
0
16 Jul 2024
Turbo: Informativity-Driven Acceleration Plug-In for Vision-Language Large Models
Chen Ju
Haicheng Wang
Haozhe Cheng
Xu Chen
Zhonghua Zhai
Weilin Huang
Jinsong Lan
Shuai Xiao
Bo Zheng
VLM
59
5
0
16 Jul 2024
UP-Diff: Latent Diffusion Model for Remote Sensing Urban Prediction
Zeyu Wang
Zecheng Hao
Jingyu Lin
Yuchao Feng
Yufei Guo
37
2
0
16 Jul 2024
Length-Aware Motion Synthesis via Latent Diffusion
Alessio Sampieri
Alessio Palma
Indro Spinelli
Fabio Galasso
VGen
DiffM
59
7
0
16 Jul 2024
Semi-Supervised Generative Models for Disease Trajectories: A Case Study on Systemic Sclerosis
Cécile Trottet
Manuel Schürch
Ahmed Allam
Imon Barua
L. Petelytska
...
Mislav Radic
Oliver Distler
A. Hoffmann-Vold
Michael Krauthammer
Eustar collaborators
MedIm
34
0
0
16 Jul 2024
COHO: Context-Sensitive City-Scale Hierarchical Urban Layout Generation
Liu He
Daniel G. Aliaga
AI4TS
59
8
0
16 Jul 2024
LiteFocus: Accelerated Diffusion Inference for Long Audio Synthesis
Zhenxiong Tan
Xinyin Ma
Gongfan Fang
Xinchao Wang
44
3
0
15 Jul 2024
BandControlNet: Parallel Transformers-based Steerable Popular Music Generation with Fine-Grained Spatiotemporal Features
Jing Luo
Xinyu Yang
Dorien Herremans
39
3
0
15 Jul 2024
Exploring the Potentials and Challenges of Deep Generative Models in Product Design Conception
Phillip Mueller
Lars Mikelsons
AI4CE
46
1
0
15 Jul 2024
Disrupting Diffusion-based Inpainters with Semantic Digression
Geonho Son
Juhun Lee
Simon S. Woo
DiffM
42
3
0
14 Jul 2024
Latent Spaces Enable Transformer-Based Dose Prediction in Complex Radiotherapy Plans
E. Wang
Ryan Au
Pencilla Lang
Sarah Mattonen
MedIm
41
0
0
11 Jul 2024
A Comprehensive Survey on Human Video Generation: Challenges, Methods, and Insights
Wentao Lei
Jinting Wang
Fengji Ma
Guanjie Huang
Li Liu
VGen
EGVM
70
8
0
11 Jul 2024
Several questions of visual generation in 2024
Shuyang Gu
40
1
0
11 Jul 2024
MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis
Wanggui He
Siming Fu
Mushui Liu
Xierui Wang
Wenyi Xiao
...
Zhelun Yu
Haoyuan Li
Ziwei Huang
Leilei Gan
Hao Jiang
DiffM
29
23
0
10 Jul 2024
Video In-context Learning: Autoregressive Transformers are Zero-Shot Video Imitators
Wentao Zhang
Junliang Guo
Tianyu He
Li Zhao
Linli Xu
Jiang Bian
47
3
0
10 Jul 2024
Latent Space Imaging
Matheus Souza
Yidan Zheng
Kaizhang Kang
Yogeshwar Nath Mishra
Qiang Fu
Wolfgang Heidrich
65
0
0
09 Jul 2024
ANOLE: An Open, Autoregressive, Native Large Multimodal Models for Interleaved Image-Text Generation
Ethan Chern
Jiadi Su
Yan Ma
Pengfei Liu
MLLM
34
29
0
08 Jul 2024
PerlDiff: Controllable Street View Synthesis Using Perspective-Layout Diffusion Models
Jinhua Zhang
Hualian Sheng
Sijia Cai
Bing Deng
Qiao Liang
Wen Li
Ying Fu
Jieping Ye
Shuhang Gu
DiffM
39
2
0
08 Jul 2024
Unmasking Trees for Tabular Data
Calvin McCarter
50
3
0
08 Jul 2024
Read, Watch and Scream! Sound Generation from Text and Video
Yujin Jeong
Yunji Kim
Sanghyuk Chun
Jiyoung Lee
VGen
DiffM
42
12
0
08 Jul 2024
Balance of Number of Embedding and their Dimensions in Vector Quantization
Hang Chen
Sankepally Sainath Reddy
Ziwei Chen
Dianbo Liu
54
1
0
06 Jul 2024
Improving ensemble extreme precipitation forecasts using generative artificial intelligence
Yingkai Sha
Ryan Sobash
David John Gagne II
38
0
0
05 Jul 2024
Feature Attenuation of Defective Representation Can Resolve Incomplete Masking on Anomaly Detection
Yeonghyeon Park
Sungho Kang
Myung Jin Kim
Hyeong Seok Kim
Juneho Yi
AAML
44
0
0
05 Jul 2024
MS2SL: Multimodal Spoken Data-Driven Continuous Sign Language Production
Jian Ma
Wenguan Wang
Yi Yang
Feng Zheng
50
1
0
04 Jul 2024
NEBULA: Neural Empirical Bayes Under Latent Representations for Efficient and Controllable Design of Molecular Libraries
E. Nowara
Pedro H. O. Pinheiro
Sai Pooja Mahajan
Omar Mahmood
Andrew Watkins
Saeed Saremi
Michael R. Maser
BDL
DiffM
49
2
0
03 Jul 2024
HoloHisto: End-to-end Gigapixel WSI Segmentation with 4K Resolution Sequential Tokenization
Yucheng Tang
Yufan He
Vishwesh Nath
Pengfeig Guo
Ruining Deng
...
Ziyue Xu
Holger Roth
Daguang Xu
Haichun Yang
Yuankai Huo
30
4
0
03 Jul 2024
DisCo-Diff: Enhancing Continuous Diffusion Models with Discrete Latents
Yilun Xu
Gabriele Corso
Tommi Jaakkola
Arash Vahdat
Karsten Kreis
49
12
0
03 Jul 2024
Non-Adversarial Learning: Vector-Quantized Common Latent Space for Multi-Sequence MRI
Luyi Han
T. Tan
Tianyu Zhang
Xin Wang
Yuan Gao
Chunyao Lu
Xinglong Liang
Haoran Dou
Yunzhi Huang
Ritse Mann
DRL
MedIm
25
0
0
03 Jul 2024
Translatotron-V(ison): An End-to-End Model for In-Image Machine Translation
Zhibin Lan
Liqiang Niu
Fandong Meng
Jie Zhou
Min Zhang
Jinsong Su
VLM
39
6
0
03 Jul 2024
Uniform Transformation: Refining Latent Representation in Variational Autoencoders
Ye Shi
C. S. G. Lee
OOD
DRL
37
0
0
02 Jul 2024
MORPHEUS: Modeling Role from Personalized Dialogue History by Exploring and Utilizing Latent Space
Yihong Tang
Bo Wang
Dongming Zhao
Xiaojia Jin
Jijun Zhang
Ruifang He
Yuexian Hou
50
2
0
02 Jul 2024
Previous
1
2
3
...
15
16
17
...
54
55
56
Next