Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1711.00937
Cited By
Neural Discrete Representation Learning
2 November 2017
Aaron van den Oord
Oriol Vinyals
Koray Kavukcuoglu
BDL
SSL
OCL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Neural Discrete Representation Learning"
50 / 2,790 papers shown
Title
Conditional Distribution Modelling for Few-Shot Image Synthesis with Diffusion Models
Parul Gupta
Munawar Hayat
Abhinav Dhall
Thanh-Toan Do
DiffM
51
1
0
25 Apr 2024
An Analysis of Recent Advances in Deepfake Image Detection in an Evolving Threat Landscape
Sifat Muhammad Abdullah
Aravind Cheruvu
Shravya Kanchi
Taejoong Chung
Peng Gao
Murtuza Jadliwala
Bimal Viswanath
AAML
34
12
0
24 Apr 2024
RHanDS: Refining Malformed Hands for Generated Images with Decoupled Structure and Style Guidance
Chengrui Wang
Pengfei Liu
Min Zhou
Ming Zeng
Xubin Li
Tiezheng Ge
Bo Zheng
DiffM
60
5
0
22 Apr 2024
Motion-aware Latent Diffusion Models for Video Frame Interpolation
Zhilin Huang
Yijie Yu
Ling Yang
C. Qin
Bing Zheng
Xiawu Zheng
Zikun Zhou
Yaowei Wang
Wenming Yang
VGen
DiffM
40
7
0
21 Apr 2024
HybridFlow: Infusing Continuity into Masked Codebook for Extreme Low-Bitrate Image Compression
Lei Lu
Yanyue Xie
Wei Jiang
Wei Wang
Xue Lin
Yanzhi Wang
50
5
0
20 Apr 2024
Purposer: Putting Human Motion Generation in Context
Nicolas Ugrinovic
Thomas Lucas
Fabien Baradel
Philippe Weinzaepfel
Grégory Rogez
Francesc Moreno-Noguer
DiffM
54
2
0
19 Apr 2024
Learn2Talk: 3D Talking Face Learns from 2D Talking Face
Yixiang Zhuang
Baoping Cheng
Yao Cheng
Yuntao Jin
Renshuai Liu
Chengyang Li
Xuan Cheng
Jing Liao
Juncong Lin
CVBM
3DH
42
6
0
19 Apr 2024
MCM: Multi-condition Motion Synthesis Framework
Zeyu Ling
Bo Han
Yongkang Wang
Han Lin
Mohan Kankanhalli
Weidong Geng
48
1
0
19 Apr 2024
DISC: Latent Diffusion Models with Self-Distillation from Separated Conditions for Prostate Cancer Grading
M. M. Ho
Elham Ghelichkhan
Yosep Chong
Yufei Zhou
Beatrice Knudsen
Tolga Tasdizen
MedIm
DiffM
42
2
0
19 Apr 2024
GenVideo: One-shot Target-image and Shape Aware Video Editing using T2I Diffusion Models
Sai Sree Harsha
Ambareesh Revanur
Dhwanit Agarwal
Shradha Agrawal
VGen
DiffM
48
3
0
18 Apr 2024
G-HOP: Generative Hand-Object Prior for Interaction Reconstruction and Grasp Synthesis
Yufei Ye
Abhinav Gupta
Kris Kitani
Shubham Tulsiani
49
16
0
18 Apr 2024
An Experimental Study on Exploring Strong Lightweight Vision Transformers via Masked Image Modeling Pre-Training
Jin Gao
Shubo Lin
Shaoru Wang
Yutong Kou
Zeming Li
Liang Li
Congxuan Zhang
Xiaoqin Zhang
Yizheng Wang
Weiming Hu
52
1
0
18 Apr 2024
MIDGET: Music Conditioned 3D Dance Generation
Jinwu Wang
Wei Mao
Miaomiao Liu
42
0
0
18 Apr 2024
A Data-Driven Representation for Sign Language Production
Harry Walsh
Abolfazl Ravanshad
Mariam Rahmani
Richard Bowden
SLR
34
3
0
17 Apr 2024
Closely Interactive Human Reconstruction with Proxemics and Physics-Guided Adaption
Buzhen Huang
Chen Li
Chongyang Xu
Liang Pan
Yangang Wang
Gim Hee Lee
38
5
0
17 Apr 2024
Personalized Heart Disease Detection via ECG Digital Twin Generation
Yaojun Hu
Jintai Chen
Lianting Hu
Dantong Li
Jiahuan Yan
Haochao Ying
Huiying Liang
Jian Wu
36
4
0
17 Apr 2024
LongVQ: Long Sequence Modeling with Vector Quantization on Structured Memory
Zicheng Liu
Li Wang
Siyuan Li
Zedong Wang
Haitao Lin
Stan Z. Li
VLM
32
4
0
17 Apr 2024
Variational quantization for state space models
Etienne David
Jean Bellot
Sylvain Le Corff
AI4TS
BDL
33
1
0
17 Apr 2024
Tripod: Three Complementary Inductive Biases for Disentangled Representation Learning
Kyle Hsu
Jubayer Ibn Hamid
Kaylee Burns
Chelsea Finn
Jiajun Wu
CML
33
5
0
16 Apr 2024
Exploring Text-to-Motion Generation with Human Preference
Jenny Sheng
Matthieu Lin
Andrew Zhao
Kevin Pruvost
Yu-Hui Wen
Yangguang Li
Gao Huang
Yong-Jin Liu
VGen
42
1
0
15 Apr 2024
Foundational GPT Model for MEG
Richard Csaky
M. Es
Oiwi Parker Jones
M. Woolrich
45
2
0
14 Apr 2024
LoopAnimate: Loopable Salient Object Animation
Fanyi Wang
Peng Liu
Haotian Hu
Dan Meng
Jingwen Su
Jinjin Xu
Yanhao Zhang
Xiaoming Ren
Zhiwang Zhang
VGen
47
2
0
14 Apr 2024
MaSkel: A Model for Human Whole-body X-rays Generation from Human Masking Images
Yingjie Xi
Boyuan Cheng
Jingyao Cai
Jian Jun Zhang
Xiaosong Yang
MedIm
52
0
0
13 Apr 2024
Study of Emotion Concept Formation by Integrating Vision, Physiology, and Word Information using Multilayered Multimodal Latent Dirichlet Allocation
Kazuki Tsurumaki
Chie Hieida
Kazuki Miyazawa
28
0
0
12 Apr 2024
Context-aware Video Anomaly Detection in Long-Term Datasets
Zhengye Yang
Richard J. Radke
36
6
0
11 Apr 2024
Encoding Urban Ecologies: Automated Building Archetype Generation through Self-Supervised Learning for Energy Modeling
Xinwei Zhuang
Zixun Huang
Wentao Zeng
Luisa Caldas
25
0
0
11 Apr 2024
Deep Generative Data Assimilation in Multimodal Setting
Yongquan Qu
Juan Nathaniel
Shuolin Li
Pierre Gentine
3DGS
36
15
0
10 Apr 2024
GeoSynth: Contextually-Aware High-Resolution Satellite Image Synthesis
Srikumar Sastry
Subash Khanal
Aayush Dhakal
Nathan Jacobs
69
7
0
09 Apr 2024
CodeEnhance: A Codebook-Driven Approach for Low-Light Image Enhancement
Xu Wu
Xianxu Hou
Zhihui Lai
Jie Zhou
Ya-Nan Zhang
Witold Pedrycz
Linlin Shen
48
3
0
08 Apr 2024
CodecNeRF: Toward Fast Encoding and Decoding, Compact, and High-quality Novel-view Synthesis
Gyeongjin Kang
Younggeun Lee
Seungjun Oh
Eunbyung Park
VGen
40
1
0
07 Apr 2024
Contextual Chart Generation for Cyber Deception
David D. Nguyen
David Liebowitz
Surya Nepal
S. Kanhere
Sharif Abuadbba
54
0
0
07 Apr 2024
MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators
Shenghai Yuan
Jinfa Huang
Yujun Shi
Yongqi Xu
Ruijie Zhu
Bin Lin
Xinhua Cheng
Li-xin Yuan
Jiebo Luo
VGen
88
34
0
07 Apr 2024
Training LLMs over Neurally Compressed Text
Brian Lester
Jaehoon Lee
A. Alemi
Jeffrey Pennington
Adam Roberts
Jascha Narain Sohl-Dickstein
Noah Constant
45
6
0
04 Apr 2024
SemGrasp: Semantic Grasp Generation via Language Aligned Discretization
Kailin Li
Jingbo Wang
Lixin Yang
Cewu Lu
Bo Dai
51
16
0
04 Apr 2024
Information-Theoretic Generalization Bounds for Deep Neural Networks
Haiyun He
Christina Lee Yu
50
5
0
04 Apr 2024
Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction
Keyu Tian
Yi Jiang
Zehuan Yuan
Bingyue Peng
Liwei Wang
VGen
61
268
0
03 Apr 2024
LidarDM: Generative LiDAR Simulation in a Generated World
Vlas Zyrianov
Henry Che
Zhijian Liu
Shenlong Wang
VGen
54
20
0
03 Apr 2024
CLaM-TTS: Improving Neural Codec Language Model for Zero-Shot Text-to-Speech
Jaehyeon Kim
Keon Lee
Seungjun Chung
Jaewoong Cho
74
42
0
03 Apr 2024
Co-Speech Gesture Video Generation via Motion-Decoupled Diffusion Model
Xu He
Qiaochu Huang
Zhensong Zhang
Zhiwei Lin
Zhiyong Wu
Sicheng Yang
Minglei Li
Zhiyi Chen
Songcen Xu
Xiaofei Wu
40
15
0
02 Apr 2024
Neuromorphic Wireless Device-Edge Co-Inference via the Directed Information Bottleneck
Yuzhen Ke
Zoran Utkovski
Mehdi Heshmati
Osvaldo Simeone
Johannes Dommel
Sławomir Stańczak
50
2
0
02 Apr 2024
MotionChain: Conversational Motion Controllers via Multimodal Prompts
Biao Jiang
Xin Chen
C. Zhang
Fukun Yin
Zhuoyuan Li
Gang Yu
Jiayuan Fan
VGen
LRM
40
10
0
02 Apr 2024
Distributed and Rate-Adaptive Feature Compression
Aditya Deshmukh
V. Veeravalli
Gunjan Verma
14
0
0
02 Apr 2024
CosmicMan: A Text-to-Image Foundation Model for Humans
Shikai Li
Jianglin Fu
Kaiyuan Liu
Wentao Wang
Kwan-Yee Lin
Wayne Wu
DiffM
47
19
0
01 Apr 2024
LLMs are Good Sign Language Translators
Jia Gong
Lin Geng Foo
Yixuan He
Hossein Rahmani
Jun Liu
SLR
76
25
0
01 Apr 2024
Towards Realistic Scene Generation with LiDAR Diffusion Models
Haoxi Ran
Vitor Campagnolo Guizilini
Yue Wang
DiffM
AI4CE
35
25
0
31 Mar 2024
Variational Autoencoders for exteroceptive perception in reinforcement learning-based collision avoidance
T. N. Larsen
Eirik Runde Barlaug
Adil Rasheed
DRL
36
1
0
31 Mar 2024
CM-TTS: Enhancing Real Time Text-to-Speech Synthesis Efficiency through Weighted Samplers and Consistency Models
Xiang Li
Fan Bu
Ambuj Mehrish
Yingting Li
Jiale Han
Bo Cheng
Soujanya Poria
DiffM
40
6
0
31 Mar 2024
LLMs are Good Action Recognizers
Haoxuan Qu
Yujun Cai
Jun Liu
45
16
0
31 Mar 2024
Transformer based Pluralistic Image Completion with Reduced Information Loss
Qiankun Liu
Yuqi Jiang
Zhentao Tan
DongDong Chen
Ying Fu
Qi Chu
Gang Hua
Nenghai Yu
ViT
73
11
0
31 Mar 2024
Towards Variable and Coordinated Holistic Co-Speech Motion Generation
Yifei Liu
Qiong Cao
Yandong Wen
Huaiguang Jiang
Changxing Ding
SLR
71
15
0
30 Mar 2024
Previous
1
2
3
...
20
21
22
...
54
55
56
Next