ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1711.00937
  4. Cited By
Neural Discrete Representation Learning

Neural Discrete Representation Learning

2 November 2017
Aaron van den Oord
Oriol Vinyals
Koray Kavukcuoglu
    BDL
    SSL
    OCL
ArXivPDFHTML

Papers citing "Neural Discrete Representation Learning"

50 / 2,793 papers shown
Title
Monocular Identity-Conditioned Facial Reflectance Reconstruction
Monocular Identity-Conditioned Facial Reflectance Reconstruction
Xingyu Ren
Jiankang Deng
Yuhao Cheng
Jia Guo
Chao Ma
Yichao Yan
Wenhan Zhu
Xiaokang Yang
3DH
55
3
0
30 Mar 2024
LAKE-RED: Camouflaged Images Generation by Latent Background Knowledge
  Retrieval-Augmented Diffusion
LAKE-RED: Camouflaged Images Generation by Latent Background Knowledge Retrieval-Augmented Diffusion
Pancheng Zhao
Peng Xu
Pengda Qin
Deng-Ping Fan
Zhicheng Zhang
Guoli Jia
Bowen Zhou
Jufeng Yang
43
9
0
30 Mar 2024
Beyond Talking -- Generating Holistic 3D Human Dyadic Motion for
  Communication
Beyond Talking -- Generating Holistic 3D Human Dyadic Motion for Communication
Mingze Sun
Chao Xu
Xinyu Jiang
Yang Liu
Baigui Sun
Ruqi Huang
54
3
0
28 Mar 2024
Learning Sampling Distribution and Safety Filter for Autonomous Driving
  with VQ-VAE and Differentiable Optimization
Learning Sampling Distribution and Safety Filter for Autonomous Driving with VQ-VAE and Differentiable Optimization
Simon Idoko
Basant Sharma
A. K. Singh
32
1
0
28 Mar 2024
BAMM: Bidirectional Autoregressive Motion Model
BAMM: Bidirectional Autoregressive Motion Model
Ekkasit Pinyoanuntapong
Muhammad Usama Saleem
Pu Wang
Minwoo Lee
Srijan Das
Chong Chen
VGen
42
23
0
28 Mar 2024
Generative Medical Segmentation
Generative Medical Segmentation
Jiayu Huo
Ouyang Xi
Sébastien Ourselin
Rachel Sparks
MedIm
42
1
0
27 Mar 2024
Don't Look into the Dark: Latent Codes for Pluralistic Image Inpainting
Don't Look into the Dark: Latent Codes for Pluralistic Image Inpainting
Haiwei Chen
Yajie Zhao
DiffM
24
2
0
27 Mar 2024
AID: Attention Interpolation of Text-to-Image Diffusion
AID: Attention Interpolation of Text-to-Image Diffusion
Qiyuan He
Jinghao Wang
Ziwei Liu
Angela Yao
DiffM
45
9
0
26 Mar 2024
Neural Embedding Compression For Efficient Multi-Task Earth Observation
  Modelling
Neural Embedding Compression For Efficient Multi-Task Earth Observation Modelling
Carlos Gomes
Thomas Brunschwiler
37
0
0
26 Mar 2024
Deepfake Generation and Detection: A Benchmark and Survey
Deepfake Generation and Detection: A Benchmark and Survey
Gan Pei
Jiangning Zhang
Menghan Hu
Zhenyu Zhang
Chengjie Wang
Yunsheng Wu
Guangtao Zhai
Jian Yang
Chunhua Shen
Dacheng Tao
59
28
0
26 Mar 2024
SD-DiT: Unleashing the Power of Self-supervised Discrimination in
  Diffusion Transformer
SD-DiT: Unleashing the Power of Self-supervised Discrimination in Diffusion Transformer
Rui Zhu
Yingwei Pan
Yehao Li
Ting Yao
Zhenglong Sun
Tao Mei
C. Chen
50
25
0
25 Mar 2024
VoiceCraft: Zero-Shot Speech Editing and Text-to-Speech in the Wild
VoiceCraft: Zero-Shot Speech Editing and Text-to-Speech in the Wild
Puyuan Peng
Po-Yao (Bernie) Huang
Daniel Li
Abdelrahman Mohamed
David Harwath
74
64
0
25 Mar 2024
GLAD: Improving Latent Graph Generative Modeling with Simple
  Quantization
GLAD: Improving Latent Graph Generative Modeling with Simple Quantization
Van Khoa Nguyen
Yoann Boget
Frantzeska Lavda
Alexandros Kalousis
31
2
0
25 Mar 2024
SDXS: Real-Time One-Step Latent Diffusion Models with Image Conditions
SDXS: Real-Time One-Step Latent Diffusion Models with Image Conditions
Yuda Song
Zehao Sun
Xuanwu Yin
VLM
51
17
0
25 Mar 2024
Pose-Guided Self-Training with Two-Stage Clustering for Unsupervised
  Landmark Discovery
Pose-Guided Self-Training with Two-Stage Clustering for Unsupervised Landmark Discovery
Siddharth Tourani
Ahmed Alwheibi
Arif Mahmood
Muhammad Haris Khan
DiffM
46
1
0
24 Mar 2024
Contact-aware Human Motion Generation from Textual Descriptions
Contact-aware Human Motion Generation from Textual Descriptions
Sihan Ma
Qiong Cao
Jing Zhang
Dacheng Tao
41
7
0
23 Mar 2024
Improve Cross-domain Mixed Sampling with Guidance Training for Adaptive
  Segmentation
Improve Cross-domain Mixed Sampling with Guidance Training for Adaptive Segmentation
Wenlve Zhou
Zhiheng Zhou
Tianlei Wang
Delu Zeng
51
0
0
22 Mar 2024
Denoising Diffusion Models for 3D Healthy Brain Tissue Inpainting
Denoising Diffusion Models for 3D Healthy Brain Tissue Inpainting
Alicia Durrer
J. Wolleb
Florentin Bieder
Paul Friedrich
L. Melie-García
...
Ozgur Yaldizli
Cristina Granziera
Bjoern H. Menze
Philippe C. Cattin
Florian Kofler
MedIm
28
6
0
21 Mar 2024
Efficient Video Diffusion Models via Content-Frame Motion-Latent
  Decomposition
Efficient Video Diffusion Models via Content-Frame Motion-Latent Decomposition
Sihyun Yu
Weili Nie
De-An Huang
Boyi Li
Jinwoo Shin
A. Anandkumar
VGen
DiffM
36
15
0
21 Mar 2024
StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text
StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text
Roberto Henschel
Levon Khachatryan
Daniil Hayrapetyan
Hayk Poghosyan
Vahram Tadevosyan
Zhangyang Wang
Shant Navasardyan
Humphrey Shi
DiffM
VGen
101
79
0
21 Mar 2024
CoMo: Controllable Motion Generation through Language Guided Pose Code
  Editing
CoMo: Controllable Motion Generation through Language Guided Pose Code Editing
Yiming Huang
Weilin Wan
Yue Yang
Chris Callison-Burch
Mark Yatskar
Lingjie Liu
44
22
0
20 Mar 2024
Learning to Infer Generative Template Programs for Visual Concepts
Learning to Infer Generative Template Programs for Visual Concepts
R. K. Jones
S. Chaudhuri
Daniel E. Ritchie
NAI
BDL
37
2
0
20 Mar 2024
Towards Principled Representation Learning from Videos for Reinforcement
  Learning
Towards Principled Representation Learning from Videos for Reinforcement Learning
Dipendra Kumar Misra
Akanksha Saran
Tengyang Xie
Alex Lamb
John Langford
SSL
OffRL
45
5
0
20 Mar 2024
DetDiffusion: Synergizing Generative and Perceptive Models for Enhanced
  Data Generation and Perception
DetDiffusion: Synergizing Generative and Perceptive Models for Enhanced Data Generation and Perception
Yibo Wang
Ruiyuan Gao
Kai Chen
Kaiqiang Zhou
Yingjie Cai
...
Zhenguo Li
Lihui Jiang
Dit-Yan Yeung
Qiang Xu
Kai Zhang
DiffM
131
21
0
20 Mar 2024
Text-to-3D Shape Generation
Text-to-3D Shape Generation
Han-Hung Lee
Manolis Savva
Angel X. Chang
37
11
0
20 Mar 2024
Ultra-High-Resolution Image Synthesis with Pyramid Diffusion Model
Ultra-High-Resolution Image Synthesis with Pyramid Diffusion Model
Jiajie Yang
45
0
0
19 Mar 2024
PostoMETRO: Pose Token Enhanced Mesh Transformer for Robust 3D Human
  Mesh Recovery
PostoMETRO: Pose Token Enhanced Mesh Transformer for Robust 3D Human Mesh Recovery
Wendi Yang
Zihang Jiang
Shang Zhao
S. Kevin Zhou
53
0
0
19 Mar 2024
SC-Diff: 3D Shape Completion with Latent Diffusion Models
SC-Diff: 3D Shape Completion with Latent Diffusion Models
Juan D. Galvis
Xingxing Zuo
Simon Schaefer
Stefan Leutengger
DiffM
49
3
0
19 Mar 2024
VQ-NeRV: A Vector Quantized Neural Representation for Videos
VQ-NeRV: A Vector Quantized Neural Representation for Videos
Y. Xu
Xiang Feng
Feiwei Qin
Ruiquan Ge
Yong Peng
Changmiao Wang
49
5
0
19 Mar 2024
Advancing Time Series Classification with Multimodal Language Modeling
Advancing Time Series Classification with Multimodal Language Modeling
Mingyue Cheng
Yiheng Chen
Qi Liu
Zhiding Liu
Yucong Luo
AI4TS
45
11
0
19 Mar 2024
CoCoCo: Improving Text-Guided Video Inpainting for Better Consistency,
  Controllability and Compatibility
CoCoCo: Improving Text-Guided Video Inpainting for Better Consistency, Controllability and Compatibility
Bojia Zi
Shihao Zhao
Xianbiao Qi
Jianan Wang
Yukai Shi
Qianyu Chen
Bin Liang
Kam-Fai Wong
Lei Zhang
DiffM
VGen
40
15
0
18 Mar 2024
IDF-CR: Iterative Diffusion Process for Divide-and-Conquer Cloud Removal
  in Remote-sensing Images
IDF-CR: Iterative Diffusion Process for Divide-and-Conquer Cloud Removal in Remote-sensing Images
Meilin Wang
Yexing Song
Pengxu Wei
Xiaoyu Xian
Yukai Shi
Liang Lin
DiffM
51
12
0
18 Mar 2024
Spatio-Temporal Fluid Dynamics Modeling via Physical-Awareness and
  Parameter Diffusion Guidance
Spatio-Temporal Fluid Dynamics Modeling via Physical-Awareness and Parameter Diffusion Guidance
Hao Wu
Fan Xu
Yifan Duan
Ziwei Niu
Weiyan Wang
Gaofeng Lu
Kun Wang
Keli Zhang
Yang Wang
DiffM
AI4CE
47
8
0
18 Mar 2024
Generalized Multi-Source Inference for Text Conditioned Music Diffusion
  Models
Generalized Multi-Source Inference for Text Conditioned Music Diffusion Models
Emilian Postolache
Giorgio Mariani
Luca Cosmo
Emmanouil Benetos
Emanuele Rodolà
DiffM
48
9
0
18 Mar 2024
Urban Scene Diffusion through Semantic Occupancy Map
Urban Scene Diffusion through Semantic Occupancy Map
Junge Zhang
Qihang Zhang
Li Zhang
Ramana Rao Kompella
Gaowen Liu
Bolei Zhou
52
4
0
18 Mar 2024
HyperVQ: MLR-based Vector Quantization in Hyperbolic Space
HyperVQ: MLR-based Vector Quantization in Hyperbolic Space
Nabarun Goswami
Yusuke Mukuta
Tatsuya Harada
42
4
0
18 Mar 2024
Boosting Flow-based Generative Super-Resolution Models via Learned Prior
Boosting Flow-based Generative Super-Resolution Models via Learned Prior
Li-Yuan Tsao
Yi-Chen Lo
Chia-Che Chang
Hao-Wei Chen
Roy Tseng
Chien Feng
Chun-Yi Lee
SupR
41
4
0
16 Mar 2024
Lodge: A Coarse to Fine Diffusion Network for Long Dance Generation
  Guided by the Characteristic Dance Primitives
Lodge: A Coarse to Fine Diffusion Network for Long Dance Generation Guided by the Characteristic Dance Primitives
Ronghui Li
YuXiang Zhang
Yachao Zhang
Hongwen Zhang
Jie Guo
Yan Zhang
Yebin Liu
Xiu Li
DiffM
54
28
0
15 Mar 2024
Isotropic3D: Image-to-3D Generation Based on a Single CLIP Embedding
Isotropic3D: Image-to-3D Generation Based on a Single CLIP Embedding
Pengkun Liu
Yikai Wang
Gang Hua
Jiafang Li
Hang Xiao
Hongxiang Xue
Xinzhou Wang
56
8
0
15 Mar 2024
RangeLDM: Fast Realistic LiDAR Point Cloud Generation
RangeLDM: Fast Realistic LiDAR Point Cloud Generation
Q. Hu
Zhimin Zhang
Wei Hu
DiffM
38
12
0
15 Mar 2024
Codebook Transfer with Part-of-Speech for Vector-Quantized Image
  Modeling
Codebook Transfer with Part-of-Speech for Vector-Quantized Image Modeling
Baoquan Zhang
Huaibin Wang
Chuyao Luo
Xutao Li
Guotao Liang
Yunming Ye
Xiaochen Qi
Yao He
40
11
0
15 Mar 2024
AD3: Implicit Action is the Key for World Models to Distinguish the
  Diverse Visual Distractors
AD3: Implicit Action is the Key for World Models to Distinguish the Diverse Visual Distractors
Yucen Wang
Shenghua Wan
Le Gan
Shuai Feng
De-Chuan Zhan
VGen
32
4
0
15 Mar 2024
Faceptor: A Generalist Model for Face Perception
Faceptor: A Generalist Model for Face Perception
Lixiong Qin
Mei Wang
Xuannan Liu
Yuhang Zhang
Weihong Deng
Xiaoshuai Song
Weiran Xu
Weihong Deng
CVBM
37
6
0
14 Mar 2024
MambaTalk: Efficient Holistic Gesture Synthesis with Selective State Space Models
MambaTalk: Efficient Holistic Gesture Synthesis with Selective State Space Models
Zunnan Xu
Yukang Lin
Haonan Han
Sicheng Yang
Ronghui Li
Yachao Zhang
Xiu Li
Mamba
54
25
0
14 Mar 2024
InfoCon: Concept Discovery with Generative and Discriminative
  Informativeness
InfoCon: Concept Discovery with Generative and Discriminative Informativeness
Ruizhe Liu
Qian Luo
Yanchao Yang
49
2
0
14 Mar 2024
GiT: Towards Generalist Vision Transformer through Universal Language
  Interface
GiT: Towards Generalist Vision Transformer through Universal Language Interface
Haiyang Wang
Hao Tang
Li Jiang
Shaoshuai Shi
Muhammad Ferjad Naeem
Hongsheng Li
Bernt Schiele
Liwei Wang
VLM
51
10
0
14 Mar 2024
Towards Faster Training of Diffusion Models: An Inspiration of A
  Consistency Phenomenon
Towards Faster Training of Diffusion Models: An Inspiration of A Consistency Phenomenon
Tianshuo Xu
Peng Mi
Ruilin Wang
Yingcong Chen
DiffM
49
6
0
14 Mar 2024
UniCode: Learning a Unified Codebook for Multimodal Large Language
  Models
UniCode: Learning a Unified Codebook for Multimodal Large Language Models
Sipeng Zheng
Bohan Zhou
Yicheng Feng
Ye Wang
Zongqing Lu
VLM
MLLM
46
7
0
14 Mar 2024
Dyadic Interaction Modeling for Social Behavior Generation
Dyadic Interaction Modeling for Social Behavior Generation
Minh Tran
Di Chang
Maksim Siniukov
Mohammad Soleymani
VGen
45
7
0
14 Mar 2024
Masked Generative Story Transformer with Character Guidance and Caption
  Augmentation
Masked Generative Story Transformer with Character Guidance and Caption Augmentation
Christos Papadimitriou
Giorgos Filandrianos
Maria Lymperaiou
Giorgos Stamou
DiffM
102
1
0
13 Mar 2024
Previous
123...212223...545556
Next