ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1711.00937
  4. Cited By
Neural Discrete Representation Learning

Neural Discrete Representation Learning

2 November 2017
Aaron van den Oord
Oriol Vinyals
Koray Kavukcuoglu
    BDL
    SSL
    OCL
ArXivPDFHTML

Papers citing "Neural Discrete Representation Learning"

50 / 2,748 papers shown
Title
CustomVideoX: 3D Reference Attention Driven Dynamic Adaptation for Zero-Shot Customized Video Diffusion Transformers
D. She
Mushui Liu
Jingxuan Pang
Jin Wang
Zhen Yang
...
Yi Wang
Qihan Huang
Haobin Tang
YunLong Yu
Siming Fu
VGen
96
4
0
21 Feb 2025
Animate Your Thoughts: Decoupled Reconstruction of Dynamic Natural Vision from Slow Brain Activity
Animate Your Thoughts: Decoupled Reconstruction of Dynamic Natural Vision from Slow Brain Activity
Yizhuo Lu
Changde Du
Chong Wang
Xuanliu Zhu
Liuyun Jiang
Xujin Li
Huiguang He
VGen
125
4
0
20 Feb 2025
From Principles to Applications: A Comprehensive Survey of Discrete Tokenizers in Generation, Comprehension, Recommendation, and Information Retrieval
From Principles to Applications: A Comprehensive Survey of Discrete Tokenizers in Generation, Comprehension, Recommendation, and Information Retrieval
Jian Jia
Jingtong Gao
Ben Xue
Junhao Wang
Qingpeng Cai
Quan Chen
Xiangyu Zhao
Peng Jiang
Kun Gai
OffRL
77
0
0
18 Feb 2025
Architect of the Bits World: Masked Autoregressive Modeling for Circuit Generation Guided by Truth Table
Architect of the Bits World: Masked Autoregressive Modeling for Circuit Generation Guided by Truth Table
Haoyuan Wu
Haisheng Zheng
Shoubo Hu
Zhuolun He
Bei Yu
53
0
0
18 Feb 2025
MALT Diffusion: Memory-Augmented Latent Transformers for Any-Length Video Generation
MALT Diffusion: Memory-Augmented Latent Transformers for Any-Length Video Generation
Sihyun Yu
Meera Hahn
Dan Kondratyuk
Jinwoo Shin
Agrim Gupta
José Lezama
Irfan Essa
David A. Ross
Jonathan Huang
DiffM
VGen
77
0
0
18 Feb 2025
MagicArticulate: Make Your 3D Models Articulation-Ready
MagicArticulate: Make Your 3D Models Articulation-Ready
Chaoyue Song
Jianfeng Zhang
Xiu Li
Fan Yang
Yiwen Chen
...
Jun Hao Liew
Xiaoyang Guo
Fayao Liu
Jiashi Feng
Guosheng Lin
74
1
0
17 Feb 2025
Leader and Follower: Interactive Motion Generation under Trajectory Constraints
Leader and Follower: Interactive Motion Generation under Trajectory Constraints
Runqi Wang
Caoyuan Ma
Jian Zhao
Hanrui Xu
Dongfang Sun
Haoyang Chen
Lin Xiong
Z. Wang
Xianrui Li
VGen
51
0
0
17 Feb 2025
MARS: Mesh AutoRegressive Model for 3D Shape Detailization
MARS: Mesh AutoRegressive Model for 3D Shape Detailization
Jingnan Gao
Weizhe Liu
Weixuan Sun
Senbo Wang
Xibin Song
...
Shenzhou Chen
Hongdong Li
Xiaoyu Yang
Yichao Yan
Pan Ji
82
2
0
17 Feb 2025
EQ-VAE: Equivariance Regularized Latent Space for Improved Generative Image Modeling
EQ-VAE: Equivariance Regularized Latent Space for Improved Generative Image Modeling
Theodoros Kouzelis
Ioannis Kakogeorgiou
Spyros Gidaris
N. Komodakis
DRL
80
5
0
17 Feb 2025
Designing a Conditional Prior Distribution for Flow-Based Generative Models
Designing a Conditional Prior Distribution for Flow-Based Generative Models
Noam Issachar
Mohammad Salama
Raanan Fattal
Sagie Benaim
91
0
0
13 Feb 2025
TokenSynth: A Token-based Neural Synthesizer for Instrument Cloning and Text-to-Instrument
TokenSynth: A Token-based Neural Synthesizer for Instrument Cloning and Text-to-Instrument
Kyungsu Kim
Junghyun Koo
Sungho Lee
Haesun Joung
Kyogu Lee
58
0
0
13 Feb 2025
Towards Virtual Clinical Trials of Radiology AI with Conditional Generative Modeling
Towards Virtual Clinical Trials of Radiology AI with Conditional Generative Modeling
Benjamin Killeen
Bohua Wan
Aditya V. Kulkarni
Nathan G. Drenkow
Michael Oberst
Paul H. Yi
Mathias Unberath
MedIm
67
0
0
13 Feb 2025
HistoSmith: Single-Stage Histology Image-Label Generation via Conditional Latent Diffusion for Enhanced Cell Segmentation and Classification
HistoSmith: Single-Stage Histology Image-Label Generation via Conditional Latent Diffusion for Enhanced Cell Segmentation and Classification
V. Vadori
Jean-Marie Graic
A. Peruffo
L. Finos
Ujwala Kiran Chaudhari
Enrico Grisan
DiffM
MedIm
105
0
0
12 Feb 2025
Image Watermarking of Generative Diffusion Models
Image Watermarking of Generative Diffusion Models
Yunzhuo Chen
J. Vice
Naveed Akhtar
Nur Al Hasan Haldar
Ajmal Mian
WIGM
56
0
0
12 Feb 2025
PlaySlot: Learning Inverse Latent Dynamics for Controllable Object-Centric Video Prediction and Planning
PlaySlot: Learning Inverse Latent Dynamics for Controllable Object-Centric Video Prediction and Planning
Angel Villar-Corrales
Sven Behnke
85
3
0
11 Feb 2025
Fast and Accurate Antibody Sequence Design via Structure Retrieval
Fast and Accurate Antibody Sequence Design via Structure Retrieval
Xingyi Zhang
Kun Xie
Ningqiao Huang
Wei Liu
Peilin Zhao
Sibo Wang
Kangfei Zhao
Biaobin Jiang
46
0
0
11 Feb 2025
The Case for Cleaner Biosignals: High-fidelity Neural Compressor Enables Transfer from Cleaner iEEG to Noisier EEG
The Case for Cleaner Biosignals: High-fidelity Neural Compressor Enables Transfer from Cleaner iEEG to Noisier EEG
Francesco Stefano Carzaniga
Gary Tom Hoppeler
Michael Hersche
Kaspar Anton Schindler
Abbas Rahimi
51
0
0
10 Feb 2025
Transfer Your Perspective: Controllable 3D Generation from Any Viewpoint in a Driving Scene
Transfer Your Perspective: Controllable 3D Generation from Any Viewpoint in a Driving Scene
Tai-Yu Pan
Sooyoung Jeon
Mengdi Fan
Jinsu Yoo
Zhenyang Feng
Mark E. Campbell
Kilian Q. Weinberger
Bharath Hariharan
Wei-Lun Chao
106
0
0
10 Feb 2025
UniMoD: Efficient Unified Multimodal Transformers with Mixture-of-Depths
Weijia Mao
Zhengyuan Yang
Mike Zheng Shou
MoE
78
0
0
10 Feb 2025
Animate Anyone 2: High-Fidelity Character Image Animation with Environment Affordance
Animate Anyone 2: High-Fidelity Character Image Animation with Environment Affordance
Li Hu
Guangyuan Wang
Zhen Shen
Xin Gao
Dechao Meng
Lian Zhuo
Peng Zhang
Bang Zhang
Liefeng Bo
DiffM
VGen
101
9
0
10 Feb 2025
MMGDreamer: Mixed-Modality Graph for Geometry-Controllable 3D Indoor Scene Generation
MMGDreamer: Mixed-Modality Graph for Geometry-Controllable 3D Indoor Scene Generation
Zhengyuan Yang
Keyang Lu
Chao Zhang
Jiaxing Qi
Hanqi Jiang
...
Yifan Xu
Mingzhe Xing
Zhen Xiao
Jieyi Long
Xiangde Liu
58
4
0
09 Feb 2025
IndexTTS: An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
IndexTTS: An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
Wei Deng
Siyi Zhou
Jingchen Shu
Jinchao Wang
Lu Wang
VLM
47
1
0
08 Feb 2025
MoFM: A Large-Scale Human Motion Foundation Model
MoFM: A Large-Scale Human Motion Foundation Model
Mohammadreza Baharani
Ghazal Alinezhad Noghre
Armin Danesh Pazho
Gabriel Maldonado
Hamed Tabkhi
AI4CE
188
1
0
08 Feb 2025
L2GNet: Optimal Local-to-Global Representation of Anatomical Structures for Generalized Medical Image Segmentation
Vandan Gorade
Sparsh Mittal
N. Dasu
Rekha Singhal
KC Santosh
Debesh Jha
36
0
0
06 Feb 2025
Beyond the Final Layer: Hierarchical Query Fusion Transformer with Agent-Interpolation Initialization for 3D Instance Segmentation
Beyond the Final Layer: Hierarchical Query Fusion Transformer with Agent-Interpolation Initialization for 3D Instance Segmentation
Jiahao Lu
Jiacheng Deng
Tianzhu Zhang
90
2
0
06 Feb 2025
AudioMiXR: Spatial Audio Object Manipulation with 6DoF for Sound Design in Augmented Reality
AudioMiXR: Spatial Audio Object Manipulation with 6DoF for Sound Design in Augmented Reality
Brandon Woodard
Margarita Geleta
Joseph J. LaViola Jr.
Andrea Fanelli
Rhonda Wilson
57
1
0
05 Feb 2025
LoCA: Location-Aware Cosine Adaptation for Parameter-Efficient Fine-Tuning
LoCA: Location-Aware Cosine Adaptation for Parameter-Efficient Fine-Tuning
Zhekai Du
Yinjie Min
Jingjing Li
Ke Lu
Changliang Zou
Liuhua Peng
Tingjin Chu
Mingming Gong
186
1
0
05 Feb 2025
BRIDLE: Generalized Self-supervised Learning with Quantization
BRIDLE: Generalized Self-supervised Learning with Quantization
Hoang M. Nguyen
Satya Narayan Shukla
Qiang Zhang
Hanchao Yu
Sreya D. Roy
Taipeng Tian
Lingjiong Zhu
Yuchen Liu
SSL
MQ
84
0
0
04 Feb 2025
Particle Trajectory Representation Learning with Masked Point Modeling
Particle Trajectory Representation Learning with Masked Point Modeling
Sam Young
Yeon-jae Jwa
Kazuhiro Terao
3DPC
69
1
0
04 Feb 2025
CASIM: Composite Aware Semantic Injection for Text to Motion Generation
CASIM: Composite Aware Semantic Injection for Text to Motion Generation
Che-Jui Chang
Qingze Tony Liu
H. Zhou
Vladimir Pavlovic
Mubbasir Kapadia
107
0
0
04 Feb 2025
ComplexDec: A Domain-robust High-fidelity Neural Audio Codec with Complex Spectrum Modeling
ComplexDec: A Domain-robust High-fidelity Neural Audio Codec with Complex Spectrum Modeling
Yi-Chiao Wu
Dejan Marković
Steven Krenn
I. D. Gebru
Alexander Richard
66
0
0
04 Feb 2025
UVGS: Reimagining Unstructured 3D Gaussian Splatting using UV Mapping
UVGS: Reimagining Unstructured 3D Gaussian Splatting using UV Mapping
Aashish Rai
Dilin Wang
Mihir Jain
N. Sarafianos
Arthur Chen
Srinath Sridhar
Aayush Prakash
3DGS
74
1
0
03 Feb 2025
Categorical Schr\"odinger Bridge Matching
Categorical Schr\"odinger Bridge Matching
Grigoriy Ksenofontov
Alexander Korotin
61
0
0
03 Feb 2025
Turn That Frown Upside Down: FaceID Customization via Cross-Training Data
Shuhe Wang
Xiaoya Li
Xiaofei Sun
G. Wang
Tianwei Zhang
Jiwei Li
Eduard H. Hovy
38
0
0
28 Jan 2025
Decrypting the temperature field in flow boiling with latent diffusion models
UngJin Na
JunYoung Seo
Taeil Kim
ByongGuk Jeon
H. Jo
DiffM
AI4CE
45
0
0
27 Jan 2025
CGI: Identifying Conditional Generative Models with Example Images
CGI: Identifying Conditional Generative Models with Example Images
Zhi-Hua Zhou
Hao-Zhe Tan
Peng-Xiao Song
Lan-Zhe Guo
DiffM
42
0
0
23 Jan 2025
SoundSpring: Loss-Resilient Audio Transceiver with Dual-Functional Masked Language Modeling
SoundSpring: Loss-Resilient Audio Transceiver with Dual-Functional Masked Language Modeling
Shengshi Yao
Jincheng Dai
Xiaoqi Qin
Sixian Wang
Siye Wang
K. Niu
Ping Zhang
38
0
0
22 Jan 2025
Taming Teacher Forcing for Masked Autoregressive Video Generation
Taming Teacher Forcing for Masked Autoregressive Video Generation
Deyu Zhou
Quan Sun
Yuang Peng
Kun Yan
Runpei Dong
...
Zheng Ge
Nan Duan
Xiangyu Zhang
L. Ni
H. Shum
VGen
54
7
0
21 Jan 2025
Driving in the Occupancy World: Vision-Centric 4D Occupancy Forecasting and Planning via World Models for Autonomous Driving
Driving in the Occupancy World: Vision-Centric 4D Occupancy Forecasting and Planning via World Models for Autonomous Driving
Yu Yang
Jianbiao Mei
Yukai Ma
Siliang Du
Wenqing Chen
Yijie Qian
Yuxiang Feng
Yong-jin Liu
92
11
0
20 Jan 2025
Simplified and Generalized Masked Diffusion for Discrete Data
Simplified and Generalized Masked Diffusion for Discrete Data
Jiaxin Shi
Kehang Han
Zehao Wang
Arnaud Doucet
Michalis K. Titsias
DiffM
85
63
0
17 Jan 2025
Patch-aware Vector Quantized Codebook Learning for Unsupervised Visual Defect Detection
Patch-aware Vector Quantized Codebook Learning for Unsupervised Visual Defect Detection
Qisen Cheng
Shuhui Qu
Janghwan Lee
55
4
0
17 Jan 2025
Democratizing Text-to-Image Masked Generative Models with Compact Text-Aware One-Dimensional Tokens
Democratizing Text-to-Image Masked Generative Models with Compact Text-Aware One-Dimensional Tokens
Dongwon Kim
Ju He
Qihang Yu
Chenglin Yang
Xiaohui Shen
Suha Kwak
Liang-Chieh Chen
VLM
54
6
0
13 Jan 2025
Synthetic Prior for Few-Shot Drivable Head Avatar Inversion
Synthetic Prior for Few-Shot Drivable Head Avatar Inversion
Wojciech Zielonka
Stephan Garbin
Alexandros Lattas
George Kopanas
Paulo F. U. Gotardo
Thabo Beeler
Justus Thies
Timo Bolkart
72
2
0
12 Jan 2025
AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder
AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder
Samir Sadok
Simon Leglaive
Laurent Girin
Gaël Richard
Xavier Alameda-Pineda
55
1
0
10 Jan 2025
EditAR: Unified Conditional Generation with Autoregressive Models
EditAR: Unified Conditional Generation with Autoregressive Models
Jiteng Mu
Nuno Vasconcelos
Xinyu Wang
DiffM
43
5
0
08 Jan 2025
ZSVC: Zero-shot Style Voice Conversion with Disentangled Latent Diffusion Models and Adversarial Training
ZSVC: Zero-shot Style Voice Conversion with Disentangled Latent Diffusion Models and Adversarial Training
Xinfa Zhu
Lei He
Yujia Xiao
Xi Wang
Xu Tan
Sheng Zhao
Lei Xie
DiffM
40
0
0
08 Jan 2025
Human Grasp Generation for Rigid and Deformable Objects with Decomposed VQ-VAE
Human Grasp Generation for Rigid and Deformable Objects with Decomposed VQ-VAE
Mengshi Qi
Zhe Zhao
Huadong Ma
46
1
0
08 Jan 2025
Learning the Language of Protein Structure
Learning the Language of Protein Structure
Benoit Gaujac
Jérémie Donà
Liviu Copoiu
Timothy Atkinson
Thomas Pierrot
Thomas D. Barrett
69
10
0
08 Jan 2025
Abstracted Shapes as Tokens -- A Generalizable and Interpretable Model for Time-series Classification
Abstracted Shapes as Tokens -- A Generalizable and Interpretable Model for Time-series Classification
Yunshi Wen
Tengfei Ma
Tsui-Wei Weng
Lam M. Nguyen
A. Julius
AI4TS
45
1
0
08 Jan 2025
Remote Inference over Dynamic Links via Adaptive Rate Deep Task-Oriented Vector Quantization
Remote Inference over Dynamic Links via Adaptive Rate Deep Task-Oriented Vector Quantization
Eyal Fishel
M. Malka
Shai Ginzach
Nir Shlezinger
39
0
0
07 Jan 2025
Previous
123...678...535455
Next