ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1711.00937
  4. Cited By
Neural Discrete Representation Learning

Neural Discrete Representation Learning

2 November 2017
Aaron van den Oord
Oriol Vinyals
Koray Kavukcuoglu
    BDL
    SSL
    OCL
ArXivPDFHTML

Papers citing "Neural Discrete Representation Learning"

50 / 2,785 papers shown
Title
Learning Global Object-Centric Representations via Disentangled Slot
  Attention
Learning Global Object-Centric Representations via Disentangled Slot Attention
Tonglin Chen
Yinxuan Huang
Zhimeng Shen
Jinghao Huang
Bin Li
Xiangyang Xue
OCL
46
1
0
24 Oct 2024
Beyond Color and Lines: Zero-Shot Style-Specific Image Variations with
  Coordinated Semantics
Beyond Color and Lines: Zero-Shot Style-Specific Image Variations with Coordinated Semantics
Jinghao Hu
Yuhe Zhang
Guohua Geng
Liuyuxin Yang
JiaRui Yan
Jingtao Cheng
YaDong Zhang
Kang Li
DiffM
43
0
0
24 Oct 2024
Structure Language Models for Protein Conformation Generation
Structure Language Models for Protein Conformation Generation
Jiarui Lu
Xiaoyin Chen
Stephen Zhewen Lu
Chence Shi
Hongyu Guo
Yoshua Bengio
Xiangbo Shu
DiffM
44
2
0
24 Oct 2024
Bio2Token: All-atom tokenization of any biomolecular structure with Mamba
Bio2Token: All-atom tokenization of any biomolecular structure with Mamba
Andrew Liu
Axel Elaldi
Nathan Russell
Olivia Viessmann
Mamba
68
3
0
24 Oct 2024
Diffusion Attribution Score: Evaluating Training Data Influence in Diffusion Models
Diffusion Attribution Score: Evaluating Training Data Influence in Diffusion Models
Jinxu Lin
Linwei Tao
Minjing Dong
Chang Xu
TDI
46
2
0
24 Oct 2024
Multi-Scale Diffusion: Enhancing Spatial Layout in High-Resolution Panoramic Image Generation
Multi-Scale Diffusion: Enhancing Spatial Layout in High-Resolution Panoramic Image Generation
Xiaoyu Zhang
Teng Zhou
Xinlong Zhang
Jia Wei
Yongchuan Tang
52
1
0
24 Oct 2024
Augmenting Training Data with Vector-Quantized Variational Autoencoder
  for Classifying RF Signals
Augmenting Training Data with Vector-Quantized Variational Autoencoder for Classifying RF Signals
Srihari Kamesh Kompella
Kemal Davaslioglu
Y. Sagduyu
Sastry Kompella
21
1
0
23 Oct 2024
Deep Generative Models for 3D Medical Image Synthesis
Deep Generative Models for 3D Medical Image Synthesis
Paul Friedrich
Yannik Frisch
P. Cattin
3DV
MedIm
39
3
0
23 Oct 2024
Conjuring Semantic Similarity
Conjuring Semantic Similarity
Tian Yu Liu
Stefano Soatto
DiffM
32
0
0
21 Oct 2024
Elucidating the design space of language models for image generation
Elucidating the design space of language models for image generation
Xuantong Liu
Shaozhe Hao
Xianbiao Qi
Tianyang Hu
Jun Wang
Rong Xiao
Yuan Yao
VLM
40
3
0
21 Oct 2024
SeisLM: a Foundation Model for Seismic Waveforms
SeisLM: a Foundation Model for Seismic Waveforms
Tianlin Liu
Jannes Münchmeyer
Laura Laurenti
C. Marone
Maarten V. de Hoop
Ivan Dokmanić
VLM
28
4
0
21 Oct 2024
Object-Centric Temporal Consistency via Conditional Autoregressive
  Inductive Biases
Object-Centric Temporal Consistency via Conditional Autoregressive Inductive Biases
Cristian Meo
Akihiro Nakano
Mircea Lica
Aniket Didolkar
Masahiro Suzuki
Anirudh Goyal
Mengmi Zhang
Justin Dauwels
Y. Matsuo
Yoshua Bengio
OCL
46
2
0
21 Oct 2024
Residual vector quantization for KV cache compression in large language
  model
Residual vector quantization for KV cache compression in large language model
Ankur Kumar
MQ
36
0
0
21 Oct 2024
LSCodec: Low-Bitrate and Speaker-Decoupled Discrete Speech Codec
LSCodec: Low-Bitrate and Speaker-Decoupled Discrete Speech Codec
Yiwei Guo
Zhihan Li
Chenpeng Du
Hankun Wang
Xie Chen
Kai Yu
41
1
0
21 Oct 2024
Improving Voice Quality in Speech Anonymization With Just
  Perception-Informed Losses
Improving Voice Quality in Speech Anonymization With Just Perception-Informed Losses
Suhita Ghosh
Tim Thiele
Frederic Lorbeer
Frank Dreyer
Sebastian Stober
40
0
0
20 Oct 2024
BrainECHO: Semantic Brain Signal Decoding through Vector-Quantized Spectrogram Reconstruction for Whisper-Enhanced Text Generation
BrainECHO: Semantic Brain Signal Decoding through Vector-Quantized Spectrogram Reconstruction for Whisper-Enhanced Text Generation
Juntao Li
Zhenxi Song
Jiaqi Wang
Meishan Zhang
Honghai Liu
Min Zhang
Zhiguo Zhang
40
1
0
19 Oct 2024
BiGR: Harnessing Binary Latent Codes for Image Generation and Improved Visual Representation Capabilities
BiGR: Harnessing Binary Latent Codes for Image Generation and Improved Visual Representation Capabilities
Shaozhe Hao
Xuantong Liu
Xianbiao Qi
Shihao Zhao
Bojia Zi
Rong Xiao
Kai Han
Kwan-Yee K. Wong
47
3
0
18 Oct 2024
LEAD: Latent Realignment for Human Motion Diffusion
LEAD: Latent Realignment for Human Motion Diffusion
Nefeli Andreou
Xi Wang
Victoria Fernandez-Abrevaya
Marie-Paule Cani
Y. Chrysanthou
Vicky Kalogeiton
VGen
DiffM
37
2
0
18 Oct 2024
SNAC: Multi-Scale Neural Audio Codec
SNAC: Multi-Scale Neural Audio Codec
Hubert Siuzdak
Florian Grötschla
Luca A. Lanzendörfer
27
12
0
18 Oct 2024
Assistive AI for Augmenting Human Decision-making
Assistive AI for Augmenting Human Decision-making
Natabara Máté Gyöngyössy
Bernát Török
Csilla Farkas
Laura Lucaj
Attila Menyhárd
Krisztina Menyhárd-Balázs
András Simonyi
Patrick van der Smagt
Zsolt Ződi
András Lőrincz
41
0
0
18 Oct 2024
HiCo: Hierarchical Controllable Diffusion Model for Layout-to-image
  Generation
HiCo: Hierarchical Controllable Diffusion Model for Layout-to-image Generation
Bo Cheng
Yuhang Ma
Liebucha Wu
Shanyuan Liu
Ao Ma
Xiaoyu Wu
Dawei Leng
Yuhui Yin
DiffM
35
8
0
18 Oct 2024
A Complexity-Based Theory of Compositionality
A Complexity-Based Theory of Compositionality
Eric Elmoznino
Thomas Jiralerspong
Yoshua Bengio
Guillaume Lajoie
CoGe
66
5
0
18 Oct 2024
Fluid: Scaling Autoregressive Text-to-image Generative Models with
  Continuous Tokens
Fluid: Scaling Autoregressive Text-to-image Generative Models with Continuous Tokens
Lijie Fan
Tianhong Li
Siyang Qin
Yuanzhen Li
Chen Sun
Michael Rubinstein
Deqing Sun
Kaiming He
Yonglong Tian
VLM
DiffM
50
43
0
17 Oct 2024
MotionBank: A Large-scale Video Motion Benchmark with Disentangled
  Rule-based Annotations
MotionBank: A Large-scale Video Motion Benchmark with Disentangled Rule-based Annotations
Liang Xu
Shaoyang Hua
Zili Lin
Yifan Liu
Feipeng Ma
Yichao Yan
Xin Jin
Xiaokang Yang
Wenjun Zeng
VGen
44
3
0
17 Oct 2024
DPLM-2: A Multimodal Diffusion Protein Language Model
DPLM-2: A Multimodal Diffusion Protein Language Model
Xinze Wang
Zaixiang Zheng
Fei Ye
Dongyu Xue
Shujian Huang
Quanquan Gu
33
14
0
17 Oct 2024
L3DG: Latent 3D Gaussian Diffusion
L3DG: Latent 3D Gaussian Diffusion
Barbara Roessle
Norman Muller
Lorenzo Porzi
Samuel Rota Buló
Peter Kontschieder
Angela Dai
Matthias Nießner
3DGS
52
12
0
17 Oct 2024
DART: Disentanglement of Accent and Speaker Representation in
  Multispeaker Text-to-Speech
DART: Disentanglement of Accent and Speaker Representation in Multispeaker Text-to-Speech
J. Melechovský
Ambuj Mehrish
Berrak Sisman
Dorien Herremans
21
2
0
17 Oct 2024
The Latent Road to Atoms: Backmapping Coarse-grained Protein Structures
  with Latent Diffusion
The Latent Road to Atoms: Backmapping Coarse-grained Protein Structures with Latent Diffusion
Xu Han
Yuancheng Sun
Kai Chen
Kang Liu
Qiwei Ye
DiffM
AI4CE
36
0
0
17 Oct 2024
GeSubNet: Gene Interaction Inference for Disease Subtype Network Generation
GeSubNet: Gene Interaction Inference for Disease Subtype Network Generation
Ziwei Yang
Zheng Chen
Xin Liu
Rikuto Kotoge
Peng Chen
Yasuko Matsubara
Yasushi Sakurai
Jimeng Sun
34
0
0
17 Oct 2024
Stabilize the Latent Space for Image Autoregressive Modeling: A Unified
  Perspective
Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective
Yongxin Zhu
Bing Li
Hang Zhang
Xin Li
Linli Xu
Lidong Bing
DiffM
44
9
0
16 Oct 2024
FlashAudio: Rectified Flows for Fast and High-Fidelity Text-to-Audio
  Generation
FlashAudio: Rectified Flows for Fast and High-Fidelity Text-to-Audio Generation
Huadai Liu
Jialei Wang
Rongjie Huang
Yang Liu
H. Lu
Wei Xue
Zhou Zhao
13
3
0
16 Oct 2024
Analysis and Benchmarking of Extending Blind Face Image Restoration to
  Videos
Analysis and Benchmarking of Extending Blind Face Image Restoration to Videos
Zhouxia Wang
Jiawei Zhang
Xintao Wang
Tianshui Chen
Y. Shan
Wei Wang
Ping Luo
CVBM
42
0
0
15 Oct 2024
Simultaneous Diffusion Sampling for Conditional LiDAR Generation
Simultaneous Diffusion Sampling for Conditional LiDAR Generation
Ryan Faulkner
Luke Haub
Simon Ratcliffe
Anh-Dzung Doan
Ian Reid
Tat-Jun Chin
35
0
0
15 Oct 2024
Latent Action Pretraining from Videos
Latent Action Pretraining from Videos
Seonghyeon Ye
Joel Jang
Byeongguk Jeon
Sejune Joo
Jianwei Yang
...
Kimin Lee
J. Gao
Luke Zettlemoyer
Dieter Fox
Minjoon Seo
40
28
0
15 Oct 2024
Customize Your Visual Autoregressive Recipe with Set Autoregressive
  Modeling
Customize Your Visual Autoregressive Recipe with Set Autoregressive Modeling
Wenze Liu
Le Zhuo
Yi Xin
Sheng Xia
Peng Gao
Xiangyu Yue
42
7
0
14 Oct 2024
LADMIM: Logical Anomaly Detection with Masked Image Modeling in Discrete
  Latent Space
LADMIM: Logical Anomaly Detection with Masked Image Modeling in Discrete Latent Space
Shunsuke Sakai
Tatushito Hasegawa
Makoto Koshino
30
1
0
14 Oct 2024
Gaussian Mixture Vector Quantization with Aggregated Categorical
  Posterior
Gaussian Mixture Vector Quantization with Aggregated Categorical Posterior
Mingyuan Yan
Jiawei Wu
Rushi Shah
Dianbo Liu
28
0
0
14 Oct 2024
Code Drift: Towards Idempotent Neural Audio Codecs
Code Drift: Towards Idempotent Neural Audio Codecs
P. O'Reilly
Prem Seetharaman
Jiaqi Su
Zeyu Jin
Bryan Pardo
208
0
0
14 Oct 2024
VQ-CNMP: Neuro-Symbolic Skill Learning for Bi-Level Planning
VQ-CNMP: Neuro-Symbolic Skill Learning for Bi-Level Planning
Hakan Aktas
Emre Ugur
26
1
0
13 Oct 2024
EBDM: Exemplar-guided Image Translation with Brownian-bridge Diffusion
  Models
EBDM: Exemplar-guided Image Translation with Brownian-bridge Diffusion Models
Eungbean Lee
Somi Jeong
Kwanghoon Sohn
DiffM
35
1
0
13 Oct 2024
Towards Homogeneous Lexical Tone Decoding from Heterogeneous Intracranial Recordings
Towards Homogeneous Lexical Tone Decoding from Heterogeneous Intracranial Recordings
Di Wu
Siyuan Li
Chen Feng
Lu Cao
Yuyao Zhang
Jie Yang
Mohamad Sawan
33
0
0
13 Oct 2024
Bridging Text and Image for Artist Style Transfer via Contrastive
  Learning
Bridging Text and Image for Artist Style Transfer via Contrastive Learning
Zhi-Song Liu
Li-Wen Wang
Jun Xiao
Vicky Kalogeiton
CLIP
VLM
38
0
0
12 Oct 2024
Towards Scalable Semantic Representation for Recommendation
Towards Scalable Semantic Representation for Recommendation
Taolin Zhang
Junwei Pan
Jinqiao Wang
Yaohua Zha
Tao Dai
...
Xiaoxiang Deng
Yuan Wang
Ming Yue
Jie Jiang
Shu-Tao Xia
56
2
0
12 Oct 2024
Toward Guidance-Free AR Visual Generation via Condition Contrastive
  Alignment
Toward Guidance-Free AR Visual Generation via Condition Contrastive Alignment
Huayu Chen
Hang Su
Peize Sun
Jun Zhu
VLM
56
3
0
12 Oct 2024
Enhancing Motion Variation in Text-to-Motion Models via Pose and Video
  Conditioned Editing
Enhancing Motion Variation in Text-to-Motion Models via Pose and Video Conditioned Editing
Clayton Frederick Souza Leite
Yu Xiao
VGen
23
0
0
11 Oct 2024
CryoFM: A Flow-based Foundation Model for Cryo-EM Densities
CryoFM: A Flow-based Foundation Model for Cryo-EM Densities
Yi Zhou
Yilai Li
Jing Yuan
Quanquan Gu
35
1
0
11 Oct 2024
Score Neural Operator: A Generative Model for Learning and Generalizing
  Across Multiple Probability Distributions
Score Neural Operator: A Generative Model for Learning and Generalizing Across Multiple Probability Distributions
Xinyu Liao
Aoyang Qin
Jacob H. Seidman
Junqi Wang
Wei Wang
P. Perdikaris
DiffM
33
0
0
11 Oct 2024
Distillation of Discrete Diffusion through Dimensional Correlations
Distillation of Discrete Diffusion through Dimensional Correlations
Satoshi Hayakawa
Yuhta Takida
Masaaki Imaizumi
Hiromi Wakaki
Yuki Mitsufuji
DiffM
61
1
0
11 Oct 2024
Drama: Mamba-Enabled Model-Based Reinforcement Learning Is Sample and Parameter Efficient
Drama: Mamba-Enabled Model-Based Reinforcement Learning Is Sample and Parameter Efficient
Wenlong Wang
Ivana Dusparic
Yucheng Shi
Ke Zhang
Vinny Cahill
Mamba
236
0
0
11 Oct 2024
Generalizable autoregressive modeling of time series through functional
  narratives
Generalizable autoregressive modeling of time series through functional narratives
Ran Liu
Wenrui Ma
Ellen L. Zippi
Hadi Pouransari
Jingyun Xiao
...
Behrooz Mahasseni
Juri Minxha
Erdrin Azemi
Eva L. Dyer
Ali Moin
AI4TS
48
1
0
10 Oct 2024
Previous
123...101112...545556
Next