ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1711.00937
  4. Cited By
Neural Discrete Representation Learning

Neural Discrete Representation Learning

2 November 2017
Aaron van den Oord
Oriol Vinyals
Koray Kavukcuoglu
    BDL
    SSL
    OCL
ArXivPDFHTML

Papers citing "Neural Discrete Representation Learning"

50 / 2,814 papers shown
Title
RangeLDM: Fast Realistic LiDAR Point Cloud Generation
RangeLDM: Fast Realistic LiDAR Point Cloud Generation
Q. Hu
Zhimin Zhang
Wei Hu
DiffM
50
12
0
15 Mar 2024
Codebook Transfer with Part-of-Speech for Vector-Quantized Image
  Modeling
Codebook Transfer with Part-of-Speech for Vector-Quantized Image Modeling
Baoquan Zhang
Huaibin Wang
Chuyao Luo
Xutao Li
Guotao Liang
Yunming Ye
Xiaochen Qi
Yao He
40
11
0
15 Mar 2024
AD3: Implicit Action is the Key for World Models to Distinguish the
  Diverse Visual Distractors
AD3: Implicit Action is the Key for World Models to Distinguish the Diverse Visual Distractors
Yucen Wang
Shenghua Wan
Le Gan
Shuai Feng
De-Chuan Zhan
VGen
32
4
0
15 Mar 2024
Faceptor: A Generalist Model for Face Perception
Faceptor: A Generalist Model for Face Perception
Lixiong Qin
Mei Wang
Xuannan Liu
Yuhang Zhang
Weihong Deng
Xiaoshuai Song
Weiran Xu
Weihong Deng
CVBM
37
6
0
14 Mar 2024
MambaTalk: Efficient Holistic Gesture Synthesis with Selective State Space Models
MambaTalk: Efficient Holistic Gesture Synthesis with Selective State Space Models
Zunnan Xu
Yukang Lin
Haonan Han
Sicheng Yang
Ronghui Li
Yachao Zhang
Xiu Li
Mamba
54
25
0
14 Mar 2024
InfoCon: Concept Discovery with Generative and Discriminative
  Informativeness
InfoCon: Concept Discovery with Generative and Discriminative Informativeness
Ruizhe Liu
Qian Luo
Yanchao Yang
61
2
0
14 Mar 2024
GiT: Towards Generalist Vision Transformer through Universal Language
  Interface
GiT: Towards Generalist Vision Transformer through Universal Language Interface
Haiyang Wang
Hao Tang
Li Jiang
Shaoshuai Shi
Muhammad Ferjad Naeem
Hongsheng Li
Bernt Schiele
Liwei Wang
VLM
78
10
0
14 Mar 2024
Towards Faster Training of Diffusion Models: An Inspiration of A
  Consistency Phenomenon
Towards Faster Training of Diffusion Models: An Inspiration of A Consistency Phenomenon
Tianshuo Xu
Peng Mi
Ruilin Wang
Yingcong Chen
DiffM
49
6
0
14 Mar 2024
UniCode: Learning a Unified Codebook for Multimodal Large Language
  Models
UniCode: Learning a Unified Codebook for Multimodal Large Language Models
Sipeng Zheng
Bohan Zhou
Yicheng Feng
Ye Wang
Zongqing Lu
VLM
MLLM
51
7
0
14 Mar 2024
Dyadic Interaction Modeling for Social Behavior Generation
Dyadic Interaction Modeling for Social Behavior Generation
Minh Tran
Di Chang
Maksim Siniukov
Mohammad Soleymani
VGen
47
7
0
14 Mar 2024
Masked Generative Story Transformer with Character Guidance and Caption
  Augmentation
Masked Generative Story Transformer with Character Guidance and Caption Augmentation
Christos Papadimitriou
Giorgos Filandrianos
Maria Lymperaiou
Giorgos Stamou
DiffM
102
1
0
13 Mar 2024
VANP: Learning Where to See for Navigation with Self-Supervised
  Vision-Action Pre-Training
VANP: Learning Where to See for Navigation with Self-Supervised Vision-Action Pre-Training
Mohammad Nazeri
Junzhe Wang
Amirreza Payandeh
Xuesu Xiao
SSL
ViT
52
6
0
12 Mar 2024
Beyond Text: Frozen Large Language Models in Visual Signal Comprehension
Beyond Text: Frozen Large Language Models in Visual Signal Comprehension
Lei Zhu
Fangyun Wei
Yanye Lu
MLLM
VLM
57
20
0
12 Mar 2024
Motion Mamba: Efficient and Long Sequence Motion Generation with
  Hierarchical and Bidirectional Selective SSM
Motion Mamba: Efficient and Long Sequence Motion Generation with Hierarchical and Bidirectional Selective SSM
Zeyu Zhang
Akide Liu
Ian Reid
Richard Hartley
Bohan Zhuang
Hao Tang
Mamba
53
63
0
12 Mar 2024
Vector Quantization for Deep-Learning-Based CSI Feedback in Massive MIMO
  Systems
Vector Quantization for Deep-Learning-Based CSI Feedback in Massive MIMO Systems
Junyong Shin
Yujin Kang
Yo-Seb Jeon
19
4
0
12 Mar 2024
BID: Boundary-Interior Decoding for Unsupervised Temporal Action
  Localization Pre-Trainin
BID: Boundary-Interior Decoding for Unsupervised Temporal Action Localization Pre-Trainin
Qihang Fang
Chengcheng Tang
Shugao Ma
Yanchao Yang
56
1
0
12 Mar 2024
Approaching Rate-Distortion Limits in Neural Compression with Lattice
  Transform Coding
Approaching Rate-Distortion Limits in Neural Compression with Lattice Transform Coding
Eric Lei
Hamed Hassani
Shirin Saeedi Bidokhti
23
3
0
12 Mar 2024
AesopAgent: Agent-driven Evolutionary System on Story-to-Video
  Production
AesopAgent: Agent-driven Evolutionary System on Story-to-Video Production
Jiuniu Wang
Zehua Du
Yuyuan Zhao
Bo Yuan
Kexiang Wang
...
Yihen Lu
Gengliang Li
Junlong Gao
Xin Tu
Zhenyu Guo
LLMAG
VGen
45
7
0
12 Mar 2024
FlowVQTalker: High-Quality Emotional Talking Face Generation through
  Normalizing Flow and Quantization
FlowVQTalker: High-Quality Emotional Talking Face Generation through Normalizing Flow and Quantization
Shuai Tan
Bin Ji
Ye Pan
47
15
0
11 Mar 2024
Say Anything with Any Style
Say Anything with Any Style
Shuai Tan
Bin Ji
Yu Ding
Ye Pan
VGen
DiffM
34
10
0
11 Mar 2024
MACE: Mass Concept Erasure in Diffusion Models
MACE: Mass Concept Erasure in Diffusion Models
Shilin Lu
Zilan Wang
Leyang Li
Yanzhu Liu
A. Kong
DiffM
52
82
0
10 Mar 2024
HAM-TTS: Hierarchical Acoustic Modeling for Token-Based Zero-Shot
  Text-to-Speech with Model and Data Scaling
HAM-TTS: Hierarchical Acoustic Modeling for Token-Based Zero-Shot Text-to-Speech with Model and Data Scaling
Chunhui Wang
Chang Zeng
Bowen Zhang
Ziyang Ma
Yefan Zhu
Zifeng Cai
Jian Zhao
Zhonglin Jiang
Yong Chen
SyDa
49
5
0
09 Mar 2024
Enhancing Expressiveness in Dance Generation via Integrating Frequency
  and Music Style Information
Enhancing Expressiveness in Dance Generation via Integrating Frequency and Music Style Information
Qiaochu Huang
Xu He
Boshi Tang
Hao-Wen Zhuang
Liyang Chen
Shuochen Gao
Zhiyong Wu
Haozhi Huang
Helen M. Meng
50
4
0
09 Mar 2024
tsGT: Stochastic Time Series Modeling With Transformer
tsGT: Stochastic Time Series Modeling With Transformer
Lukasz Kuciñski
Witold Drzewakowski
Mateusz Olko
Piotr Kozakowski
Lukasz Maziarka
Marta Emilia Nowakowska
Lukasz Kaiser
Piotr Milo's
54
1
0
08 Mar 2024
OmniJet-$α$: The first cross-task foundation model for particle
  physics
OmniJet-ααα: The first cross-task foundation model for particle physics
Joschka Birk
Anna Hallin
Gregor Kasieczka
AI4CE
52
22
0
08 Mar 2024
Sora as an AGI World Model? A Complete Survey on Text-to-Video
  Generation
Sora as an AGI World Model? A Complete Survey on Text-to-Video Generation
Joseph Cho
Fachrina Dewi Puspitasari
Sheng Zheng
Jingyao Zheng
Lik-Hang Lee
Tae-Ho Kim
Choong Seon Hong
Chaoning Zhang
EGVM
VGen
46
41
0
08 Mar 2024
Face2Diffusion for Fast and Editable Face Personalization
Face2Diffusion for Fast and Editable Face Personalization
Kaede Shiohara
Toshihiko Yamasaki
DiffM
24
11
0
08 Mar 2024
UniTable: Towards a Unified Framework for Table Recognition via
  Self-Supervised Pretraining
UniTable: Towards a Unified Framework for Table Recognition via Self-Supervised Pretraining
Sheng-Hsuan Peng
Aishwarya Chakravarthy
Seongmin Lee
Xiaojing Wang
Rajarajeswari Balasubramaniyan
Duen Horng Chau
LMTD
56
0
0
07 Mar 2024
Extreme Precipitation Nowcasting using Transformer-based Generative
  Models
Extreme Precipitation Nowcasting using Transformer-based Generative Models
Cristian Meo
Ankush Roy
Mircea Lica
Junzhe Yin
Zeineb Bou Che
Yanbo Wang
R. Imhoff
R. Uijlenhoet
Justin Dauwels
31
3
0
06 Mar 2024
Towards Controllable Time Series Generation
Towards Controllable Time Series Generation
Yifan Bao
Yihao Ang
Qiang Huang
Anthony K. H. Tung
Zhiyong Huang
DiffM
59
4
0
06 Mar 2024
Behavior Generation with Latent Actions
Behavior Generation with Latent Actions
Seungjae Lee
Yibin Wang
Haritheja Etukuru
H. J. Kim
Mahi Shafiullah
Lerrel Pinto
VGen
OffRL
40
66
0
05 Mar 2024
Deep-Learned Compression for Radio-Frequency Signal Classification
Deep-Learned Compression for Radio-Frequency Signal Classification
Armani Rodriguez
Yagna Kaasaragadda
S. Kokalj-Filipovic
36
1
0
05 Mar 2024
NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and
  Diffusion Models
NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models
Zeqian Ju
Yuancheng Wang
Kai Shen
Xu Tan
Detai Xin
...
Shikun Zhang
Jiang Bian
Lei He
Jinyu Li
Sheng Zhao
DiffM
54
150
0
05 Mar 2024
VQSynery: Robust Drug Synergy Prediction With Vector Quantization
  Mechanism
VQSynery: Robust Drug Synergy Prediction With Vector Quantization Mechanism
Jiawei Wu
Mingyuan Yan
Dianbo Liu
45
2
0
05 Mar 2024
World Models for Autonomous Driving: An Initial Survey
World Models for Autonomous Driving: An Initial Survey
Yanchen Guan
Haicheng Liao
Zhenning Li
Jia Hu
Runze Yuan
Yunjian Li
Guohui Zhang
Chengzhong Xu
72
33
0
05 Mar 2024
UniCtrl: Improving the Spatiotemporal Consistency of Text-to-Video
  Diffusion Models via Training-Free Unified Attention Control
UniCtrl: Improving the Spatiotemporal Consistency of Text-to-Video Diffusion Models via Training-Free Unified Attention Control
Xuweiyi Chen
Tian Xia
Sihan Xu
VGen
DiffM
40
7
0
04 Mar 2024
ResAdapter: Domain Consistent Resolution Adapter for Diffusion Models
ResAdapter: Domain Consistent Resolution Adapter for Diffusion Models
Jiaxiang Cheng
Pan Xie
Xin Xia
Jiashi Li
Jie Wu
Yuxi Ren
Huixia Li
Xuefeng Xiao
Min Zheng
Lean Fu
51
12
0
04 Mar 2024
HyperSDFusion: Bridging Hierarchical Structures in Language and Geometry
  for Enhanced 3D Text2Shape Generation
HyperSDFusion: Bridging Hierarchical Structures in Language and Geometry for Enhanced 3D Text2Shape Generation
Zhiying Leng
Tolga Birdal
Xiaohui Liang
Federico Tombari
81
3
0
01 Mar 2024
CustomListener: Text-guided Responsive Interaction for User-friendly
  Listening Head Generation
CustomListener: Text-guided Responsive Interaction for User-friendly Listening Head Generation
Xi Liu
Ying Guo
Cheng Zhen
Tong Li
Yingying Ao
Pengfei Yan
DiffM
81
4
0
01 Mar 2024
Large Convolutional Model Tuning via Filter Subspace
Large Convolutional Model Tuning via Filter Subspace
Wei Chen
Zichen Miao
Qiang Qiu
65
3
0
01 Mar 2024
A Novel Approach to Industrial Defect Generation through Blended Latent
  Diffusion Model with Online Adaptation
A Novel Approach to Industrial Defect Generation through Blended Latent Diffusion Model with Online Adaptation
Hanxi Li
Zhengxun Zhang
Hao Chen
Lin Wu
Bo Li
Deyin Liu
Mingwen Wang
57
2
0
29 Feb 2024
ProtoP-OD: Explainable Object Detection with Prototypical Parts
ProtoP-OD: Explainable Object Detection with Prototypical Parts
Pavlos Rath-Manakidis
Frederik Strothmann
Tobias Glasmachers
Laurenz Wiskott
ViT
45
1
0
29 Feb 2024
Uncertainty-Based Extensible Codebook for Discrete Federated Learning in
  Heterogeneous Data Silos
Uncertainty-Based Extensible Codebook for Discrete Federated Learning in Heterogeneous Data Silos
Tianyi Zhang
Yu Cao
Dianbo Liu
FedML
29
0
0
29 Feb 2024
Dr. Strategy: Model-Based Generalist Agents with Strategic Dreaming
Dr. Strategy: Model-Based Generalist Agents with Strategic Dreaming
Hany Hamed
Subin Kim
Dongyeong Kim
Jaesik Yoon
Sungjin Ahn
64
4
0
29 Feb 2024
Generalizability Under Sensor Failure: Tokenization + Transformers
  Enable More Robust Latent Spaces
Generalizability Under Sensor Failure: Tokenization + Transformers Enable More Robust Latent Spaces
Geeling Chau
Yujin An
Ahamed Raffey Iqbal
Soon-Jo Chung
Yisong Yue
Sabera Talukder
OOD
51
4
0
28 Feb 2024
Latent Neural PDE Solver: a reduced-order modelling framework for partial differential equations
Latent Neural PDE Solver: a reduced-order modelling framework for partial differential equations
Zijie Li
Saurabh Patil
Francis Ogoke
Dule Shu
Wilson Zhen
Michael Schneier
John R. Buchanan
A. Farimani
AI4CE
45
5
0
27 Feb 2024
Rethinking Mutual Information for Language Conditioned Skill Discovery
  on Imitation Learning
Rethinking Mutual Information for Language Conditioned Skill Discovery on Imitation Learning
Zhaoxun Ju
Chao Yang
Hongbo Wang
Yu Qiao
Gang Hua
LM&Ro
54
3
0
27 Feb 2024
BiVRec: Bidirectional View-based Multimodal Sequential Recommendation
BiVRec: Bidirectional View-based Multimodal Sequential Recommendation
Jiaxi Hu
Jingtong Gao
Xiangyu Zhao
Yuehong Hu
Yuxuan Liang
Yiqi Wang
Ming He
Zitao Liu
Hongzhi Yin
HAI
63
1
0
27 Feb 2024
Inpainting Computational Fluid Dynamics with Deep Learning
Inpainting Computational Fluid Dynamics with Deep Learning
Dule Shu
Wilson Zhen
Zijie Li
A. Farimani
AI4CE
44
0
0
27 Feb 2024
Sora: A Review on Background, Technology, Limitations, and Opportunities
  of Large Vision Models
Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models
Yixin Liu
Kai Zhang
Yuan Li
Zhiling Yan
Chujie Gao
...
Yue Huang
Hanchi Sun
Jianfeng Gao
Lifang He
Lichao Sun
VLM
VGen
EGVM
84
264
0
27 Feb 2024
Previous
123...222324...555657
Next