ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1711.00937
  4. Cited By
Neural Discrete Representation Learning

Neural Discrete Representation Learning

2 November 2017
Aaron van den Oord
Oriol Vinyals
Koray Kavukcuoglu
    BDL
    SSL
    OCL
ArXivPDFHTML

Papers citing "Neural Discrete Representation Learning"

50 / 2,751 papers shown
Title
CAT: Content-Adaptive Image Tokenization
Junhong Shen
Kushal Tirumala
Michihiro Yasunaga
Ishan Misra
Luke Zettlemoyer
Lili Yu
Chunting Zhou
35
0
0
06 Jan 2025
Qinco2: Vector Compression and Search with Improved Implicit Neural Codebooks
Théophane Vallaeys
Matthew Muckley
Jakob Verbeek
Matthijs Douze
MQ
43
2
0
06 Jan 2025
Passive Non-Line-of-Sight Imaging with Light Transport Modulation
Passive Non-Line-of-Sight Imaging with Light Transport Modulation
Jiarui Zhang
Ruixu Geng
Xiaolong Du
Yan Chen
Houqiang Li
Yang Hu
78
1
0
03 Jan 2025
TOTEM: TOkenized Time Series EMbeddings for General Time Series Analysis
TOTEM: TOkenized Time Series EMbeddings for General Time Series Analysis
Sabera Talukder
Yisong Yue
Georgia Gkioxari
AI4TS
51
12
0
03 Jan 2025
Disentangling data distribution for Federated Learning
Disentangling data distribution for Federated Learning
Xinyuan Zhao
Hanlin Gu
Lixin Fan
Qiang Yang
Yuxing Han
OOD
FedML
44
0
0
31 Dec 2024
Visual Style Prompt Learning Using Diffusion Models for Blind Face Restoration
Visual Style Prompt Learning Using Diffusion Models for Blind Face Restoration
Wanglong Lu
Jikai Wang
Tao Wang
Kaihao Zhang
Xianta Jiang
Hanli Zhao
DiffM
52
1
0
31 Dec 2024
Stable-TTS: Stable Speaker-Adaptive Text-to-Speech Synthesis via Prosody Prompting
Stable-TTS: Stable Speaker-Adaptive Text-to-Speech Synthesis via Prosody Prompting
Wooseok Han
Minki Kang
Changhun Kim
Eunho Yang
43
0
0
31 Dec 2024
DrivingWorld: Constructing World Model for Autonomous Driving via Video GPT
DrivingWorld: Constructing World Model for Autonomous Driving via Video GPT
Xiaotao Hu
Wei Yin
Mingkai Jia
Junyuan Deng
Xiaoyang Guo
Qian Zhang
Xiaoxiao Long
Ping Tan
VGen
53
10
0
31 Dec 2024
Advanced Knowledge Transfer: Refined Feature Distillation for Zero-Shot
  Quantization in Edge Computing
Advanced Knowledge Transfer: Refined Feature Distillation for Zero-Shot Quantization in Edge Computing
Inpyo Hong
Youngwan Jo
Hyojeong Lee
Sunghyun Ahn
Sanghyun Park
MQ
49
1
0
26 Dec 2024
Bridging Interpretability and Robustness Using LIME-Guided Model
  Refinement
Bridging Interpretability and Robustness Using LIME-Guided Model Refinement
Navid Nayyem
Abdullah Rakin
Longwei Wang
AAML
FAtt
63
0
0
25 Dec 2024
DrivingGPT: Unifying Driving World Modeling and Planning with
  Multi-modal Autoregressive Transformers
DrivingGPT: Unifying Driving World Modeling and Planning with Multi-modal Autoregressive Transformers
Yuntao Chen
Yuqi Wang
Zhaoxiang Zhang
189
7
0
24 Dec 2024
Hierarchical Vector Quantization for Unsupervised Action Segmentation
Hierarchical Vector Quantization for Unsupervised Action Segmentation
Federico Spurio
Emad Bahrami
Gianpiero Francesca
Juergen Gall
44
0
0
23 Dec 2024
When Worse is Better: Navigating the compression-generation tradeoff in
  visual tokenization
When Worse is Better: Navigating the compression-generation tradeoff in visual tokenization
Vivek Ramanujan
Kushal Tirumala
Armen Aghajanyan
Luke Zettlemoyer
Ali Farhadi
DiffM
76
2
0
20 Dec 2024
Next Patch Prediction for Autoregressive Visual Generation
Next Patch Prediction for Autoregressive Visual Generation
Yatian Pang
Peng Jin
Shuo Yang
Bin Lin
Bin Zhu
...
Liuhan Chen
Francis E. H. Tay
Ser-Nam Lim
Harry Yang
Li Yuan
129
9
0
19 Dec 2024
Parallelized Autoregressive Visual Generation
Parallelized Autoregressive Visual Generation
Yanjie Wang
Shuhuai Ren
Zhijie Lin
Yujin Han
Haoyuan Guo
Zhenheng Yang
Difan Zou
Jiashi Feng
Xihui Liu
VGen
90
12
0
19 Dec 2024
Learning from Massive Human Videos for Universal Humanoid Pose Control
Learning from Massive Human Videos for Universal Humanoid Pose Control
Jiageng Mao
Siheng Zhao
Siqi Song
Tianheng Shi
Junjie Ye
Mingtong Zhang
Haoran Geng
Jitendra Malik
Vitor Campagnolo Guizilini
Yue Wang
98
5
0
18 Dec 2024
Incorporating Feature Pyramid Tokenization and Open Vocabulary Semantic
  Segmentation
Incorporating Feature Pyramid Tokenization and Open Vocabulary Semantic Segmentation
J. Zhang
Li Zhang
Shijian Li
VLM
83
0
0
18 Dec 2024
Future Research Avenues for Artificial Intelligence in Digital Gaming:
  An Exploratory Report
Future Research Avenues for Artificial Intelligence in Digital Gaming: An Exploratory Report
Markus Dablander
82
0
0
18 Dec 2024
Towards Generalist Robot Policies: What Matters in Building
  Vision-Language-Action Models
Towards Generalist Robot Policies: What Matters in Building Vision-Language-Action Models
Xinghang Li
Peiyan Li
Minghuan Liu
Dong Wang
Jirong Liu
Bingyi Kang
Xiao Ma
Tao Kong
Hanbo Zhang
Huaping Liu
LM&Ro
99
18
0
18 Dec 2024
SongEditor: Adapting Zero-Shot Song Generation Language Model as a Multi-Task Editor
SongEditor: Adapting Zero-Shot Song Generation Language Model as a Multi-Task Editor
Chenyu Yang
Shuai Wang
Hangting Chen
Jianwei Yu
Wei Tan
Rongzhi Gu
Yongjun Xu
Yizhi Zhou
Haina Zhu
Yiming Li
KELM
197
1
0
18 Dec 2024
An Efficient Occupancy World Model via Decoupled Dynamic Flow and
  Image-assisted Training
An Efficient Occupancy World Model via Decoupled Dynamic Flow and Image-assisted Training
Haiming Zhang
Ying Xue
Xu Yan
Jiacheng Zhang
Weichao Qiu
Dongfeng Bai
Bingbing Liu
Shuguang Cui
Zehan Li
78
5
0
18 Dec 2024
Model Decides How to Tokenize: Adaptive DNA Sequence Tokenization with
  MxDNA
Model Decides How to Tokenize: Adaptive DNA Sequence Tokenization with MxDNA
Lifeng Qiao
Peng Ye
Yuchen Ren
Weiqiang Bai
Chaoqi Liang
Xinzhu Ma
Nanqing Dong
W. Ouyang
86
2
0
18 Dec 2024
Self-control: A Better Conditional Mechanism for Masked Autoregressive
  Model
Self-control: A Better Conditional Mechanism for Masked Autoregressive Model
Qiaoying Qu
Shiyu Shen
DiffM
81
0
0
18 Dec 2024
LaMI-GO: Latent Mixture Integration for Goal-Oriented Communications
  Achieving High Spectrum Efficiency
LaMI-GO: Latent Mixture Integration for Goal-Oriented Communications Achieving High Spectrum Efficiency
Achintha Wijesinghe
Suchinthaka Wanninayaka
Weiwei Wang
Yu-Chieh Chao
Songyang Zhang
Zhi Ding
47
1
0
18 Dec 2024
MeshArt: Generating Articulated Meshes with Structure-guided
  Transformers
MeshArt: Generating Articulated Meshes with Structure-guided Transformers
Daoyi Gao
Yawar Siddiqui
Lei Li
Angela Dai
121
3
0
16 Dec 2024
CLIP-SR: Collaborative Linguistic and Image Processing for Super-Resolution
CLIP-SR: Collaborative Linguistic and Image Processing for Super-Resolution
Bingwen Hu
Heng Liu
Zhedong Zheng
Ping Liu
SupR
86
0
0
16 Dec 2024
ViSymRe: Vision-guided Multimodal Symbolic Regression
ViSymRe: Vision-guided Multimodal Symbolic Regression
Da Li
Junping Yin
Jin Xu
Xinxin Li
Juan Zhang
85
1
0
15 Dec 2024
SceneLLM: Implicit Language Reasoning in LLM for Dynamic Scene Graph Generation
SceneLLM: Implicit Language Reasoning in LLM for Dynamic Scene Graph Generation
Hang Zhang
Zhuoling Li
Jun Liu
LRM
100
1
0
15 Dec 2024
Diffusion Model from Scratch
Diffusion Model from Scratch
Wang Zhen
Dong Yunyun
DiffM
70
0
0
14 Dec 2024
Optimizing Few-Step Sampler for Diffusion Probabilistic Model
Optimizing Few-Step Sampler for Diffusion Probabilistic Model
Jen-Yuan Huang
DiffM
77
0
0
14 Dec 2024
SoftVQ-VAE: Efficient 1-Dimensional Continuous Tokenizer
SoftVQ-VAE: Efficient 1-Dimensional Continuous Tokenizer
Hongyu Chen
Zihan Wang
Xianrui Li
Xingchen Sun
Fangyi Chen
Jiang Liu
Jie Wang
Bhiksha Raj
Zicheng Liu
Emad Barsoum
VLM
114
7
0
14 Dec 2024
Sample-efficient Unsupervised Policy Cloning from Ensemble Self-supervised Labeled Videos
Sample-efficient Unsupervised Policy Cloning from Ensemble Self-supervised Labeled Videos
Xin Liu
Yaran Chen
Haoran Li
SSL
94
0
0
14 Dec 2024
Aspen Open Jets: Unlocking LHC Data for Foundation Models in Particle
  Physics
Aspen Open Jets: Unlocking LHC Data for Foundation Models in Particle Physics
Oz Amram
Luca Anzalone
Joschka Birk
D. Faroughy
Anna Hallin
Gregor Kasieczka
Michael Krämer
Ian Pang
H. Reyes-González
David Shih
AI4CE
82
5
0
13 Dec 2024
T-SVG: Text-Driven Stereoscopic Video Generation
T-SVG: Text-Driven Stereoscopic Video Generation
Qiao Jin
Xiaodong Chen
Wu Liu
Tao Mei
Yongdong Zhang
DiffM
VGen
97
1
0
12 Dec 2024
Motion Generation Review: Exploring Deep Learning for Lifelike Animation
  with Manifold
Motion Generation Review: Exploring Deep Learning for Lifelike Animation with Manifold
Jiayi Zhao
Dongdong Weng
Qiuxin Du
Zeyu Tian
83
0
0
12 Dec 2024
GPTDrawer: Enhancing Visual Synthesis through ChatGPT
GPTDrawer: Enhancing Visual Synthesis through ChatGPT
Kun Li
Xinwei Chen
Tianyou Song
Hansong Zhang
Wenzhe Zhang
Qing Shan
85
7
0
11 Dec 2024
CoMA: Compositional Human Motion Generation with Multi-modal Agents
CoMA: Compositional Human Motion Generation with Multi-modal Agents
Shanlin Sun
Gabriel De Araujo
Jiaqi Xu
S. Kevin Zhou
Hanwen Zhang
Ziheng Huang
Chenyu You
Xiaohui Xie
97
4
0
10 Dec 2024
[MASK] is All You Need
[MASK] is All You Need
Vincent Tao Hu
Bjorn Ommer
DiffM
137
2
0
09 Dec 2024
Sound2Vision: Generating Diverse Visuals from Audio through Cross-Modal
  Latent Alignment
Sound2Vision: Generating Diverse Visuals from Audio through Cross-Modal Latent Alignment
Kim Sung-Bin
Arda Senocak
Hyunwoo Ha
Tae-Hyun Oh
DiffM
83
0
0
09 Dec 2024
Evaluating Hallucination in Text-to-Image Diffusion Models with
  Scene-Graph based Question-Answering Agent
Evaluating Hallucination in Text-to-Image Diffusion Models with Scene-Graph based Question-Answering Agent
Ziyuan Qin
D. Cheng
Haoyu Wang
Huahui Yi
Yuting Shao
Zhiyuan Fan
Kang Li
Qicheng Lao
EGVM
MLLM
220
0
0
07 Dec 2024
Training MLPs on Graphs without Supervision
Training MLPs on Graphs without Supervision
Zehong Wang
Zheyuan Zhang
Chuxu Zhang
Yanfang Ye
75
5
0
05 Dec 2024
UTSD: Unified Time Series Diffusion Model
UTSD: Unified Time Series Diffusion Model
Xiangkai Ma
Xiaobin Hong
Wenzhong Li
Sanglu Lu
82
0
0
04 Dec 2024
SceneFactor: Factored Latent 3D Diffusion for Controllable 3D Scene
  Generation
SceneFactor: Factored Latent 3D Diffusion for Controllable 3D Scene Generation
Alexey Bokhovkin
Quan Meng
Shubham Tulsiani
Angela Dai
DiffM
83
5
0
02 Dec 2024
XQ-GAN: An Open-source Image Tokenization Framework for Autoregressive
  Generation
XQ-GAN: An Open-source Image Tokenization Framework for Autoregressive Generation
Xianrui Li
Kai Qiu
Hongyu Chen
Jason Kuen
Jiuxiang Gu
Jie Wang
Zhe-nan Lin
Bhiksha Raj
VLM
125
3
0
02 Dec 2024
HoloDrive: Holistic 2D-3D Multi-Modal Street Scene Generation for
  Autonomous Driving
HoloDrive: Holistic 2D-3D Multi-Modal Street Scene Generation for Autonomous Driving
Z. Wu
Jingcheng Ni
Xiaodong Wang
Yuxin Guo
Rui Chen
Lewei Lu
Jifeng Dai
Yuwen Xiong
85
6
0
02 Dec 2024
Hierarchical VAE with a Diffusion-based VampPrior
Hierarchical VAE with a Diffusion-based VampPrior
Anna Kuzina
Jakub M. Tomczak
DiffM
TPM
BDL
92
1
0
02 Dec 2024
MFTF: Mask-free Training-free Object Level Layout Control Diffusion
  Model
MFTF: Mask-free Training-free Object Level Layout Control Diffusion Model
Shan Yang
DiffM
76
0
0
02 Dec 2024
A Semantic Communication System for Real-time 3D Reconstruction Tasks
A Semantic Communication System for Real-time 3D Reconstruction Tasks
Jiaxing Zhang
Luosong Guo
K. Zhu
Houming Qiu
67
0
0
02 Dec 2024
FreeCodec: A disentangled neural speech codec with fewer tokens
FreeCodec: A disentangled neural speech codec with fewer tokens
Youqiang Zheng
Weiping Tu
Yueteng Kang
Jie Chen
Yike Zhang
Li Xiao
Yuhong Yang
Long Ma
75
1
0
02 Dec 2024
Improving Detail in Pluralistic Image Inpainting with Feature
  Dequantization
Improving Detail in Pluralistic Image Inpainting with Feature Dequantization
Kyungri Park
Woohwan Jung
80
1
0
02 Dec 2024
Previous
123...789...545556
Next