ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1711.00937
  4. Cited By
Neural Discrete Representation Learning

Neural Discrete Representation Learning

2 November 2017
Aaron van den Oord
Oriol Vinyals
Koray Kavukcuoglu
    BDL
    SSL
    OCL
ArXivPDFHTML

Papers citing "Neural Discrete Representation Learning"

50 / 2,785 papers shown
Title
HybridBooth: Hybrid Prompt Inversion for Efficient Subject-Driven
  Generation
HybridBooth: Hybrid Prompt Inversion for Efficient Subject-Driven Generation
Shanyan Guan
Yanhao Ge
Ying Tai
Jian Yang
Wei Li
Mingyu You
DiffM
34
1
0
10 Oct 2024
DART: Denoising Autoregressive Transformer for Scalable Text-to-Image Generation
DART: Denoising Autoregressive Transformer for Scalable Text-to-Image Generation
Jiatao Gu
Yuyang Wang
Yizhe Zhang
Qihang Zhang
Dinghuai Zhang
Navdeep Jaitly
Josh Susskind
Shuangfei Zhai
DiffM
44
13
0
10 Oct 2024
Protect Before Generate: Error Correcting Codes within Discrete Deep
  Generative Models
Protect Before Generate: Error Correcting Codes within Discrete Deep Generative Models
María Martínez-García
Grace Villacrés
David Mitchell
Pablo Martínez Olmos
DRL
24
0
0
10 Oct 2024
MMHead: Towards Fine-grained Multi-modal 3D Facial Animation
MMHead: Towards Fine-grained Multi-modal 3D Facial Animation
Sijing Wu
Yunhao Li
Yichao Yan
Huiyu Duan
Ziwei Liu
Guangtao Zhai
3DH
VGen
44
4
0
10 Oct 2024
ACCEPT: Adaptive Codebook for Composite and Efficient Prompt Tuning
ACCEPT: Adaptive Codebook for Composite and Efficient Prompt Tuning
Yu-Chen Lin
Wei-Hua Li
Jun-Cheng Chen
Chu-Song Chen
37
1
0
10 Oct 2024
Imitation Learning with Limited Actions via Diffusion Planners and Deep Koopman Controllers
Imitation Learning with Limited Actions via Diffusion Planners and Deep Koopman Controllers
Jianxin Bi
Kelvin Lim
Kaiqi Chen
Yifei Huang
Harold Soh
43
0
0
10 Oct 2024
MotionAura: Generating High-Quality and Motion Consistent Videos using Discrete Diffusion
MotionAura: Generating High-Quality and Motion Consistent Videos using Discrete Diffusion
Onkar Susladkar
Jishu Sen Gupta
Chirag Sehgal
Sparsh Mittal
Rekha Singhal
DiffM
VGen
46
0
0
10 Oct 2024
ElasticTok: Adaptive Tokenization for Image and Video
ElasticTok: Adaptive Tokenization for Image and Video
Wilson Yan
Matei A. Zaharia
Volodymyr Mnih
Pieter Abbeel
Aleksandra Faust
Hao Liu
VGen
54
6
0
10 Oct 2024
Masked Generative Priors Improve World Models Sequence Modelling Capabilities
Masked Generative Priors Improve World Models Sequence Modelling Capabilities
Cristian Meo
Mircea Lica
Zarif Ikram
Akihiro Nakano
Vedant Shah
Aniket Didolkar
Dianbo Liu
Anirudh Goyal
Justin Dauwels
OffRL
90
0
0
10 Oct 2024
Towards Synergistic, Generalized, and Efficient Dual-System for Robotic Manipulation
Towards Synergistic, Generalized, and Efficient Dual-System for Robotic Manipulation
Qingwen Bu
Hongyang Li
Li Chen
Jisong Cai
Jia Zeng
Heming Cui
Maoqing Yao
Yu Qiao
60
4
0
10 Oct 2024
Language-Guided Joint Audio-Visual Editing via One-Shot Adaptation
Language-Guided Joint Audio-Visual Editing via One-Shot Adaptation
Susan Liang
Chao Huang
Yapeng Tian
Anurag Kumar
Chenliang Xu
DiffM
34
7
0
09 Oct 2024
Zero-Shot Generalization of Vision-Based RL Without Data Augmentation
Zero-Shot Generalization of Vision-Based RL Without Data Augmentation
Sumeet Batra
Gaurav Sukhatme
OffRL
DRL
41
2
0
09 Oct 2024
Trans4D: Realistic Geometry-Aware Transition for Compositional
  Text-to-4D Synthesis
Trans4D: Realistic Geometry-Aware Transition for Compositional Text-to-4D Synthesis
Bohan Zeng
Ling Yang
Siyu Li
Jiaming Liu
Zixiang Zhang
...
Yongzhen Guo
Fu-Yun Wang
Minkai Xu
Stefano Ermon
Wentao Zhang
VGen
AI4CE
36
7
0
09 Oct 2024
ReinDiffuse: Crafting Physically Plausible Motions with Reinforced
  Diffusion Model
ReinDiffuse: Crafting Physically Plausible Motions with Reinforced Diffusion Model
Gaoge Han
Mingjiang Liang
Jinglei Tang
Yongkang Cheng
Wei Liu
Shaoli Huang
VGen
51
5
0
09 Oct 2024
DDRN:a Data Distribution Reconstruction Network for Occluded Person
  Re-Identification
DDRN:a Data Distribution Reconstruction Network for Occluded Person Re-Identification
Zhaoyong Wang
Yujie Liu
Mingyue Li
Wenxin Zhang
Zongmin Li
23
0
0
09 Oct 2024
G2D2: Gradient-guided Discrete Diffusion for image inverse problem
  solving
G2D2: Gradient-guided Discrete Diffusion for image inverse problem solving
Naoki Murata
Chieh-Hsin Lai
Yuhta Takida
Toshimitsu Uesaka
Bac Nguyen
Stefano Ermon
Yuki Mitsufuji
DiffM
65
1
0
09 Oct 2024
InstantIR: Blind Image Restoration with Instant Generative Reference
InstantIR: Blind Image Restoration with Instant Generative Reference
Jen-Yuan Huang
Haofan Wang
Qixun Wang
Xu Bai
Hao Ai
Peng-Fei Xing
Jen-tse Huang
30
1
0
09 Oct 2024
MotionRL: Align Text-to-Motion Generation to Human Preferences with
  Multi-Reward Reinforcement Learning
MotionRL: Align Text-to-Motion Generation to Human Preferences with Multi-Reward Reinforcement Learning
Xiaoyang Liu
Yunyao Mao
Wengang Zhou
Houqiang Li
42
2
0
09 Oct 2024
LaMP: Language-Motion Pretraining for Motion Generation, Retrieval, and Captioning
LaMP: Language-Motion Pretraining for Motion Generation, Retrieval, and Captioning
Zhe Li
Weihao Yuan
Yisheng He
Lingteng Qiu
Shenhao Zhu
Xiaodong Gu
Weichao Shen
Yuan Dong
Zilong Dong
Laurence T. Yang
33
8
0
09 Oct 2024
Vector Grimoire: Codebook-based Shape Generation under Raster Image
  Supervision
Vector Grimoire: Codebook-based Shape Generation under Raster Image Supervision
Moritz Feuerpfeil
Marco Cipriano
Gerard de Melo
34
0
0
08 Oct 2024
CodeUnlearn: Amortized Zero-Shot Machine Unlearning in Language Models
  Using Discrete Concept
CodeUnlearn: Amortized Zero-Shot Machine Unlearning in Language Models Using Discrete Concept
YuXuan Wu
Bonaventure F. P. Dossou
Dianbo Liu
MU
28
0
0
08 Oct 2024
Restructuring Vector Quantization with the Rotation Trick
Restructuring Vector Quantization with the Rotation Trick
Christopher Fifty
Ronald G. Junkins
Dennis Duan
Aniketh Iger
Jerry W. Liu
Ehsan Amid
Sebastian Thrun
Christopher Ré
LLMSV
50
11
0
08 Oct 2024
CAR: Controllable Autoregressive Modeling for Visual Generation
CAR: Controllable Autoregressive Modeling for Visual Generation
Ziyu Yao
Jialin Li
Yifeng Zhou
Yong Liu
Xi Jiang
Chengjie Wang
Feng Zheng
Yuexian Zou
Lei Li
DiffM
50
13
0
07 Oct 2024
Towards Unsupervised Blind Face Restoration using Diffusion Prior
Towards Unsupervised Blind Face Restoration using Diffusion Prior
Tianshu Kuai
Sina Honari
Igor Gilitschenski
Alex Levinshtein
DiffM
45
0
0
06 Oct 2024
Variational Language Concepts for Interpreting Foundation Language
  Models
Variational Language Concepts for Interpreting Foundation Language Models
Hengyi Wang
Shiwei Tan
Zhiqing Hong
Desheng Zhang
Hao Wang
39
3
0
04 Oct 2024
ShieldDiff: Suppressing Sexual Content Generation from Diffusion Models
  through Reinforcement Learning
ShieldDiff: Suppressing Sexual Content Generation from Diffusion Models through Reinforcement Learning
Dong Han
Salaheldin Mohamed
Yong Li
31
2
0
04 Oct 2024
Chain-of-Jailbreak Attack for Image Generation Models via Editing Step
  by Step
Chain-of-Jailbreak Attack for Image Generation Models via Editing Step by Step
Wenxuan Wang
Kuiyi Gao
Zihan Jia
Youliang Yuan
Jen-tse Huang
Qiuzhi Liu
Shuai Wang
Wenxiang Jiao
Zhaopeng Tu
231
2
0
04 Oct 2024
MO-DDN: A Coarse-to-Fine Attribute-based Exploration Agent for
  Multi-object Demand-driven Navigation
MO-DDN: A Coarse-to-Fine Attribute-based Exploration Agent for Multi-object Demand-driven Navigation
Hongcheng Wang
Peiqi Liu
Wenzhe Cai
Mingdong Wu
Zhengyu Qian
Hao Dong
31
0
0
04 Oct 2024
Zebra: In-Context and Generative Pretraining for Solving Parametric PDEs
Zebra: In-Context and Generative Pretraining for Solving Parametric PDEs
Louis Serrano
Armand K. Koupai
Thomas X. Wang
Pierre Erbacher
Patrick Gallinari
AI4CE
41
3
0
04 Oct 2024
Mitigating Adversarial Perturbations for Deep Reinforcement Learning via
  Vector Quantization
Mitigating Adversarial Perturbations for Deep Reinforcement Learning via Vector Quantization
Tung M. Luu
Thanh Nguyen
Tee Joshua Tian Jin
Sungwoon Kim
Chang D. Yoo
AAML
30
0
0
04 Oct 2024
MultiVerse: Efficient and Expressive Zero-Shot Multi-Task Text-to-Speech
MultiVerse: Efficient and Expressive Zero-Shot Multi-Task Text-to-Speech
Taejun Bak
Youngsik Eom
SeungJae Choi
Young-Sun Joo
43
0
0
04 Oct 2024
ECHOPulse: ECG controlled echocardio-grams video generation
ECHOPulse: ECG controlled echocardio-grams video generation
Yiwei Li
Sekeun Kim
Zihao Wu
Hanqi Jiang
Yi Pan
...
Sifan Song
Yucheng Shi
Tianming Liu
Quanzheng Li
Xiang Li
VGen
37
1
0
04 Oct 2024
Geometric Representation Condition Improves Equivariant Molecule Generation
Geometric Representation Condition Improves Equivariant Molecule Generation
Zian Li
Cai Zhou
Xiyuan Wang
Xingang Peng
Muhan Zhang
50
2
0
04 Oct 2024
LANTERN: Accelerating Visual Autoregressive Models with Relaxed Speculative Decoding
LANTERN: Accelerating Visual Autoregressive Models with Relaxed Speculative Decoding
Doohyuk Jang
Sihwan Park
J. Yang
Yeonsung Jung
Jihun Yun
Souvik Kundu
Sung-Yub Kim
Eunho Yang
56
7
0
04 Oct 2024
Scaling Large Motion Models with Million-Level Human Motions
Scaling Large Motion Models with Million-Level Human Motions
Ye Wang
Sipeng Zheng
Bin Cao
Qianshan Wei
Qin Jin
Qin Jin
Zongqing Lu
VGen
47
0
0
04 Oct 2024
Disentangling Textual and Acoustic Features of Neural Speech
  Representations
Disentangling Textual and Acoustic Features of Neural Speech Representations
Hosein Mohebbi
Grzegorz Chrupała
Willem H. Zuidema
Afra Alishahi
Ivan Titov
CoGe
36
0
0
03 Oct 2024
Grounded Answers for Multi-agent Decision-making Problem through
  Generative World Model
Grounded Answers for Multi-agent Decision-making Problem through Generative World Model
Zeyang Liu
Xinrui Yang
Shiguang Sun
Long Qian
Lipeng Wan
Xingyu Chen
Xuguang Lan
44
3
0
03 Oct 2024
Convolutional Variational Autoencoders for Spectrogram Compression in
  Automatic Speech Recognition
Convolutional Variational Autoencoders for Spectrogram Compression in Automatic Speech Recognition
Olga Iakovenko
Ivan Bondarenko
32
0
0
03 Oct 2024
SGW-based Multi-Task Learning in Vision Tasks
SGW-based Multi-Task Learning in Vision Tasks
Ruiyuan Zhang
Yuyao Chen
Yuchi Huo
Jiaxiang Liu
Dianbing Xi
Jie Liu
Chao Wu
33
1
0
03 Oct 2024
SEAL: SEmantic-Augmented Imitation Learning via Language Model
SEAL: SEmantic-Augmented Imitation Learning via Language Model
Chengyang Gu
Yuxin Pan
Haotian Bai
Hui Xiong
Yize Chen
37
0
0
03 Oct 2024
CaLMFlow: Volterra Flow Matching using Causal Language Models
CaLMFlow: Volterra Flow Matching using Causal Language Models
Shiyang Zhang
Daniel Levine
Ivan Vrkic
Marco Francesco Bressana
David Zhang
S. Rizvi
Yangtian Zhang
E. Zappala
David van Dijk
27
0
0
03 Oct 2024
Remember and Recall: Associative-Memory-based Trajectory Prediction
Remember and Recall: Associative-Memory-based Trajectory Prediction
Hang Guo
Yuzhen Zhang
Tianci Gao
Junning Su
Pei Lv
Mingliang Xu
37
0
0
03 Oct 2024
Plug-and-Play Controllable Generation for Discrete Masked Models
Plug-and-Play Controllable Generation for Discrete Masked Models
Wei Guo
Yuchen Zhu
Molei Tao
Yongxin Chen
45
1
0
03 Oct 2024
From Pixels to Tokens: Byte-Pair Encoding on Quantized Visual Modalities
From Pixels to Tokens: Byte-Pair Encoding on Quantized Visual Modalities
Wanpeng Zhang
Zilong Xie
Yicheng Feng
Yijiang Li
Xingrun Xing
Sipeng Zheng
Zongqing Lu
MLLM
30
0
0
03 Oct 2024
Loong: Generating Minute-level Long Videos with Autoregressive Language Models
Loong: Generating Minute-level Long Videos with Autoregressive Language Models
Yuqing Wang
Tianwei Xiong
Daquan Zhou
Zhijie Lin
Yang Zhao
Bingyi Kang
Jiashi Feng
Xihui Liu
VGen
61
23
0
03 Oct 2024
A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive
  Transformer for Efficient Finegrained Image Generation
A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegrained Image Generation
Liang Chen
Sinan Tan
Zefan Cai
Weichu Xie
Haozhe Zhao
Yichi Zhang
Junyang Lin
Jinze Bai
Tianyu Liu
Baobao Chang
ViT
58
3
0
02 Oct 2024
ImageFolder: Autoregressive Image Generation with Folded Tokens
ImageFolder: Autoregressive Image Generation with Folded Tokens
Xiang Li
Kai Qiu
Hao Chen
Jason Kuen
Jiuxiang Gu
Bhiksha Raj
Zhe Lin
VLM
47
18
0
02 Oct 2024
Boosting Weakly-Supervised Referring Image Segmentation via Progressive
  Comprehension
Boosting Weakly-Supervised Referring Image Segmentation via Progressive Comprehension
Zaiquan Yang
Yuhao Liu
Jiaying Lin
Gerhard Hancke
Rynson W. H. Lau
36
1
0
02 Oct 2024
Denoising with a Joint-Embedding Predictive Architecture
Denoising with a Joint-Embedding Predictive Architecture
Dengsheng Chen
Jie Hu
Xiaoming Wei
Enhua Wu
DiffM
57
2
0
02 Oct 2024
Multi-Scale Fusion for Object Representation
Multi-Scale Fusion for Object Representation
Rongzhen Zhao
V. Wang
Arno Solin
Joni Pajarinen
OCL
VOS
68
1
0
02 Oct 2024
Previous
123...111213...545556
Next