ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1711.00937
  4. Cited By
Neural Discrete Representation Learning

Neural Discrete Representation Learning

2 November 2017
Aaron van den Oord
Oriol Vinyals
Koray Kavukcuoglu
    BDL
    SSL
    OCL
ArXivPDFHTML

Papers citing "Neural Discrete Representation Learning"

50 / 2,845 papers shown
Title
Masked Particle Modeling on Sets: Towards Self-Supervised High Energy
  Physics Foundation Models
Masked Particle Modeling on Sets: Towards Self-Supervised High Energy Physics Foundation Models
T. Golling
Lukas Heinrich
Michael Kagan
Samuel Klein
Matthew Leigh
Margarita Osadchy
J. A. Raine
47
24
0
24 Jan 2024
Generative Human Motion Stylization in Latent Space
Generative Human Motion Stylization in Latent Space
Chuan Guo
Yuxuan Mu
Wei Ji
Peng Dai
Youliang Yan
Juwei Lu
Li Cheng
VGen
55
11
0
24 Jan 2024
PA-SAM: Prompt Adapter SAM for High-Quality Image Segmentation
PA-SAM: Prompt Adapter SAM for High-Quality Image Segmentation
Zhaozhi Xie
Bochen Guan
Weihao Jiang
Muyang Yi
Yue Ding
Hongtao Lu
Lei Zhang
VLM
46
13
0
23 Jan 2024
CloSe: A 3D Clothing Segmentation Dataset and Model
CloSe: A 3D Clothing Segmentation Dataset and Model
Dimitrije Antic
Garvita Tiwari
Batuhan Ozcomlekci
R. Marin
Gerard Pons-Moll
3DPC
3DH
27
9
0
22 Jan 2024
Empowering Communication: Speech Technology for Indian and Western
  Accents through AI-powered Speech Synthesis
Empowering Communication: Speech Technology for Indian and Western Accents through AI-powered Speech Synthesis
R. Vinotha
D. Hepsiba
L. D. V. Anand
Deepak John Reji
15
1
0
22 Jan 2024
Text-to-Image Cross-Modal Generation: A Systematic Review
Text-to-Image Cross-Modal Generation: A Systematic Review
Maciej Żelaszczyk
Jacek Mańdziuk
60
4
0
21 Jan 2024
Diffusion Model Conditioning on Gaussian Mixture Model and Negative
  Gaussian Mixture Gradient
Diffusion Model Conditioning on Gaussian Mixture Model and Negative Gaussian Mixture Gradient
Weiguo Lu
Xuan Wu
Deng Ding
Jinqiao Duan
Jirong Zhuang
Gangnan Yuan
DiffM
VLM
68
0
0
20 Jan 2024
Dream360: Diverse and Immersive Outdoor Virtual Scene Creation via
  Transformer-Based 360 Image Outpainting
Dream360: Diverse and Immersive Outdoor Virtual Scene Creation via Transformer-Based 360 Image Outpainting
Hao Ai
Zidong Cao
H. Lu
Chen Chen
Jiancang Ma
Pengyuan Zhou
Tae-Kyun Kim
Pan Hui
Lin Wang
43
4
0
19 Jan 2024
A Simple Latent Diffusion Approach for Panoptic Segmentation and Mask
  Inpainting
A Simple Latent Diffusion Approach for Panoptic Segmentation and Mask Inpainting
Wouter Van Gansbeke
Bert De Brabandere
DiffM
65
11
0
18 Jan 2024
MM-Interleaved: Interleaved Image-Text Generative Modeling via
  Multi-modal Feature Synchronizer
MM-Interleaved: Interleaved Image-Text Generative Modeling via Multi-modal Feature Synchronizer
Changyao Tian
Xizhou Zhu
Yuwen Xiong
Weiyun Wang
Zhe Chen
...
Tong Lu
Jie Zhou
Hongsheng Li
Yu Qiao
Jifeng Dai
AuLLM
85
43
0
18 Jan 2024
WorldDreamer: Towards General World Models for Video Generation via
  Predicting Masked Tokens
WorldDreamer: Towards General World Models for Video Generation via Predicting Masked Tokens
Xiaofeng Wang
Zheng Zhu
Guan Huang
Boyuan Wang
Xinze Chen
Jiwen Lu
VGen
45
34
0
18 Jan 2024
CustomVideo: Customizing Text-to-Video Generation with Multiple Subjects
CustomVideo: Customizing Text-to-Video Generation with Multiple Subjects
Zhao Wang
Aoxue Li
Lingting Zhu
Yong Guo
Qi Dou
Zhenguo Li
VGen
DiffM
54
41
0
18 Jan 2024
A New Creative Generation Pipeline for Click-Through Rate with Stable
  Diffusion Model
A New Creative Generation Pipeline for Click-Through Rate with Stable Diffusion Model
Hao Yang
Jianxin Yuan
Shuai Yang
Linhe Xu
Shuo Yuan
Yifan Zeng
28
11
0
17 Jan 2024
Machine Perceptual Quality: Evaluating the Impact of Severe Lossy
  Compression on Audio and Image Models
Machine Perceptual Quality: Evaluating the Impact of Severe Lossy Compression on Audio and Image Models
Dan G. Jacobellis
Daniel Cummings
N. Yadwadkar
31
2
0
15 Jan 2024
Optimization of Discrete Parameters Using the Adaptive Gradient Method
  and Directed Evolution
Optimization of Discrete Parameters Using the Adaptive Gradient Method and Directed Evolution
Andrei Beinarovich
Sergey Stepanov
Alexander Zaslavsky
56
0
0
12 Jan 2024
360DVD: Controllable Panorama Video Generation with 360-Degree Video
  Diffusion Model
360DVD: Controllable Panorama Video Generation with 360-Degree Video Diffusion Model
Qian Wang
Weiqi Li
Chong Mou
Xinhua Cheng
Jian Zhang
VGen
66
18
0
12 Jan 2024
EEGFormer: Towards Transferable and Interpretable Large-Scale EEG
  Foundation Model
EEGFormer: Towards Transferable and Interpretable Large-Scale EEG Foundation Model
Yuqi Chen
Kan Ren
Kaitao Song
Yansen Wang
Yifan Wang
Dongsheng Li
Lili Qiu
56
15
0
11 Jan 2024
Learning Cognitive Maps from Transformer Representations for Efficient
  Planning in Partially Observed Environments
Learning Cognitive Maps from Transformer Representations for Efficient Planning in Partially Observed Environments
Antoine Dedieu
Wolfgang Lehrach
Guangyao Zhou
Dileep George
Miguel Lazaro-Gredilla
52
2
0
11 Jan 2024
Source-Free Cross-Modal Knowledge Transfer by Unleashing the Potential
  of Task-Irrelevant Data
Source-Free Cross-Modal Knowledge Transfer by Unleashing the Potential of Task-Irrelevant Data
Jinjin Zhu
Yucheng Chen
Lin Wang
43
2
0
10 Jan 2024
Exploratory Evaluation of Speech Content Masking
Exploratory Evaluation of Speech Content Masking
Jennifer Williams
Karla Pizzi
Paul-Gauthier Noé
Sneha Das
49
3
0
08 Jan 2024
A Survey on 3D Gaussian Splatting
A Survey on 3D Gaussian Splatting
Guikun Chen
Wenguan Wang
3DGS
78
181
0
08 Jan 2024
PIXAR: Auto-Regressive Language Modeling in Pixel Space
PIXAR: Auto-Regressive Language Modeling in Pixel Space
Yintao Tai
Xiyang Liao
Alessandro Suglia
Antonio Vergari
MLLM
33
8
0
06 Jan 2024
Bring Metric Functions into Diffusion Models
Bring Metric Functions into Diffusion Models
Jie An
Zhengyuan Yang
Jianfeng Wang
Linjie Li
Zicheng Liu
Lijuan Wang
Jiebo Luo
DiffM
50
3
0
04 Jan 2024
Improving Diffusion-Based Image Synthesis with Context Prediction
Improving Diffusion-Based Image Synthesis with Context Prediction
Ling Yang
Jingwei Liu
Shenda Hong
Zhilong Zhang
Zhilin Huang
Zheming Cai
Wentao Zhang
Tengjiao Wang
DiffM
56
34
0
04 Jan 2024
From Audio to Photoreal Embodiment: Synthesizing Humans in Conversations
From Audio to Photoreal Embodiment: Synthesizing Humans in Conversations
Evonne Ng
Javier Romero
Timur M. Bagautdinov
Shaojie Bai
Trevor Darrell
Angjoo Kanazawa
Alexander Richard
VGen
30
41
0
03 Jan 2024
ColorizeDiffusion: Adjustable Sketch Colorization with Reference Image
  and Text
ColorizeDiffusion: Adjustable Sketch Colorization with Reference Image and Text
Dingkun Yan
Liang Yuan
Erwin Wu
Yuma Nishioka
I. Fujishiro
Suguru Saito
DiffM
26
5
0
02 Jan 2024
MOC-RVQ: Multilevel Codebook-assisted Digital Generative Semantic
  Communication
MOC-RVQ: Multilevel Codebook-assisted Digital Generative Semantic Communication
Yingbin Zhou
Yaping Sun
Guanying Chen
Xiaodong Xu
Hao Chen
Binhong Huang
Shuguang Cui
Ping Zhang
35
6
0
02 Jan 2024
Utilizing Autoregressive Networks for Full Lifecycle Data Generation of
  Rolling Bearings for RUL Prediction
Utilizing Autoregressive Networks for Full Lifecycle Data Generation of Rolling Bearings for RUL Prediction
Junliang Wang
Qinghua Zhang
Guanhua Zhu
Guoxi Sun
29
0
0
02 Jan 2024
Efficient Parallel Audio Generation using Group Masked Language Modeling
Efficient Parallel Audio Generation using Group Masked Language Modeling
Myeonghun Jeong
Minchan Kim
Joun Yeop Lee
Nam Soo Kim
30
5
0
02 Jan 2024
Auffusion: Leveraging the Power of Diffusion and Large Language Models
  for Text-to-Audio Generation
Auffusion: Leveraging the Power of Diffusion and Large Language Models for Text-to-Audio Generation
Jinlong Xue
Yayue Deng
Yingming Gao
Ya Li
DiffM
36
29
0
02 Jan 2024
New Job, New Gender? Measuring the Social Bias in Image Generation
  Models
New Job, New Gender? Measuring the Social Bias in Image Generation Models
Wenxuan Wang
Haonan Bai
Jen-tse Huang
Yuxuan Wan
Youliang Yuan
Haoyi Qiu
Nanyun Peng
Michael R. Lyu
56
21
0
01 Jan 2024
Masked Modeling for Self-supervised Representation Learning on Vision
  and Beyond
Masked Modeling for Self-supervised Representation Learning on Vision and Beyond
Siyuan Li
Luyuan Zhang
Zedong Wang
Di Wu
Lirong Wu
...
Jun Xia
Cheng Tan
Yang Liu
Baigui Sun
Stan Z. Li
SSL
57
14
0
31 Dec 2023
Brain-Conditional Multimodal Synthesis: A Survey and Taxonomy
Brain-Conditional Multimodal Synthesis: A Survey and Taxonomy
Weijian Mai
Jian Zhang
Pengfei Fang
Zhijun Zhang
83
9
0
31 Dec 2023
EMAGE: Towards Unified Holistic Co-Speech Gesture Generation via
  Expressive Masked Audio Gesture Modeling
EMAGE: Towards Unified Holistic Co-Speech Gesture Generation via Expressive Masked Audio Gesture Modeling
Haiyang Liu
Zihao Zhu
Giorgio Becherini
Yichen Peng
Mingyang Su
You Zhou
Xuefei Zhe
Naoya Iwamoto
Bo Zheng
Michael J. Black
SLR
63
31
0
31 Dec 2023
HQ-VAE: Hierarchical Discrete Representation Learning with Variational
  Bayes
HQ-VAE: Hierarchical Discrete Representation Learning with Variational Bayes
Yuhta Takida
Yukara Ikemiya
Takashi Shibuya
Kazuki Shimada
Woosung Choi
...
Naoki Murata
Toshimitsu Uesaka
Kengo Uchida
Wei-Hsiang Liao
Yuki Mitsufuji
BDL
51
12
0
31 Dec 2023
FlashVideo: A Framework for Swift Inference in Text-to-Video Generation
FlashVideo: A Framework for Swift Inference in Text-to-Video Generation
Bin Lei
Le Chen
Caiwen Ding
VGen
33
1
0
30 Dec 2023
Discrete Distribution Networks
Discrete Distribution Networks
Lei Yang
50
1
0
29 Dec 2023
Compact Neural Graphics Primitives with Learned Hash Probing
Compact Neural Graphics Primitives with Learned Hash Probing
Towaki Takikawa
Thomas Müller
Merlin Nimier-David
Alex Evans
Sanja Fidler
Alec Jacobson
Alexander Keller
37
20
0
28 Dec 2023
Fast gradient-free activation maximization for neurons in spiking neural
  networks
Fast gradient-free activation maximization for neurons in spiking neural networks
N. Pospelov
Andrei Chertkov
Maxim Beketov
Ivan Oseledets
Konstantin Anokhin
42
2
0
28 Dec 2023
Unified-IO 2: Scaling Autoregressive Multimodal Models with Vision,
  Language, Audio, and Action
Unified-IO 2: Scaling Autoregressive Multimodal Models with Vision, Language, Audio, and Action
Jiasen Lu
Christopher Clark
Sangho Lee
Zichen Zhang
Savya Khosla
Ryan Marten
Derek Hoiem
Aniruddha Kembhavi
VLM
MLLM
47
155
0
28 Dec 2023
Discrete Messages Improve Communication Efficiency among Isolated
  Intelligent Agents
Discrete Messages Improve Communication Efficiency among Isolated Intelligent Agents
Hang Chen
Yuchuan Jang
Weijie Zhou
Cristian Meo
Ziwei Chen
Dianbo Liu
40
0
0
26 Dec 2023
Explicit-Implicit Subgoal Planning for Long-Horizon Tasks with Sparse
  Reward
Explicit-Implicit Subgoal Planning for Long-Horizon Tasks with Sparse Reward
Fangyuan Wang
Anqing Duan
Peng Zhou
Shengzeng Huo
Guodong Guo
Chenguang Yang
D. Navarro-Alarcon
OffRL
VLM
48
0
0
25 Dec 2023
Fréchet Wavelet Distance: A Domain-Agnostic Metric for Image Generation
Fréchet Wavelet Distance: A Domain-Agnostic Metric for Image Generation
Lokesh Veeramacheneni
Moritz Wolter
Hildegard Kuehne
Juergen Gall
EGVM
41
3
0
23 Dec 2023
BrainVis: Exploring the Bridge between Brain and Visual Signals via
  Image Reconstruction
BrainVis: Exploring the Bridge between Brain and Visual Signals via Image Reconstruction
Honghao Fu
Zhiqi Shen
Jing Jih Chin
Hao Wang
DiffM
60
6
0
22 Dec 2023
Emage: Non-Autoregressive Text-to-Image Generation
Emage: Non-Autoregressive Text-to-Image Generation
Zhangyin Feng
Runyi Hu
Liangxin Liu
Fan Zhang
Duyu Tang
Yong Dai
Xiaocheng Feng
Jiwei Li
Bing Qin
Shuming Shi
DiffM
VLM
47
0
0
22 Dec 2023
HeadCraft: Modeling High-Detail Shape Variations for Animated 3DMMs
HeadCraft: Modeling High-Detail Shape Variations for Animated 3DMMs
Artem Sevastopolsky
Philip-William Grassal
Simon Giebenhain
ShahRukh Athar
Luisa Verdoliva
Matthias Niessner
3DH
73
4
0
21 Dec 2023
HD-Painter: High-Resolution and Prompt-Faithful Text-Guided Image
  Inpainting with Diffusion Models
HD-Painter: High-Resolution and Prompt-Faithful Text-Guided Image Inpainting with Diffusion Models
Hayk Manukyan
Andranik Sargsyan
Barsegh Atanyan
Zhangyang Wang
Shant Navasardyan
Humphrey Shi
DiffM
54
28
0
21 Dec 2023
DreamTuner: Single Image is Enough for Subject-Driven Generation
DreamTuner: Single Image is Enough for Subject-Driven Generation
Miao Hua
Jiawei Liu
Fei Ding
Wei Liu
Jie Wu
Qian He
33
28
0
21 Dec 2023
Sign Language Production with Latent Motion Transformer
Sign Language Production with Latent Motion Transformer
Pan Xie
Taiying Peng
Yao Du
Qipeng Zhang
SLR
32
3
0
20 Dec 2023
All but One: Surgical Concept Erasing with Model Preservation in
  Text-to-Image Diffusion Models
All but One: Surgical Concept Erasing with Model Preservation in Text-to-Image Diffusion Models
Seunghoo Hong
Juhun Lee
Simon S. Woo
51
20
0
20 Dec 2023
Previous
123...252627...555657
Next