ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1906.00446
  4. Cited By
Generating Diverse High-Fidelity Images with VQ-VAE-2

Generating Diverse High-Fidelity Images with VQ-VAE-2

2 June 2019
Ali Razavi
Aaron van den Oord
Oriol Vinyals
    DRL
    BDL
ArXivPDFHTML

Papers citing "Generating Diverse High-Fidelity Images with VQ-VAE-2"

50 / 1,107 papers shown
Title
A Demographic-Conditioned Variational Autoencoder for fMRI Distribution
  Sampling and Removal of Confounds
A Demographic-Conditioned Variational Autoencoder for fMRI Distribution Sampling and Removal of Confounds
Anton Orlichenko
Gang Qu
Ziyu Zhou
Anqi Liu
Hong-Wen Deng
Zhengming Ding
Julia M. Stephen
Tony W. Wilson
Vince D. Calhoun
Yu-Ping Wang
22
0
0
13 May 2024
Stable Diffusion-based Data Augmentation for Federated Learning with
  Non-IID Data
Stable Diffusion-based Data Augmentation for Federated Learning with Non-IID Data
Mahdi Morafah
M. Reisser
Bill Lin
Christos Louizos
FedML
34
5
0
13 May 2024
Generating Human Motion in 3D Scenes from Text Descriptions
Generating Human Motion in 3D Scenes from Text Descriptions
Zhi Cen
Huaijin Pi
Sida Peng
Zehong Shen
Minghui Yang
Shuai Zhu
Hujun Bao
Xiaowei Zhou
55
19
0
13 May 2024
MAxPrototyper: A Multi-Agent Generation System for Interactive User
  Interface Prototyping
MAxPrototyper: A Multi-Agent Generation System for Interactive User Interface Prototyping
Mingyue Yuan
Jieshan Chen
Aaron Quigley
LLMAG
51
5
0
12 May 2024
Training-free Subject-Enhanced Attention Guidance for Compositional
  Text-to-image Generation
Training-free Subject-Enhanced Attention Guidance for Compositional Text-to-image Generation
Shengyuan Liu
Bo Wang
Ye Ma
Te Yang
Xipeng Cao
Quan Chen
Han Li
Di Dong
Peng Jiang
EGVM
44
2
0
11 May 2024
Controllable Image Generation With Composed Parallel Token Prediction
Controllable Image Generation With Composed Parallel Token Prediction
Jamie Stirling
Noura Al-Moubayed
38
0
0
10 May 2024
Detecting music deepfakes is easy but actually hard
Detecting music deepfakes is easy but actually hard
Darius Afchar
Gabriel Meseguer-Brocal
Romain Hennequin
63
6
0
07 May 2024
MVDiff: Scalable and Flexible Multi-View Diffusion for 3D Object
  Reconstruction from Single-View
MVDiff: Scalable and Flexible Multi-View Diffusion for 3D Object Reconstruction from Single-View
Emmanuelle Bourigault
Pauline Bourigault
37
2
0
06 May 2024
Generated Contents Enrichment
Generated Contents Enrichment
Mahdi Naseri
Jiayan Qiu
Zhou Wang
37
0
0
06 May 2024
Towards Real-world Video Face Restoration: A New Benchmark
Towards Real-world Video Face Restoration: A New Benchmark
Ziyan Chen
Jingwen He
Xinqi Lin
Yu Qiao
Chao Dong
48
4
0
30 Apr 2024
Assessing Image Quality Using a Simple Generative Representation
Assessing Image Quality Using a Simple Generative Representation
Simon Raviv
Gal Chechik
55
0
0
28 Apr 2024
TokenHMR: Advancing Human Mesh Recovery with a Tokenized Pose
  Representation
TokenHMR: Advancing Human Mesh Recovery with a Tokenized Pose Representation
Sai Kumar Dwivedi
Yu Sun
Priyanka Patel
Yao Feng
Michael J. Black
3DH
46
28
0
25 Apr 2024
HybridFlow: Infusing Continuity into Masked Codebook for Extreme
  Low-Bitrate Image Compression
HybridFlow: Infusing Continuity into Masked Codebook for Extreme Low-Bitrate Image Compression
Lei Lu
Yanyue Xie
Wei Jiang
Wei Wang
Xue Lin
Yanzhi Wang
45
4
0
20 Apr 2024
Lazy Diffusion Transformer for Interactive Image Editing
Lazy Diffusion Transformer for Interactive Image Editing
Yotam Nitzan
Zongze Wu
Richard Zhang
Eli Shechtman
Daniel Cohen-Or
Taesung Park
Michael Gharbi
43
9
0
18 Apr 2024
MIDGET: Music Conditioned 3D Dance Generation
MIDGET: Music Conditioned 3D Dance Generation
Jinwu Wang
Wei Mao
Miaomiao Liu
40
0
0
18 Apr 2024
Large Language Models: From Notes to Musical Form
Large Language Models: From Notes to Musical Form
Lilac Atassi
35
0
0
18 Apr 2024
Octopus v3: Technical Report for On-device Sub-billion Multimodal AI
  Agent
Octopus v3: Technical Report for On-device Sub-billion Multimodal AI Agent
Wei Chen
Zhiyuan Li
LLMAG
30
5
0
17 Apr 2024
Closely Interactive Human Reconstruction with Proxemics and
  Physics-Guided Adaption
Closely Interactive Human Reconstruction with Proxemics and Physics-Guided Adaption
Buzhen Huang
Chen Li
Chongyang Xu
Liang Pan
Yangang Wang
Gim Hee Lee
33
5
0
17 Apr 2024
Personalized Heart Disease Detection via ECG Digital Twin Generation
Personalized Heart Disease Detection via ECG Digital Twin Generation
Yaojun Hu
Jintai Chen
Lianting Hu
Dantong Li
Jiahuan Yan
Haochao Ying
Huiying Liang
Jian Wu
36
4
0
17 Apr 2024
MaSkel: A Model for Human Whole-body X-rays Generation from Human
  Masking Images
MaSkel: A Model for Human Whole-body X-rays Generation from Human Masking Images
Yingjie Xi
Boyuan Cheng
Jingyao Cai
Jian Jun Zhang
Xiaosong Yang
MedIm
42
0
0
13 Apr 2024
Adapting LLaMA Decoder to Vision Transformer
Adapting LLaMA Decoder to Vision Transformer
Jiahao Wang
Wenqi Shao
Yonghong Tian
Chengyue Wu
Yong Liu
Taiqiang Wu
Kaipeng Zhang
Songyang Zhang
Kai-xiang Chen
Ping Luo
MLLM
40
4
0
10 Apr 2024
Rethinking the Spatial Inconsistency in Classifier-Free Diffusion
  Guidance
Rethinking the Spatial Inconsistency in Classifier-Free Diffusion Guidance
Dazhong Shen
Guanglu Song
Zeyue Xue
Fu-Yun Wang
Yu Liu
DiffM
38
11
0
08 Apr 2024
Gull: A Generative Multifunctional Audio Codec
Gull: A Generative Multifunctional Audio Codec
Yi Luo
Jianwei Yu
Hangting Chen
Rongzhi Gu
Chao Weng
AuLLM
46
3
0
07 Apr 2024
Do We Really Need a Complex Agent System? Distill Embodied Agent into a
  Single Model
Do We Really Need a Complex Agent System? Distill Embodied Agent into a Single Model
Zhonghan Zhao
Ke Ma
Wenhao Chai
Xuan Wang
Kewei Chen
Dongxu Guo
Yanting Zhang
Hongwei Wang
Gaoang Wang
45
16
0
06 Apr 2024
SemGrasp: Semantic Grasp Generation via Language Aligned Discretization
SemGrasp: Semantic Grasp Generation via Language Aligned Discretization
Kailin Li
Jingbo Wang
Lixin Yang
Cewu Lu
Bo Dai
48
16
0
04 Apr 2024
Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale
  Prediction
Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction
Keyu Tian
Yi-Xin Jiang
Zehuan Yuan
Bingyue Peng
Liwei Wang
VGen
55
260
0
03 Apr 2024
CLaM-TTS: Improving Neural Codec Language Model for Zero-Shot
  Text-to-Speech
CLaM-TTS: Improving Neural Codec Language Model for Zero-Shot Text-to-Speech
Jaehyeon Kim
Keon Lee
Seungjun Chung
Jaewoong Cho
74
41
0
03 Apr 2024
MotionChain: Conversational Motion Controllers via Multimodal Prompts
MotionChain: Conversational Motion Controllers via Multimodal Prompts
Biao Jiang
Xin Chen
C. Zhang
Fukun Yin
Zhuoyuan Li
Gang Yu
Jiayuan Fan
VGen
LRM
35
10
0
02 Apr 2024
Variational Autoencoders for exteroceptive perception in reinforcement
  learning-based collision avoidance
Variational Autoencoders for exteroceptive perception in reinforcement learning-based collision avoidance
T. N. Larsen
Eirik Runde Barlaug
Adil Rasheed
DRL
28
1
0
31 Mar 2024
Transformer based Pluralistic Image Completion with Reduced Information
  Loss
Transformer based Pluralistic Image Completion with Reduced Information Loss
Qiankun Liu
Yuqi Jiang
Zhentao Tan
DongDong Chen
Ying Fu
Qi Chu
Gang Hua
Nenghai Yu
ViT
73
11
0
31 Mar 2024
Towards Variable and Coordinated Holistic Co-Speech Motion Generation
Towards Variable and Coordinated Holistic Co-Speech Motion Generation
Yifei Liu
Qiong Cao
Yandong Wen
Huaiguang Jiang
Changxing Ding
SLR
71
14
0
30 Mar 2024
Don't Look into the Dark: Latent Codes for Pluralistic Image Inpainting
Don't Look into the Dark: Latent Codes for Pluralistic Image Inpainting
Haiwei Chen
Yajie Zhao
DiffM
24
2
0
27 Mar 2024
Pose-Guided Self-Training with Two-Stage Clustering for Unsupervised
  Landmark Discovery
Pose-Guided Self-Training with Two-Stage Clustering for Unsupervised Landmark Discovery
Siddharth Tourani
Ahmed Alwheibi
Arif Mahmood
Muhammad Haris Khan
DiffM
46
1
0
24 Mar 2024
BEND: Bagging Deep Learning Training Based on Efficient Neural Network
  Diffusion
BEND: Bagging Deep Learning Training Based on Efficient Neural Network Diffusion
Jia Wei
Xingjun Zhang
Witold Pedrycz
DiffM
26
0
0
23 Mar 2024
Denoising Diffusion Models for 3D Healthy Brain Tissue Inpainting
Denoising Diffusion Models for 3D Healthy Brain Tissue Inpainting
Alicia Durrer
J. Wolleb
Florentin Bieder
Paul Friedrich
L. Melie-García
...
Ozgur Yaldizli
Cristina Granziera
Bjoern H. Menze
Philippe C. Cattin
Florian Kofler
MedIm
21
6
0
21 Mar 2024
Ultra-High-Resolution Image Synthesis with Pyramid Diffusion Model
Ultra-High-Resolution Image Synthesis with Pyramid Diffusion Model
Jiajie Yang
40
0
0
19 Mar 2024
SC-Diff: 3D Shape Completion with Latent Diffusion Models
SC-Diff: 3D Shape Completion with Latent Diffusion Models
Juan D. Galvis
Xingxing Zuo
Simon Schaefer
Stefan Leutengger
DiffM
44
3
0
19 Mar 2024
GenView: Enhancing View Quality with Pretrained Generative Model for
  Self-Supervised Learning
GenView: Enhancing View Quality with Pretrained Generative Model for Self-Supervised Learning
Xiaojie Li
Yibo Yang
Hefei Ling
Jianlong Wu
Yue Yu
Guohao Li
Min Zhang
SSL
36
6
0
18 Mar 2024
Artifact Feature Purification for Cross-domain Detection of AI-generated
  Images
Artifact Feature Purification for Cross-domain Detection of AI-generated Images
Zheling Meng
Bo Peng
Jing Dong
Tieniu Tan
89
4
0
17 Mar 2024
Boosting Flow-based Generative Super-Resolution Models via Learned Prior
Boosting Flow-based Generative Super-Resolution Models via Learned Prior
Li-Yuan Tsao
Yi-Chen Lo
Chia-Che Chang
Hao-Wei Chen
Roy Tseng
Chien Feng
Chun-Yi Lee
SupR
34
4
0
16 Mar 2024
Codebook Transfer with Part-of-Speech for Vector-Quantized Image
  Modeling
Codebook Transfer with Part-of-Speech for Vector-Quantized Image Modeling
Baoquan Zhang
Huaibin Wang
Chuyao Luo
Xutao Li
Guotao Liang
Yunming Ye
Xiaochen Qi
Yao He
40
11
0
15 Mar 2024
UniCode: Learning a Unified Codebook for Multimodal Large Language
  Models
UniCode: Learning a Unified Codebook for Multimodal Large Language Models
Sipeng Zheng
Bohan Zhou
Yicheng Feng
Ye Wang
Zongqing Lu
VLM
MLLM
46
7
0
14 Mar 2024
Beyond Text: Frozen Large Language Models in Visual Signal Comprehension
Beyond Text: Frozen Large Language Models in Visual Signal Comprehension
Lei Zhu
Fangyun Wei
Yanye Lu
MLLM
VLM
52
18
0
12 Mar 2024
Discriminative Probing and Tuning for Text-to-Image Generation
Discriminative Probing and Tuning for Text-to-Image Generation
Leigang Qu
Wenjie Wang
Yongqi Li
Hanwang Zhang
Liqiang Nie
Tat-Seng Chua
46
7
0
07 Mar 2024
Inference via Interpolation: Contrastive Representations Provably Enable Planning and Inference
Inference via Interpolation: Contrastive Representations Provably Enable Planning and Inference
Benjamin Eysenbach
Vivek Myers
Ruslan Salakhutdinov
Sergey Levine
AI4TS
43
8
0
06 Mar 2024
Shapley Values-Powered Framework for Fair Reward Split in Content
  Produced by GenAI
Shapley Values-Powered Framework for Fair Reward Split in Content Produced by GenAI
Alex Glinsky
Alexey Sokolsky
35
0
0
05 Mar 2024
Large Convolutional Model Tuning via Filter Subspace
Large Convolutional Model Tuning via Filter Subspace
Wei Chen
Zichen Miao
Qiang Qiu
59
3
0
01 Mar 2024
Unified Generation, Reconstruction, and Representation: Generalized
  Diffusion with Adaptive Latent Encoding-Decoding
Unified Generation, Reconstruction, and Representation: Generalized Diffusion with Adaptive Latent Encoding-Decoding
Guangyi Liu
Yu Wang
Zeyu Feng
Qiyu Wu
Liping Tang
...
Shuguang Cui
Julian McAuley
Zichao Yang
Eric P. Xing
Zhiting Hu
DiffM
80
3
0
29 Feb 2024
Dr. Strategy: Model-Based Generalist Agents with Strategic Dreaming
Dr. Strategy: Model-Based Generalist Agents with Strategic Dreaming
Hany Hamed
Subin Kim
Dongyeong Kim
Jaesik Yoon
Sungjin Ahn
47
4
0
29 Feb 2024
Latent Neural PDE Solver: a reduced-order modelling framework for partial differential equations
Latent Neural PDE Solver: a reduced-order modelling framework for partial differential equations
Zijie Li
Saurabh Patil
Francis Ogoke
Dule Shu
Wilson Zhen
Michael Schneier
John R. Buchanan
A. Farimani
AI4CE
45
5
0
27 Feb 2024
Previous
123...567...212223
Next