ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2111.14822
  4. Cited By
Vector Quantized Diffusion Model for Text-to-Image Synthesis

Vector Quantized Diffusion Model for Text-to-Image Synthesis

29 November 2021
Shuyang Gu
Dong Chen
Jianmin Bao
Fang Wen
Bo Zhang
Dongdong Chen
Lu Yuan
B. Guo
    DiffM
ArXivPDFHTML

Papers citing "Vector Quantized Diffusion Model for Text-to-Image Synthesis"

50 / 566 papers shown
Title
LayerDiff: Exploring Text-guided Multi-layered Composable Image
  Synthesis via Layer-Collaborative Diffusion Model
LayerDiff: Exploring Text-guided Multi-layered Composable Image Synthesis via Layer-Collaborative Diffusion Model
Runhu Huang
Kaixin Cai
Jianhua Han
Xiaodan Liang
Renjing Pei
Guansong Lu
Songcen Xu
Wei Zhang
Hang Xu
DiffM
31
4
0
18 Mar 2024
LogicalDefender: Discovering, Extracting, and Utilizing Common-Sense
  Knowledge
LogicalDefender: Discovering, Extracting, and Utilizing Common-Sense Knowledge
Yuhe Liu
Mengxue Kang
Zengchang Qin
Xiangxiang Chu
NAI
VLM
38
0
0
18 Mar 2024
HyperVQ: MLR-based Vector Quantization in Hyperbolic Space
HyperVQ: MLR-based Vector Quantization in Hyperbolic Space
Nabarun Goswami
Yusuke Mukuta
Tatsuya Harada
40
3
0
18 Mar 2024
Artifact Feature Purification for Cross-domain Detection of AI-generated
  Images
Artifact Feature Purification for Cross-domain Detection of AI-generated Images
Zheling Meng
Bo Peng
Jing Dong
Tieniu Tan
86
2
0
17 Mar 2024
Codebook Transfer with Part-of-Speech for Vector-Quantized Image
  Modeling
Codebook Transfer with Part-of-Speech for Vector-Quantized Image Modeling
Baoquan Zhang
Huaibin Wang
Chuyao Luo
Xutao Li
Guotao Liang
Yunming Ye
Xiaochen Qi
Yao He
37
11
0
15 Mar 2024
UniCode: Learning a Unified Codebook for Multimodal Large Language
  Models
UniCode: Learning a Unified Codebook for Multimodal Large Language Models
Sipeng Zheng
Bohan Zhou
Yicheng Feng
Ye Wang
Zongqing Lu
VLM
MLLM
46
7
0
14 Mar 2024
Data-Independent Operator: A Training-Free Artifact Representation
  Extractor for Generalizable Deepfake Detection
Data-Independent Operator: A Training-Free Artifact Representation Extractor for Generalizable Deepfake Detection
Chuangchuang Tan
Ping Liu
Renshuai Tao
Huan Liu
Yao-Min Zhao
Baoyuan Wu
Yunchao Wei
36
9
0
11 Mar 2024
DiffuMatting: Synthesizing Arbitrary Objects with Matting-level
  Annotation
DiffuMatting: Synthesizing Arbitrary Objects with Matting-level Annotation
Xiaobin Hu
Xu Peng
Donghao Luo
Xiaozhong Ji
Jinlong Peng
Zhengkai Jiang
Jiangning Zhang
Taisong Jin
Chengjie Wang
Rongrong Ji
DiffM
34
5
0
10 Mar 2024
Towards In-Vehicle Multi-Task Facial Attribute Recognition:
  Investigating Synthetic Data and Vision Foundation Models
Towards In-Vehicle Multi-Task Facial Attribute Recognition: Investigating Synthetic Data and Vision Foundation Models
Esmaeil Seraj
Walter Talamonti
30
0
0
10 Mar 2024
StableDrag: Stable Dragging for Point-based Image Editing
StableDrag: Stable Dragging for Point-based Image Editing
Yutao Cui
Xiaotong Zhao
Guozhen Zhang
Shengming Cao
Kai Ma
Limin Wang
36
10
0
07 Mar 2024
Deep-Learned Compression for Radio-Frequency Signal Classification
Deep-Learned Compression for Radio-Frequency Signal Classification
Armani Rodriguez
Yagna Kaasaragadda
S. Kokalj-Filipovic
26
1
0
05 Mar 2024
NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and
  Diffusion Models
NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models
Zeqian Ju
Yuancheng Wang
Kai Shen
Xu Tan
Detai Xin
...
Shikun Zhang
Jiang Bian
Lei He
Jinyu Li
Sheng Zhao
DiffM
46
144
0
05 Mar 2024
HanDiffuser: Text-to-Image Generation With Realistic Hand Appearances
HanDiffuser: Text-to-Image Generation With Realistic Hand Appearances
Supreeth Narasimhaswamy
Uttaran Bhattacharya
Xiang Chen
Ishita Dasgupta
Saayan Mitra
Minh Hoai
DiffM
26
24
0
04 Mar 2024
DiffSal: Joint Audio and Video Learning for Diffusion Saliency
  Prediction
DiffSal: Joint Audio and Video Learning for Diffusion Saliency Prediction
Jun Xiong
Peng Zhang
Tao You
Chuanyue Li
Wei Huang
Yufei Zha
DiffM
32
5
0
02 Mar 2024
Text-guided Explorable Image Super-resolution
Text-guided Explorable Image Super-resolution
Kanchana Vaishnavi Gandikota
Paramanand Chandramouli
42
7
0
02 Mar 2024
Diffusion Meets DAgger: Supercharging Eye-in-hand Imitation Learning
Diffusion Meets DAgger: Supercharging Eye-in-hand Imitation Learning
Xiaoyu Zhang
Matthew Chang
Pranav Kumar
Saurabh Gupta
DiffM
OffRL
45
13
0
27 Feb 2024
DiffuseKronA: A Parameter Efficient Fine-tuning Method for Personalized
  Diffusion Models
DiffuseKronA: A Parameter Efficient Fine-tuning Method for Personalized Diffusion Models
Shyam Marjit
Harshit Singh
Nityanand Mathur
Sayak Paul
Chia-Mu Yu
Pin-Yu Chen
DiffM
39
6
0
27 Feb 2024
Diffusion Model-Based Image Editing: A Survey
Diffusion Model-Based Image Editing: A Survey
Yi Huang
Jiancheng Huang
Yifan Liu
Mingfu Yan
Jiaxi Lv
Jianzhuang Liu
Wei Xiong
He Zhang
Liangliang Cao
Liangliang Cao
EGVM
66
85
0
27 Feb 2024
Generative AI in Vision: A Survey on Models, Metrics and Applications
Generative AI in Vision: A Survey on Models, Metrics and Applications
Gaurav Raut
Apoorv Singh
VLM
MedIm
43
6
0
26 Feb 2024
Gen4Gen: Generative Data Pipeline for Generative Multi-Concept
  Composition
Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition
Chun-Hsiao Yeh
Ta-Ying Cheng
He-Yen Hsieh
Chuan-En Lin
Yi Ma
Andrew Markham
Niki Trigoni
H. T. Kung
Yubei Chen
DiffM
25
3
0
23 Feb 2024
Hierarchical Invariance for Robust and Interpretable Vision Tasks at
  Larger Scales
Hierarchical Invariance for Robust and Interpretable Vision Tasks at Larger Scales
Shuren Qi
Yushu Zhang
Chao Wang
Zhihua Xia
Xiaochun Cao
Jian Weng
21
1
0
23 Feb 2024
Human Video Translation via Query Warping
Human Video Translation via Query Warping
Haiming Zhu
Yangyang Xu
Shengfeng He
DiffM
49
0
0
19 Feb 2024
ComFusion: Personalized Subject Generation in Multiple Specific Scenes
  From Single Image
ComFusion: Personalized Subject Generation in Multiple Specific Scenes From Single Image
Yan Hong
Jianfu Zhang
DiffM
30
3
0
19 Feb 2024
WildFake: A Large-scale Challenging Dataset for AI-Generated Images
  Detection
WildFake: A Large-scale Challenging Dataset for AI-Generated Images Detection
Yan Hong
Jianfu Zhang
74
9
0
19 Feb 2024
Visual Concept-driven Image Generation with Text-to-Image Diffusion Model
Visual Concept-driven Image Generation with Text-to-Image Diffusion Model
Tanzila Rahman
Shweta Mahajan
Hsin-Ying Lee
Jian Ren
Sergey Tulyakov
Leonid Sigal
88
4
0
18 Feb 2024
Make a Cheap Scaling: A Self-Cascade Diffusion Model for
  Higher-Resolution Adaptation
Make a Cheap Scaling: A Self-Cascade Diffusion Model for Higher-Resolution Adaptation
Lanqing Guo
Yin-Yin He
Haoxin Chen
Menghan Xia
Xiaodong Cun
...
Yong Zhang
Xintao Wang
Qifeng Chen
Ying Shan
Bihan Wen
35
23
0
16 Feb 2024
GaussianObject: Just Taking Four Images to Get A High-Quality 3D Object
  with Gaussian Splatting
GaussianObject: Just Taking Four Images to Get A High-Quality 3D Object with Gaussian Splatting
Chen Yang
Sikuang Li
Jiemin Fang
Ruofan Liang
Lingxi Xie
Xiaopeng Zhang
Wei Shen
Qi Tian
3DGS
19
19
0
15 Feb 2024
Quantized Embedding Vectors for Controllable Diffusion Language Models
Quantized Embedding Vectors for Controllable Diffusion Language Models
Cheng Kang
Xinye Chen
Yong Hu
Daniel Novak
25
0
0
15 Feb 2024
Textual Localization: Decomposing Multi-concept Images for
  Subject-Driven Text-to-Image Generation
Textual Localization: Decomposing Multi-concept Images for Subject-Driven Text-to-Image Generation
Junjie Shentu
Matthew Watson
Noura Al Moubayed
15
0
0
15 Feb 2024
Trustworthy SR: Resolving Ambiguity in Image Super-resolution via
  Diffusion Models and Human Feedback
Trustworthy SR: Resolving Ambiguity in Image Super-resolution via Diffusion Models and Human Feedback
Cansu Korkmaz
Ege Çirakman
A. Murat Tekalp
Zafer Do˘gan
21
0
0
12 Feb 2024
Diff-RNTraj: A Structure-aware Diffusion Model for Road
  Network-constrained Trajectory Generation
Diff-RNTraj: A Structure-aware Diffusion Model for Road Network-constrained Trajectory Generation
Tonglong Wei
Youfang Lin
S. Guo
Yan Lin
Yiheng Huang
Chenyang Xiang
Yuqing Bai
Menglu Ya
Huaiyu Wan
31
11
0
12 Feb 2024
Scalable Diffusion Models with State Space Backbone
Scalable Diffusion Models with State Space Backbone
Zhengcong Fei
Mingyuan Fan
Changqian Yu
Junshi Huang
62
34
0
08 Feb 2024
Towards Aligned Layout Generation via Diffusion Model with Aesthetic
  Constraints
Towards Aligned Layout Generation via Diffusion Model with Aesthetic Constraints
Jian Chen
Ruiyi Zhang
Yufan Zhou
Rajiv Jain
Zhiqiang Xu
Ryan A. Rossi
Changyou Chen
DiffM
47
12
0
07 Feb 2024
InstructScene: Instruction-Driven 3D Indoor Scene Synthesis with
  Semantic Graph Prior
InstructScene: Instruction-Driven 3D Indoor Scene Synthesis with Semantic Graph Prior
Chenguo Lin
Yadong Mu
3DV
14
32
0
07 Feb 2024
DisDet: Exploring Detectability of Backdoor Attack on Diffusion Models
DisDet: Exploring Detectability of Backdoor Attack on Diffusion Models
Yang Sui
Huy Phan
Jinqi Xiao
Tian-Di Zhang
Zijie Tang
Cong Shi
Yan Wang
Yingying Chen
Bo Yuan
DiffM
AAML
22
12
0
05 Feb 2024
Separable Multi-Concept Erasure from Diffusion Models
Separable Multi-Concept Erasure from Diffusion Models
Mengnan Zhao
Lihe Zhang
Tianhang Zheng
Yuqiu Kong
Baocai Yin
50
9
0
03 Feb 2024
A Single Simple Patch is All You Need for AI-generated Image Detection
A Single Simple Patch is All You Need for AI-generated Image Detection
Jiaxuan Chen
Jieteng Yao
Li Niu
18
22
0
02 Feb 2024
Diffusion Facial Forgery Detection
Diffusion Facial Forgery Detection
Harry Cheng
Yangyang Guo
Tianyi Wang
L. Nie
Mohan S. Kankanhalli
61
16
0
29 Jan 2024
CCA: Collaborative Competitive Agents for Image Editing
CCA: Collaborative Competitive Agents for Image Editing
Tiankai Hang
Shuyang Gu
Dong Chen
Xin Geng
Baining Guo
33
5
0
23 Jan 2024
Detecting Multimedia Generated by Large AI Models: A Survey
Detecting Multimedia Generated by Large AI Models: A Survey
Li Lin
Neeraj Gupta
Yue Zhang
Hainan Ren
Chun-Hao Liu
Feng Ding
Xin Wang
Xin Li
Luisa Verdoliva
Shu Hu
86
57
0
22 Jan 2024
Consistent3D: Towards Consistent High-Fidelity Text-to-3D Generation
  with Deterministic Sampling Prior
Consistent3D: Towards Consistent High-Fidelity Text-to-3D Generation with Deterministic Sampling Prior
Zike Wu
Pan Zhou
Xuanyu Yi
Xiaoding Yuan
Hanwang Zhang
DiffM
31
36
0
17 Jan 2024
VideoCrafter2: Overcoming Data Limitations for High-Quality Video
  Diffusion Models
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models
Haoxin Chen
Yong Zhang
Xiaodong Cun
Menghan Xia
Xintao Wang
Chao-Liang Weng
Ying Shan
VGen
DiffM
120
275
0
17 Jan 2024
Revealing Vulnerabilities in Stable Diffusion via Targeted Attacks
Revealing Vulnerabilities in Stable Diffusion via Targeted Attacks
Chenyu Zhang
Lanjun Wang
Anan Liu
24
6
0
16 Jan 2024
Multi-scale 2D Temporal Map Diffusion Models for Natural Language Video
  Localization
Multi-scale 2D Temporal Map Diffusion Models for Natural Language Video Localization
Chongzhi Zhang
Mingyuan Zhang
Zhiyang Teng
Jiayi Li
Xizhou Zhu
Lewei Lu
Ziwei Liu
Aixin Sun
DiffM
VGen
18
0
0
16 Jan 2024
Improving Diffusion-Based Image Synthesis with Context Prediction
Improving Diffusion-Based Image Synthesis with Context Prediction
Ling Yang
Jingwei Liu
Shenda Hong
Zhilong Zhang
Zhilin Huang
Zheming Cai
Wentao Zhang
Bin Cui
DiffM
43
33
0
04 Jan 2024
HQ-VAE: Hierarchical Discrete Representation Learning with Variational
  Bayes
HQ-VAE: Hierarchical Discrete Representation Learning with Variational Bayes
Yuhta Takida
Yukara Ikemiya
Takashi Shibuya
Kazuki Shimada
Woosung Choi
...
Naoki Murata
Toshimitsu Uesaka
Kengo Uchida
Wei-Hsiang Liao
Yuki Mitsufuji
BDL
43
11
0
31 Dec 2023
FlashVideo: A Framework for Swift Inference in Text-to-Video Generation
FlashVideo: A Framework for Swift Inference in Text-to-Video Generation
Bin Lei
Le Chen
Caiwen Ding
VGen
20
1
0
30 Dec 2023
Classifier-free graph diffusion for molecular property targeting
Classifier-free graph diffusion for molecular property targeting
Matteo Ninniri
Marco Podda
Davide Bacciu
35
5
0
28 Dec 2023
Forgery-aware Adaptive Transformer for Generalizable Synthetic Image
  Detection
Forgery-aware Adaptive Transformer for Generalizable Synthetic Image Detection
Huan Liu
Zichang Tan
Chuangchuang Tan
Yunchao Wei
Yao-Min Zhao
Jingdong Wang
ViT
28
42
0
27 Dec 2023
Towards Flexible, Scalable, and Adaptive Multi-Modal Conditioned Face
  Synthesis
Towards Flexible, Scalable, and Adaptive Multi-Modal Conditioned Face Synthesis
Jingjing Ren
Cheng Xu
Haoyu Chen
Xinran Qin
Lei Zhu
CVBM
DiffM
26
4
0
26 Dec 2023
Previous
123456...101112
Next