ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2111.14822
  4. Cited By
Vector Quantized Diffusion Model for Text-to-Image Synthesis

Vector Quantized Diffusion Model for Text-to-Image Synthesis

29 November 2021
Shuyang Gu
Dong Chen
Jianmin Bao
Fang Wen
Bo Zhang
Dongdong Chen
Lu Yuan
B. Guo
    DiffM
ArXivPDFHTML

Papers citing "Vector Quantized Diffusion Model for Text-to-Image Synthesis"

50 / 566 papers shown
Title
Text-to-image Diffusion Models in Generative AI: A Survey
Text-to-image Diffusion Models in Generative AI: A Survey
Chenshuang Zhang
Chaoning Zhang
Mengchun Zhang
In So Kweon
VLM
51
265
0
14 Mar 2023
Diffusion Models in NLP: A Survey
Diffusion Models in NLP: A Survey
Yuansong Zhu
Yu Zhao
DiffM
VLM
MedIm
29
23
0
14 Mar 2023
DDS2M: Self-Supervised Denoising Diffusion Spatio-Spectral Model for
  Hyperspectral Image Restoration
DDS2M: Self-Supervised Denoising Diffusion Spatio-Spectral Model for Hyperspectral Image Restoration
Yuchun Miao
Lefei Zhang
L. Zhang
Dacheng Tao
DiffM
8
38
0
12 Mar 2023
One Transformer Fits All Distributions in Multi-Modal Diffusion at Scale
One Transformer Fits All Distributions in Multi-Modal Diffusion at Scale
Fan Bao
Shen Nie
Kaiwen Xue
Chongxuan Li
Shiliang Pu
Yaole Wang
Gang Yue
Yue Cao
Hang Su
Jun Zhu
DiffM
207
149
0
12 Mar 2023
PARASOL: Parametric Style Control for Diffusion Image Synthesis
PARASOL: Parametric Style Control for Diffusion Image Synthesis
Gemma Canet Tarrés
Dan Ruta
Tu Bui
John Collomosse
DiffM
34
6
0
11 Mar 2023
Regularized Vector Quantization for Tokenized Image Synthesis
Regularized Vector Quantization for Tokenized Image Synthesis
Jiahui Zhang
Fangneng Zhan
Christian Theobalt
Shijian Lu
DiffM
MQ
41
30
0
11 Mar 2023
MaskDiff: Modeling Mask Distribution with Diffusion Probabilistic Model
  for Few-Shot Instance Segmentation
MaskDiff: Modeling Mask Distribution with Diffusion Probabilistic Model for Few-Shot Instance Segmentation
Minh-Quan Le
Tam V. Nguyen
Trung-Nghia Le
Thanh-Toan Do
Minh N. Do
M. Tran
DiffM
45
13
0
09 Mar 2023
Unifying Layout Generation with a Decoupled Diffusion Model
Unifying Layout Generation with a Decoupled Diffusion Model
Mude Hui
Zhizheng Zhang
Xiaoyi Zhang
Wenxuan Xie
Yuwang Wang
Yan Lu
DiffM
15
39
0
09 Mar 2023
Neural Vector Fields: Implicit Representation by Explicit Learning
Neural Vector Fields: Implicit Representation by Explicit Learning
Xianghui Yang
Guosheng Lin
Zhenghao Chen
Luping Zhou
AI4CE
49
17
0
08 Mar 2023
Lformer: Text-to-Image Generation with L-shape Block Parallel Decoding
Lformer: Text-to-Image Generation with L-shape Block Parallel Decoding
Jiacheng Li
Longhui Wei
Zongyuan Zhan
Xinfu He
Siliang Tang
Qi Tian
Yueting Zhuang
24
4
0
07 Mar 2023
An investigation into the adaptability of a diffusion-based TTS model
An investigation into the adaptability of a diffusion-based TTS model
Haolin Chen
Philip N. Garner
DiffM
31
1
0
03 Mar 2023
StraIT: Non-autoregressive Generation with Stratified Image Transformer
StraIT: Non-autoregressive Generation with Stratified Image Transformer
Shengju Qian
Huiwen Chang
Yuanzhen Li
Zizhao Zhang
Jiaya Jia
Han Zhang
39
10
0
01 Mar 2023
Diffusion Models and Semi-Supervised Learners Benefit Mutually with Few
  Labels
Diffusion Models and Semi-Supervised Learners Benefit Mutually with Few Labels
Zebin You
Yong Zhong
Fan Bao
Jiacheng Sun
Chongxuan Li
Jun Zhu
DiffM
VLM
206
36
0
21 Feb 2023
Boundary Guided Learning-Free Semantic Control with Diffusion Models
Boundary Guided Learning-Free Semantic Control with Diffusion Models
Ye Zhu
Yuehua Wu
Zhiwei Deng
Olga Russakovsky
Yan Yan
DiffM
17
23
0
16 Feb 2023
Speech Enhancement with Multi-granularity Vector Quantization
Speech Enhancement with Multi-granularity Vector Quantization
Xiaokang Zhao
Qiu-shi Zhu
Jie Zhang
23
0
0
16 Feb 2023
DIFUSCO: Graph-based Diffusion Solvers for Combinatorial Optimization
DIFUSCO: Graph-based Diffusion Solvers for Combinatorial Optimization
Zhiqing Sun
Yiming Yang
DiffM
33
118
0
16 Feb 2023
Video Probabilistic Diffusion Models in Projected Latent Space
Video Probabilistic Diffusion Models in Projected Latent Space
Sihyun Yu
Kihyuk Sohn
Subin Kim
Jinwoo Shin
VGen
DiffM
37
160
0
15 Feb 2023
A Reparameterized Discrete Diffusion Model for Text Generation
A Reparameterized Discrete Diffusion Model for Text Generation
Lin Zheng
Jianbo Yuan
Lei Yu
Lingpeng Kong
DiffM
38
57
0
11 Feb 2023
3D Colored Shape Reconstruction from a Single RGB Image through
  Diffusion
3D Colored Shape Reconstruction from a Single RGB Image through Diffusion
Bo Li
Xiaolin K. Wei
F. Chen
Bin Liu
DiffM
20
1
0
11 Feb 2023
MaskSketch: Unpaired Structure-guided Masked Image Generation
MaskSketch: Unpaired Structure-guided Masked Image Generation
D. Bashkirova
José Lezama
Kihyuk Sohn
Kate Saenko
Irfan Essa
DiffM
30
25
0
10 Feb 2023
UniPC: A Unified Predictor-Corrector Framework for Fast Sampling of
  Diffusion Models
UniPC: A Unified Predictor-Corrector Framework for Fast Sampling of Diffusion Models
Wenliang Zhao
Lujia Bai
Yongming Rao
Jie Zhou
Jiwen Lu
DiffM
27
198
0
09 Feb 2023
Information-Theoretic Diffusion
Information-Theoretic Diffusion
Xianghao Kong
Rob Brekelmans
Greg Ver Steeg
DiffM
14
14
0
07 Feb 2023
InstructTTS: Modelling Expressive TTS in Discrete Latent Space with
  Natural Language Style Prompt
InstructTTS: Modelling Expressive TTS in Discrete Latent Space with Natural Language Style Prompt
Dongchao Yang
Songxiang Liu
Rongjie Huang
Chao Weng
Helen Meng
DiffM
VLM
31
85
0
31 Jan 2023
GALIP: Generative Adversarial CLIPs for Text-to-Image Synthesis
GALIP: Generative Adversarial CLIPs for Text-to-Image Synthesis
Ming Tao
Bingkun Bao
Hao Tang
Changsheng Xu
DiffM
VLM
65
101
0
30 Jan 2023
ERA-Solver: Error-Robust Adams Solver for Fast Sampling of Diffusion
  Probabilistic Models
ERA-Solver: Error-Robust Adams Solver for Fast Sampling of Diffusion Probabilistic Models
Shengmeng Li
Luping Liu
Zenghao Chai
Runnan Li
Xuejiao Tan
DiffM
32
11
0
30 Jan 2023
Input Perturbation Reduces Exposure Bias in Diffusion Models
Input Perturbation Reduces Exposure Bias in Diffusion Models
Mang Ning
E. Sangineto
Angelo Porrello
Simone Calderara
Rita Cucchiara
DiffM
21
64
0
27 Jan 2023
A Denoising Diffusion Model for Fluid Field Prediction
A Denoising Diffusion Model for Fluid Field Prediction
Gefan Yang
Stefan Sommer
DiffM
AI4CE
28
27
0
27 Jan 2023
Dif-Fusion: Towards High Color Fidelity in Infrared and Visible Image
  Fusion with Diffusion Models
Dif-Fusion: Towards High Color Fidelity in Infrared and Visible Image Fusion with Diffusion Models
Jun Yue
Leyuan Fang
Shaobo Xia
Yue Deng
Jiayi Ma
DiffM
21
92
0
19 Jan 2023
Speech Driven Video Editing via an Audio-Conditioned Diffusion Model
Speech Driven Video Editing via an Audio-Conditioned Diffusion Model
Dan Bigioi
Shubhajit Basak
Michał Stypułkowski
Maciej Ziȩba
H. Jordan
R. Mcdonnell
Peter Corcoran
DiffM
VGen
24
34
0
10 Jan 2023
CodeTalker: Speech-Driven 3D Facial Animation with Discrete Motion Prior
CodeTalker: Speech-Driven 3D Facial Animation with Discrete Motion Prior
Jinbo Xing
Menghan Xia
Yuechen Zhang
Xiaodong Cun
Jue Wang
T. Wong
24
141
0
06 Jan 2023
All in Tokens: Unifying Output Space of Visual Tasks via Soft Token
All in Tokens: Unifying Output Space of Visual Tasks via Soft Token
Jia Ning
Chen Li
Zheng-Wei Zhang
Zigang Geng
Qi Dai
Kun He
Han Hu
33
44
0
05 Jan 2023
Attribute-Centric Compositional Text-to-Image Generation
Attribute-Centric Compositional Text-to-Image Generation
Yuren Cong
Martin Renqiang Min
Erran L. Li
Bodo Rosenhahn
M. Yang
68
11
0
04 Jan 2023
Muse: Text-To-Image Generation via Masked Generative Transformers
Muse: Text-To-Image Generation via Masked Generative Transformers
Huiwen Chang
Han Zhang
Jarred Barber
AJ Maschinot
José Lezama
...
Kevin Patrick Murphy
William T. Freeman
Michael Rubinstein
Yuanzhen Li
Dilip Krishnan
DiffM
197
519
0
02 Jan 2023
Diffusion Probabilistic Models for Scene-Scale 3D Categorical Data
Diffusion Probabilistic Models for Scene-Scale 3D Categorical Data
Jumin Lee
Woobin Im
Sebin Lee
Sung-eui Yoon
DiffM
27
15
0
02 Jan 2023
Tune-A-Video: One-Shot Tuning of Image Diffusion Models for
  Text-to-Video Generation
Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation
Jay Zhangjie Wu
Yixiao Ge
Xintao Wang
Weixian Lei
Yuchao Gu
Yufei Shi
W. Hsu
Ying Shan
Xiaohu Qie
Mike Zheng Shou
VGen
29
691
0
22 Dec 2022
Scalable Diffusion Models with Transformers
Scalable Diffusion Models with Transformers
William S. Peebles
Saining Xie
GNN
40
2,008
0
19 Dec 2022
Optimizing Prompts for Text-to-Image Generation
Optimizing Prompts for Text-to-Image Generation
Y. Hao
Zewen Chi
Li Dong
Furu Wei
27
140
0
19 Dec 2022
DAG: Depth-Aware Guidance with Denoising Diffusion Probabilistic Models
DAG: Depth-Aware Guidance with Denoising Diffusion Probabilistic Models
Gyeongnyeon Kim
Wooseok Jang
Gyuseong Lee
Susung Hong
Junyoung Seo
Seung Wook Kim
VLM
DiffM
37
11
0
17 Dec 2022
Rodin: A Generative Model for Sculpting 3D Digital Avatars Using
  Diffusion
Rodin: A Generative Model for Sculpting 3D Digital Avatars Using Diffusion
Tengfei Wang
Bo Zhang
Ting Zhang
Shuyang Gu
Jianmin Bao
...
Jingjing Shen
Dong Chen
Fang Wen
Qifeng Chen
B. Guo
35
279
0
12 Dec 2022
Towards Practical Plug-and-Play Diffusion Models
Towards Practical Plug-and-Play Diffusion Models
Hyojun Go
Yunsung Lee
Jin-Young Kim
Seunghyun Lee
Myeongho Jeong
Hyun Seung Lee
Seungtaek Choi
DiffM
32
16
0
12 Dec 2022
MAGVIT: Masked Generative Video Transformer
MAGVIT: Masked Generative Video Transformer
Lijun Yu
Yong Cheng
Kihyuk Sohn
José Lezama
Han Zhang
...
Alexander G. Hauptmann
Ming-Hsuan Yang
Yuan Hao
Irfan Essa
Lu Jiang
DiffM
VGen
32
224
0
10 Dec 2022
Training-Free Structured Diffusion Guidance for Compositional
  Text-to-Image Synthesis
Training-Free Structured Diffusion Guidance for Compositional Text-to-Image Synthesis
Weixi Feng
Xuehai He
Tsu-jui Fu
Varun Jampani
Arjun Reddy Akula
P. Narayana
Sugato Basu
Qing Guo
William Yang Wang
CoGe
33
299
0
09 Dec 2022
X-Paste: Revisiting Scalable Copy-Paste for Instance Segmentation using
  CLIP and StableDiffusion
X-Paste: Revisiting Scalable Copy-Paste for Instance Segmentation using CLIP and StableDiffusion
Hanqing Zhao
Dianmo Sheng
Jianmin Bao
Dongdong Chen
Dong Chen
...
Ce Liu
Wenbo Zhou
Qi Chu
Weiming Zhang
Neng H. Yu
VLM
DiffM
38
39
0
07 Dec 2022
Rethinking the Objectives of Vector-Quantized Tokenizers for Image
  Synthesis
Rethinking the Objectives of Vector-Quantized Tokenizers for Image Synthesis
Yuchao Gu
Xintao Wang
Yixiao Ge
Ying Shan
Xiaohu Qie
Mike Zheng Shou
DiffM
32
20
0
06 Dec 2022
Towards Cross Domain Generalization of Hamiltonian Representation via
  Meta Learning
Towards Cross Domain Generalization of Hamiltonian Representation via Meta Learning
Yeongwoo Song
Hawoong Jeong
OOD
AI4CE
24
1
0
02 Dec 2022
Score-based Continuous-time Discrete Diffusion Models
Score-based Continuous-time Discrete Diffusion Models
Haoran Sun
Lijun Yu
Bo Dai
Dale Schuurmans
H. Dai
DiffM
18
69
0
30 Nov 2022
Compressing Volumetric Radiance Fields to 1 MB
Compressing Volumetric Radiance Fields to 1 MB
Lingzhi Li
Zhen Shen
Zhongshu Wang
Li Shen
Liefeng Bo
25
65
0
29 Nov 2022
Diffusion Probabilistic Model Made Slim
Diffusion Probabilistic Model Made Slim
Xingyi Yang
Daquan Zhou
Jiashi Feng
Xinchao Wang
DiffM
27
102
0
27 Nov 2022
Unified Discrete Diffusion for Simultaneous Vision-Language Generation
Unified Discrete Diffusion for Simultaneous Vision-Language Generation
Minghui Hu
Chuanxia Zheng
Heliang Zheng
Tat-Jen Cham
Chaoyue Wang
Zuopeng Yang
Dacheng Tao
Ponnuthurai Nagaratnam Suganthan
DiffM
20
23
0
27 Nov 2022
Shifted Diffusion for Text-to-image Generation
Shifted Diffusion for Text-to-image Generation
Yufan Zhou
Bingchen Liu
Yizhe Zhu
Xiao Yang
Changyou Chen
Jinhui Xu
DiffM
24
40
0
24 Nov 2022
Previous
123...1011129
Next