ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2205.11487
  4. Cited By
Photorealistic Text-to-Image Diffusion Models with Deep Language
  Understanding

Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding

23 May 2022
Chitwan Saharia
William Chan
Saurabh Saxena
Lala Li
Jay Whang
Emily L. Denton
Seyed Kamyar Seyed Ghasemipour
Burcu Karagol Ayan
S. S. Mahdavi
Raphael Gontijo-Lopes
Tim Salimans
Jonathan Ho
David J Fleet
Mohammad Norouzi
    VLM
ArXiv (abs)PDFHTML

Papers citing "Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding"

50 / 1,371 papers shown
Title
Cross Initialization for Personalized Text-to-Image Generation
Cross Initialization for Personalized Text-to-Image Generation
Lianyu Pang
Jian Yin
Haoran Xie
Qiping Wang
Qing Li
Xudong Mao
DiffM
100
7
0
26 Dec 2023
Prompt-Propose-Verify: A Reliable Hand-Object-Interaction Data
  Generation Framework using Foundational Models
Prompt-Propose-Verify: A Reliable Hand-Object-Interaction Data Generation Framework using Foundational Models
Gurusha Juneja
Sukrit Kumar
DiffM
33
0
0
23 Dec 2023
Learning from Mistakes: Iterative Prompt Relabeling for Text-to-Image Diffusion Model Training
Learning from Mistakes: Iterative Prompt Relabeling for Text-to-Image Diffusion Model Training
Xinyan Chen
Jiaxin Ge
Tianjun Zhang
Jiaming Liu
Shanghang Zhang
VLMEGVM
190
0
0
23 Dec 2023
Align Your Gaussians: Text-to-4D with Dynamic 3D Gaussians and Composed
  Diffusion Models
Align Your Gaussians: Text-to-4D with Dynamic 3D Gaussians and Composed Diffusion Models
Huan Ling
Seung Wook Kim
Antonio Torralba
Sanja Fidler
Karsten Kreis
DiffM3DGS
89
123
0
21 Dec 2023
RealCraft: Attention Control as A Tool for Zero-Shot Consistent Video Editing
RealCraft: Attention Control as A Tool for Zero-Shot Consistent Video Editing
Shutong Jin
Ruiyu Wang
Florian T. Pokorny
DiffMVGen
212
1
0
19 Dec 2023
Adaptive Guidance: Training-free Acceleration of Conditional Diffusion
  Models
Adaptive Guidance: Training-free Acceleration of Conditional Diffusion Models
Angela Castillo
Jonas Kohler
Juan C. Pérez
Juan Pablo Pérez
Albert Pumarola
Guohao Li
Pablo Arbelaez
Ali K. Thabet
111
13
0
19 Dec 2023
MaskINT: Video Editing via Interpolative Non-autoregressive Masked
  Transformers
MaskINT: Video Editing via Interpolative Non-autoregressive Masked Transformers
Haoyu Ma
Shahin Mahdizadehaghdam
Bichen Wu
Zhipeng Fan
Yuchao Gu
Wenliang Zhao
Lior Shapira
Xiaohui Xie
DiffMVGen
66
4
0
19 Dec 2023
Scene-Conditional 3D Object Stylization and Composition
Scene-Conditional 3D Object Stylization and Composition
Jinghao Zhou
Tomas Jakab
Philip Torr
Christian Rupprecht
DiffM
139
3
0
19 Dec 2023
IPAD: Iterative, Parallel, and Diffusion-based Network for Scene Text Recognition
IPAD: Iterative, Parallel, and Diffusion-based Network for Scene Text Recognition
Xiaomeng Yang
Zhi Qiao
Yu Zhou
DiffM
183
1
0
19 Dec 2023
HAAR: Text-Conditioned Generative Model of 3D Strand-based Human
  Hairstyles
HAAR: Text-Conditioned Generative Model of 3D Strand-based Human Hairstyles
V. Sklyarova
Egor Zakharov
Otmar Hilliges
Michael J. Black
Justus Thies
3DH
64
14
0
18 Dec 2023
GAvatar: Animatable 3D Gaussian Avatars with Implicit Mesh Learning
GAvatar: Animatable 3D Gaussian Avatars with Implicit Mesh Learning
Ye Yuan
Xueting Li
Yangyi Huang
Shalini De Mello
Koki Nagano
Jan Kautz
Umar Iqbal
3DGS
93
47
0
18 Dec 2023
Diffusion-Based Particle-DETR for BEV Perception
Diffusion-Based Particle-DETR for BEV Perception
Asen Nachkov
Martin Danelljan
D. Paudel
Luc Van Gool
DiffM
122
3
0
18 Dec 2023
Towards Detailed Text-to-Motion Synthesis via Basic-to-Advanced
  Hierarchical Diffusion Model
Towards Detailed Text-to-Motion Synthesis via Basic-to-Advanced Hierarchical Diffusion Model
Zhenyu Xie
Yang Wu
Xuehao Gao
Zhongqian Sun
Wei Yang
Xiaodan Liang
DiffM
81
11
0
18 Dec 2023
Lecture Notes in Probabilistic Diffusion Models
Lecture Notes in Probabilistic Diffusion Models
Inga Strümke
H. Langseth
DiffM
31
0
0
16 Dec 2023
Tell Me What You See: Text-Guided Real-World Image Denoising
Tell Me What You See: Text-Guided Real-World Image Denoising
E. Yosef
Raja Giryes
DiffM
151
2
0
15 Dec 2023
SEEAvatar: Photorealistic Text-to-3D Avatar Generation with Constrained
  Geometry and Appearance
SEEAvatar: Photorealistic Text-to-3D Avatar Generation with Constrained Geometry and Appearance
Yuanyou Xu
Zongxin Yang
Yi Yang
75
27
0
13 Dec 2023
Diffusion-based Blind Text Image Super-Resolution
Diffusion-based Blind Text Image Super-Resolution
Yuzhe Zhang
Jiawei Zhang
Hao Li
Zhouxia Wang
Luwei Hou
Dongqing Zou
Liheng Bian
89
14
0
13 Dec 2023
Denoising diffusion-based synthetic generation of three-dimensional (3D)
  anisotropic microstructures from two-dimensional (2D) micrographs
Denoising diffusion-based synthetic generation of three-dimensional (3D) anisotropic microstructures from two-dimensional (2D) micrographs
Kang-Hyun Lee
G. Yun
DiffM
48
3
0
13 Dec 2023
Individualized Deepfake Detection Exploiting Traces Due to Double Neural-Network Operations
Individualized Deepfake Detection Exploiting Traces Due to Double Neural-Network Operations
Mushfiqur Rahman
Runze Liu
Chau-Wai Wong
Huaiyu Dai
118
0
0
13 Dec 2023
Fast Sampling Through The Reuse Of Attention Maps In Diffusion Models
Fast Sampling Through The Reuse Of Attention Maps In Diffusion Models
Rosco Hunter
Łukasz Dudziak
Mohamed S. Abdelfattah
Abhinav Mehrotra
Sourav Bhattacharya
Hongkai Wen
111
1
0
13 Dec 2023
Stellar: Systematic Evaluation of Human-Centric Personalized
  Text-to-Image Methods
Stellar: Systematic Evaluation of Human-Centric Personalized Text-to-Image Methods
Panos Achlioptas
Alexandros Benetatos
Iordanis Fostiropoulos
Dimitris Skourtis
119
9
0
11 Dec 2023
HOI-Diff: Text-Driven Synthesis of 3D Human-Object Interactions using Diffusion Models
HOI-Diff: Text-Driven Synthesis of 3D Human-Object Interactions using Diffusion Models
Xiaogang Peng
Yiming Xie
Zizhao Wu
Varun Jampani
Deqing Sun
Huaizu Jiang
DiffM
115
52
0
11 Dec 2023
Optimized View and Geometry Distillation from Multi-view Diffuser
Optimized View and Geometry Distillation from Multi-view Diffuser
Youjia Zhang
Zikai Song
Junqing Yu
Yawei Luo
Wei Yang
169
0
0
11 Dec 2023
Generative Network Layer for Communication Systems with Artificial
  Intelligence
Generative Network Layer for Communication Systems with Artificial Intelligence
Mathias D. Thorsager
Israel Leyva Mayorga
B. Soret
P. Popovski
GANGNN
23
3
0
08 Dec 2023
PrimDiffusion: Volumetric Primitives Diffusion for 3D Human Generation
PrimDiffusion: Volumetric Primitives Diffusion for 3D Human Generation
Zhaoxi Chen
Fangzhou Hong
Haiyi Mei
Guangcong Wang
Lei Yang
Ziwei Liu
86
26
0
07 Dec 2023
Free3D: Consistent Novel View Synthesis without 3D Representation
Free3D: Consistent Novel View Synthesis without 3D Representation
Chuanxia Zheng
Andrea Vedaldi
3DV
136
50
0
07 Dec 2023
PhotoMaker: Customizing Realistic Human Photos via Stacked ID Embedding
PhotoMaker: Customizing Realistic Human Photos via Stacked ID Embedding
Zhen Li
Mingdeng Cao
Xintao Wang
Zhongang Qi
Ming-Ming Cheng
Ying Shan
DiffM
138
201
0
07 Dec 2023
DemoCaricature: Democratising Caricature Generation with a Rough Sketch
DemoCaricature: Democratising Caricature Generation with a Rough Sketch
Dar-Yen Chen
A. Bhunia
Subhadeep Koley
Aneeshan Sain
Pinaki Nath Chowdhury
Yi-Zhe Song
94
8
0
07 Dec 2023
MEVG: Multi-event Video Generation with Text-to-Video Models
MEVG: Multi-event Video Generation with Text-to-Video Models
Gyeongrok Oh
Jaehwan Jeong
Sieun Kim
Wonmin Byeon
Jinkyu Kim
Sungwoong Kim
Sangpil Kim
VGenDiffM
112
23
0
07 Dec 2023
MMM: Generative Masked Motion Model
MMM: Generative Masked Motion Model
Ekkasit Pinyoanuntapong
Pu Wang
Minwoo Lee
Chong Chen
DiffMVGen
107
53
0
06 Dec 2023
Make-A-Storyboard: A General Framework for Storyboard with Disentangled
  and Merged Control
Make-A-Storyboard: A General Framework for Storyboard with Disentangled and Merged Control
Jingkuan Song
Litao Guo
Lianli Gao
Hengtao Shen
Jingkuan Song
DiffM
68
3
0
06 Dec 2023
WoVoGen: World Volume-aware Diffusion for Controllable Multi-camera
  Driving Scene Generation
WoVoGen: World Volume-aware Diffusion for Controllable Multi-camera Driving Scene Generation
Jiachen Lu
Ze Huang
Zeyu Yang
Jiahui Zhang
Li Zhang
VGen
94
46
0
05 Dec 2023
Diversified in-domain synthesis with efficient fine-tuning for few-shot
  classification
Diversified in-domain synthesis with efficient fine-tuning for few-shot classification
Victor G. Turrisi da Costa
Nicola Dall’Asen
Yiming Wang
N. Sebe
Elisa Ricci
89
5
0
05 Dec 2023
Navigating the Synthetic Realm: Harnessing Diffusion-based Models for
  Laparoscopic Text-to-Image Generation
Navigating the Synthetic Realm: Harnessing Diffusion-based Models for Laparoscopic Text-to-Image Generation
Simeon Allmendinger
Patrick Hemmer
Moritz Queisner
Igor Sauer
Leopold Muller
Johannes Jakubik
Michael Vossing
Niklas Kühl
MedIm
77
6
0
05 Dec 2023
Training on Synthetic Data Beats Real Data in Multimodal Relation
  Extraction
Training on Synthetic Data Beats Real Data in Multimodal Relation Extraction
Zilin Du
Haoxin Li
Xu Guo
Boyang Li
91
1
0
05 Dec 2023
DreamVideo: High-Fidelity Image-to-Video Generation with Image Retention
  and Text Guidance
DreamVideo: High-Fidelity Image-to-Video Generation with Image Retention and Text Guidance
Cong Wang
Jiaxi Gu
Panwen Hu
Songcen Xu
Hang Xu
Xiaodan Liang
VGen
109
16
0
05 Dec 2023
PartSLIP++: Enhancing Low-Shot 3D Part Segmentation via Multi-View
  Instance Segmentation and Maximum Likelihood Estimation
PartSLIP++: Enhancing Low-Shot 3D Part Segmentation via Multi-View Instance Segmentation and Maximum Likelihood Estimation
Yuchen Zhou
Jiayuan Gu
Xuanlin Li
Minghua Liu
Yunhao Fang
Hao Su
VLM
95
17
0
05 Dec 2023
Diversify, Don't Fine-Tune: Scaling Up Visual Recognition Training with Synthetic Images
Diversify, Don't Fine-Tune: Scaling Up Visual Recognition Training with Synthetic Images
Zhuoran Yu
Chenchen Zhu
Sean Culatana
Raghuraman Krishnamoorthi
Fanyi Xiao
Yong Jae Lee
177
15
0
04 Dec 2023
Latent Feature-Guided Diffusion Models for Shadow Removal
Latent Feature-Guided Diffusion Models for Shadow Removal
Kangfu Mei
Luis Figueroa
Zhe Lin
Zhihong Ding
Scott D. Cohen
Vishal M. Patel
DiffM
133
19
0
04 Dec 2023
StoryGPT-V: Large Language Models as Consistent Story Visualizers
StoryGPT-V: Large Language Models as Consistent Story Visualizers
Xiaoqian Shen
Mohamed Elhoseiny
VLM
202
11
0
04 Dec 2023
DiverseDream: Diverse Text-to-3D Synthesis with Augmented Text Embedding
DiverseDream: Diverse Text-to-3D Synthesis with Augmented Text Embedding
Uy Dieu Tran
Minh Luu
P. Nguyen
K. Nguyen
Binh-Son Hua
98
1
0
02 Dec 2023
Text-Guided 3D Face Synthesis -- From Generation to Editing
Text-Guided 3D Face Synthesis -- From Generation to Editing
Yunjie Wu
Yapeng Meng
Zhipeng Hu
Lincheng Li
Haoqian Wu
Kun Zhou
Weiwei Xu
Xin Yu
DiffM
130
10
0
01 Dec 2023
LucidDreaming: Controllable Object-Centric 3D Generation
LucidDreaming: Controllable Object-Centric 3D Generation
Zhaoning Wang
Ming Li
Chong Chen
132
10
0
30 Nov 2023
SMaRt: Improving GANs with Score Matching Regularity
SMaRt: Improving GANs with Score Matching Regularity
Mengfei Xia
Yujun Shen
Ceyuan Yang
Ran Yi
Wenping Wang
Yong Liu
100
5
0
30 Nov 2023
ElasticDiffusion: Training-free Arbitrary Size Image Generation through Global-Local Content Separation
ElasticDiffusion: Training-free Arbitrary Size Image Generation through Global-Local Content Separation
Moayed Haji-Ali
Guha Balakrishnan
Vicente Ordonez
177
27
0
30 Nov 2023
4D-fy: Text-to-4D Generation Using Hybrid Score Distillation Sampling
4D-fy: Text-to-4D Generation Using Hybrid Score Distillation Sampling
Sherwin Bahmani
Ivan Skorokhodov
Victor Rong
Gordon Wetzstein
Leonidas Guibas
Peter Wonka
Sergey Tulyakov
Jeong Joon Park
Andrea Tagliasacchi
David B. Lindell
DiffM
143
112
0
29 Nov 2023
Rethinking Image Editing Detection in the Era of Generative AI
  Revolution
Rethinking Image Editing Detection in the Era of Generative AI Revolution
Zhihao Sun
Haipeng Fang
Xinying Zhao
Danding Wang
Juan Cao
91
10
0
29 Nov 2023
HumanGaussian: Text-Driven 3D Human Generation with Gaussian Splatting
HumanGaussian: Text-Driven 3D Human Generation with Gaussian Splatting
Xian Liu
Xiaohang Zhan
Jiaxiang Tang
Ying Shan
Gang Zeng
Dahua Lin
Xihui Liu
Ziwei Liu
3DGS
125
77
0
28 Nov 2023
A Unified Approach for Text- and Image-guided 4D Scene Generation
A Unified Approach for Text- and Image-guided 4D Scene Generation
Yufeng Zheng
Xueting Li
Koki Nagano
Sifei Liu
Karsten Kreis
Otmar Hilliges
Shalini De Mello
108
49
0
28 Nov 2023
TextDiffuser-2: Unleashing the Power of Language Models for Text
  Rendering
TextDiffuser-2: Unleashing the Power of Language Models for Text Rendering
Jingye Chen
Yupan Huang
Tengchao Lv
Lei Cui
Qifeng Chen
Furu Wei
DiffM
128
70
0
28 Nov 2023
Previous
123...141516...262728
Next