ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2205.11487
  4. Cited By
Photorealistic Text-to-Image Diffusion Models with Deep Language
  Understanding

Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding

23 May 2022
Chitwan Saharia
William Chan
Saurabh Saxena
Lala Li
Jay Whang
Emily L. Denton
Seyed Kamyar Seyed Ghasemipour
Burcu Karagol Ayan
S. S. Mahdavi
Raphael Gontijo-Lopes
Tim Salimans
Jonathan Ho
David J Fleet
Mohammad Norouzi
    VLM
ArXiv (abs)PDFHTML

Papers citing "Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding"

50 / 1,364 papers shown
Title
DualNeRF: Text-Driven 3D Scene Editing via Dual-Field Representation
DualNeRF: Text-Driven 3D Scene Editing via Dual-Field Representation
Yuxuan Xiong
Yue Shi
Yishun Dou
Bingbing Ni
DiffM
69
0
0
22 Feb 2025
Data Attribution for Text-to-Image Models by Unlearning Synthesized Images
Data Attribution for Text-to-Image Models by Unlearning Synthesized Images
Sheng-Yu Wang
Aaron Hertzmann
Alexei A. Efros
Jun-Yan Zhu
Richard Zhang
TDI
209
3
0
21 Feb 2025
Text-to-Image Rectified Flow as Plug-and-Play Priors
Text-to-Image Rectified Flow as Plug-and-Play Priors
Xiaofeng Yang
Cheng Chen
Xulei Yang
Fayao Liu
Guosheng Lin
DiffM
133
7
0
21 Feb 2025
CrossVTON: Mimicking the Logic Reasoning on Cross-category Virtual Try-on guided by Tri-zone Priors
CrossVTON: Mimicking the Logic Reasoning on Cross-category Virtual Try-on guided by Tri-zone Priors
Donghao Luo
Yujie Liang
Xu Peng
Xiaobin Hu
Boyuan Jiang
C. Xu
Taisong Jin
Chengjie Wang
Yanwei Fu
113
2
0
21 Feb 2025
FreqPrior: Improving Video Diffusion Models with Frequency Filtering Gaussian Noise
FreqPrior: Improving Video Diffusion Models with Frequency Filtering Gaussian Noise
Yunlong Yuan
Yuanfan Guo
Chunwei Wang
Wei Zhang
Hang Xu
L. Zhang
DiffMVGen
207
3
0
20 Feb 2025
Controllable Unlearning for Image-to-Image Generative Models via $\varepsilon$-Constrained Optimization
Controllable Unlearning for Image-to-Image Generative Models via ε\varepsilonε-Constrained Optimization
Xiaohua Feng
Chao-Jun Chen
Yuyuan Li
Lulu Zhang
Longfei Li
Jun Zhou
Xiaolin Zheng
MU
167
0
0
20 Feb 2025
SMITE: Segment Me In TimE
SMITE: Segment Me In TimE
Amirhossein Alimohammadi
Sauradip Nag
Saeid Asgari Taghanaki
Andrea Tagliasacchi
Ghassan Hamarneh
Ali Mahdavi-Amiri
VLMVOS
531
3
0
20 Feb 2025
Denoising as Adaptation: Noise-Space Domain Adaptation for Image Restoration
Denoising as Adaptation: Noise-Space Domain Adaptation for Image Restoration
Kang Liao
Zongsheng Yue
Zhouxia Wang
Chen Change Loy
194
4
0
20 Feb 2025
A Transfer Attack to Image Watermarks
A Transfer Attack to Image Watermarks
Yuepeng Hu
Zhengyuan Jiang
Moyang Guo
Neil Zhenqiang Gong
151
14
0
20 Feb 2025
Robust Optimization with Diffusion Models for Green Security
Robust Optimization with Diffusion Models for Green Security
Lingkai Kong
Haichuan Wang
Yuqi Pan
Cheol Woo Kim
Mingxiao Song
Alayna Nguyen
Tonghan Wang
Haifeng Xu
Milind Tambe
93
1
0
19 Feb 2025
Object-centric Binding in Contrastive Language-Image Pretraining
Object-centric Binding in Contrastive Language-Image Pretraining
Rim Assouel
Pietro Astolfi
Florian Bordes
M. Drozdzal
Adriana Romero Soriano
OCLVLMCoGe
161
3
0
19 Feb 2025
MotionMatcher: Motion Customization of Text-to-Video Diffusion Models via Motion Feature Matching
MotionMatcher: Motion Customization of Text-to-Video Diffusion Models via Motion Feature Matching
Yen-Siang Wu
Chi-Pin Huang
Fu-En Yang
Yu-Jie Wang
DiffMVGen
125
1
0
18 Feb 2025
Precise Parameter Localization for Textual Generation in Diffusion Models
Precise Parameter Localization for Textual Generation in Diffusion Models
Łukasz Staniszewski
Bartosz Cywiński
Franziska Boenisch
Kamil Deja
Adam Dziedzic
DiffM
471
1
0
17 Feb 2025
Diffusion Models without Classifier-free Guidance
Diffusion Models without Classifier-free Guidance
Zhicong Tang
Jianmin Bao
Dong Chen
Baining Guo
VLM
79
5
0
17 Feb 2025
Maximum Entropy Reinforcement Learning with Diffusion Policy
Maximum Entropy Reinforcement Learning with Diffusion Policy
Xiaoyi Dong
Jian Cheng
Xinsong Zhang
129
2
0
17 Feb 2025
Learning to Sample Effective and Diverse Prompts for Text-to-Image Generation
Learning to Sample Effective and Diverse Prompts for Text-to-Image Generation
Taeyoung Yun
Dinghuai Zhang
Jinkyoo Park
Ling Pan
DiffM
108
6
0
17 Feb 2025
GaussianMotion: End-to-End Learning of Animatable Gaussian Avatars with Pose Guidance from Text
GaussianMotion: End-to-End Learning of Animatable Gaussian Avatars with Pose Guidance from Text
Gyumin Shim
Sangmin Lee
Jaegul Choo
3DGS
106
0
0
17 Feb 2025
ControlText: Unlocking Controllable Fonts in Multilingual Text Rendering without Font Annotations
ControlText: Unlocking Controllable Fonts in Multilingual Text Rendering without Font Annotations
Bowen Jiang
Yuan Yuan
Xinyi Bai
Zhuoqun Hao
Alyson Yin
Yaojie Hu
Wenyu Liao
Lyle Ungar
Camillo J Taylor
DiffM
127
2
0
16 Feb 2025
Occlusion-aware Text-Image-Point Cloud Pretraining for Open-World 3D Object Recognition
Occlusion-aware Text-Image-Point Cloud Pretraining for Open-World 3D Object Recognition
Khanh Nguyen
Ghulam Mubashar Hassan
Ajmal Mian
3DPC
90
0
0
15 Feb 2025
MuDoC: An Interactive Multimodal Document-grounded Conversational AI System
MuDoC: An Interactive Multimodal Document-grounded Conversational AI System
Karan Taneja
Ashok K. Goel
173
2
0
14 Feb 2025
Diffusion Models Through a Global Lens: Are They Culturally Inclusive?
Diffusion Models Through a Global Lens: Are They Culturally Inclusive?
Zahra Bayramli
Ayhan Suleymanzade
Na Min An
Huzama Ahmad
Eunsu Kim
Junyeong Park
James Thorne
Alice Oh
137
4
0
13 Feb 2025
PoGDiff: Product-of-Gaussians Diffusion Models for Imbalanced Text-to-Image Generation
PoGDiff: Product-of-Gaussians Diffusion Models for Imbalanced Text-to-Image Generation
Ziyan Wang
Sizhe Wei
Xiaoming Huo
Hao Wang
DiffM
275
0
0
12 Feb 2025
I Think, Therefore I Diffuse: Enabling Multimodal In-Context Reasoning in Diffusion Models
I Think, Therefore I Diffuse: Enabling Multimodal In-Context Reasoning in Diffusion Models
Zhenxing Mi
Kuan-Chieh Wang
Guocheng Qian
Hanrong Ye
Runtao Liu
Sergey Tulyakov
Kfir Aberman
Dan Xu
LRM
97
2
0
12 Feb 2025
Guidance-base Diffusion Models for Improving Photoacoustic Image Quality
Tatsuhiro Eguchi
Shumpei Takezaki
Mihoko Shimano
Takayuki Yagi
Ryoma Bise
MedIm
90
0
0
10 Feb 2025
Unleashing the Potential of Pre-Trained Diffusion Models for Generalizable Person Re-Identification
Unleashing the Potential of Pre-Trained Diffusion Models for Generalizable Person Re-Identification
Jiachen Li
Xiaojin Gong
DiffM
206
1
0
10 Feb 2025
Transfer Your Perspective: Controllable 3D Generation from Any Viewpoint in a Driving Scene
Transfer Your Perspective: Controllable 3D Generation from Any Viewpoint in a Driving Scene
Tai-Yu Pan
Sooyoung Jeon
Mengdi Fan
Jinsu Yoo
Zhenyang Feng
Mark E. Campbell
Kilian Q. Weinberger
Bharath Hariharan
Wei-Lun Chao
254
0
0
10 Feb 2025
Solving Linear-Gaussian Bayesian Inverse Problems with Decoupled Diffusion Sequential Monte Carlo
Solving Linear-Gaussian Bayesian Inverse Problems with Decoupled Diffusion Sequential Monte Carlo
Filip Ekstrom Kelvinius
Zheng Zhao
Fredrik Lindsten
DiffM
94
3
0
10 Feb 2025
UniMoD: Efficient Unified Multimodal Transformers with Mixture-of-Depths
Weijia Mao
Zhiyong Yang
Mike Zheng Shou
MoE
194
1
0
10 Feb 2025
Decoder-Only LLMs are Better Controllers for Diffusion Models
Decoder-Only LLMs are Better Controllers for Diffusion Models
Ziyi Dong
Yao Xiao
Pengxu Wei
Liang Lin
DiffM
214
0
0
06 Feb 2025
FairT2I: Mitigating Social Bias in Text-to-Image Generation via Large Language Model-Assisted Detection and Attribute Rebalancing
FairT2I: Mitigating Social Bias in Text-to-Image Generation via Large Language Model-Assisted Detection and Attribute Rebalancing
Jinya Sakurai
Issei Sato
149
1
0
06 Feb 2025
TabPFN Unleashed: A Scalable and Effective Solution to Tabular Classification Problems
TabPFN Unleashed: A Scalable and Effective Solution to Tabular Classification Problems
Si-Yang Liu
Han-Jia Ye
179
12
0
04 Feb 2025
Open Materials Generation with Stochastic Interpolants
Open Materials Generation with Stochastic Interpolants
Philipp Hoellmer
Thomas Egg
Maya M. Martirossyan
Eric Fuemmeler
Amit Gupta
...
George Karypis
Mark K. Transtrum
Richard G. Hennig
E. Tadmor
Stefano Martiniani
AI4CE
155
2
0
04 Feb 2025
Information-Theoretic Proofs for Diffusion Sampling
Information-Theoretic Proofs for Diffusion Sampling
Galen Reeves
H. Pfister
DiffM
150
0
0
04 Feb 2025
Score as Action: Fine-Tuning Diffusion Generative Models by Continuous-time Reinforcement Learning
Score as Action: Fine-Tuning Diffusion Generative Models by Continuous-time Reinforcement Learning
Hanyang Zhao
Haoxian Chen
Ji Zhang
D. Yao
Wenpin Tang
150
1
0
03 Feb 2025
HuViDPO:Enhancing Video Generation through Direct Preference Optimization for Human-Centric Alignment
HuViDPO:Enhancing Video Generation through Direct Preference Optimization for Human-Centric Alignment
Lifan Jiang
Boxi Wu
Jiahui Zhang
Xiaotong Guan
Shuang Chen
VGen
93
1
0
02 Feb 2025
Shape from Semantics: 3D Shape Generation from Multi-View Semantics
Shape from Semantics: 3D Shape Generation from Multi-View Semantics
Liangchen Li
Caoliwen Wang
Yuqi Zhou
Bailin Deng
Juyong Zhang
3DV
129
0
0
01 Feb 2025
Video Latent Flow Matching: Optimal Polynomial Projections for Video Interpolation and Extrapolation
Video Latent Flow Matching: Optimal Polynomial Projections for Video Interpolation and Extrapolation
Yang Cao
Zhao Song
Chiwun Yang
VGen
144
3
0
01 Feb 2025
Data-Free Model-Related Attacks: Unleashing the Potential of Generative AI
Data-Free Model-Related Attacks: Unleashing the Potential of Generative AI
Dayong Ye
Tianqing Zhu
Shang Wang
B. Liu
Lefei Zhang
Wanlei Zhou
Yanmei Zhang
AAMLSILM
110
0
0
28 Jan 2025
MM-NeRF: Multimodal-Guided 3D Multi-Style Transfer of Neural Radiance Field
MM-NeRF: Multimodal-Guided 3D Multi-Style Transfer of Neural Radiance Field
Zijian Győző Yang
Zhongwei Qiu
Chang Xu
Dongmei Fu
167
2
0
28 Jan 2025
CE-SDWV: Effective and Efficient Concept Erasure for Text-to-Image Diffusion Models via a Semantic-Driven Word Vocabulary
Jiahang Tu
Qian Feng
Chufan Chen
Jiahua Dong
Hanbin Zhao
Chao Zhang
Hui Qian
114
4
0
28 Jan 2025
Turn That Frown Upside Down: FaceID Customization via Cross-Training Data
Shuhe Wang
Xiaoya Li
Xiaofei Sun
G. Wang
Tianwei Zhang
Jiwei Li
Eduard H. Hovy
119
1
0
28 Jan 2025
Do Existing Testing Tools Really Uncover Gender Bias in Text-to-Image Models?
Yunbo Lyu
Zhou Yang
Yuqing Niu
Jing Jiang
David Lo
134
1
0
28 Jan 2025
Visual Generation Without Guidance
Huayu Chen
Kai Jiang
Kaiwen Zheng
Jianfei Chen
Hang Su
Jun Zhu
166
2
0
28 Jan 2025
Slot-Guided Adaptation of Pre-trained Diffusion Models for Object-Centric Learning and Compositional Generation
Slot-Guided Adaptation of Pre-trained Diffusion Models for Object-Centric Learning and Compositional Generation
Adil Kaan Akan
Yucel Yemez
DiffMOCL
84
0
0
27 Jan 2025
VideoShield: Regulating Diffusion-based Video Generation Models via Watermarking
VideoShield: Regulating Diffusion-based Video Generation Models via Watermarking
Runyi Hu
Jing Zhang
You Li
Jiwei Li
Qing Guo
Han Qiu
Tianwei Zhang
WIGMVGen
186
8
0
24 Jan 2025
LLM-guided Instance-level Image Manipulation with Diffusion U-Net Cross-Attention Maps
LLM-guided Instance-level Image Manipulation with Diffusion U-Net Cross-Attention Maps
Andrey Palaev
Adil Mehmood Khan
S. M. Ahsan Kazmi
DiffM
120
0
0
23 Jan 2025
MultiDreamer3D: Multi-concept 3D Customization with Concept-Aware Diffusion Guidance
MultiDreamer3D: Multi-concept 3D Customization with Concept-Aware Diffusion Guidance
Wooseok Song
Seunggyu Chang
Jaejun Yoo
DiffM
115
0
0
23 Jan 2025
Neural Radiance Fields for the Real World: A Survey
Neural Radiance Fields for the Real World: A Survey
Wenhui Xiao
Remi Chierchia
Rodrigo Santa Cruz
Xuesong Li
David Ahmedt-Aristizabal
Olivier Salvado
Clinton Fookes
Léo Lebrat
AI4CE
177
0
0
22 Jan 2025
PreciseCam: Precise Camera Control for Text-to-Image Generation
PreciseCam: Precise Camera Control for Text-to-Image Generation
Edurne Bernal-Berdun
Ana Serrano
B. Masiá
Matheus Gadelha
Yannick Hold-Geoffroy
Xin Sun
Diego F. F. Gutierrez
DiffMVGen
102
1
0
22 Jan 2025
Accelerate High-Quality Diffusion Models with Inner Loop Feedback
Accelerate High-Quality Diffusion Models with Inner Loop Feedback
M. Gwilliam
Han Cai
Di Wu
Abhinav Shrivastava
Zhiyu Cheng
224
1
0
22 Jan 2025
Previous
123...567...262728
Next