Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2205.11487
Cited By
Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding
23 May 2022
Chitwan Saharia
William Chan
Saurabh Saxena
Lala Li
Jay Whang
Emily L. Denton
Seyed Kamyar Seyed Ghasemipour
Burcu Karagol Ayan
S. S. Mahdavi
Raphael Gontijo-Lopes
Tim Salimans
Jonathan Ho
David J Fleet
Mohammad Norouzi
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding"
50 / 4,338 papers shown
Title
Dynamic Concepts Personalization from Single Videos
Rameen Abdal
Or Patashnik
Ivan Skorokhodov
Willi Menapace
Aliaksandr Siarohin
Sergey Tulyakov
Daniel Cohen-Or
Kfir Aberman
DiffM
VGen
57
0
0
21 Feb 2025
T2ISafety: Benchmark for Assessing Fairness, Toxicity, and Privacy in Image Generation
Lijun Li
Zhelun Shi
Xuhao Hu
Bowen Dong
Yiran Qin
Xihui Liu
Lu Sheng
Jing Shao
114
1
0
21 Feb 2025
FreqPrior: Improving Video Diffusion Models with Frequency Filtering Gaussian Noise
Yunlong Yuan
Yuanfan Guo
Chunwei Wang
Wei Zhang
Hang Xu
L. Zhang
DiffM
VGen
119
2
0
20 Feb 2025
Controllable Unlearning for Image-to-Image Generative Models via
ε
\varepsilon
ε
-Constrained Optimization
Xiaohua Feng
Chao-Jun Chen
Yuyuan Li
L. Zhang
Longfei Li
Jun Zhou
Xiaolin Zheng
MU
73
0
0
20 Feb 2025
PoGDiff: Product-of-Gaussians Diffusion Models for Imbalanced Text-to-Image Generation
Ziyan Wang
Sizhe Wei
Xiaoming Huo
Hao Wang
DiffM
102
0
0
20 Feb 2025
SMITE: Segment Me In TimE
Amirhossein Alimohammadi
Sauradip Nag
Saeid Asgari Taghanaki
Andrea Tagliasacchi
Ghassan Hamarneh
Ali Mahdavi-Amiri
VLM
VOS
191
2
0
20 Feb 2025
Generative Video Semantic Communication via Multimodal Semantic Fusion with Large Model
Hang Yin
Li Qiao
Yu Ma
Shuo Sun
Kan Li
Zhen Gao
Dusit Niyato
DiffM
VGen
237
0
0
20 Feb 2025
Denoising as Adaptation: Noise-Space Domain Adaptation for Image Restoration
Kang Liao
Zongsheng Yue
Zhouxia Wang
Chen Change Loy
95
3
0
20 Feb 2025
A Transfer Attack to Image Watermarks
Yuepeng Hu
Zhengyuan Jiang
Moyang Guo
Neil Zhenqiang Gong
77
10
0
20 Feb 2025
Object-centric Binding in Contrastive Language-Image Pretraining
Rim Assouel
Pietro Astolfi
Florian Bordes
M. Drozdzal
Adriana Romero Soriano
OCL
VLM
CoGe
103
0
0
19 Feb 2025
Robust Optimization with Diffusion Models for Green Security
Lingkai Kong
Haichuan Wang
Yuqi Pan
Cheol Woo Kim
Mingxiao Song
Alayna Nguyen
Tonghan Wang
Haifeng Xu
Milind Tambe
42
0
0
19 Feb 2025
MotionMatcher: Motion Customization of Text-to-Video Diffusion Models via Motion Feature Matching
Yen-Siang Wu
Chi-Pin Huang
Fu-En Yang
Yu-Jie Wang
DiffM
VGen
56
1
0
18 Feb 2025
Diffusion Models without Classifier-free Guidance
Zhicong Tang
Jianmin Bao
Dong Chen
Baining Guo
VLM
60
2
0
17 Feb 2025
Maximum Entropy Reinforcement Learning with Diffusion Policy
Xiaoyi Dong
Jian Cheng
Xinsong Zhang
46
0
0
17 Feb 2025
Precise Parameter Localization for Textual Generation in Diffusion Models
Łukasz Staniszewski
Bartosz Cywiñski
Franziska Boenisch
Kamil Deja
Adam Dziedzic
DiffM
229
0
0
17 Feb 2025
Diffusion-Sharpening: Fine-tuning Diffusion Models with Denoising Trajectory Sharpening
Ye Tian
L. Yang
Xinchen Zhang
Yunhai Tong
Mengdi Wang
Bin Cui
67
2
0
17 Feb 2025
GaussianMotion: End-to-End Learning of Animatable Gaussian Avatars with Pose Guidance from Text
Gyumin Shim
Sangmin Lee
Jaegul Choo
3DGS
66
0
0
17 Feb 2025
Learning to Sample Effective and Diverse Prompts for Text-to-Image Generation
Taeyoung Yun
Dinghuai Zhang
Jinkyoo Park
Ling Pan
DiffM
84
2
0
17 Feb 2025
ControlText: Unlocking Controllable Fonts in Multilingual Text Rendering without Font Annotations
Bowen Jiang
Yuan Yuan
Xinyi Bai
Zhuoqun Hao
Alyson Yin
Yaojie Hu
Wenyu Liao
Lyle Ungar
Camillo J Taylor
DiffM
55
1
0
16 Feb 2025
Occlusion-aware Text-Image-Point Cloud Pretraining for Open-World 3D Object Recognition
Khanh Nguyen
Ghulam Mubashar Hassan
Ajmal Mian
3DPC
54
0
0
15 Feb 2025
MuDoC: An Interactive Multimodal Document-grounded Conversational AI System
Karan Taneja
Ashok K. Goel
58
2
0
14 Feb 2025
Diffusion Models Through a Global Lens: Are They Culturally Inclusive?
Zahra Bayramli
Ayhan Suleymanzade
Na Min An
Huzama Ahmad
Eunsu Kim
Junyeong Park
James Thorne
Alice Oh
91
0
0
13 Feb 2025
I Think, Therefore I Diffuse: Enabling Multimodal In-Context Reasoning in Diffusion Models
Zhenxing Mi
Kuan-Chieh Jackson Wang
Guocheng Qian
Hanrong Ye
Runtao Liu
Sergey Tulyakov
Kfir Aberman
Dan Xu
LRM
47
0
0
12 Feb 2025
Understanding Classifier-Free Guidance: High-Dimensional Theory and Non-Linear Generalizations
Krunoslav Lehman Pavasovic
Jakob Verbeek
Giulio Biroli
Marc Mézard
64
0
0
11 Feb 2025
SketchFlex: Facilitating Spatial-Semantic Coherence in Text-to-Image Generation with Region-Based Sketches
Haichuan Lin
Yilin Ye
Jiazhi Xia
Wei Zeng
DiffM
72
0
0
11 Feb 2025
Guidance-base Diffusion Models for Improving Photoacoustic Image Quality
Tatsuhiro Eguchi
Shumpei Takezaki
Mihoko Shimano
Takayuki Yagi
Ryoma Bise
MedIm
58
0
0
10 Feb 2025
Solving Linear-Gaussian Bayesian Inverse Problems with Decoupled Diffusion Sequential Monte Carlo
Filip Ekstrom Kelvinius
Zheng Zhao
Fredrik Lindsten
DiffM
52
0
0
10 Feb 2025
Transfer Your Perspective: Controllable 3D Generation from Any Viewpoint in a Driving Scene
Tai-Yu Pan
Sooyoung Jeon
Mengdi Fan
Jinsu Yoo
Zhenyang Feng
Mark E. Campbell
Kilian Q. Weinberger
Bharath Hariharan
Wei-Lun Chao
106
0
0
10 Feb 2025
UniMoD: Efficient Unified Multimodal Transformers with Mixture-of-Depths
Weijia Mao
Zhiyong Yang
Mike Zheng Shou
MoE
80
0
0
10 Feb 2025
Unleashing the Potential of Pre-Trained Diffusion Models for Generalizable Person Re-Identification
Jiachen Li
Xiaojin Gong
DiffM
84
0
0
10 Feb 2025
Preventing Rogue Agents Improves Multi-Agent Collaboration
Ohav Barbi
Ori Yoran
Mor Geva
55
1
0
09 Feb 2025
Beyond Fine-Tuning: A Systematic Study of Sampling Techniques in Personalized Image Generation
Vera Soboleva
M. Nakhodnov
Aibek Alanov
52
0
0
09 Feb 2025
Revisiting Gradient-based Uncertainty for Monocular Depth Estimation
Julia Hornauer
Amir El-Ghoussani
Vasileios Belagiannis
UQCV
59
0
0
09 Feb 2025
Beyond and Free from Diffusion: Invertible Guided Consistency Training
Chia-Hong Hsu
Shiu-hong Kao
Randall Balestriero
3DV
82
0
0
08 Feb 2025
Decoder-Only LLMs are Better Controllers for Diffusion Models
Ziyi Dong
Yao Xiao
Pengxu Wei
Liang Lin
DiffM
86
0
0
06 Feb 2025
FairT2I: Mitigating Social Bias in Text-to-Image Generation via Large Language Model-Assisted Detection and Attribute Rebalancing
Jinya Sakurai
Issei Sato
76
0
0
06 Feb 2025
Information-Theoretic Proofs for Diffusion Sampling
Galen Reeves
H. Pfister
DiffM
100
0
0
04 Feb 2025
Open Materials Generation with Stochastic Interpolants
Philipp Hoellmer
Thomas Egg
Maya M. Martirossyan
Eric Fuemmeler
Amit Gupta
...
George Karypis
Mark K. Transtrum
Richard G. Hennig
E. Tadmor
Stefano Martiniani
AI4CE
102
1
0
04 Feb 2025
TabPFN Unleashed: A Scalable and Effective Solution to Tabular Classification Problems
Si-Yang Liu
Han-Jia Ye
70
7
0
04 Feb 2025
Score as Action: Fine-Tuning Diffusion Generative Models by Continuous-time Reinforcement Learning
Hanyang Zhao
Haoxian Chen
Ji Zhang
D. Yao
Wenpin Tang
60
0
0
03 Feb 2025
Assessing the use of Diffusion models for motion artifact correction in brain MRI
Paolo Angella
Vito Paolo Pastore
Matteo Santacesaria
MedIm
DiffM
67
1
0
03 Feb 2025
HuViDPO:Enhancing Video Generation through Direct Preference Optimization for Human-Centric Alignment
Lifan Jiang
Boxi Wu
Jiahui Zhang
Xiaotong Guan
Shuang Chen
VGen
71
1
0
02 Feb 2025
Video Latent Flow Matching: Optimal Polynomial Projections for Video Interpolation and Extrapolation
Yang Cao
Zhao Song
Chiwun Yang
VGen
53
2
0
01 Feb 2025
Shape from Semantics: 3D Shape Generation from Multi-View Semantics
Liangchen Li
Caoliwen Wang
Yuqi Zhou
Bailin Deng
Juyong Zhang
3DV
37
0
0
01 Feb 2025
Text-to-Image Generation for Vocabulary Learning Using the Keyword Method
Nuwan T. Attygalle
M. Kljun
Aaron Quigley
Klen Copic Pucihar
Jens Grubert
...
Juri Yoneyama
Alice Toniolo
Angela Miguel
Hirokazu Kato
M. Weerasinghe
DiffM
83
0
0
28 Jan 2025
Visual Generation Without Guidance
Huayu Chen
Kai Jiang
Kaiwen Zheng
Jianfei Chen
Hang Su
Jun Zhu
57
0
0
28 Jan 2025
RDEIC: Accelerating Diffusion-Based Extreme Image Compression with Relay Residual Diffusion
Zhiyuan Li
Yanhui Zhou
Hao Wei
Chenyang Ge
Ajmal Mian
DiffM
44
0
0
28 Jan 2025
MM-NeRF: Multimodal-Guided 3D Multi-Style Transfer of Neural Radiance Field
Zijian Győző Yang
Zhongwei Qiu
Chang Xu
Dongmei Fu
50
2
0
28 Jan 2025
CE-SDWV: Effective and Efficient Concept Erasure for Text-to-Image Diffusion Models via a Semantic-Driven Word Vocabulary
Jiahang Tu
Qian Feng
Chufan Chen
Jiahua Dong
Hanbin Zhao
Chao Zhang
Hui Qian
72
2
0
28 Jan 2025
Do Existing Testing Tools Really Uncover Gender Bias in Text-to-Image Models?
Yunbo Lyu
Zhou Yang
Yuqing Niu
Jing Jiang
David Lo
44
1
0
28 Jan 2025
Previous
1
2
3
...
8
9
10
...
85
86
87
Next