Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2205.11487
Cited By
Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding
23 May 2022
Chitwan Saharia
William Chan
Saurabh Saxena
Lala Li
Jay Whang
Emily L. Denton
Seyed Kamyar Seyed Ghasemipour
Burcu Karagol Ayan
S. S. Mahdavi
Raphael Gontijo-Lopes
Tim Salimans
Jonathan Ho
David J Fleet
Mohammad Norouzi
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding"
50 / 1,364 papers shown
Title
HumanGif: Single-View Human Diffusion with Generative Prior
Shoukang Hu
Takuya Narihira
Kazumi Fukuda
Ryosuke Sawata
Takashi Shibuya
Yuki Mitsufuji
205
2
0
01 Jul 2025
Edit360: 2D Image Edits to 3D Assets from Any Angle
Junchao Huang
Xinting Hu
Zhuotao Tian
Shaoshuai Shi
Li Jiang
VGen
118
0
0
01 Jul 2025
Noise-Informed Diffusion-Generated Image Detection with Anomaly Attention
Weinan Guan
Wei Wang
Bo Peng
Ziwen He
Jing Dong
Haonan Cheng
DiffM
19
0
0
20 Jun 2025
A Common Pool of Privacy Problems: Legal and Technical Lessons from a Large-Scale Web-Scraped Machine Learning Dataset
Rachel Hong
Jevan Hutson
William Agnew
Imaad Huda
Tadayoshi Kohno
Jamie Morgenstern
AILaw
26
0
0
20 Jun 2025
How to Train your Text-to-Image Model: Evaluating Design Choices for Synthetic Training Captions
Manuel Brack
Sudeep Katakol
Felix Friedrich
P. Schramowski
Hareesh Ravi
Kristian Kersting
Ajinkya Kale
17
0
0
20 Jun 2025
Visual-Instructed Degradation Diffusion for All-in-One Image Restoration
Wenyang Luo
Haina Qin
Zewen Chen
L. xilinx Wang
Dandan Zheng
Yuming Li
Yufan Liu
B. Li
Weiming Hu
22
0
0
20 Jun 2025
Watermarking Autoregressive Image Generation
Nikola Jovanović
Ismail Labiad
Tomáš Souček
Martin Vechev
Pierre Fernandez
WIGM
34
0
0
19 Jun 2025
Improving Rectified Flow with Boundary Conditions
Xixi Hu
Runlong Liao
Keyang Xu
B. Liu
Yeqing Li
Eugene Ie
Hongliang Fei
Qiang Liu
15
0
0
18 Jun 2025
When Model Knowledge meets Diffusion Model: Diffusion-assisted Data-free Image Synthesis with Alignment of Domain and Class
Yujin Kim
H. Kim
Hyunwoo J.Kim
S. Kim
15
0
0
18 Jun 2025
Control and Realism: Best of Both Worlds in Layout-to-Image without Training
Bonan li
Yinhan Hu
Songhua Liu
Xinchao Wang
DiffM
38
0
0
18 Jun 2025
Evolutionary Caching to Accelerate Your Off-the-Shelf Diffusion Model
Anirud Aggarwal
Abhinav Shrivastava
M. Gwilliam
50
0
0
18 Jun 2025
FLUX.1 Kontext: Flow Matching for In-Context Image Generation and Editing in Latent Space
Black Forest Labs
Stephen Batifol
A. Blattmann
Frederic Boesel
Saksham Consul
...
Dustin Podell
Robin Rombach
Harry Saini
Axel Sauer
Luke Smith
DiffM
25
0
0
17 Jun 2025
ASMR: Augmenting Life Scenario using Large Generative Models for Robotic Action Reflection
Shang-Chi Tsai
Seiya Kawano
Angel García Contreras
Koichiro Yoshino
Yun-Nung Chen
LM&Ro
29
2
0
16 Jun 2025
DiffS-NOCS: 3D Point Cloud Reconstruction through Coloring Sketches to NOCS Maps Using Diffusion Models
Di Kong
Qianhui Wan
DiffM
16
0
0
15 Jun 2025
ViSTA: Visual Storytelling using Multi-modal Adapters for Text-to-Image Diffusion Models
Sibo Dong
Ismail Shaheen
Maggie Shen
Rupayan Mallick
Sarah Adel Bargal
DiffM
31
0
0
13 Jun 2025
Auditing Data Provenance in Real-world Text-to-Image Diffusion Models for Privacy and Copyright Protection
Jie Zhu
Leye Wang
18
0
0
13 Jun 2025
TexTailor: Customized Text-aligned Texturing via Effective Resampling
Suin Lee
Dae-Shik Kim
DiffM
114
0
0
12 Jun 2025
Text to Image for Multi-Label Image Recognition with Joint Prompt-Adapter Learning
Chun-Mei Feng
Kai-An Yu
Xinxing Xu
Salman Khan
Rick Siow Mong Goh
Wangmeng Zuo
Yong Liu
VLM
138
0
0
12 Jun 2025
Fine-Grained Perturbation Guidance via Attention Head Selection
Donghoon Ahn
Jiwon Kang
Sanghyun Lee
Minjae Kim
Jaewon Min
Wooseok Jang
Saungwu Lee
Sayak Paul
S. Hong
Seungryong Kim
DiffM
AAML
123
0
0
12 Jun 2025
Build the web for agents, not agents for the web
Xing Han Lù
Gaurav Kamath
Marius Mosbach
Siva Reddy
LLMAG
LM&Ro
115
0
0
12 Jun 2025
Stroke-based Cyclic Amplifier: Image Super-Resolution at Arbitrary Ultra-Large Scales
Wenhao Guo
Peng Lu
Xujun Peng
Zhaoran Zhao
Sheng Li
119
0
0
12 Jun 2025
Geometric Regularity in Deterministic Sampling of Diffusion-based Generative Models
Defang Chen
Zhenyu Zhou
C. Wang
Siwei Lyu
DiffM
60
0
0
11 Jun 2025
NnD: Diffusion-based Generation of Physically-Nonnegative Objects
Nadav Torem
Tamar Sde-Chen
Y. Schechner
DiffM
64
0
0
11 Jun 2025
Only-Style: Stylistic Consistency in Image Generation without Content Leakage
Tilemachos Aravanis
P. Filntisis
Petros Maragos
George Retsinas
72
0
0
11 Jun 2025
How Much To Guide: Revisiting Adaptive Guidance in Classifier-Free Guidance Text-to-Vision Diffusion Models
Huixuan Zhang
Junzhe Zhang
Xiaojun Wan
35
0
0
10 Jun 2025
CulturalFrames: Assessing Cultural Expectation Alignment in Text-to-Image Models and Evaluation Metrics
Shravan Nayak
Mehar Bhatia
Xiaofeng Zhang
Verena Rieser
Lisa Anne Hendricks
Sjoerd van Steenkiste
Yash Goyal
Karolina Stañczak
Aishwarya Agrawal
EGVM
27
0
0
10 Jun 2025
MagCache: Fast Video Generation with Magnitude-Aware Cache
Zehong Ma
Longhui Wei
Feng Wang
Shiliang Zhang
Q. Tian
40
0
0
10 Jun 2025
CuRe: Cultural Gaps in the Long Tail of Text-to-Image Systems
Aniket Rege
Zinnia Nie
Mahesh Ramesh
Unmesh Raskar
Zhuoran Yu
Aditya Kusupati
Yong Jae Lee
Ramya Korlakai Vinayak
24
0
0
09 Jun 2025
FunDiff: Diffusion Models over Function Spaces for Physics-Informed Generative Modeling
Sifan Wang
Zehao Dou
Tong-Rui Liu
Lu Lu
DiffM
33
0
0
09 Jun 2025
OneIG-Bench: Omni-dimensional Nuanced Evaluation for Image Generation
Jingjing Chang
Yixiao Fang
Peng Xing
Shuhan Wu
Wei Cheng
Rui Wang
Xianfang Zeng
Gang Yu
H. Chen
EGVM
VLM
30
0
0
09 Jun 2025
Diffuse Everything: Multimodal Diffusion Models on Arbitrary State Spaces
Kevin Rojas
Yuchen Zhu
Sichen Zhu
Felix X.-F. Ye
Molei Tao
DiffM
19
0
0
09 Jun 2025
Difference Inversion: Interpolate and Isolate the Difference with Token Consistency for Image Analogy Generation
H. Kim
Donghyun Kim
Suhyun Kim
DiffM
29
1
0
09 Jun 2025
Diffusion Counterfactual Generation with Semantic Abduction
Rajat Rasal
Avinash Kori
Fabio De Sousa Ribeiro
Tian Xia
Ben Glocker
DiffM
22
0
0
09 Jun 2025
R3D2: Realistic 3D Asset Insertion via Diffusion for Autonomous Driving Simulation
William Ljungbergh
Bernardo Taveira
Wenzhao Zheng
Adam Tonderski
Chensheng Peng
...
Christoffer Petersson
Michael Felsberg
Kurt Keutzer
Masayoshi Tomizuka
Wei Zhan
19
0
0
09 Jun 2025
Inverse Design of Metamaterials with Manufacturing-Guiding Spectrum-to-Structure Conditional Diffusion Model
Jiawen Li
Jiang Guo
Yuanzhe Li
Zetian Mao
Jiaxing Shen
...
Jinming He
Run Hu
Yaerim Lee
Koji Tsuda
Junichiro Shiomi
DiffM
18
0
0
08 Jun 2025
Controllable Coupled Image Generation via Diffusion Models
Chenfei Yuan
Nanshan Jia
Hangqi Li
Peter W. Glynn
Zeyu Zheng
DiffM
23
0
0
07 Jun 2025
Noise Consistency Regularization for Improved Subject-Driven Image Synthesis
Yao Ni
Song Wen
Piotr Koniusz
A. Cherian
17
0
0
06 Jun 2025
FontAdapter: Instant Font Adaptation in Visual Text Generation
Myungkyu Koo
Subin Kim
Sangkyung Kwak
Jaehyun Nam
Seojin Kim
Jinwoo Shin
DiffM
VLM
56
0
0
06 Jun 2025
Learning to Weight Parameters for Data Attribution
Shuangqi Li
Hieu M. Le
Jingyi Xu
Mathieu Salzmann
TDI
DiffM
66
0
0
06 Jun 2025
SmartAvatar: Text- and Image-Guided Human Avatar Generation with VLM AI Agents
Alexander Huang-Menders
Xinhang Liu
Andy Xu
Yuyao Zhang
Chi-Keung Tang
Yu-Wing Tai
DiffM
111
0
0
05 Jun 2025
FlowDirector: Training-Free Flow Steering for Precise Text-to-Video Editing
Guangzhao Li
Yanming Yang
Chenxi Song
Chi Zhang
DiffM
VGen
107
0
0
05 Jun 2025
Towards Reliable Identification of Diffusion-based Image Manipulations
Alex Costanzino
Woody Bayliss
Juil Sock
Marc Gorriz Blanch
Danijela Horak
Ivan Laptev
Philip Torr
Fabio Pizzati
DiffM
44
0
0
05 Jun 2025
Progressive Tempering Sampler with Diffusion
Severi Rissanen
RuiKang OuYang
Jiajun He
Wenlin Chen
Markus Heinonen
Arno Solin
José Miguel Hernández-Lobato
DiffM
110
1
0
05 Jun 2025
ContentV: Efficient Training of Video Generation Models with Limited Compute
Wenfeng Lin
Renjie Chen
Boyuan Liu
Shiyue Yan
Ruoyu Feng
...
Chao Feng
Jiao Ran
Qi Wu
Zuotao Liu
Mingyu Guo
VGen
111
0
0
05 Jun 2025
Diffusion Domain Teacher: Diffusion Guided Domain Adaptive Object Detector
Boyong He
Yuxiang Ji
Zhuoyue Tan
Liaoni Wu
DiffM
106
2
0
04 Jun 2025
Learning Monotonic Probabilities with a Generative Cost Model
Yongxiang Tang
Yanhua Cheng
Xiaocheng Liu
Chenchen Jiao
Yanxiang Zeng
Ning Luo
Pengjia Yuan
Xialong Liu
Peng Jiang
59
0
0
04 Jun 2025
RAID: A Dataset for Testing the Adversarial Robustness of AI-Generated Image Detectors
Hicham Eddoubi
Jonas Ricker
Federico Cocchi
Lorenzo Baraldi
Angelo Sotgiu
...
Marcella Cornia
Lorenzo Baraldi
Asja Fischer
Rita Cucchiara
Battista Biggio
AAML
145
0
0
04 Jun 2025
EmoArt: A Multidimensional Dataset for Emotion-Aware Artistic Generation
Cheng Zhang
Hongxia Xie
Bin Wen
Songhan Zuo
Ruoxuan Zhang
Wen-Huang Cheng
101
0
0
04 Jun 2025
SViMo: Synchronized Diffusion for Video and Motion Generation in Hand-object Interaction Scenarios
Lingwei Dang
Ruizhi Shao
Hongwen Zhang
Wei Min
Yebin Liu
Qingyao Wu
DiffM
VGen
82
0
0
03 Jun 2025
Rectified Flows for Fast Multiscale Fluid Flow Modeling
Victor Armegioiu
Yannick Ramic
Siddhartha Mishra
DiffM
AI4CE
52
0
0
03 Jun 2025
1
2
3
4
...
26
27
28
Next