ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2205.11487
  4. Cited By
Photorealistic Text-to-Image Diffusion Models with Deep Language
  Understanding

Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding

23 May 2022
Chitwan Saharia
William Chan
Saurabh Saxena
Lala Li
Jay Whang
Emily L. Denton
Seyed Kamyar Seyed Ghasemipour
Burcu Karagol Ayan
S. S. Mahdavi
Raphael Gontijo-Lopes
Tim Salimans
Jonathan Ho
David J Fleet
Mohammad Norouzi
    VLM
ArXivPDFHTML

Papers citing "Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding"

50 / 4,326 papers shown
Title
Latent Space Super-Resolution for Higher-Resolution Image Generation with Diffusion Models
Latent Space Super-Resolution for Higher-Resolution Image Generation with Diffusion Models
Jinho Jeong
Sangmin Han
Jinwoo Kim
Seon Joo Kim
42
0
0
24 Mar 2025
From Fragment to One Piece: A Survey on AI-Driven Graphic Design
From Fragment to One Piece: A Survey on AI-Driven Graphic Design
Xingxing Zou
Wen Zhang
Nanxuan Zhao
59
0
0
24 Mar 2025
FDS: Frequency-Aware Denoising Score for Text-Guided Latent Diffusion Image Editing
FDS: Frequency-Aware Denoising Score for Text-Guided Latent Diffusion Image Editing
Yufan Ren
Zicong Jiang
Tong Zhang
Søren Forchhammer
Sabine Süsstrunk
DiffM
61
0
0
24 Mar 2025
InPO: Inversion Preference Optimization with Reparametrized DDIM for Efficient Diffusion Model Alignment
InPO: Inversion Preference Optimization with Reparametrized DDIM for Efficient Diffusion Model Alignment
Yunfan LU
Qichao Wang
H. Cao
Xierui Wang
Xiaoyin Xu
Min Zhang
64
0
0
24 Mar 2025
Training-free Diffusion Acceleration with Bottleneck Sampling
Training-free Diffusion Acceleration with Bottleneck Sampling
Ye Tian
Xin Xia
Yuxi Ren
Shanchuan Lin
Xing Wang
Xuefeng Xiao
Yunhai Tong
L. Yang
Bin Cui
60
0
0
24 Mar 2025
Resource-Efficient Motion Control for Video Generation via Dynamic Mask Guidance
Resource-Efficient Motion Control for Video Generation via Dynamic Mask Guidance
Sicong Feng
Jielong Yang
Li Peng
DiffM
VGen
56
0
0
24 Mar 2025
Diffusion-4K: Ultra-High-Resolution Image Synthesis with Latent Diffusion Models
Diffusion-4K: Ultra-High-Resolution Image Synthesis with Latent Diffusion Models
Jinjin Zhang
Qiuyu Huang
Junjie Liu
Xiefan Guo
Di Huang
62
2
0
24 Mar 2025
DiffusedWrinkles: A Diffusion-Based Model for Data-Driven Garment Animation
DiffusedWrinkles: A Diffusion-Based Model for Data-Driven Garment Animation
R. Vidaurre
Elena Garces
Dan Casas
DiffM
AI4CE
84
1
0
24 Mar 2025
Unified Geometry and Color Compression Framework for Point Clouds via Generative Diffusion Priors
Unified Geometry and Color Compression Framework for Point Clouds via Generative Diffusion Priors
Tianxin Huang
Gim Hee Lee
47
0
0
23 Mar 2025
OmnimatteZero: Training-free Real-time Omnimatte with Pre-trained Video Diffusion Models
OmnimatteZero: Training-free Real-time Omnimatte with Pre-trained Video Diffusion Models
Dvir Samuel
Matan Levy
N. Darshan
Gal Chechik
Rami Ben-Ari
DiffM
67
0
0
23 Mar 2025
Self-Attention Diffusion Models for Zero-Shot Biomedical Image Segmentation: Unlocking New Frontiers in Medical Imaging
Self-Attention Diffusion Models for Zero-Shot Biomedical Image Segmentation: Unlocking New Frontiers in Medical Imaging
Abderrachid Hamrani
Anuradha Godavarty
MedIm
44
0
0
23 Mar 2025
Payload-Aware Intrusion Detection with CMAE and Large Language Models
Payload-Aware Intrusion Detection with CMAE and Large Language Models
Yongcheol Kim
Chanjae Lee
Young Yoon
44
0
0
23 Mar 2025
LongDiff: Training-Free Long Video Generation in One Go
LongDiff: Training-Free Long Video Generation in One Go
Zhuoling Li
Hossein Rahmani
Qiuhong Ke
Xiaozhong Liu
DiffM
VGen
VLM
61
0
0
23 Mar 2025
TCFG: Tangential Damping Classifier-free Guidance
TCFG: Tangential Damping Classifier-free Guidance
Mingi Kwon
Shin seong Kim
Jaeseok Jeong. Yi Ting Hsiao
Youngjung Uh
DiffM
65
0
0
23 Mar 2025
Adoption of Watermarking for Generative AI Systems in Practice and Implications under the new EU AI Act
Adoption of Watermarking for Generative AI Systems in Practice and Implications under the new EU AI Act
Bram Rijsbosch
Gijs van Dijck
Konrad Kollnig
48
1
0
23 Mar 2025
Towards Transformer-Based Aligned Generation with Self-Coherence Guidance
Towards Transformer-Based Aligned Generation with Self-Coherence Guidance
Shulei Wang
Wang Lin
Hai Huang
Hanting Wang
Sihang Cai
...
Tao Jin
Jingyuan Chen
Jiacheng Sun
Jieming Zhu
Zhou Zhao
DiffM
55
2
0
22 Mar 2025
DynASyn: Multi-Subject Personalization Enabling Dynamic Action Synthesis
DynASyn: Multi-Subject Personalization Enabling Dynamic Action Synthesis
Yongjin Choi
Chanhun Park
Seung Jun Baek
DiffM
51
0
0
22 Mar 2025
RDTF: Resource-efficient Dual-mask Training Framework for Multi-frame Animated Sticker Generation
RDTF: Resource-efficient Dual-mask Training Framework for Multi-frame Animated Sticker Generation
Zhiqiang Yuan
Ting Zhang
Ying Deng
Jiapei Zhang
Yeshuang Zhu
Zexi Jia
Jie Zhou
Jinchao Zhang
VGen
41
0
0
22 Mar 2025
ComfyGPT: A Self-Optimizing Multi-Agent System for Comprehensive ComfyUI Workflow Generation
ComfyGPT: A Self-Optimizing Multi-Agent System for Comprehensive ComfyUI Workflow Generation
Oucheng Huang
Yuhang Ma
Zeng Zhao
Mingrui Wu
Jiayi Ji
Rongsheng Zhang
Zhibo Hu
Xiaoshuai Sun
Rongrong Ji
46
0
0
22 Mar 2025
TDRI: Two-Phase Dialogue Refinement and Co-Adaptation for Interactive Image Generation
TDRI: Two-Phase Dialogue Refinement and Co-Adaptation for Interactive Image Generation
Yuheng Feng
Jianhui Wang
Kun Li
Sida Li
Tianyu Shi
Haoyue Han
Miao Zhang
Xueqian Wang
DiffM
164
0
0
22 Mar 2025
Progressive Prompt Detailing for Improved Alignment in Text-to-Image Generative Models
Progressive Prompt Detailing for Improved Alignment in Text-to-Image Generative Models
Ketan Suhaas Saichandran
Xavier Thomas
Prakhar Kaushik
Deepti Ghadiyaram
DiffM
78
0
0
22 Mar 2025
ARFlow: Human Action-Reaction Flow Matching with Physical Guidance
ARFlow: Human Action-Reaction Flow Matching with Physical Guidance
Wentao Jiang
Jingya Wang
Haotao Lu
Kaiyang Ji
Baoxiong Jia
Siyuan Huang
Ye-ling Shi
44
0
0
21 Mar 2025
R2LDM: An Efficient 4D Radar Super-Resolution Framework Leveraging Diffusion Model
R2LDM: An Efficient 4D Radar Super-Resolution Framework Leveraging Diffusion Model
Boyuan Zheng
Shouyi Lu
Renbo Huang
Minqing Huang
Fan Lu
Wei Tian
Guirong Zhuo
Lu Xiong
70
1
0
21 Mar 2025
AnimatePainter: A Self-Supervised Rendering Framework for Reconstructing Painting Process
AnimatePainter: A Self-Supervised Rendering Framework for Reconstructing Painting Process
J. Hu
Shuyong Gao
Qianyu Guo
Yan Wang
Qishan Wang
Yuang Feng
Wenqiang Zhang
DiffM
VGen
47
0
0
21 Mar 2025
UniCon: Unidirectional Information Flow for Effective Control of Large-Scale Diffusion Models
UniCon: Unidirectional Information Flow for Effective Control of Large-Scale Diffusion Models
Fanghua Yu
Jinjin Gu
Jinfan Hu
Zheyuan Li
Chao Dong
DiffM
55
0
0
21 Mar 2025
Is there anything left? Measuring semantic residuals of objects removed from 3D Gaussian Splatting
Is there anything left? Measuring semantic residuals of objects removed from 3D Gaussian Splatting
Simona Kocour
Assia Benbihi
Aikaterini Adam
Torsten Sattler
3DPC
41
0
0
21 Mar 2025
What's Producible May Not Be Reachable: Measuring the Steerability of Generative Models
What's Producible May Not Be Reachable: Measuring the Steerability of Generative Models
Keyon Vafa
Sarah Bentley
Jon M. Kleinberg
S. Mullainathan
43
0
0
21 Mar 2025
FFaceNeRF: Few-shot Face Editing in Neural Radiance Fields
FFaceNeRF: Few-shot Face Editing in Neural Radiance Fields
Kwan Yun
Chaelin Kim
Hangyeul Shin
Junyong Noh
CVBM
51
0
0
21 Mar 2025
Temporal Score Analysis for Understanding and Correcting Diffusion Artifacts
Temporal Score Analysis for Understanding and Correcting Diffusion Artifacts
Yu Cao
Zengqun Zhao
Ioannis Patras
Shaogang Gong
DiffM
50
0
0
20 Mar 2025
Shining Yourself: High-Fidelity Ornaments Virtual Try-on with Diffusion Model
Shining Yourself: High-Fidelity Ornaments Virtual Try-on with Diffusion Model
Yingmao Miao
Zhanpeng Huang
Rui Han
Zibin Wang
Chenhao Lin
Chao Shen
DiffM
49
0
0
20 Mar 2025
Controllable Segmentation-Based Text-Guided Style Editing
Controllable Segmentation-Based Text-Guided Style Editing
Jingwen Li
Aravind Chandrasekar
Mariana Rocha
Chao Li
Yuqing Chen
53
0
0
20 Mar 2025
FreeFlux: Understanding and Exploiting Layer-Specific Roles in RoPE-Based MMDiT for Versatile Image Editing
FreeFlux: Understanding and Exploiting Layer-Specific Roles in RoPE-Based MMDiT for Versatile Image Editing
Tianyi Wei
Yifan Zhou
Dongdong Chen
Xingang Pan
77
0
0
20 Mar 2025
Scale-wise Distillation of Diffusion Models
Scale-wise Distillation of Diffusion Models
Nikita Starodubcev
Denis Kuznedelev
Artem Babenko
Dmitry Baranchuk
DiffM
53
0
0
20 Mar 2025
PoseTraj: Pose-Aware Trajectory Control in Video Diffusion
PoseTraj: Pose-Aware Trajectory Control in Video Diffusion
Longbin Ji
Lei Zhong
Pengfei Wei
Changjian Li
DiffM
VGen
46
0
0
20 Mar 2025
MagicMotion: Controllable Video Generation with Dense-to-Sparse Trajectory Guidance
MagicMotion: Controllable Video Generation with Dense-to-Sparse Trajectory Guidance
Quanhao Li
Zhen Xing
Rui Wang
Hui Zhang
Qi Dai
Zuxuan Wu
VGen
66
0
0
20 Mar 2025
Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation
Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation
Yanjie Wang
Zhijie Lin
Yao Teng
Yuanzhi Zhu
Shuhuai Ren
Jiashi Feng
Xihui Liu
53
0
0
20 Mar 2025
Bezier Distillation
Bezier Distillation
Ling Feng
SK Yang
44
0
0
20 Mar 2025
World Knowledge from AI Image Generation for Robot Control
World Knowledge from AI Image Generation for Robot Control
Jonas Krumme
C. Zetzsche
LM&Ro
55
0
0
20 Mar 2025
Visual Persona: Foundation Model for Full-Body Human Customization
Visual Persona: Foundation Model for Full-Body Human Customization
Jisu Nam
Soowon Son
Zhan Xu
Jing Shi
Difan Liu
Feng Liu
Aashish Misraa
Seungryong Kim
Yang Zhou
DiffM
51
0
0
19 Mar 2025
Multi-focal Conditioned Latent Diffusion for Person Image Synthesis
Multi-focal Conditioned Latent Diffusion for Person Image Synthesis
Jiaqi Liu
Jichao Zahng
Paolo Rota
N. Sebe
DiffM
53
0
0
19 Mar 2025
How to Train Your Dragon: Automatic Diffusion-Based Rigging for Characters with Diverse Topologies
How to Train Your Dragon: Automatic Diffusion-Based Rigging for Characters with Diverse Topologies
Zeqi Gu
Difan Liu
Timothy Langlois
Matthew Fisher
Abe Davis
DiffM
3DH
62
0
0
19 Mar 2025
Decompositional Neural Scene Reconstruction with Generative Diffusion Prior
Decompositional Neural Scene Reconstruction with Generative Diffusion Prior
Junfeng Ni
Yu Liu
Ruijie Lu
Zirui Zhou
Song-Chun Zhu
Yixin Chen
Siyuan Huang
DiffM
72
4
0
19 Mar 2025
The Power of Context: How Multimodality Improves Image Super-Resolution
The Power of Context: How Multimodality Improves Image Super-Resolution
Kangfu Mei
Hossein Talebi
Mojtaba Ardakani
Vishal M. Patel
P. Milanfar
M. Delbracio
DiffM
85
1
0
18 Mar 2025
CRCE: Coreference-Retention Concept Erasure in Text-to-Image Diffusion Models
CRCE: Coreference-Retention Concept Erasure in Text-to-Image Diffusion Models
Yuyang Xue
Edward Moroshko
Feng Chen
Jingyu Sun
Steven McDonagh
Sotirios A. Tsaftaris
56
1
0
18 Mar 2025
Advances in 4D Generation: A Survey
Advances in 4D Generation: A Survey
Qiaowei Miao
Kehan Li
Jinsheng Quan
Zhiyuan Min
Shaojie Ma
Yichao Xu
Yi Yang
Yawei Luo
56
0
0
18 Mar 2025
ICE-Bench: A Unified and Comprehensive Benchmark for Image Creating and Editing
ICE-Bench: A Unified and Comprehensive Benchmark for Image Creating and Editing
Yulin Pan
Xiangteng He
Chaojie Mao
Zhen Han
Zeyinzi Jiang
J. Zhang
Yu Liu
EGVM
VLM
78
1
0
18 Mar 2025
MusicInfuser: Making Video Diffusion Listen and Dance
MusicInfuser: Making Video Diffusion Listen and Dance
Susung Hong
Ira Kemelmacher-Shlizerman
Brian L. Curless
Steven M. Seitz
VGen
52
0
0
18 Mar 2025
Multi-view Reconstruction via SfM-guided Monocular Depth Estimation
Multi-view Reconstruction via SfM-guided Monocular Depth Estimation
Haoyu Guo
He Zhu
Sida Peng
Haotong Lin
Yunzhi Yan
Tao Xie
Wenguan Wang
Xiaowei Zhou
Hujun Bao
3DV
MDE
87
1
0
18 Mar 2025
Diffusion-based G-buffer generation and rendering
Diffusion-based G-buffer generation and rendering
Bowen Xue
G. C. Guarnera
Shuang Zhao
Zahra Montazeri
DiffM
53
0
0
18 Mar 2025
A Comprehensive Survey on Visual Concept Mining in Text-to-image Diffusion Models
A Comprehensive Survey on Visual Concept Mining in Text-to-image Diffusion Models
Ziqiang Li
Jun Li
Lizhi Xiong
Zhangjie Fu
Zechao Li
VLM
59
0
0
17 Mar 2025
Previous
123456...858687
Next