ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2205.11487
  4. Cited By
Photorealistic Text-to-Image Diffusion Models with Deep Language
  Understanding

Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding

23 May 2022
Chitwan Saharia
William Chan
Saurabh Saxena
Lala Li
Jay Whang
Emily L. Denton
Seyed Kamyar Seyed Ghasemipour
Burcu Karagol Ayan
S. S. Mahdavi
Raphael Gontijo-Lopes
Tim Salimans
Jonathan Ho
David J Fleet
Mohammad Norouzi
    VLM
ArXiv (abs)PDFHTML

Papers citing "Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding"

50 / 1,382 papers shown
Title
Rethinking Image Editing Detection in the Era of Generative AI
  Revolution
Rethinking Image Editing Detection in the Era of Generative AI Revolution
Zhihao Sun
Haipeng Fang
Xinying Zhao
Danding Wang
Juan Cao
91
10
0
29 Nov 2023
HumanGaussian: Text-Driven 3D Human Generation with Gaussian Splatting
HumanGaussian: Text-Driven 3D Human Generation with Gaussian Splatting
Xian Liu
Xiaohang Zhan
Jiaxiang Tang
Ying Shan
Gang Zeng
Dahua Lin
Xihui Liu
Ziwei Liu
3DGS
128
77
0
28 Nov 2023
A Unified Approach for Text- and Image-guided 4D Scene Generation
A Unified Approach for Text- and Image-guided 4D Scene Generation
Yufeng Zheng
Xueting Li
Koki Nagano
Sifei Liu
Karsten Kreis
Otmar Hilliges
Shalini De Mello
108
49
0
28 Nov 2023
TextDiffuser-2: Unleashing the Power of Language Models for Text
  Rendering
TextDiffuser-2: Unleashing the Power of Language Models for Text Rendering
Jingye Chen
Yupan Huang
Tengchao Lv
Lei Cui
Qifeng Chen
Furu Wei
DiffM
128
70
0
28 Nov 2023
DreamPropeller: Supercharge Text-to-3D Generation with Parallel Sampling
DreamPropeller: Supercharge Text-to-3D Generation with Parallel Sampling
Linqi Zhou
Andy Shih
Minh-Tuan Tran
Dinh Q. Phung
DiffM
101
14
0
28 Nov 2023
IG Captioner: Information Gain Captioners are Strong Zero-shot
  Classifiers
IG Captioner: Information Gain Captioners are Strong Zero-shot Classifiers
Chenglin Yang
Siyuan Qiao
Yuan Cao
Yu Zhang
Tao Zhu
Alan Yuille
Jiahui Yu
VLM
54
3
0
27 Nov 2023
GaussianEditor: Editing 3D Gaussians Delicately with Text Instructions
GaussianEditor: Editing 3D Gaussians Delicately with Text Instructions
Jiemin Fang
Junjie Wang
Xiaopeng Zhang
Lingxi Xie
Qi Tian
3DGSDiffM
130
117
0
27 Nov 2023
Tell2Design: A Dataset for Language-Guided Floor Plan Generation
Tell2Design: A Dataset for Language-Guided Floor Plan Generation
Sicong Leng
Yangqiaoyu Zhou
Mohammed Haroon Dupty
W. Lee
Sam Joyce
Wei Lu
3DV
67
15
0
27 Nov 2023
Image Super-Resolution with Text Prompt Diffusion
Image Super-Resolution with Text Prompt Diffusion
Zheng Chen
Yulun Zhang
Jinjin Gu
Xin Yuan
Linghe Kong
Guihai Chen
Xiaokang Yang
DiffM
152
21
0
24 Nov 2023
Paragraph-to-Image Generation with Information-Enriched Diffusion Model
Paragraph-to-Image Generation with Information-Enriched Diffusion Model
Weijia Wu
Zhuang Li
Yefei He
Mike Zheng Shou
Chunhua Shen
Lele Cheng
Yan Li
Yan Li
Di Zhang
VLM
230
25
0
24 Nov 2023
Using Human Feedback to Fine-tune Diffusion Models without Any Reward
  Model
Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model
Kai Yang
Jian Tao
Jiafei Lyu
Chunjiang Ge
Jiaxin Chen
Qimai Li
Weihan Shen
Xiaolong Zhu
Xiu Li
EGVM
126
109
0
22 Nov 2023
Steal My Artworks for Fine-tuning? A Watermarking Framework for
  Detecting Art Theft Mimicry in Text-to-Image Models
Steal My Artworks for Fine-tuning? A Watermarking Framework for Detecting Art Theft Mimicry in Text-to-Image Models
Ge Luo
Junqiang Huang
Manman Zhang
Zhenxing Qian
Sheng Li
Xinpeng Zhang
WIGM
65
9
0
22 Nov 2023
FusionFrames: Efficient Architectural Aspects for Text-to-Video
  Generation Pipeline
FusionFrames: Efficient Architectural Aspects for Text-to-Video Generation Pipeline
V.Ya. Arkhipkin
Zein Shaheen
Viacheslav Vasilev
E. Dakhova
Andrey Kuznetsov
Denis Dimitrov
DiffMVGen
93
5
0
22 Nov 2023
GPT4Motion: Scripting Physical Motions in Text-to-Video Generation via
  Blender-Oriented GPT Planning
GPT4Motion: Scripting Physical Motions in Text-to-Video Generation via Blender-Oriented GPT Planning
Jiaxi Lv
Yi Huang
Mingfu Yan
Jiancheng Huang
Jianzhuang Liu
Yifan Liu
Yafei Wen
Xiaoxin Chen
Shifeng Chen
VGenDiffM
119
25
0
21 Nov 2023
What's left can't be right -- The remaining positional incompetence of
  contrastive vision-language models
What's left can't be right -- The remaining positional incompetence of contrastive vision-language models
Nils Hoehing
Ellen Rushe
Anthony Ventresque
VLM
77
3
0
20 Nov 2023
MoVideo: Motion-Aware Video Generation with Diffusion Models
MoVideo: Motion-Aware Video Generation with Diffusion Models
Christos Sakaridis
Yuchen Fan
Kai Zhang
Radu Timofte
Luc Van Gool
Rakesh Ranjan
DiffMVGen
85
10
0
19 Nov 2023
Contrastive Transformer Learning with Proximity Data Generation for
  Text-Based Person Search
Contrastive Transformer Learning with Proximity Data Generation for Text-Based Person Search
Hefeng Wu
Weifeng Chen
Zhibin Liu
Tianshui Chen
Zhiguang Chen
Liang Lin
86
13
0
15 Nov 2023
SceneScore: Learning a Cost Function for Object Arrangement
SceneScore: Learning a Cost Function for Object Arrangement
Ivan Kapelyukh
Edward Johns
OffRLDiffMOCL
106
4
0
14 Nov 2023
3DStyle-Diffusion: Pursuing Fine-grained Text-driven 3D Stylization with
  2D Diffusion Models
3DStyle-Diffusion: Pursuing Fine-grained Text-driven 3D Stylization with 2D Diffusion Models
Haibo Yang
Yang Chen
Yingwei Pan
Ting Yao
Zhineng Chen
Tao Mei
76
20
0
09 Nov 2023
ControlStyle: Text-Driven Stylized Image Generation Using Diffusion
  Priors
ControlStyle: Text-Driven Stylized Image Generation Using Diffusion Priors
Jingwen Chen
Yingwei Pan
Ting Yao
Tao Mei
DiffM
105
42
0
09 Nov 2023
Control3D: Towards Controllable Text-to-3D Generation
Control3D: Towards Controllable Text-to-3D Generation
Yang Chen
Yingwei Pan
Yehao Li
Ting Yao
Tao Mei
DiffM
97
49
0
09 Nov 2023
3DiffTection: 3D Object Detection with Geometry-Aware Diffusion Features
3DiffTection: 3D Object Detection with Geometry-Aware Diffusion Features
Chenfeng Xu
Huan Ling
Sanja Fidler
Or Litany
103
15
0
07 Nov 2023
A Data Perspective on Enhanced Identity Preservation for Diffusion
  Personalization
A Data Perspective on Enhanced Identity Preservation for Diffusion Personalization
Xingzhe He
Zhiwen Cao
Nicholas I. Kolkin
Lantao Yu
Kun Wan
Helge Rhodin
Ratheesh Kalarot
93
14
0
07 Nov 2023
I2VGen-XL: High-Quality Image-to-Video Synthesis via Cascaded Diffusion
  Models
I2VGen-XL: High-Quality Image-to-Video Synthesis via Cascaded Diffusion Models
Shiwei Zhang
Jiayu Wang
Yingya Zhang
Kang Zhao
Hangjie Yuan
Zhan Qin
Xiang Wang
Deli Zhao
Jingren Zhou
DiffMVGen
135
231
0
07 Nov 2023
Leveraging Large Language Models for Collective Decision-Making
Leveraging Large Language Models for Collective Decision-Making
Marios Papachristou
Longqi Yang
Chin-Chia Hsu
LLMAG
89
3
0
03 Nov 2023
E3 TTS: Easy End-to-End Diffusion-based Text to Speech
E3 TTS: Easy End-to-End Diffusion-based Text to Speech
Yuan Gao
Nobuyuki Morioka
Yu Zhang
Nanxin Chen
DiffM
85
33
0
02 Nov 2023
De-Diffusion Makes Text a Strong Cross-Modal Interface
De-Diffusion Makes Text a Strong Cross-Modal Interface
Chen Wei
Chenxi Liu
Siyuan Qiao
Zhishuai Zhang
Alan Yuille
Jiahui Yu
VLMDiffM
103
11
0
01 Nov 2023
LatentWarp: Consistent Diffusion Latents for Zero-Shot Video-to-Video
  Translation
LatentWarp: Consistent Diffusion Latents for Zero-Shot Video-to-Video Translation
Yuxiang Bao
Di Qiu
Guoliang Kang
Baochang Zhang
Bo Jin
Kaiye Wang
Pengfei Yan
VGenDiffM
74
7
0
01 Nov 2023
SEINE: Short-to-Long Video Diffusion Model for Generative Transition and
  Prediction
SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction
Xinyuan Chen
Yaohui Wang
Lingjun Zhang
Shaobin Zhuang
Xin Ma
Jiashuo Yu
Yali Wang
Dahua Lin
Yu Qiao
Ziwei Liu
VGenDiffM
79
146
0
31 Oct 2023
Davidsonian Scene Graph: Improving Reliability in Fine-grained
  Evaluation for Text-to-Image Generation
Davidsonian Scene Graph: Improving Reliability in Fine-grained Evaluation for Text-to-Image Generation
Jaemin Cho
Yushi Hu
Roopal Garg
Peter Anderson
Ranjay Krishna
Jason Baldridge
Mohit Bansal
Jordi Pont-Tuset
Su Wang
EGVM
84
81
0
27 Oct 2023
Uncovering Meanings of Embeddings via Partial Orthogonality
Uncovering Meanings of Embeddings via Partial Orthogonality
Yibo Jiang
Bryon Aragam
Victor Veitch
89
15
0
26 Oct 2023
Noise-Free Score Distillation
Noise-Free Score Distillation
Oren Katzir
Or Patashnik
Daniel Cohen-Or
Dani Lischinski
DiffM
112
71
0
26 Oct 2023
HyperFields: Towards Zero-Shot Generation of NeRFs from Text
HyperFields: Towards Zero-Shot Generation of NeRFs from Text
Sudarshan Babu
Richard Liu
Avery Zhou
Michael Maire
Greg Shakhnarovich
Rana Hanocka
AI4CE
111
11
0
26 Oct 2023
Local Statistics for Generative Image Detection
Local Statistics for Generative Image Detection
Yung Jer Wong
Teck Khim Ng
DiffM
51
2
0
25 Oct 2023
Online Detection of AI-Generated Images
Online Detection of AI-Generated Images
David C. Epstein
Ishan Jain
Oliver Wang
Richard Y. Zhang
72
60
0
23 Oct 2023
Understanding Generative AI in Art: An Interview Study with Artists on
  G-AI from an HCI Perspective
Understanding Generative AI in Art: An Interview Study with Artists on G-AI from an HCI Perspective
Jingyu Shi
Rahul Jain
Runlin Duan
Karthik Ramani
64
8
0
19 Oct 2023
Scalable Diffusion for Materials Generation
Scalable Diffusion for Materials Generation
Mengjiao Yang
KwangHwan Cho
Amil Merchant
Pieter Abbeel
Dale Schuurmans
Igor Mordatch
E. D. Cubuk
85
43
0
18 Oct 2023
To Generate or Not? Safety-Driven Unlearned Diffusion Models Are Still
  Easy To Generate Unsafe Images ... For Now
To Generate or Not? Safety-Driven Unlearned Diffusion Models Are Still Easy To Generate Unsafe Images ... For Now
Yimeng Zhang
Jinghan Jia
Xin Chen
Aochuan Chen
Yihua Zhang
Jiancheng Liu
Ke Ding
Sijia Liu
DiffM
177
101
0
18 Oct 2023
ForceGen: End-to-end de novo protein generation based on nonlinear
  mechanical unfolding responses using a protein language diffusion model
ForceGen: End-to-end de novo protein generation based on nonlinear mechanical unfolding responses using a protein language diffusion model
Bo Ni
David L. Kaplan
Markus J. Buehler
DiffM
84
5
0
16 Oct 2023
DeltaSpace: A Semantic-aligned Feature Space for Flexible Text-guided Image Editing
DeltaSpace: A Semantic-aligned Feature Space for Flexible Text-guided Image Editing
Yueming Lyu
Kang Zhao
Bo Peng
H. Chen
Yue Jiang
Yingya Zhang
Jing Dong
Caifeng Shan
78
2
0
12 Oct 2023
DrivingDiffusion: Layout-Guided multi-view driving scene video
  generation with latent diffusion model
DrivingDiffusion: Layout-Guided multi-view driving scene video generation with latent diffusion model
Xiaofan Li
Yifu Zhang
Xiaoqing Ye
VGen
125
78
0
11 Oct 2023
An HCI-Centric Survey and Taxonomy of Human-Generative-AI Interactions
An HCI-Centric Survey and Taxonomy of Human-Generative-AI Interactions
Jingyu Shi
Rahul Jain
Hyungjun Doh
Ryo Suzuki
Karthik Ramani
3DV
80
24
0
11 Oct 2023
Improving Discriminative Multi-Modal Learning with Large-Scale
  Pre-Trained Models
Improving Discriminative Multi-Modal Learning with Large-Scale Pre-Trained Models
Chenzhuang Du
Yue Zhao
Chonghua Liao
Jiacheng You
Jie Fu
Hang Zhao
86
2
0
08 Oct 2023
Latent Consistency Models: Synthesizing High-Resolution Images with
  Few-Step Inference
Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference
Simian Luo
Yiqin Tan
Longbo Huang
Jian Li
Hang Zhao
DiffM
126
479
0
06 Oct 2023
MedSyn: Text-guided Anatomy-aware Synthesis of High-Fidelity 3D CT
  Images
MedSyn: Text-guided Anatomy-aware Synthesis of High-Fidelity 3D CT Images
Yanwu Xu
Li Sun
Wei Peng
Shyam Visweswaran
Kayhan Batmanghelich
MedImDiffM
111
23
0
05 Oct 2023
Realistic Speech-to-Face Generation with Speech-Conditioned Latent
  Diffusion Model with Face Prior
Realistic Speech-to-Face Generation with Speech-Conditioned Latent Diffusion Model with Face Prior
Jinting Wang
Li Liu
Jun Wang
Hei Victor Cheng
DiffM
57
2
0
05 Oct 2023
Ctrl-Room: Controllable Text-to-3D Room Meshes Generation with Layout Constraints
Ctrl-Room: Controllable Text-to-3D Room Meshes Generation with Layout Constraints
Chuan Fang
Yuan Dong
Kunming Luo
Xiaotao Hu
Rakesh Shrestha
Ping Tan
DiffM
152
37
0
05 Oct 2023
Kosmos-G: Generating Images in Context with Multimodal Large Language
  Models
Kosmos-G: Generating Images in Context with Multimodal Large Language Models
Xichen Pan
Li Dong
Shaohan Huang
Zhiliang Peng
Wenhu Chen
Furu Wei
VLM
152
68
0
04 Oct 2023
Predicated Diffusion: Predicate Logic-Based Attention Guidance for
  Text-to-Image Diffusion Models
Predicated Diffusion: Predicate Logic-Based Attention Guidance for Text-to-Image Diffusion Models
Kota Sueyoshi
Takashi Matsubara
DiffM
98
8
0
03 Oct 2023
Prompt-tuning latent diffusion models for inverse problems
Prompt-tuning latent diffusion models for inverse problems
Hyungjin Chung
Jong Chul Ye
P. Milanfar
M. Delbracio
DiffM
108
44
0
02 Oct 2023
Previous
123...151617...262728
Next