ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2205.11487
  4. Cited By
Photorealistic Text-to-Image Diffusion Models with Deep Language
  Understanding

Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding

23 May 2022
Chitwan Saharia
William Chan
Saurabh Saxena
Lala Li
Jay Whang
Emily L. Denton
Seyed Kamyar Seyed Ghasemipour
Burcu Karagol Ayan
S. S. Mahdavi
Raphael Gontijo-Lopes
Tim Salimans
Jonathan Ho
David J Fleet
Mohammad Norouzi
    VLM
ArXivPDFHTML

Papers citing "Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding"

50 / 4,363 papers shown
Title
Evaluating Data Attribution for Text-to-Image Models
Evaluating Data Attribution for Text-to-Image Models
Sheng-Yu Wang
Alexei A. Efros
Jun-Yan Zhu
Richard Y. Zhang
TDI
57
32
0
15 Jun 2023
DreamSim: Learning New Dimensions of Human Visual Similarity using
  Synthetic Data
DreamSim: Learning New Dimensions of Human Visual Similarity using Synthetic Data
Stephanie Fu
Netanel Y. Tamir
Shobhita Sundaram
Lucy Chai
Richard Y. Zhang
Tali Dekel
Phillip Isola
EGVM
57
105
0
15 Jun 2023
Human Preference Score v2: A Solid Benchmark for Evaluating Human
  Preferences of Text-to-Image Synthesis
Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis
Xiaoshi Wu
Yiming Hao
Keqiang Sun
Yixiong Chen
Feng Zhu
Rui Zhao
Hongsheng Li
51
268
0
15 Jun 2023
Understanding Optimization of Deep Learning via Jacobian Matrix and
  Lipschitz Constant
Understanding Optimization of Deep Learning via Jacobian Matrix and Lipschitz Constant
Xianbiao Qi
Jianan Wang
Lei Zhang
31
0
0
15 Jun 2023
Generative Proxemics: A Prior for 3D Social Interaction from Images
Generative Proxemics: A Prior for 3D Social Interaction from Images
Lea Müller
Vickie Ye
Georgios Pavlakos
Michael J. Black
Angjoo Kanazawa
DiffM
46
29
0
15 Jun 2023
DreamHuman: Animatable 3D Avatars from Text
DreamHuman: Animatable 3D Avatars from Text
Nikos Kolotouros
Thiemo Alldieck
Andrei Zanfir
Eduard Gabriel Bazavan
Mihai Fieraru
C. Sminchisescu
70
94
0
15 Jun 2023
Fast Training of Diffusion Models with Masked Transformers
Fast Training of Diffusion Models with Masked Transformers
Hongkai Zheng
Weili Nie
Arash Vahdat
Anima Anandkumar
DiffM
56
69
0
15 Jun 2023
Towards Faster Non-Asymptotic Convergence for Diffusion-Based Generative
  Models
Towards Faster Non-Asymptotic Convergence for Diffusion-Based Generative Models
Gen Li
Yuting Wei
Yuxin Chen
Yuejie Chi
DiffM
41
60
0
15 Jun 2023
DiffAug: A Diffuse-and-Denoise Augmentation for Training Robust
  Classifiers
DiffAug: A Diffuse-and-Denoise Augmentation for Training Robust Classifiers
Chandramouli Shama Sastry
Sri Harsha Dumpala
Sageev Oore
35
0
0
15 Jun 2023
Unbalanced Diffusion Schrödinger Bridge
Unbalanced Diffusion Schrödinger Bridge
Matteo Pariset
Ya-Ping Hsieh
Charlotte Bunne
Andreas Krause
Valentin De Bortoli
DiffM
OT
49
6
0
15 Jun 2023
Training Multimedia Event Extraction With Generated Images and Captions
Training Multimedia Event Extraction With Generated Images and Captions
Zilin Du
Yunxin Li
Xu Guo
Yidan Sun
Boyang Albert Li
DiffM
46
7
0
15 Jun 2023
RecFusion: A Binomial Diffusion Process for 1D Data for Recommendation
RecFusion: A Binomial Diffusion Process for 1D Data for Recommendation
Gabriel Bénédict
Olivier Jeunen
Samuele Papa
Samarth Bhargav
Daan Odijk
Maarten de Rijke
DiffM
47
9
0
15 Jun 2023
Linguistic Binding in Diffusion Models: Enhancing Attribute
  Correspondence through Attention Map Alignment
Linguistic Binding in Diffusion Models: Enhancing Attribute Correspondence through Attention Map Alignment
Royi Rassin
Eran Hirsch
Daniel Glickman
Shauli Ravfogel
Yoav Goldberg
Gal Chechik
DiffM
48
103
0
15 Jun 2023
Taming Diffusion Models for Music-driven Conducting Motion Generation
Taming Diffusion Models for Music-driven Conducting Motion Generation
Zhuoran Zhao
Jinbin Bai
Delong Chen
Debang Wang
Yubo Pan
DiffM
85
12
0
15 Jun 2023
InfoDiffusion: Representation Learning Using Information Maximizing
  Diffusion Models
InfoDiffusion: Representation Learning Using Information Maximizing Diffusion Models
Yingheng Wang
Yair Schiff
Aaron Gokaslan
Weishen Pan
Fei Wang
Chris De Sa
Volodymyr Kuleshov
DiffM
60
40
0
14 Jun 2023
Norm-guided latent space exploration for text-to-image generation
Norm-guided latent space exploration for text-to-image generation
Dvir Samuel
Rami Ben-Ari
N. Darshan
Haggai Maron
Gal Chechik
DiffM
42
25
0
14 Jun 2023
Training-free Diffusion Model Adaptation for Variable-Sized
  Text-to-Image Synthesis
Training-free Diffusion Model Adaptation for Variable-Sized Text-to-Image Synthesis
Zhiyu Jin
Xuli Shen
Bin Li
Xiangyang Xue
44
36
0
14 Jun 2023
Towards AGI in Computer Vision: Lessons Learned from GPT and Large
  Language Models
Towards AGI in Computer Vision: Lessons Learned from GPT and Large Language Models
Lingxi Xie
Longhui Wei
Xiaopeng Zhang
Kaifeng Bi
Xiaotao Gu
Jianlong Chang
Qi Tian
67
7
0
14 Jun 2023
TAPIR: Tracking Any Point with per-frame Initialization and temporal
  Refinement
TAPIR: Tracking Any Point with per-frame Initialization and temporal Refinement
Carl Doersch
Yi Yang
Mel Vecerík
Dilara Gokay
Ankush Gupta
Y. Aytar
João Carreira
Andrew Zisserman
44
150
0
14 Jun 2023
Fine-Tuned but Zero-Shot 3D Shape Sketch View Similarity and Retrieval
Fine-Tuned but Zero-Shot 3D Shape Sketch View Similarity and Retrieval
G. Berardi
Yulia Gryaditskaya
3DV
60
2
0
14 Jun 2023
Distribution Shift Inversion for Out-of-Distribution Prediction
Distribution Shift Inversion for Out-of-Distribution Prediction
Runpeng Yu
Songhua Liu
Xingyi Yang
Xinchao Wang
OODD
33
19
0
14 Jun 2023
TryOnDiffusion: A Tale of Two UNets
TryOnDiffusion: A Tale of Two UNets
Luyang Zhu
Dawei Yang
Tyler Lixuan Zhu
F. Reda
William Chan
Chitwan Saharia
Mohammad Norouzi
Ira Kemelmacher-Shlizerman
DiffM
53
106
0
14 Jun 2023
On the Robustness of Latent Diffusion Models
On the Robustness of Latent Diffusion Models
Jianping Zhang
Zhuoer Xu
Shiwen Cui
Changhua Meng
Weibin Wu
Michael R. Lyu
AAML
32
20
0
14 Jun 2023
GBSD: Generative Bokeh with Stage Diffusion
GBSD: Generative Bokeh with Stage Diffusion
Jieren Deng
Xiaoxia Zhou
Hao Tian
Zhihong Pan
Derek Aguiar
DiffM
37
1
0
14 Jun 2023
Diffusion in Diffusion: Cyclic One-Way Diffusion for
  Text-Vision-Conditioned Generation
Diffusion in Diffusion: Cyclic One-Way Diffusion for Text-Vision-Conditioned Generation
Ruoyu Wang
Yongqi Yang
Zhihao Qian
Ye Zhu
Yuehua Wu
DiffM
38
13
0
14 Jun 2023
CLIPXPlore: Coupled CLIP and Shape Spaces for 3D Shape Exploration
CLIPXPlore: Coupled CLIP and Shape Spaces for 3D Shape Exploration
Jingyu Hu
Ka-Hei Hui
Zhengzhe Liu
Haotong Zhang
Chi-Wing Fu
31
2
0
14 Jun 2023
DORSal: Diffusion for Object-centric Representations of Scenes et al
DORSal: Diffusion for Object-centric Representations of Scenes et al
Allan Jabri
Sjoerd van Steenkiste
Emiel Hoogeboom
Mehdi S. M. Sajjadi
Thomas Kipf
37
16
0
13 Jun 2023
Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation
Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation
Shuai Yang
Yifan Zhou
Ziwei Liu
Chen Change Loy
VGen
DiffM
54
209
0
13 Jun 2023
Generative Watermarking Against Unauthorized Subject-Driven Image
  Synthesis
Generative Watermarking Against Unauthorized Subject-Driven Image Synthesis
Yi Ma
Zhengyu Zhao
Xinlei He
Zheng Li
Michael Backes
Yang Zhang
AAML
WIGM
45
21
0
13 Jun 2023
Dynamically Masked Discriminator for Generative Adversarial Networks
Dynamically Masked Discriminator for Generative Adversarial Networks
Wentian Zhang
Haozhe Liu
Bing Li
Jinheng Xie
Yawen Huang
Yuexiang Li
Yefeng Zheng
Guohao Li
TTA
45
2
0
13 Jun 2023
Paste, Inpaint and Harmonize via Denoising: Subject-Driven Image Editing
  with Pre-Trained Diffusion Model
Paste, Inpaint and Harmonize via Denoising: Subject-Driven Image Editing with Pre-Trained Diffusion Model
Xinyu Zhang
Jiaxian Guo
Paul D. Yoo
Yutaka Matsuo
Yusuke Iwasawa
DiffM
51
22
0
13 Jun 2023
I See Dead People: Gray-Box Adversarial Attack on Image-To-Text Models
I See Dead People: Gray-Box Adversarial Attack on Image-To-Text Models
Raz Lapid
Moshe Sipper
AAML
37
17
0
13 Jun 2023
User-defined Event Sampling and Uncertainty Quantification in Diffusion
  Models for Physical Dynamical Systems
User-defined Event Sampling and Uncertainty Quantification in Diffusion Models for Physical Dynamical Systems
Marc Finzi
Anudhyan Boral
A. Wilson
Fei Sha
Leonardo Zepeda-Núñez
DiffM
43
21
0
13 Jun 2023
Controlling Text-to-Image Diffusion by Orthogonal Finetuning
Controlling Text-to-Image Diffusion by Orthogonal Finetuning
Zeju Qiu
Wei-yu Liu
Haiwen Feng
Yuxuan Xue
Yao Feng
Zhen Liu
Dan Zhang
Adrian Weller
Bernhard Schölkopf
DiffM
56
142
0
12 Jun 2023
Scalable 3D Captioning with Pretrained Models
Scalable 3D Captioning with Pretrained Models
Tiange Luo
C. Rockwell
Honglak Lee
Justin Johnson
37
155
0
12 Jun 2023
MovieFactory: Automatic Movie Creation from Text using Large Generative
  Models for Language and Images
MovieFactory: Automatic Movie Creation from Text using Large Generative Models for Language and Images
Sitong Su
Huan Yang
Huiguo He
Wenjing Wang
Zixi Tuo
Wen-Huang Cheng
Lianli Gao
Jingkuan Song
Jianlong Fu
VGen
DiffM
47
40
0
12 Jun 2023
Fill-Up: Balancing Long-Tailed Data with Generative Models
Fill-Up: Balancing Long-Tailed Data with Generative Models
Joonghyuk Shin
Minguk Kang
Jaesik Park
53
30
0
12 Jun 2023
Diffusion Models for Black-Box Optimization
Diffusion Models for Black-Box Optimization
S. Krishnamoorthy
Satvik Mashkaria
Aditya Grover
DiffM
52
52
0
12 Jun 2023
InstructP2P: Learning to Edit 3D Point Clouds with Text Instructions
InstructP2P: Learning to Edit 3D Point Clouds with Text Instructions
Jiale Xu
Xintao Wang
Yannan Cao
Weihao Cheng
Ying Shan
Shenghua Gao
DiffM
33
11
0
12 Jun 2023
VillanDiffusion: A Unified Backdoor Attack Framework for Diffusion
  Models
VillanDiffusion: A Unified Backdoor Attack Framework for Diffusion Models
Sheng-Yen Chou
Pin-Yu Chen
Tsung-Yi Ho
DiffM
28
54
0
12 Jun 2023
Face0: Instantaneously Conditioning a Text-to-Image Model on a Face
Face0: Instantaneously Conditioning a Text-to-Image Model on a Face
Dani Valevski
Danny Lumen
Yossi Matias
Yaniv Leviathan
DiffM
VLM
29
76
0
11 Jun 2023
Image Vectorization: a Review
Image Vectorization: a Review
Maria Dziuba
Ivan Jarsky
Valeria Efimova
Andrey Filchenkov
3DV
DiffM
40
10
0
10 Jun 2023
AutoSAM: Adapting SAM to Medical Images by Overloading the Prompt
  Encoder
AutoSAM: Adapting SAM to Medical Images by Overloading the Prompt Encoder
Tal Shaharabany
Aviad Dahan
Raja Giryes
Lior Wolf
MedIm
VLM
24
68
0
10 Jun 2023
Boosting GUI Prototyping with Diffusion Models
Boosting GUI Prototyping with Diffusion Models
Jialiang Wei
A. Courbis
Thomas Lambolais
Binbin Xu
P. Bernard
Gérard Dray
DiffM
37
22
0
09 Jun 2023
Aladdin: Zero-Shot Hallucination of Stylized 3D Assets from Abstract
  Scene Descriptions
Aladdin: Zero-Shot Hallucination of Stylized 3D Assets from Abstract Scene Descriptions
Ian Huang
Vrishab Krishna
Omoruyi E. Atekha
Leonidas Guibas
DiffM
VGen
37
11
0
09 Jun 2023
The Age of Synthetic Realities: Challenges and Opportunities
The Age of Synthetic Realities: Challenges and Opportunities
J. P. Cardenuto
Jing Yang
Rafael Padilha
Renjie Wan
Daniel Moreira
Haoliang Li
Shiqi Wang
Fernanda A. Andaló
Sébastien Marcel
Anderson de Rezende Rocha
DeLMO
58
29
0
09 Jun 2023
Evaluating the Social Impact of Generative AI Systems in Systems and
  Society
Evaluating the Social Impact of Generative AI Systems in Systems and Society
Irene Solaiman
Zeerak Talat
William Agnew
Lama Ahmad
Dylan K. Baker
...
Marie-Therese Png
Shubham Singh
A. Strait
Lukas Struppek
Arjun Subramonian
ELM
EGVM
67
106
0
09 Jun 2023
RePaint-NeRF: NeRF Editting via Semantic Masks and Diffusion Models
RePaint-NeRF: NeRF Editting via Semantic Masks and Diffusion Models
Xing-Chun Zhou
Ying He
F. Richard Yu
Jianqiang Li
You Li
DiffM
57
18
0
09 Jun 2023
Safety and Fairness for Content Moderation in Generative Models
Safety and Fairness for Content Moderation in Generative Models
Susan Hao
Piyush Kumar
Sarah Laszlo
Shivani Poddar
Bhaktipriya Radharapu
Renee Shelby
EGVM
50
20
0
09 Jun 2023
BOOT: Data-free Distillation of Denoising Diffusion Models with
  Bootstrapping
BOOT: Data-free Distillation of Denoising Diffusion Models with Bootstrapping
Jiatao Gu
Shuangfei Zhai
Yizhe Zhang
Lingjie Liu
J. Susskind
DiffM
36
70
0
08 Jun 2023
Previous
123...656667...868788
Next