ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2205.11487
  4. Cited By
Photorealistic Text-to-Image Diffusion Models with Deep Language
  Understanding

Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding

23 May 2022
Chitwan Saharia
William Chan
Saurabh Saxena
Lala Li
Jay Whang
Emily L. Denton
Seyed Kamyar Seyed Ghasemipour
Burcu Karagol Ayan
S. S. Mahdavi
Raphael Gontijo-Lopes
Tim Salimans
Jonathan Ho
David J Fleet
Mohammad Norouzi
    VLM
ArXiv (abs)PDFHTML

Papers citing "Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding"

50 / 1,384 papers shown
Title
Prompt-tuning latent diffusion models for inverse problems
Prompt-tuning latent diffusion models for inverse problems
Hyungjin Chung
Jong Chul Ye
P. Milanfar
M. Delbracio
DiffM
108
44
0
02 Oct 2023
Feedback-guided Data Synthesis for Imbalanced Classification
Feedback-guided Data Synthesis for Imbalanced Classification
Reyhane Askari Hemmat
Mohammad Pezeshki
Florian Bordes
M. Drozdzal
Adriana Romero Soriano
SyDa
96
21
0
29 Sep 2023
LLM-grounded Video Diffusion Models
LLM-grounded Video Diffusion Models
Long Lian
Baifeng Shi
Semih Yavuz
Ye Liu
Boyi Li
DiffM
103
55
0
29 Sep 2023
Directly Fine-Tuning Diffusion Models on Differentiable Rewards
Directly Fine-Tuning Diffusion Models on Differentiable Rewards
Amita Gajewar
Paul Vicol
G. Bansal
David J Fleet
110
177
0
29 Sep 2023
GAIA-1: A Generative World Model for Autonomous Driving
GAIA-1: A Generative World Model for Autonomous Driving
Masane Fuchi
Lloyd Russell
Hudson Yeo
Zak Murez
Hiroto Minami
Alex Kendall
Tomohiro Takagi
Gianluca Corrado
VGen
130
252
0
29 Sep 2023
AdaDiff: Accelerating Diffusion Models through Step-Wise Adaptive
  Computation
AdaDiff: Accelerating Diffusion Models through Step-Wise Adaptive Computation
Shengkun Tang
Yaqing Wang
Maksim Dzhigil
Yi Liang
Yongbin Li
Dongkuan Xu
62
7
0
29 Sep 2023
Towards Few-Call Model Stealing via Active Self-Paced Knowledge Distillation and Diffusion-Based Image Generation
Towards Few-Call Model Stealing via Active Self-Paced Knowledge Distillation and Diffusion-Based Image Generation
Vlad Hondru
Radu Tudor Ionescu
DiffM
104
2
0
29 Sep 2023
From LAION-5B to LAION-EO: Filtering Billions of Images Using Anchor
  Datasets for Satellite Image Extraction
From LAION-5B to LAION-EO: Filtering Billions of Images Using Anchor Datasets for Satellite Image Extraction
Mikolaj Czerkawski
Alistair Francis
83
9
0
27 Sep 2023
Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation
Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation
David Junhao Zhang
Jay Zhangjie Wu
Jia-Wei Liu
Rui Zhao
L. Ran
Yuchao Gu
Difei Gao
Mike Zheng Shou
DiffMVGen
129
223
0
27 Sep 2023
A Simple Text to Video Model via Transformer
A Simple Text to Video Model via Transformer
Gang Chen
ViT
37
1
0
26 Sep 2023
MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary
  Instance Segmentation
MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance Segmentation
Jiahao Xie
Wei Li
Xiangtai Li
Ziwei Liu
Yew-Soon Ong
Chen Change Loy
DiffMVLM
154
37
0
22 Sep 2023
Looking at words and points with attention: a benchmark for
  text-to-shape coherence
Looking at words and points with attention: a benchmark for text-to-shape coherence
Andrea Amaduzzi
Giuseppe Lisanti
Samuele Salti
Luigi Di Stefano
49
2
0
14 Sep 2023
DreamStyler: Paint by Style Inversion with Text-to-Image Diffusion
  Models
DreamStyler: Paint by Style Inversion with Text-to-Image Diffusion Models
Namhyuk Ahn
Junsoo Lee
Chunggi Lee
Kunhee Kim
Daesik Kim
Seung-Hun Nam
Kibeom Hong
DiffM
89
24
0
13 Sep 2023
Elucidating the solution space of extended reverse-time SDE for diffusion models
Elucidating the solution space of extended reverse-time SDE for diffusion models
Qinpeng Cui
Xinyi Zhang
Zongqing Lu
Qingmin Liao
DiffM
103
6
0
12 Sep 2023
Treatment-aware Diffusion Probabilistic Model for Longitudinal MRI Generation and Diffuse Glioma Growth Prediction
Treatment-aware Diffusion Probabilistic Model for Longitudinal MRI Generation and Diffuse Glioma Growth Prediction
Qinghui Liu
E. Fuster-García
I. T. Hovden
Donatas Sederevičius
Karoline Skogen
...
Till Schellhorn
P. Brandal
A. Bjørnerud
K. Emblem
Kyrre Eeg Emblem
107
3
0
11 Sep 2023
SA-Solver: Stochastic Adams Solver for Fast Sampling of Diffusion Models
SA-Solver: Stochastic Adams Solver for Fast Sampling of Diffusion Models
Shuchen Xue
Mingyang Yi
Weijian Luo
Shifeng Zhang
Jiacheng Sun
Zechao Li
Zhi-Ming Ma
DiffM
165
52
0
10 Sep 2023
DiffusionEngine: Diffusion Model is Scalable Data Engine for Object
  Detection
DiffusionEngine: Diffusion Model is Scalable Data Engine for Object Detection
Manlin Zhang
Jie Wu
Yuxi Ren
Ming Li
Jie Qin
Xuefeng Xiao
Wei Liu
Rui Wang
Min Zheng
Andy J. Ma
DiffM
104
22
0
07 Sep 2023
Reuse and Diffuse: Iterative Denoising for Text-to-Video Generation
Reuse and Diffuse: Iterative Denoising for Text-to-Video Generation
Jiaxi Gu
Shicong Wang
Haoyu Zhao
Tianyi Lu
Xing Zhang
Zuxuan Wu
Songcen Xu
Wei Zhang
Yu-Gang Jiang
Hang Xu
DiffMVGen
82
48
0
07 Sep 2023
Elucidating the Exposure Bias in Diffusion Models
Elucidating the Exposure Bias in Diffusion Models
Mang Ning
Mingxiao Li
Jianlin Su
A. A. Salah
Itir Onal Ertugrul
DiffM
263
38
0
29 Aug 2023
AI-Generated Content (AIGC) for Various Data Modalities: A Survey
AI-Generated Content (AIGC) for Various Data Modalities: A Survey
Lin Geng Foo
Hossein Rahmani
Jing Liu
276
31
0
27 Aug 2023
Decoding Natural Images from EEG for Object Recognition
Decoding Natural Images from EEG for Object Recognition
Yonghao Song
Bingchuan Liu
Xiang Li
Nanlin Shi
Yijun Wang
Xiaorong Gao
120
34
0
25 Aug 2023
Region-Disentangled Diffusion Model for High-Fidelity PPG-to-ECG
  Translation
Region-Disentangled Diffusion Model for High-Fidelity PPG-to-ECG Translation
Debaditya Shome
Pritam Sarkar
Ali Etemad
DiffM
83
12
0
25 Aug 2023
MOFA: A Model Simplification Roadmap for Image Restoration on Mobile
  Devices
MOFA: A Model Simplification Roadmap for Image Restoration on Mobile Devices
Xiangyu Chen
Ruiwen Zhen
Shuai Li
Xiaotian Li
Guanghui Wang
59
1
0
24 Aug 2023
Blending-NeRF: Text-Driven Localized Editing in Neural Radiance Fields
Blending-NeRF: Text-Driven Localized Editing in Neural Radiance Fields
H. Song
Seokhun Choi
Hoseok Do
Chul Lee
Taehyeong Kim
DiffM
116
24
0
23 Aug 2023
IT3D: Improved Text-to-3D Generation with Explicit View Synthesis
IT3D: Improved Text-to-3D Generation with Explicit View Synthesis
Yiwen Chen
Chi Zhang
Xiaofeng Yang
Zhongang Cai
Gang Yu
Lei Yang
Guo-Shing Lin
DiffM
98
64
0
22 Aug 2023
Backdooring Textual Inversion for Concept Censorship
Backdooring Textual Inversion for Concept Censorship
Yutong Wu
Jiehan Zhang
Florian Kerschbaum
Tianwei Zhang
DiffM
93
7
0
21 Aug 2023
Make-It-4D: Synthesizing a Consistent Long-Term Dynamic Scene Video from
  a Single Image
Make-It-4D: Synthesizing a Consistent Long-Term Dynamic Scene Video from a Single Image
Liao Shen
Xingyi Li
Huiqiang Sun
Juewen Peng
Ke Xian
Zhiguo Cao
Guo-Shing Lin
DiffM
103
15
0
20 Aug 2023
AltDiffusion: A Multilingual Text-to-Image Diffusion Model
AltDiffusion: A Multilingual Text-to-Image Diffusion Model
Fulong Ye
Guangyi Liu
Xinya Wu
Ledell Yu Wu
VLM
103
29
0
19 Aug 2023
Diff2Lip: Audio Conditioned Diffusion Models for Lip-Synchronization
Diff2Lip: Audio Conditioned Diffusion Models for Lip-Synchronization
Soumik Mukhopadhyay
Saksham Suri
R. Gadde
Abhinav Shrivastava
DiffM
80
24
0
18 Aug 2023
Language-Guided Diffusion Model for Visual Grounding
Language-Guided Diffusion Model for Visual Grounding
Sijia Chen
Baochun Li
142
5
0
18 Aug 2023
Dual-Stream Diffusion Net for Text-to-Video Generation
Dual-Stream Diffusion Net for Text-to-Video Generation
Binhui Liu
Xin Liu
Anbo Dai
Zhiyong Zeng
Dan Wang
Zhen Cui
Jian Yang
DiffMVGen
95
10
0
16 Aug 2023
Learning to Generate Semantic Layouts for Higher Text-Image
  Correspondence in Text-to-Image Synthesis
Learning to Generate Semantic Layouts for Higher Text-Image Correspondence in Text-to-Image Synthesis
Minho Park
Jooyeol Yun
Seunghwan Choi
Jaegul Choo
DiffM
75
11
0
16 Aug 2023
CoDeF: Content Deformation Fields for Temporally Consistent Video
  Processing
CoDeF: Content Deformation Fields for Temporally Consistent Video Processing
Ouyang Hao
Qiuyu Wang
Yuxi Xiao
Qingyan Bai
Juntao Zhang
Kecheng Zheng
Xiaowei Zhou
Qifeng Chen
Yujun Shen
DiffMVGen
123
85
0
15 Aug 2023
UniBrain: Unify Image Reconstruction and Captioning All in One Diffusion
  Model from Human Brain Activity
UniBrain: Unify Image Reconstruction and Captioning All in One Diffusion Model from Human Brain Activity
Weijian Mai
Zhijun Zhang
DiffM
81
35
0
14 Aug 2023
MarkovGen: Structured Prediction for Efficient Text-to-Image Generation
MarkovGen: Structured Prediction for Efficient Text-to-Image Generation
Sadeep Jayasumana
Daniel Glasner
Srikumar Ramalingam
Andreas Veit
Ayan Chakrabarti
Surinder Kumar
DiffM
47
0
0
14 Aug 2023
Neural radiance fields in the industrial and robotics domain:
  applications, research opportunities and use cases
Neural radiance fields in the industrial and robotics domain: applications, research opportunities and use cases
Eugen Šlapak
Enric Pardo
Matús Dopiriak
T. Maksymyuk
Juraj Gazda
AI4CE
88
15
0
14 Aug 2023
Free-ATM: Exploring Unsupervised Learning on Diffusion-Generated Images
  with Free Attention Masks
Free-ATM: Exploring Unsupervised Learning on Diffusion-Generated Images with Free Attention Masks
David Junhao Zhang
Mutian Xu
Chuhui Xue
Wenqing Zhang
Xiaoguang Han
Song Bai
Mike Zheng Shou
DiffM
131
6
0
13 Aug 2023
LAW-Diffusion: Complex Scene Generation by Diffusion with Layouts
LAW-Diffusion: Complex Scene Generation by Diffusion with Layouts
Binbin Yang
Yinzheng Luo
Ziliang Chen
Guangrun Wang
Xiaodan Liang
Liang Lin
DiffM
95
15
0
13 Aug 2023
Large Language Models and Foundation Models in Smart Agriculture:
  Basics, Opportunities, and Challenges
Large Language Models and Foundation Models in Smart Agriculture: Basics, Opportunities, and Challenges
Jiajia Li
Mingle Xu
Lirong Xiang
Dong Chen
Weichao Zhuang
Xunyuan Yin
Zhao Li
130
3
0
13 Aug 2023
White-box Membership Inference Attacks against Diffusion Models
White-box Membership Inference Attacks against Diffusion Models
Yan Pang
Tianhao Wang
Xu Kang
Mengdi Huai
Yang Zhang
AAMLDiffM
82
24
0
11 Aug 2023
IDiff-Face: Synthetic-based Face Recognition through Fizzy
  Identity-Conditioned Diffusion Models
IDiff-Face: Synthetic-based Face Recognition through Fizzy Identity-Conditioned Diffusion Models
Fadi Boutros
J. H. Grebe
Arjan Kuijper
Naser Damer
65
63
0
09 Aug 2023
JEN-1: Text-Guided Universal Music Generation with Omnidirectional Diffusion Models
JEN-1: Text-Guided Universal Music Generation with Omnidirectional Diffusion Models
Peike Li
Bo-Yu Chen
Yao Yao
Yikai Wang
Allen Wang
Alex Jinpeng Wang
MGenVLMDiffM
167
41
0
09 Aug 2023
From Fake to Real: Pretraining on Balanced Synthetic Images to Prevent
  Spurious Correlations in Image Recognition
From Fake to Real: Pretraining on Balanced Synthetic Images to Prevent Spurious Correlations in Image Recognition
Maan Qraitem
Kate Saenko
Bryan A. Plummer
121
4
0
08 Aug 2023
DiffCR: A Fast Conditional Diffusion Framework for Cloud Removal from
  Optical Satellite Images
DiffCR: A Fast Conditional Diffusion Framework for Cloud Removal from Optical Satellite Images
Xuechao Zou
Keqin Li
Junliang Xing
Yu-an Zhang
Shiying Wang
Lei Jin
Pin Tao
DiffM
92
34
0
08 Aug 2023
3D Scene Diffusion Guidance using Scene Graphs
3D Scene Diffusion Guidance using Scene Graphs
Mohammad Naanaa
Katharina Schmid
Y. Nie
DiffM
61
0
0
08 Aug 2023
Food-500 Cap: A Fine-Grained Food Caption Benchmark for Evaluating
  Vision-Language Models
Food-500 Cap: A Fine-Grained Food Caption Benchmark for Evaluating Vision-Language Models
Zheng Ma
Mianzhi Pan
Wenhan Wu
Ka Leong Cheng
Jianbing Zhang
Shujian Huang
Jiajun Chen
VLMCoGe
76
5
0
06 Aug 2023
ImageBrush: Learning Visual In-Context Instructions for Exemplar-Based
  Image Manipulation
ImageBrush: Learning Visual In-Context Instructions for Exemplar-Based Image Manipulation
Yasheng Sun
Yifan Yang
Houwen Peng
Yifei Shen
Yuqing Yang
Hang-Rui Hu
Lili Qiu
Hideki Koike
DiffMLM&Ro
87
39
0
02 Aug 2023
DiffProsody: Diffusion-based Latent Prosody Generation for Expressive
  Speech Synthesis with Prosody Conditional Adversarial Training
DiffProsody: Diffusion-based Latent Prosody Generation for Expressive Speech Synthesis with Prosody Conditional Adversarial Training
H. Oh
Sang-Hoon Lee
Seong-Whan Lee
DiffM
102
16
0
31 Jul 2023
Generative AI for Medical Imaging: extending the MONAI Framework
Generative AI for Medical Imaging: extending the MONAI Framework
W. H. Pinaya
M. Graham
E. Kerfoot
Petru-Daniel Tudosiu
J. Dafflon
...
Andrew Feng
Marc Modat
P. Nachev
Sebastien Ourselin
M. Jorge Cardoso
SyDaMedIm
103
72
0
27 Jul 2023
MCMC-Correction of Score-Based Diffusion Models for Model Composition
MCMC-Correction of Score-Based Diffusion Models for Model Composition
Anders Sjöberg
Jakob Lindqvist
Magnus Önnheim
Mats Jirstrand
Lennart Svensson
DiffM
92
3
0
26 Jul 2023
Previous
123...161718...262728
Next