Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2410.11815
Cited By
SGEdit: Bridging LLM with Text2Image Generative Model for Scene Graph-based Image Editing
15 October 2024
Zhiyuan Zhang
Dongdong Chen
J. Liao
DiffM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"SGEdit: Bridging LLM with Text2Image Generative Model for Scene Graph-based Image Editing"
17 / 17 papers shown
Title
Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs
Ling Yang
Zhaochen Yu
Chenlin Meng
Minkai Xu
Stefano Ermon
Tengjiao Wang
CoGe
DiffM
105
137
0
22 Jan 2024
De-Diffusion Makes Text a Strong Cross-Modal Interface
Chen Wei
Chenxi Liu
Siyuan Qiao
Zhishuai Zhang
Alan Yuille
Jiahui Yu
VLM
DiffM
78
11
0
01 Nov 2023
Intriguing properties of synthetic images: from generative adversarial networks to diffusion models
Riccardo Corvi
D. Cozzolino
Giovanni Poggi
Koki Nagano
L. Verdoliva
DiffM
91
94
0
13 Apr 2023
GPT-4 Technical Report
OpenAI OpenAI
OpenAI Josh Achiam
Steven Adler
Sandhini Agarwal
Lama Ahmad
...
Shengjia Zhao
Tianhao Zheng
Juntang Zhuang
William Zhuk
Barret Zoph
LLMAG
MLLM
1.5K
14,748
0
15 Mar 2023
InstructPix2Pix: Learning to Follow Image Editing Instructions
Tim Brooks
Aleksander Holynski
Alexei A. Efros
DiffM
213
1,835
0
17 Nov 2022
Prompt-to-Prompt Image Editing with Cross Attention Control
Amir Hertz
Ron Mokady
J. Tenenbaum
Kfir Aberman
Yael Pritch
Daniel Cohen-Or
DiffM
206
1,790
0
02 Aug 2022
An Image is Worth One Word: Personalizing Text-to-Image Generation using Textual Inversion
Rinon Gal
Yuval Alaluf
Yuval Atzmon
Or Patashnik
Amit H. Bermano
Gal Chechik
Daniel Cohen-Or
168
1,897
0
02 Aug 2022
Blended Latent Diffusion
Omri Avrahami
Ohad Fried
Dani Lischinski
DiffM
138
390
0
06 Jun 2022
BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Junnan Li
Dongxu Li
Caiming Xiong
Guosheng Lin
MLLM
BDL
VLM
CLIP
555
4,413
0
28 Jan 2022
GODIVA: Generating Open-DomaIn Videos from nAtural Descriptions
Chenfei Wu
Lun Huang
Qianxi Zhang
Binyang Li
Lei Ji
Fan Yang
Guillermo Sapiro
Nan Duan
DiffM
VGen
84
243
0
30 Apr 2021
Emerging Properties in Self-Supervised Vision Transformers
Mathilde Caron
Hugo Touvron
Ishan Misra
Hervé Jégou
Julien Mairal
Piotr Bojanowski
Armand Joulin
730
6,135
0
29 Apr 2021
Denoising Diffusion Implicit Models
Jiaming Song
Chenlin Meng
Stefano Ermon
VLM
DiffM
295
7,492
0
06 Oct 2020
LayoutTransformer: Layout Generation and Completion with Self-attention
Kamal Gupta
Justin Lazarow
Alessandro Achille
Larry S. Davis
Vijay Mahadevan
Abhinav Shrivastava
ViT
91
137
0
25 Jun 2020
Semantic Image Manipulation Using Scene Graphs
Helisa Dhamo
Azade Farshad
Iro Laina
Nassir Navab
Gregory Hager
Federico Tombari
Christian Rupprecht
105
121
0
07 Apr 2020
Image Generation from Scene Graphs
Justin Johnson
Agrim Gupta
Li Fei-Fei
GNN
316
821
0
04 Apr 2018
The Unreasonable Effectiveness of Deep Features as a Perceptual Metric
Richard Y. Zhang
Phillip Isola
Alexei A. Efros
Eli Shechtman
Oliver Wang
EGVM
384
11,920
0
11 Jan 2018
SPICE: Semantic Propositional Image Caption Evaluation
Peter Anderson
Basura Fernando
Mark Johnson
Stephen Gould
EGVM
108
1,919
0
29 Jul 2016
1