Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2405.16785
Cited By
PromptFix: You Prompt and We Fix the Photo
27 May 2024
Yongsheng Yu
Ziyun Zeng
Hang Hua
Jianlong Fu
Jiebo Luo
MLLM
DiffM
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
Github (780★)
Papers citing
"PromptFix: You Prompt and We Fix the Photo"
45 / 45 papers shown
Title
Step1X-Edit: A Practical Framework for General Image Editing
Shixuan Liu
Yucheng Han
Peng Xing
Fukun Yin
Rui Wang
...
Yibo Zhu
Binxing Jiao
Wei Wei
Gang Yu
Daxin Jiang
DiffM
193
24
0
24 Apr 2025
MULAN: A Multi Layer Annotated Dataset for Controllable Text-to-Image Generation
Petru-Daniel Tudosiu
Yongxin Yang
Shifeng Zhang
Fei Chen
Jingyu Sun
Gerasimos Lampouras
Ignacio Iacobacci
Sarah Parisot
86
12
0
03 Apr 2024
ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment
Xiwei Hu
Rui Wang
Yixiao Fang
Bin-Bin Fu
Pei Cheng
Gang Yu
VLM
99
94
0
08 Mar 2024
Diff-Plugin: Revitalizing Details for Diffusion-based Low-level Tasks
Yuhao Liu
Zhanghan Ke
Fang Liu
Nanxuan Zhao
Rynson W. H. Lau
DiffM
76
23
0
01 Mar 2024
Inst-Inpaint: Instructing to Remove Objects with Diffusion Models
Ahmet Burak Yildirim
Vedat Baday
Erkut Erdem
Aykut Erdem
Aysegül Dündar
DiffM
98
63
0
06 Apr 2023
HIVE: Harnessing Human Feedback for Instructional Visual Editing
Shu Zhen Zhang
Xinyi Yang
Yihao Feng
Can Qin
Chia-Chih Chen
...
Haiquan Wang
Silvio Savarese
Stefano Ermon
Caiming Xiong
Ran Xu
82
115
0
16 Mar 2023
InstructPix2Pix: Learning to Follow Image Editing Instructions
Tim Brooks
Aleksander Holynski
Alexei A. Efros
DiffM
213
1,835
0
17 Nov 2022
PromptCap: Prompt-Guided Task-Aware Image Captioning
Yushi Hu
Hang Hua
Zhengyuan Yang
Weijia Shi
Noah A. Smith
Jiebo Luo
100
104
0
15 Nov 2022
DiffEdit: Diffusion-based semantic image editing with mask guidance
Guillaume Couairon
Jakob Verbeek
Holger Schwenk
Matthieu Cord
DiffM
145
511
0
20 Oct 2022
LAION-5B: An open large-scale dataset for training next generation image-text models
Christoph Schuhmann
Romain Beaumont
Richard Vencu
Cade Gordon
Ross Wightman
...
Srivatsa Kundurthy
Katherine Crowson
Ludwig Schmidt
R. Kaczmarczyk
J. Jitsev
VLM
MLLM
CLIP
200
3,502
0
16 Oct 2022
High-Fidelity Image Inpainting with GAN Inversion
Yongsheng Yu
Libo Zhang
Hengrui Fan
Tiejian Luo
60
27
0
25 Aug 2022
Prompt-to-Prompt Image Editing with Cross Attention Control
Amir Hertz
Ron Mokady
J. Tenenbaum
Kfir Aberman
Yael Pritch
Daniel Cohen-Or
DiffM
206
1,790
0
02 Aug 2022
Elucidating the Design Space of Diffusion-Based Generative Models
Tero Karras
M. Aittala
Timo Aila
S. Laine
DiffM
220
2,033
0
01 Jun 2022
MANIQA: Multi-dimension Attention Network for No-Reference Image Quality Assessment
Sidi Yang
Tianhe Wu
Shu Shi
Shanshan Lao
S. Gong
Ming Cao
Jiahao Wang
Yujiu Yang
80
339
0
19 Apr 2022
VQGAN-CLIP: Open Domain Image Generation and Editing with Natural Language Guidance
Katherine Crowson
Stella Biderman
Daniel Kornis
Dashiell Stander
Eric Hallahan
Louis Castricato
Edward Raff
CLIP
135
381
0
18 Apr 2022
Text2LIVE: Text-Driven Layered Image and Video Editing
Omer Bar-Tal
Dolev Ofri-Amar
Rafail Fridman
Yoni Kasten
Tali Dekel
VGen
DiffM
101
317
0
05 Apr 2022
Generative Adversarial Networks
Gilad Cohen
Raja Giryes
GAN
298
30,152
0
01 Mar 2022
High-Resolution Image Synthesis with Latent Diffusion Models
Robin Rombach
A. Blattmann
Dominik Lorenz
Patrick Esser
Bjorn Ommer
3DV
502
15,788
0
20 Dec 2021
CLIPstyler: Image Style Transfer with a Single Text Condition
Gihyun Kwon
Jong Chul Ye
VLM
CLIP
86
247
0
01 Dec 2021
Blended Diffusion for Text-driven Editing of Natural Images
Omri Avrahami
Dani Lischinski
Ohad Fried
DiffM
135
954
0
29 Nov 2021
DiffusionCLIP: Text-Guided Diffusion Models for Robust Image Manipulation
Gwanghyun Kim
Taesung Kwon
Jong Chul Ye
DiffM
203
656
0
06 Oct 2021
Resolution-robust Large Mask Inpainting with Fourier Convolutions
Roman Suvorov
Elizaveta Logacheva
Anton Mashikhin
Anastasia Remizova
Arsenii Ashukha
Aleksei Silvestrov
Naejin Kong
Harshith Goka
Kiwoong Park
Victor Lempitsky
108
868
0
15 Sep 2021
LoRA: Low-Rank Adaptation of Large Language Models
J. E. Hu
Yelong Shen
Phillip Wallis
Zeyuan Allen-Zhu
Yuanzhi Li
Shean Wang
Lu Wang
Weizhu Chen
OffRL
AI4TS
AI4CE
ALM
AIMat
502
10,526
0
17 Jun 2021
TextOCR: Towards large-scale end-to-end reasoning for arbitrary-shaped scene text
Amanpreet Singh
Guan Pang
Mandy Toh
Jing Huang
Wojciech Galuba
Tal Hassner
68
174
0
12 May 2021
Diffusion Models Beat GANs on Image Synthesis
Prafulla Dhariwal
Alex Nichol
271
7,958
0
11 May 2021
WDNet: Watermark-Decomposition Network for Visible Watermark Removal
Yang Liu
Zhen Zhu
X. Bai
61
50
0
14 Dec 2020
Split then Refine: Stacked Attention-guided ResUNets for Blind Single Image Visible Watermark Removal
Xiaodong Cun
Chi-Man Pun
116
45
0
13 Dec 2020
Score-Based Generative Modeling through Stochastic Differential Equations
Yang Song
Jascha Narain Sohl-Dickstein
Diederik P. Kingma
Abhishek Kumar
Stefano Ermon
Ben Poole
DiffM
SyDa
358
6,586
0
26 Nov 2020
Denoising Diffusion Implicit Models
Jiaming Song
Chenlin Meng
Stefano Ermon
VLM
DiffM
295
7,492
0
06 Oct 2020
Human-Aware Motion Deblurring
Ziyi Shen
Wenguan Wang
Xiankai Lu
Jianbing Shen
Haibin Ling
Tingfa Xu
Ling Shao
3DH
81
289
0
19 Jan 2020
Generative Modeling by Estimating Gradients of the Data Distribution
Yang Song
Stefano Ermon
SyDa
DiffM
258
3,961
0
12 Jul 2019
Dense Haze: A benchmark for image dehazing with dense-haze and haze-free images
C. Ancuti
Cosmin Ancuti
M. Sbert
Radu Timofte
53
303
0
05 Apr 2019
A Style-Based Generator Architecture for Generative Adversarial Networks
Tero Karras
S. Laine
Timo Aila
619
10,595
0
12 Dec 2018
Deep Retinex Decomposition for Low-Light Enhancement
Chen Wei
Wenjing Wang
Wenhan Yang
Jiaying Liu
108
1,734
0
14 Aug 2018
Learning to See in the Dark
Cheng Chen
Qifeng Chen
Jia Xu
V. Koltun
301
1,190
0
04 May 2018
The Unreasonable Effectiveness of Deep Features as a Perceptual Metric
Richard Y. Zhang
Phillip Isola
Alexei A. Efros
Eli Shechtman
Oliver Wang
EGVM
384
11,920
0
11 Jan 2018
Benchmarking Single Image Dehazing and Beyond
Boyi Li
Wenqi Ren
Dengpan Fu
Dacheng Tao
Dan Feng
Wenjun Zeng
Zhangyang Wang
VLM
72
1,542
0
12 Dec 2017
DesnowNet: Context-Aware Deep Network for Snow Removal
Yun-Fu Liu
Da-Wei Jaw
Shih-Chia Huang
Lei Li
65
334
0
15 Aug 2017
Deep Multi-scale Convolutional Neural Network for Dynamic Scene Deblurring
Seungjun Nah
Tae Hyun Kim
Kyoung Mu Lee
147
1,981
0
07 Dec 2016
Generative Adversarial Text to Image Synthesis
Scott E. Reed
Zeynep Akata
Xinchen Yan
Lajanugen Logeswaran
Bernt Schiele
Honglak Lee
GAN
209
3,149
0
17 May 2016
WIDER FACE: A Face Detection Benchmark
Shuo Yang
Ping Luo
Chen Change Loy
Xiaoou Tang
CVBM
101
1,595
0
20 Nov 2015
Flickr30k Entities: Collecting Region-to-Phrase Correspondences for Richer Image-to-Sentence Models
Bryan A. Plummer
Liwei Wang
Christopher M. Cervantes
Juan C. Caicedo
Julia Hockenmaier
Svetlana Lazebnik
208
2,074
0
19 May 2015
VQA: Visual Question Answering
Aishwarya Agrawal
Jiasen Lu
Stanislaw Antol
Margaret Mitchell
C. L. Zitnick
Dhruv Batra
Devi Parikh
CoGe
233
5,509
0
03 May 2015
Deep Unsupervised Learning using Nonequilibrium Thermodynamics
Jascha Narain Sohl-Dickstein
Eric A. Weiss
Niru Maheswaranathan
Surya Ganguli
SyDa
DiffM
312
7,031
0
12 Mar 2015
Deep Learning Face Attributes in the Wild
Ziwei Liu
Ping Luo
Xiaogang Wang
Xiaoou Tang
CVBM
253
8,429
0
28 Nov 2014
1