Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2211.09800
Cited By
InstructPix2Pix: Learning to Follow Image Editing Instructions
17 November 2022
Tim Brooks
Aleksander Holynski
Alexei A. Efros
DiffM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"InstructPix2Pix: Learning to Follow Image Editing Instructions"
50 / 1,350 papers shown
Title
ChatSearch: a Dataset and a Generative Retrieval Model for General Conversational Image Retrieval
Zijia Zhao
Longteng Guo
Tongtian Yue
Erdong Hu
Shuai Shao
Zehuan Yuan
Hua Huang
Jiaheng Liu
28
1
0
24 Oct 2024
Robust Watermarking Using Generative Priors Against Image Editing: From Benchmarking to Advances
Shilin Lu
Zihan Zhou
Jiayou Lu
Yuanzhi Zhu
A. Kong
WIGM
97
10
0
24 Oct 2024
Neural Cover Selection for Image Steganography
Karl Chahine
Hyeji Kim
DiffM
21
0
0
23 Oct 2024
WorldSimBench: Towards Video Generation Models as World Simulators
Yiran Qin
Zhelun Shi
Jiwen Yu
Xijun Wang
Enshen Zhou
...
Lu Sheng
Jing Shao
Junlin Wu
Wanli Ouyang
Ruimao Zhang
EGVM
VGen
126
381
0
23 Oct 2024
One-Step Diffusion Distillation through Score Implicit Matching
Weijian Luo
Zemin Huang
Zhengyang Geng
J. Zico Kolter
Guo-jun Qi
DiffM
37
13
0
22 Oct 2024
Progressive Compositionality in Text-to-Image Generative Models
Xu Han
Linghao Jin
Xiaofeng Liu
Paul Pu Liang
CoGe
106
2
0
22 Oct 2024
DocEdit-v2: Document Structure Editing Via Multimodal LLM Grounding
Manan Suri
Puneet Mathur
Franck Dernoncourt
R. Jain
Vlad I. Morariu
Ramit Sawhney
Preslav Nakov
Dinesh Manocha
37
1
0
21 Oct 2024
A roadmap for generative mapping: unlocking the power of generative AI for map-making
Sidi Wu
Katharina Henggeler
Yizi Chen
L. Hurni
19
1
0
21 Oct 2024
MedDiff-FM: A Diffusion-based Foundation Model for Versatile Medical Image Applications
Yongrui Yu
Yannian Gu
S. Zhang
Xiaofan Zhang
MedIm
41
2
0
20 Oct 2024
DiffuseST: Unleashing the Capability of the Diffusion Model for Style Transfer
Ying Hu
Chenyi Zhuang
Pan Gao
DiffM
28
0
0
19 Oct 2024
SeaS: Few-shot Industrial Anomaly Image Generation with Separation and Sharing Fine-tuning
Zhewei Dai
Shilei Zeng
Haotian Liu
Xurui Li
Feng Xue
Yu Zhou
DiffM
40
2
0
19 Oct 2024
HiCo: Hierarchical Controllable Diffusion Model for Layout-to-image Generation
Bo Cheng
Yuhang Ma
Liebucha Wu
Shanyuan Liu
Ao Ma
Xiaoyu Wu
Dawei Leng
Yuhui Yin
DiffM
24
8
0
18 Oct 2024
PUMA: Empowering Unified MLLM with Multi-granular Visual Generation
Rongyao Fang
Chengqi Duan
Kun Wang
Hao Li
H. Tian
Xingyu Zeng
Rui Zhao
Jifeng Dai
Hongsheng Li
Xihui Liu
MLLM
36
11
0
17 Oct 2024
AdaptiveDrag: Semantic-Driven Dragging on Diffusion-Based Image Editing
DuoSheng Chen
Binghui Chen
Yifeng Geng
Liefeng Bo
DiffM
30
1
0
16 Oct 2024
Imagine2Servo: Intelligent Visual Servoing with Diffusion-Driven Goal Generation for Robotic Tasks
Pranjali Pathre
Gunjan Gupta
M. N. Qureshi
Mandyam Brunda
Samarth Brahmbhatt
K. M. Krishna
VGen
34
0
0
16 Oct 2024
SGEdit: Bridging LLM with Text2Image Generative Model for Scene Graph-based Image Editing
Zhiyuan Zhang
Dongdong Chen
J. Liao
DiffM
26
3
0
15 Oct 2024
DreamSteerer: Enhancing Source Image Conditioned Editability using Personalized Diffusion Models
Zhengyang Yu
Zhaoyuan Yang
Jing Zhang
DiffM
26
2
0
15 Oct 2024
Improving Long-Text Alignment for Text-to-Image Diffusion Models
Luping Liu
Chao Du
Tianyu Pang
Zehan Wang
Chongxuan Li
Dong Xu
VLM
53
4
0
15 Oct 2024
Incorporating Task Progress Knowledge for Subgoal Generation in Robotic Manipulation through Image Edits
Xuhui Kang
Yen-Ling Kuo
38
3
0
14 Oct 2024
MagicEraser: Erasing Any Objects via Semantics-Aware Control
Fan Li
Zixiao Zhang
Yi Huang
Jianzhuang Liu
Renjing Pei
Bin Shao
Songcen Xu
DiffM
44
6
0
14 Oct 2024
Learning to Customize Text-to-Image Diffusion In Diverse Context
Taewook Kim
Wei Chen
Qiang Qiu
DiffM
38
2
0
14 Oct 2024
Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis
Jinbin Bai
Tian-Chun Ye
Wei Chow
Enxin Song
Qing-Guo Chen
Hefei Ling
Zhen Dong
Lei Zhu
66
13
0
10 Oct 2024
AvatarGO: Zero-shot 4D Human-Object Interaction Generation and Animation
Yukang Cao
Liang Pan
Kai Han
Kwan-Yee K. Wong
Ziwei Liu
VGen
38
6
0
09 Oct 2024
InstructG2I: Synthesizing Images from Multimodal Attributed Graphs
Bowen Jin
Ziqi Pang
Bingjun Guo
Yu-Xiong Wang
Jiaxuan You
Jiawei Han
DiffM
47
1
0
09 Oct 2024
Jointly Generating Multi-view Consistent PBR Textures using Collaborative Control
Shimon Vainer
Konstantin Kutsy
Dante De Nigris
Ciara Rowles
Slava Elizarov
Simon Donné
DiffM
66
1
0
09 Oct 2024
HFH-Font: Few-shot Chinese Font Synthesis with Higher Quality, Faster Speed, and Higher Resolution
Yiming Li
Zhouhui Lian
64
2
0
09 Oct 2024
PixLens: A Novel Framework for Disentangled Evaluation in Diffusion-Based Image Editing with Object Detection + SAM
Stefan Stefanache
Lluís Pastor Pérez
Julen Costa Watanabe
Ernesto Sanchez Tejedor
Thomas Hofmann
Enis Simsar
EGVM
28
0
0
08 Oct 2024
DiffusionGuard: A Robust Defense Against Malicious Diffusion-based Image Editing
June Suk Choi
Kyungmin Lee
Jongheon Jeong
Saining Xie
Jinwoo Shin
Kimin Lee
DiffM
AAML
33
2
0
08 Oct 2024
Multimodal Large Language Models and Tunings: Vision, Language, Sensors, Audio, and Beyond
Soyeon Caren Han
Feiqi Cao
Josiah Poon
Roberto Navigli
MLLM
VLM
32
5
0
08 Oct 2024
Generative Portrait Shadow Removal
Jae Shin Yoon
Zhixin Shu
Mengwei Ren
Xuaner Zhang
Yannick Hold-Geoffroy
Krishna Kumar Singh
He Zhang
DiffM
21
1
0
07 Oct 2024
TextureMeDefect: LLM-based Defect Texture Generation for Railway Components on Mobile Devices
Rahatara Ferdousi
M. Anwar Hossain
Abdulmotaleb El Saddik
16
0
0
07 Oct 2024
GS-VTON: Controllable 3D Virtual Try-on with Gaussian Splatting
Yukang Cao
Masoud Hadi
Liang Pan
Ziwei Liu
3DGS
DiffM
53
4
0
07 Oct 2024
Revealing Directions for Text-guided 3D Face Editing
Zhuo Chen
Yichao Yan
Sehngqi Liu
Yuhao Cheng
Weiming Zhao
Lincheng Li
Mengxiao Bi
Xiaokang Yang
DiffM
37
0
0
07 Oct 2024
Image Watermarks are Removable Using Controllable Regeneration from Clean Noise
Yepeng Liu
Yiren Song
Hai Ci
Yu Zhang
Haofan Wang
Mike Zheng Shou
Yuheng Bu
WIGM
56
3
0
07 Oct 2024
DeepONet for Solving Nonlinear Partial Differential Equations with Physics-Informed Training
Yahong Yang
25
0
0
06 Oct 2024
Real-World Benchmarks Make Membership Inference Attacks Fail on Diffusion Models
Chumeng Liang
Jiaxuan You
40
0
0
04 Oct 2024
ScriptViz: A Visualization Tool to Aid Scriptwriting based on a Large Movie Database
Anyi Rao
Jean-Peic Chou
Maneesh Agrawala
VGen
28
2
0
04 Oct 2024
SAFLEX: Self-Adaptive Augmentation via Feature Label Extrapolation
Mucong Ding
Bang An
Yuancheng Xu
Anirudh Satheesh
Furong Huang
27
1
0
03 Oct 2024
Unleashing the Potential of the Diffusion Model in Few-shot Semantic Segmentation
Muzhi Zhu
Yang Liu
Zekai Luo
Chenchen Jing
Hao Chen
Guangkai Xu
Xinlong Wang
Chunhua Shen
DiffM
VLM
36
3
0
03 Oct 2024
Towards Native Generative Model for 3D Head Avatar
Yiyu Zhuang
Yuxiao He
Jiawei Zhang
Yanwen Wang
Jiahe Zhu
Yao Yao
Siyu Zhu
Xun Cao
Hao Zhu
3DH
34
4
0
02 Oct 2024
ACE: All-round Creator and Editor Following Instructions via Diffusion Transformer
Zhen Han
Zeyinzi Jiang
Yulin Pan
Jingfeng Zhang
Chaojie Mao
Chenwei Xie
Yu Liu
Jingren Zhou
DiffM
32
17
0
30 Sep 2024
FoAM: Foresight-Augmented Multi-Task Imitation Policy for Robotic Manipulation
Litao Liu
Wentao Wang
Yifan Han
Zhuoli Xie
Pengfei Yi
Junyan Li
Yi Qin
Wenzhao Lian
37
2
0
29 Sep 2024
Conditional Image Synthesis with Diffusion Models: A Survey
Zheyuan Zhan
Defang Chen
Jian-Ping Mei
Zhenghe Zhao
Jiawei Chen
Chun Chen
Siwei Lyu
Can Wang
VLM
45
5
0
28 Sep 2024
Multimodal Pragmatic Jailbreak on Text-to-image Models
Tong Liu
Zhixin Lai
Gengyuan Zhang
Philip Torr
Vera Demberg
Volker Tresp
Jindong Gu
37
4
0
27 Sep 2024
Word2Wave: Language Driven Mission Programming for Efficient Subsea Deployments of Marine Robots
Ruo Chen
David Blow
Adnan Abdullah
Md Jahidul Islam
50
1
0
27 Sep 2024
Text2FX: Harnessing CLAP Embeddings for Text-Guided Audio Effects
Annie Chu
P. O'Reilly
Julia Barnett
Bryan Pardo
CLIP
36
1
0
27 Sep 2024
Amodal Instance Segmentation with Diffusion Shape Prior Estimation
Minh Tran
Khoa T. Vo
Tri Nguyen
Ngan Le
DiffM
34
0
0
26 Sep 2024
FreeEdit: Mask-free Reference-based Image Editing with Multi-modal Instruction
Runze He
Kai Ma
Linjiang Huang
Shaofei Huang
Jialin Gao
Xiaoming Wei
Jiao Dai
Jizhong Han
Si Liu
DiffM
52
7
0
26 Sep 2024
Visual Data Diagnosis and Debiasing with Concept Graphs
Rwiddhi Chakraborty
Yinong Wang
Jialu Gao
Runkai Zheng
Cheng Zhang
Fernando de la Torre
37
2
0
26 Sep 2024
AnyLogo: Symbiotic Subject-Driven Diffusion System with Gemini Status
Jinghao Zhang
Wen Qian
Hao Luo
Fan Wang
Feng Zhao
DiffM
28
0
0
26 Sep 2024
Previous
1
2
3
...
7
8
9
...
25
26
27
Next