Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2211.09800
Cited By
v1
v2 (latest)
InstructPix2Pix: Learning to Follow Image Editing Instructions
17 November 2022
Tim Brooks
Aleksander Holynski
Alexei A. Efros
DiffM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"InstructPix2Pix: Learning to Follow Image Editing Instructions"
50 / 1,418 papers shown
Title
OCTO+: A Suite for Automatic Open-Vocabulary Object Placement in Mixed Reality
Aditya Sharma
Luke Yoffe
Tobias Höllerer
71
8
0
17 Jan 2024
Instilling Multi-round Thinking to Text-guided Image Generation
Lidong Zeng
Zhedong Zheng
Yinwei Wei
Tat-Seng Chua
108
5
0
16 Jan 2024
Revealing Vulnerabilities in Stable Diffusion via Targeted Attacks
Chenyu Zhang
Lanjun Wang
Anan Liu
108
6
0
16 Jan 2024
RotationDrag: Point-based Image Editing with Rotated Diffusion Features
Minxing Luo
Wentao Cheng
Jian Yang
DiffM
76
1
0
12 Jan 2024
PALP: Prompt Aligned Personalization of Text-to-Image Models
Moab Arar
Andrey Voynov
Amir Hertz
Omri Avrahami
Shlomi Fruchter
Yael Pritch
Daniel Cohen-Or
Ariel Shamir
DiffM
101
22
0
11 Jan 2024
InseRF: Text-Driven Generative Object Insertion in Neural 3D Scenes
Mohamad Shahbazi
Liesbeth Claessens
Michael Niemeyer
Edo Collins
A. Tonioni
Luc Van Gool
Federico Tombari
95
12
0
10 Jan 2024
Score Distillation Sampling with Learned Manifold Corrective
Thiemo Alldieck
Nikos Kolotouros
C. Sminchisescu
DiffM
72
14
0
10 Jan 2024
Uni3D-LLM: Unifying Point Cloud Perception, Generation and Editing with Large Language Models
Dingning Liu
Xiaoshui Huang
Yuenan Hou
Zhihui Wang
Zhen-fei Yin
Yongshun Gong
Peng Gao
Wanli Ouyang
49
11
0
09 Jan 2024
Large Language Models for Robotics: Opportunities, Challenges, and Perspectives
Jiaqi Wang
Zihao Wu
Yiwei Li
Hanqi Jiang
Peng Shu
...
Lin Zhao
Bao Ge
Xiang Li
Tianming Liu
Shu Zhang
LM&Ro
97
75
0
09 Jan 2024
SpecRef: A Fast Training-free Baseline of Specific Reference-Condition Real Image Editing
Songyan Chen
Jiancheng Huang
DiffM
50
7
0
07 Jan 2024
MirrorDiffusion: Stabilizing Diffusion Process in Zero-shot Image Translation by Prompts Redescription and Beyond
Yu-Hsiang Lin
Xiaoyu Xian
Yukai Shi
Liang Lin
DiffM
61
6
0
06 Jan 2024
Preserving Image Properties Through Initializations in Diffusion Models
Jeffrey Zhang
Shao-Yu Chang
Kedan Li
David Forsyth
DiffM
46
6
0
04 Jan 2024
Image Sculpting: Precise Object Editing with 3D Geometry Control
Jiraphon Yenphraphai
Xichen Pan
Sainan Liu
Daniele Panozzo
Saining Xie
78
22
0
02 Jan 2024
Joint Generative Modeling of Scene Graphs and Images via Diffusion Models
Bicheng Xu
Qi Yan
Renjie Liao
Lele Wang
Leonid Sigal
DiffM
82
3
0
02 Jan 2024
DiffAugment: Diffusion based Long-Tailed Visual Relationship Recognition
Parul Gupta
Tuan Nguyen
Abhinav Dhall
Munawar Hayat
Trung Le
Thanh-Toan Do
60
0
0
01 Jan 2024
New Job, New Gender? Measuring the Social Bias in Image Generation Models
Wenxuan Wang
Haonan Bai
Jen-tse Huang
Yuxuan Wan
Youliang Yuan
Haoyi Qiu
Nanyun Peng
Michael R. Lyu
102
23
0
01 Jan 2024
DiffMorph: Text-less Image Morphing with Diffusion Models
Shounak Chatterjee
DiffM
28
0
0
01 Jan 2024
SynCDR : Training Cross Domain Retrieval Models with Synthetic Data
Samarth Mishra
Carlos D. Castillo
Hongcheng Wang
Kate Saenko
Venkatesh Saligrama
82
1
0
31 Dec 2023
Inpaint4DNeRF: Promptable Spatio-Temporal NeRF Inpainting with Generative Diffusion Models
Han Jiang
Haosen Sun
Ruoxuan Li
Chi-Keung Tang
Yu-Wing Tai
DiffM
92
2
0
30 Dec 2023
FlowVid: Taming Imperfect Optical Flows for Consistent Video-to-Video Synthesis
Feng Liang
Bichen Wu
Jialiang Wang
Licheng Yu
Kunpeng Li
...
Ishan Misra
Jia-Bin Huang
Peizhao Zhang
Peter Vajda
Diana Marculescu
VGen
DiffM
67
35
0
29 Dec 2023
Personalized Restoration via Dual-Pivot Tuning
Pradyumna Chari
Sizhuo Ma
Daniil Ostashev
A. Kadambi
Gurunandan Krishnan
Jian Wang
Kfir Aberman
DiffM
87
3
0
28 Dec 2023
Unified-IO 2: Scaling Autoregressive Multimodal Models with Vision, Language, Audio, and Action
Jiasen Lu
Christopher Clark
Sangho Lee
Zichen Zhang
Savya Khosla
Ryan Marten
Derek Hoiem
Aniruddha Kembhavi
VLM
MLLM
102
175
0
28 Dec 2023
ZONE: Zero-Shot Instruction-Guided Local Editing
Shanglin Li
Bo-Wen Zeng
Yutang Feng
Sicheng Gao
Xuhui Liu
...
Li Lin
Xu Tang
Yao Hu
Jianzhuang Liu
Baochang Zhang
DiffM
101
35
0
28 Dec 2023
Semantic Guidance Tuning for Text-To-Image Diffusion Models
Hyun Kang
Dohae Lee
Myungjin Shin
In-Kwon Lee
51
1
0
26 Dec 2023
LLM-Powered Hierarchical Language Agent for Real-time Human-AI Coordination
Jijia Liu
Chao Yu
Jiaxuan Gao
Yuqing Xie
Qingmin Liao
Yi Wu
Yu Wang
LLMAG
LM&Ro
176
38
0
23 Dec 2023
VIEScore: Towards Explainable Metrics for Conditional Image Synthesis Evaluation
Max Ku
Dongfu Jiang
Cong Wei
Xiang Yue
Wenhu Chen
105
61
0
22 Dec 2023
Plan, Posture and Go: Towards Open-World Text-to-Motion Generation
Jinpeng Liu
Wen-Dao Dai
Chunyu Wang
Yiji Cheng
Yansong Tang
Xin Tong
VGen
DiffM
122
19
0
22 Dec 2023
VideoPoet: A Large Language Model for Zero-Shot Video Generation
Dan Kondratyuk
Lijun Yu
Xiuye Gu
José Lezama
Jonathan Huang
...
Irfan Essa
Huisheng Wang
David A. Ross
Bryan Seybold
Lu Jiang
VGen
152
273
0
21 Dec 2023
HD-Painter: High-Resolution and Prompt-Faithful Text-Guided Image Inpainting with Diffusion Models
Hayk Manukyan
Andranik Sargsyan
Barsegh Atanyan
Zhangyang Wang
Shant Navasardyan
Humphrey Shi
DiffM
97
31
0
21 Dec 2023
Free-Editor: Zero-shot Text-driven 3D Scene Editing
Nazmul Karim
Hasan Iqbal
Umar Khalid
Jingyi Hua
Chong Chen
DiffM
94
9
0
21 Dec 2023
DreamDistribution: Learning Prompt Distribution for Diverse In-distribution Generation
Brian Nlong Zhao
Yuhang Xiao
Lyne Tchapmi
Xinyang Jiang
Yifan Yang
Dongsheng Li
Laurent Itti
Vibhav Vineet
Yunhao Ge
VLM
144
7
0
21 Dec 2023
Generative Multimodal Models are In-Context Learners
Quan-Sen Sun
Yufeng Cui
Xiaosong Zhang
Fan Zhang
Qiying Yu
...
Yueze Wang
Yongming Rao
Jingjing Liu
Tiejun Huang
Xinlong Wang
MLLM
LRM
155
291
0
20 Dec 2023
Fairy: Fast Parallelized Instruction-Guided Video-to-Video Synthesis
Bichen Wu
Ching-Yao Chuang
Xiaoyan Wang
Yichen Jia
K. Krishnakumar
Tong Xiao
Feng Liang
Licheng Yu
Peter Vajda
DiffM
VGen
55
24
0
20 Dec 2023
Adaptive Guidance: Training-free Acceleration of Conditional Diffusion Models
Angela Castillo
Jonas Kohler
Juan C. Pérez
Juan Pablo Pérez
Albert Pumarola
Guohao Li
Pablo Arbelaez
Ali K. Thabet
111
13
0
19 Dec 2023
Optimizing Diffusion Noise Can Serve As Universal Motion Priors
Korrawe Karunratanakul
Konpat Preechakul
Emre Aksan
Thabo Beeler
Supasorn Suwajanakorn
Siyu Tang
DiffM
93
43
0
19 Dec 2023
CreativeConnect: Supporting Reference Recombination for Graphic Design Ideation with Generative AI
DaEun Choi
Sumin Hong
Jeongeon Park
John Joon Young Chung
Juho Kim
76
62
0
19 Dec 2023
MaskINT: Video Editing via Interpolative Non-autoregressive Masked Transformers
Haoyu Ma
Shahin Mahdizadehaghdam
Bichen Wu
Zhipeng Fan
Yuchao Gu
Wenliang Zhao
Lior Shapira
Xiaohui Xie
DiffM
VGen
66
4
0
19 Dec 2023
Scene-Conditional 3D Object Stylization and Composition
Jinghao Zhou
Tomas Jakab
Philip Torr
Christian Rupprecht
DiffM
141
3
0
19 Dec 2023
MAG-Edit: Localized Image Editing in Complex Scenarios via Mask-Based Attention-Adjusted Guidance
Qi Mao
Lan Chen
Yuchao Gu
Zhen Fang
Mike Zheng Shou
DiffM
89
11
0
18 Dec 2023
SCEdit: Efficient and Controllable Image Diffusion Generation via Skip Connection Editing
Zeyinzi Jiang
Chaojie Mao
Yulin Pan
Zhen Han
Jingfeng Zhang
71
30
0
18 Dec 2023
SPIRE: Semantic Prompt-Driven Image Restoration
Chenyang Qi
Zhengzhong Tu
Keren Ye
M. Delbracio
P. Milanfar
Qifeng Chen
Hossein Talebi
DiffM
101
12
0
18 Dec 2023
Your Student is Better Than Expected: Adaptive Teacher-Student Collaboration for Text-Conditional Diffusion Models
Nikita Starodubcev
Artem Fedorov
Artem Babenko
Dmitry Baranchuk
DiffM
64
4
0
17 Dec 2023
CogCartoon: Towards Practical Story Visualization
Zhongyang Zhu
Jie Tang
DiffM
70
4
0
17 Dec 2023
Iterative Motion Editing with Natural Language
Purvi Goel
Kuan-Chieh Wang
Chenxi Liu
Kayvon Fatahalian
DiffM
127
22
0
15 Dec 2023
Rich Human Feedback for Text-to-Image Generation
Youwei Liang
Junfeng He
Gang Li
Peizhao Li
Arseniy Klimovskiy
...
Yiwen Luo
Yang Li
Kai Kohlhoff
Deepak Ramachandran
Vidhya Navalpakkam
EGVM
83
86
0
15 Dec 2023
Latent Diffusion Models with Image-Derived Annotations for Enhanced AI-Assisted Cancer Diagnosis in Histopathology
Pedro Osório
Guillermo Jiménez-Pérez
Javier Montalt-Tordera
Jens Hooge
Guillem Duran Ballester
...
Sabrina Schroeder
K. Siudak
Julia Vienenkoetter
Bettina Lawrenz
Sadegh Mohammadi
MedIm
39
10
0
15 Dec 2023
Collaborating Foundation Models for Domain Generalized Semantic Segmentation
Yasser Benigmim
Subhankar Roy
S. Essid
Vicky Kalogeiton
Stéphane Lathuilière
135
14
0
15 Dec 2023
Focus on Your Instruction: Fine-grained and Multi-instruction Image Editing by Attention Modulation
Qin Guo
Tianwei Lin
DiffM
76
36
0
15 Dec 2023
Tell Me What You See: Text-Guided Real-World Image Denoising
E. Yosef
Raja Giryes
DiffM
151
2
0
15 Dec 2023
InstructPipe: Generating Visual Blocks Pipelines with Human Instructions and LLMs
Zhongyi Zhou
Jing Jin
Vrushank Phadnis
Xiuxiu Yuan
Jun Jiang
...
A. Olwal
David Kim
Ram Iyengar
Na Li
Andrea Colaço
63
5
0
15 Dec 2023
Previous
1
2
3
...
19
20
21
...
27
28
29
Next