Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2211.09800
Cited By
InstructPix2Pix: Learning to Follow Image Editing Instructions
17 November 2022
Tim Brooks
Aleksander Holynski
Alexei A. Efros
DiffM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"InstructPix2Pix: Learning to Follow Image Editing Instructions"
50 / 1,352 papers shown
Title
Abstract Art Interpretation Using ControlNet
Rishabh Srivastava
Addrish Roy
13
0
0
23 Aug 2024
Diffusion-Based Visual Art Creation: A Survey and New Perspectives
Bingyuan Wang
Qifeng Chen
Zeyu Wang
54
7
0
22 Aug 2024
MegaFusion: Extend Diffusion Models towards Higher-resolution Image Generation without Further Tuning
Haoning Wu
Shaocheng Shen
Qiang Hu
Xiaoyun Zhang
Ya Zhang
Yanfeng Wang
40
10
0
20 Aug 2024
Learning Instruction-Guided Manipulation Affordance via Large Models for Embodied Robotic Tasks
Dayou Li
Chenkun Zhao
Shuo Yang
Lin Ma
Yibin Li
Wei Zhang
LM&Ro
43
1
0
20 Aug 2024
Factorized-Dreamer: Training A High-Quality Video Generator with Limited and Low-Quality Data
Tao Yang
Yangming Shi
Yunwen Huang
Feng Chen
Yin Zheng
Lei Zhang
DiffM
VGen
70
0
0
19 Aug 2024
ARMADA: Attribute-Based Multimodal Data Augmentation
Xiaomeng Jin
Jeonghwan Kim
Yu Zhou
Kuan-Hao Huang
Te-Lin Wu
Nanyun Peng
Heng Ji
26
2
0
19 Aug 2024
Style-Editor: Text-driven object-centric style editing
Jihun Park
Jongmin Gim
Kyoungmin Lee
Seunghun Lee
Sunghoon Im
DiffM
37
0
0
16 Aug 2024
TurboEdit: Instant text-based image editing
Zongze Wu
Nicholas I. Kolkin
Jonathan Brandt
Richard Zhang
Eli Shechtman
DiffM
46
11
0
14 Aug 2024
DeCo: Decoupled Human-Centered Diffusion Video Editing with Motion Consistency
Xiaojing Zhong
Xinyi Huang
Xiaofeng Yang
Guosheng Lin
Qingyao Wu
DiffM
43
3
0
14 Aug 2024
Connecting Dreams with Visual Brainstorming Instruction
Yasheng Sun
Bohan Li
Mingchen Zhuge
Deng-Ping Fan
Salman Khan
Fahad Shahbaz Khan
Hideki Koike
DiffM
42
0
0
14 Aug 2024
GRIF-DM: Generation of Rich Impression Fonts using Diffusion Models
Lei Kang
Fei Yang
Kai Wang
Mohamed Ali Souibgui
Lluís Gómez
Alicia Fornés
Ernest Valveny
Dimosthenis Karatzas
DiffM
26
0
0
14 Aug 2024
Controlling the World by Sleight of Hand
Sruthi Sudhakar
Ruoshi Liu
Basile Van Hoorick
Carl Vondrick
Richard Zemel
49
4
0
13 Aug 2024
EditScribe: Non-Visual Image Editing with Natural Language Verification Loops
Ruei-Che Chang
Yuxuan Liu
Lotus Zhang
Anhong Guo
DiffM
46
2
0
13 Aug 2024
Egocentric Vision Language Planning
Zhirui Fang
Ming Yang
Weishuai Zeng
Boyu Li
Junpeng Yue
Ziluo Ding
Xiu Li
Zongqing Lu
LM&Ro
39
1
0
11 Aug 2024
Img-Diff: Contrastive Data Synthesis for Multimodal Large Language Models
Qirui Jiao
Daoyuan Chen
Yilun Huang
Yaliang Li
Ying Shen
VLM
45
5
0
08 Aug 2024
InstantStyleGaussian: Efficient Art Style Transfer with 3D Gaussian Splatting
Xin-Yi Yu
Jun-Xin Yu
Li-Bo Zhou
Yan Wei
Lin-Lin Ou
3DGS
40
5
0
08 Aug 2024
IPAdapter-Instruct: Resolving Ambiguity in Image-based Conditioning using Instruct Prompts
Ciara Rowles
Shimon Vainer
Dante De Nigris
Slava Elizarov
Konstantin Kutsy
Simon Donné
DiffM
51
9
0
06 Aug 2024
Inference Optimizations for Large Language Models: Effects, Challenges, and Practical Considerations
Leo Donisch
Sigurd Schacht
Carsten Lanquillon
30
2
0
06 Aug 2024
UnifiedMLLM: Enabling Unified Representation for Multi-modal Multi-tasks With Large Language Model
Zhaowei Li
Wei Wang
Yiqing Cai
Xu Qi
Pengyu Wang
Dong Zhang
Hang Song
Botian Jiang
Zhida Huang
Tao Wang
AIFin
LRM
40
3
0
05 Aug 2024
ExoViP: Step-by-step Verification and Exploration with Exoskeleton Modules for Compositional Visual Reasoning
Yanjie Wang
Alan Yuille
Zhuowan Li
Zilong Zheng
LRM
36
3
0
05 Aug 2024
SAT3D: Image-driven Semantic Attribute Transfer in 3D
Zhijun Zhai
Zengmao Wang
Xiaoxiao Long
Kaixuan Zhou
Bo Du
52
0
0
03 Aug 2024
Stimulating Imagination: Towards General-purpose Object Rearrangement
Jianyang Wu
Jie Gu
Xiaokang Ma
Chu Tang
Jingmin Chen
DiffM
LM&Ro
OCL
37
0
0
03 Aug 2024
Leveraging BEV Paradigm for Ground-to-Aerial Image Synthesis
Junyan Ye
Jun He
Weijia Li
Zhutao Lv
Yi Lin
Haote Yang
Haote Yang
Conghui He
37
0
0
03 Aug 2024
FBSDiff: Plug-and-Play Frequency Band Substitution of Diffusion Features for Highly Controllable Text-Driven Image Translation
Xiang Gao
Jiaying Liu
45
2
0
02 Aug 2024
TurboEdit: Text-Based Image Editing Using Few-Step Diffusion Models
Gilad Deutch
Rinon Gal
Daniel Garibi
Or Patashnik
Daniel Cohen-Or
DiffM
43
22
0
01 Aug 2024
MotionFix: Text-Driven 3D Human Motion Editing
Lavrentia Aravani
Alpár Ceske
Markos Diomataris
Michael J. Black
Gül Varol
VGen
DiffM
43
17
0
01 Aug 2024
Head360: Learning a Parametric 3D Full-Head for Free-View Synthesis in 360°
Yuxiao He
Yi Zhuang
Yuanxun Lu
Yao Yao
Siyu Zhu
Xiaofei Wu
Zixiao Zhang
Xun Cao
Hao Zhu
3DH
37
3
0
01 Aug 2024
Localized Gaussian Splatting Editing with Contextual Awareness
Hanyuan Xiao
Yingshu Chen
Huajian Huang
Haolin Xiong
Jing Yang
P. Prasad
Yajie Zhao
3DGS
DiffM
47
4
0
31 Jul 2024
Fine-gained Zero-shot Video Sampling
Dengsheng Chen
Jie Hu
Javier Segovia-Aguas
Enhua Wu
VGen
DiffM
44
0
0
31 Jul 2024
Matting by Generation
Zhixiang Wang
Baiang Li
Jian Wang
Yu-Lun Liu
Jinwei Gu
Yung-Yu Chuang
Shiníchi Satoh
DiffM
32
1
0
30 Jul 2024
Add-SD: Rational Generation without Manual Reference
Lingfeng Yang
Xinyu Zhang
Xiang Li
Jinwen Chen
Kun Yao
Gang Zhang
Errui Ding
Ling-Ling Liu
Jingdong Wang
Jian Yang
45
0
0
30 Jul 2024
SceneTeller: Language-to-3D Scene Generation
Basak Melis Öcal
Maxim Tatarchenko
Sezer Karaoglu
Theo Gevers
44
6
0
30 Jul 2024
Learning Feature-Preserving Portrait Editing from Generated Pairs
Bowei Chen
Tiancheng Zhi
Peihao Zhu
Shen Sang
Jing Liu
Linjie Luo
DiffM
35
0
0
29 Jul 2024
Specify and Edit: Overcoming Ambiguity in Text-Based Image Editing
Ekaterina Iakovleva
Fabio Pizzati
Philip Torr
Stéphane Lathuiliere
DiffM
34
0
0
29 Jul 2024
Retinex-Diffusion: On Controlling Illumination Conditions in Diffusion Models via Retinex Theory
Xiaoyan Xing
Vincent Tao Hu
J. H. Metzen
Konrad Groh
Sezer Karaoglu
Theo Gevers
45
4
0
29 Jul 2024
Auto DragGAN: Editing the Generative Image Manifold in an Autoregressive Manner
Pengxiang Cai
Zhiwei Liu
Guibo Zhu
Yunfang Niu
Jinqiao Wang
DiffM
40
1
0
26 Jul 2024
Answerability Fields: Answerable Location Estimation via Diffusion Models
Daich Azuma
Taiki Miyanishi
Shuhei Kurita
Koya Sakamoto
M. Kawanabe
DiffM
48
0
0
26 Jul 2024
RegionDrag: Fast Region-Based Image Editing with Diffusion Models
Jingyi Lu
Xinghui Li
Kai Han
DiffM
36
11
0
25 Jul 2024
ReCorD: Reasoning and Correcting Diffusion for HOI Generation
Jian-Yu Jiang-Lin
Kang-Yang Huang
Ling Lo
Yi-Ning Huang
Terence Lin
Jhih-Ciang Wu
Hong-Han Shuai
Wen-Huang Cheng
DiffM
26
5
0
25 Jul 2024
PreciseControl: Enhancing Text-To-Image Diffusion Models with Fine-Grained Attribute Control
Rishubh Parihar
VS Sachidanand
Sabariswaran Mani
Tejan Karmali
R. V. Babu
DiffM
44
13
0
24 Jul 2024
Diffree: Text-Guided Shape Free Object Inpainting with Diffusion Model
Lirui Zhao
Tianshuo Yang
Wenqi Shao
Yuxin Zhang
Yu Qiao
Ping Luo
Kaipeng Zhang
Rongrong Ji
DiffM
48
3
0
24 Jul 2024
Diffusion Models for Monocular Depth Estimation: Overcoming Challenging Conditions
Fabio Tosi
Pierluigi Zama Ramirez
Matteo Poggi
DiffM
MQ
MDE
35
9
0
23 Jul 2024
StylusAI: Stylistic Adaptation for Robust German Handwritten Text Generation
Nauman Riaz
S. Saifullah
S. Agne
Andreas Dengel
Sheraz Ahmed
DiffM
47
0
0
22 Jul 2024
DOPRA: Decoding Over-accumulation Penalization and Re-allocation in Specific Weighting Layer
Jinfeng Wei
Xiaofeng Zhang
28
13
0
21 Jul 2024
LSReGen: Large-Scale Regional Generator via Backward Guidance Framework
Bowen Zhang
Cheng Yang
Xuanhui Liu
DiffM
42
0
0
21 Jul 2024
CatVTON: Concatenation Is All You Need for Virtual Try-On with Diffusion Models
Zheng Chong
Xiao Dong
Haoxiang Li
Shiyue Zhang
Wenqing Zhang
Xujie Zhang
Hanqing Zhao
D. Jiang
Xiaodan Liang
DiffM
60
18
0
21 Jul 2024
Diffusion Models as Data Mining Tools
Ioannis Siglidis
Aleksander Holynski
Alexei A. Efros
Mathieu Aubry
Shiry Ginosar
DiffM
MedIm
44
3
0
20 Jul 2024
Visual Text Generation in the Wild
Yuanzhi Zhu
Jiawei Liu
Feiyu Gao
Wenyu Liu
Xinggang Wang
Peng Wang
Fei Huang
Cong Yao
Zhibo Yang
DiffM
58
10
0
19 Jul 2024
MEDIC: Zero-shot Music Editing with Disentangled Inversion Control
Huadai Liu
Jialei Wang
Rongjie Huang
Yang Liu
Jiayang Xu
Zhou Zhao
31
4
0
18 Jul 2024
Safe-SD: Safe and Traceable Stable Diffusion with Text Prompt Trigger for Invisible Generative Watermarking
Zhiyuan Ma
Guoli Jia
Biqing Qi
Bowen Zhou
WIGM
76
10
0
18 Jul 2024
Previous
1
2
3
...
9
10
11
...
26
27
28
Next