ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2211.09800
  4. Cited By
InstructPix2Pix: Learning to Follow Image Editing Instructions
v1v2 (latest)

InstructPix2Pix: Learning to Follow Image Editing Instructions

17 November 2022
Tim Brooks
Aleksander Holynski
Alexei A. Efros
    DiffM
ArXiv (abs)PDFHTML

Papers citing "InstructPix2Pix: Learning to Follow Image Editing Instructions"

50 / 1,418 papers shown
Title
MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image
  Editing
MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image Editing
Kai Zhang
Lingbo Mo
Wenhu Chen
Huan Sun
Yu-Chuan Su
EGVM
226
277
0
16 Jun 2023
Edit-DiffNeRF: Editing 3D Neural Radiance Fields using 2D Diffusion
  Model
Edit-DiffNeRF: Editing 3D Neural Radiance Fields using 2D Diffusion Model
Lu Yu
Weikang Xiang
Kang Han
DiffM
79
16
0
15 Jun 2023
UrbanIR: Large-Scale Urban Scene Inverse Rendering from a Single Video
UrbanIR: Large-Scale Urban Scene Inverse Rendering from a Single Video
Zhi-Hao Lin
Bohan Liu
Yi-Ting Chen
David A. Forsyth
Jia-Bin Huang
Jia-Bin Huang
Anand Bhattad
Shenlong Wang
VGen
140
10
0
15 Jun 2023
Evaluating Data Attribution for Text-to-Image Models
Evaluating Data Attribution for Text-to-Image Models
Sheng-Yu Wang
Alexei A. Efros
Jun-Yan Zhu
Richard Y. Zhang
TDI
94
33
0
15 Jun 2023
DreamSim: Learning New Dimensions of Human Visual Similarity using
  Synthetic Data
DreamSim: Learning New Dimensions of Human Visual Similarity using Synthetic Data
Stephanie Fu
Netanel Y. Tamir
Shobhita Sundaram
Lucy Chai
Richard Y. Zhang
Tali Dekel
Phillip Isola
EGVM
100
123
0
15 Jun 2023
Extending CLIP's Image-Text Alignment to Referring Image Segmentation
Extending CLIP's Image-Text Alignment to Referring Image Segmentation
Seoyeon Kim
Minguk Kang
Dongwon Kim
Jaesik Park
Suha Kwak
VLM
96
11
0
14 Jun 2023
On the Robustness of Latent Diffusion Models
On the Robustness of Latent Diffusion Models
Jianping Zhang
Zhuoer Xu
Shiwen Cui
Changhua Meng
Weibin Wu
Michael R. Lyu
AAML
82
20
0
14 Jun 2023
GeneCIS: A Benchmark for General Conditional Image Similarity
GeneCIS: A Benchmark for General Conditional Image Similarity
S. Vaze
Nicolas Carion
Ishan Misra
VLMDiffM
100
30
0
13 Jun 2023
Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation
Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation
Shuai Yang
Yifan Zhou
Ziwei Liu
Chen Change Loy
VGenDiffM
102
221
0
13 Jun 2023
InstructP2P: Learning to Edit 3D Point Clouds with Text Instructions
InstructP2P: Learning to Edit 3D Point Clouds with Text Instructions
Jiale Xu
Xintao Wang
Yannan Cao
Weihao Cheng
Ying Shan
Shenghua Gao
DiffM
86
11
0
12 Jun 2023
Unsupervised Compositional Concepts Discovery with Text-to-Image
  Generative Models
Unsupervised Compositional Concepts Discovery with Text-to-Image Generative Models
Nan Liu
Yilun Du
Shuang Li
J. Tenenbaum
Antonio Torralba
DiffMCoGe
100
27
0
08 Jun 2023
SyncDiffusion: Coherent Montage via Synchronized Joint Diffusions
SyncDiffusion: Coherent Montage via Synchronized Joint Diffusions
Yuseung Lee
Kunho Kim
Hyunjin Kim
Minhyuk Sung
DiffM
127
67
0
08 Jun 2023
Designing a Better Asymmetric VQGAN for StableDiffusion
Designing a Better Asymmetric VQGAN for StableDiffusion
Zixin Zhu
Xuelu Feng
DongDong Chen
Jianmin Bao
Le Wang
Yinpeng Chen
Lu Yuan
Gang Hua
DiffM
95
35
0
07 Jun 2023
Fine-Grained Visual Prompting
Fine-Grained Visual Prompting
Lingfeng Yang
Yueze Wang
Xiang Li
Xinlong Wang
Jian Yang
ObjDVLM
115
68
0
07 Jun 2023
Emergent Correspondence from Image Diffusion
Emergent Correspondence from Image Diffusion
Luming Tang
Menglin Jia
Qianqian Wang
Cheng Perng Phoo
Bharath Hariharan
118
270
0
06 Jun 2023
HeadSculpt: Crafting 3D Head Avatars with Text
HeadSculpt: Crafting 3D Head Avatars with Text
Xiaoping Han
Yukang Cao
Kai Han
Xiatian Zhu
Jiankang Deng
Yi-Zhe Song
Tao Xiang
Kwan-Yee K. Wong
DiffM
73
47
0
05 Jun 2023
Instruct-Video2Avatar: Video-to-Avatar Generation with Instructions
Instruct-Video2Avatar: Video-to-Avatar Generation with Instructions
Shaoxu Li
DiffM
60
5
0
05 Jun 2023
Towards Unified Text-based Person Retrieval: A Large-scale
  Multi-Attribute and Language Search Benchmark
Towards Unified Text-based Person Retrieval: A Large-scale Multi-Attribute and Language Search Benchmark
Shuyu Yang
Yinan Zhou
Yaxiong Wang
Yujiao Wu
Li Zhu
Zhedong Zheng
VLMDiffM
144
92
0
05 Jun 2023
User-friendly Image Editing with Minimal Text Input: Leveraging
  Captioning and Injection Techniques
User-friendly Image Editing with Minimal Text Input: Leveraging Captioning and Injection Techniques
Sunwoo Kim
Wooseok Jang
Hyunsung Kim
Junho Kim
Yunjey Choi
Seung Wook Kim
Gayeong Lee
DiffM
65
6
0
05 Jun 2023
Video Colorization with Pre-trained Text-to-Image Diffusion Models
Video Colorization with Pre-trained Text-to-Image Diffusion Models
Hanyuan Liu
M. Xie
Jinbo Xing
Chengze Li
T. Wong
VLMDiffM
104
13
0
02 Jun 2023
Adjustable Visual Appearance for Generalizable Novel View Synthesis
Adjustable Visual Appearance for Generalizable Novel View Synthesis
Josef Bengtson
David Nilsson
Che-Tsung Lin
Marcel Büsching
Fredrik Kahl
98
0
0
02 Jun 2023
Diffusion Self-Guidance for Controllable Image Generation
Diffusion Self-Guidance for Controllable Image Generation
Dave Epstein
Allan Jabri
Ben Poole
Alexei A. Efros
Aleksander Holynski
111
266
0
01 Jun 2023
Intelligent Grimm -- Open-ended Visual Storytelling via Latent Diffusion
  Models
Intelligent Grimm -- Open-ended Visual Storytelling via Latent Diffusion Models
Chang-rui Liu
Haoning Wu
Yujie Zhong
Xiaoyu Zhang
Yanfeng Wang
Weidi Xie
DiffMVLM
154
44
0
01 Jun 2023
ViCo: Plug-and-play Visual Condition for Personalized Text-to-image
  Generation
ViCo: Plug-and-play Visual Condition for Personalized Text-to-image Generation
Shaozhe Hao
Kai Han
Shihao Zhao
Kwan-Yee K. Wong
88
10
0
01 Jun 2023
Differential Diffusion: Giving Each Pixel Its Strength
Differential Diffusion: Giving Each Pixel Its Strength
E. Levin
Ohad Fried
DiffM
88
21
0
01 Jun 2023
Versatile Backdoor Attack with Visible, Semantic, Sample-Specific, and
  Compatible Triggers
Versatile Backdoor Attack with Visible, Semantic, Sample-Specific, and Compatible Triggers
Ke Xu
Hongrui Chen
Zihao Zhu
Li Liu
Baoyuan Wu
DiffM
126
11
0
01 Jun 2023
AvatarStudio: Text-driven Editing of 3D Dynamic Human Head Avatars
AvatarStudio: Text-driven Editing of 3D Dynamic Human Head Avatars
Mohit Mendiratta
Xingang Pan
Mohamed A. Elgharib
Kartik Teotia
Mallikarjun B R
A. Tewari
Vladislav Golyanik
Adam Kortylewski
Christian Theobalt
DiffM
75
30
0
01 Jun 2023
Diffusion Brush: A Latent Diffusion Model-based Editing Tool for
  AI-generated Images
Diffusion Brush: A Latent Diffusion Model-based Editing Tool for AI-generated Images
Peyman Gholami
R. Xiao
DiffM
116
3
0
31 May 2023
Control4D: Efficient 4D Portrait Editing with Text
Control4D: Efficient 4D Portrait Editing with Text
Ruizhi Shao
Jingxiang Sun
Cheng Peng
Zerong Zheng
Boyao Zhou
Hongwen Zhang
Yebin Liu
DiffM
116
25
0
31 May 2023
A Unified Conditional Framework for Diffusion-based Image Restoration
A Unified Conditional Framework for Diffusion-based Image Restoration
Yuanhang Zhang
Xiaoyu Shi
Dasong Li
Xiaogang Wang
Jian Wang
Hongsheng Li
DiffM
94
26
0
31 May 2023
PanoGen: Text-Conditioned Panoramic Environment Generation for
  Vision-and-Language Navigation
PanoGen: Text-Conditioned Panoramic Environment Generation for Vision-and-Language Navigation
Jialu Li
Joey Tianyi Zhou
DiffM
101
55
0
30 May 2023
Video ControlNet: Towards Temporally Consistent Synthetic-to-Real Video
  Translation Using Conditional Image Diffusion Models
Video ControlNet: Towards Temporally Consistent Synthetic-to-Real Video Translation Using Conditional Image Diffusion Models
Ernie Chu
Shuohao Lin
Jun-Cheng Chen
DiffM
64
21
0
30 May 2023
LANCE: Stress-testing Visual Models by Generating Language-guided
  Counterfactual Images
LANCE: Stress-testing Visual Models by Generating Language-guided Counterfactual Images
Viraj Prabhu
Sriram Yenamandra
Prithvijit Chattopadhyay
Judy Hoffman
104
42
0
30 May 2023
Diffusion Model for Dense Matching
Diffusion Model for Dense Matching
Jisu Nam
Gyuseong Lee
Sunwoo Kim
Ines Hyeonsu Kim
Hyoungwon Cho
Seyeong Kim
Seung Wook Kim
DiffM
84
10
0
30 May 2023
Nested Diffusion Processes for Anytime Image Generation
Nested Diffusion Processes for Anytime Image Generation
Noam Elata
Bahjat Kawar
T. Michaeli
Michael Elad
DiffM
62
4
0
30 May 2023
GPT4Tools: Teaching Large Language Model to Use Tools via
  Self-instruction
GPT4Tools: Teaching Large Language Model to Use Tools via Self-instruction
Rui Yang
Lin Song
Yanwei Li
Sijie Zhao
Yixiao Ge
Xiu Li
Ying Shan
SyDaMLLM
88
227
0
30 May 2023
Real-World Image Variation by Aligning Diffusion Inversion Chain
Real-World Image Variation by Aligning Diffusion Inversion Chain
Yuechen Zhang
Jinbo Xing
Eric Lo
Jiaya Jia
99
35
0
30 May 2023
Controllable Text-to-Image Generation with GPT-4
Controllable Text-to-Image Generation with GPT-4
Tianjun Zhang
Yi Zhang
Vibhav Vineet
Neel Joshi
Xin Eric Wang
DiffM
150
44
0
29 May 2023
InstructEdit: Improving Automatic Masks for Diffusion-based Image
  Editing With User Instructions
InstructEdit: Improving Automatic Masks for Diffusion-based Image Editing With User Instructions
Qian Wang
Biao Zhang
Michael Birsak
Peter Wonka
DiffM
69
37
0
29 May 2023
FuseCap: Leveraging Large Language Models for Enriched Fused Image
  Captions
FuseCap: Leveraging Large Language Models for Enriched Fused Image Captions
Noam Rotstein
David Bensaid
Shaked Brody
Roy Ganz
Ron Kimmel
VLM
81
31
0
28 May 2023
Accelerating Text-to-Image Editing via Cache-Enabled Sparse Diffusion
  Inference
Accelerating Text-to-Image Editing via Cache-Enabled Sparse Diffusion Inference
Zihao Yu
Haoyang Li
Fangcheng Fu
Xupeng Miao
Tengjiao Wang
DiffM
93
8
0
27 May 2023
CRoSS: Diffusion Model Makes Controllable, Robust and Secure Image
  Steganography
CRoSS: Diffusion Model Makes Controllable, Robust and Secure Image Steganography
Jiwen Yu
Xuanyu Zhang
You-song Xu
Jian Zhang
DiffM
99
53
0
26 May 2023
Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models
Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models
Shihao Zhao
Dongdong Chen
Yen-Chun Chen
Jianmin Bao
Shaozhe Hao
Lu Yuan
Kwan-Yee K. Wong
115
268
0
25 May 2023
Break-A-Scene: Extracting Multiple Concepts from a Single Image
Break-A-Scene: Extracting Multiple Concepts from a Single Image
Omri Avrahami
Kfir Aberman
Ohad Fried
Daniel Cohen-Or
Dani Lischinski
VLMDiffM
106
178
0
25 May 2023
Diversify Your Vision Datasets with Automatic Diffusion-Based
  Augmentation
Diversify Your Vision Datasets with Automatic Diffusion-Based Augmentation
Lisa Dunlap
Alyssa Umino
Han Zhang
Jiezhi Yang
Joseph E. Gonzalez
Trevor Darrell
DiffM
91
79
0
25 May 2023
CommonScenes: Generating Commonsense 3D Indoor Scenes with Scene Graph
  Diffusion
CommonScenes: Generating Commonsense 3D Indoor Scenes with Scene Graph Diffusion
Guangyao Zhai
Evin Pınar Örnek
Shun-cheng Wu
Yan Di
F. Tombari
Nassir Navab
Benjamin Busam
DiffM
139
14
0
25 May 2023
ProSpect: Prompt Spectrum for Attribute-Aware Personalization of
  Diffusion Models
ProSpect: Prompt Spectrum for Attribute-Aware Personalization of Diffusion Models
Yuxin Zhang
Weiming Dong
Fan Tang
Nisha Huang
Haibin Huang
Chongyang Ma
Tong-Yee Lee
Oliver Deussen
Changsheng Xu
DiffM
99
81
0
25 May 2023
Towards Language-guided Interactive 3D Generation: LLMs as Layout
  Interpreter with Generative Feedback
Towards Language-guided Interactive 3D Generation: LLMs as Layout Interpreter with Generative Feedback
Yiqi Lin
Hao Wu
Ruichen Wang
H. Lu
Xiaodong Lin
Hui Xiong
Lin Wang
3DV
63
13
0
25 May 2023
Balancing the Picture: Debiasing Vision-Language Datasets with Synthetic
  Contrast Sets
Balancing the Picture: Debiasing Vision-Language Datasets with Synthetic Contrast Sets
Brandon Smith
Miguel Farinha
Elizaveta Semenova
Hannah Rose Kirk
Aleksandar Shtedritski
Max Bain
90
19
0
24 May 2023
LayoutGPT: Compositional Visual Planning and Generation with Large
  Language Models
LayoutGPT: Compositional Visual Planning and Generation with Large Language Models
Weixi Feng
Wanrong Zhu
Tsu-Jui Fu
Varun Jampani
Arjun Reddy Akula
Xuehai He
Sugato Basu
Xinze Wang
William Yang Wang
MLLM
125
180
0
24 May 2023
Previous
123...2526272829
Next