ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2211.09800
  4. Cited By
InstructPix2Pix: Learning to Follow Image Editing Instructions

InstructPix2Pix: Learning to Follow Image Editing Instructions

17 November 2022
Tim Brooks
Aleksander Holynski
Alexei A. Efros
    DiffM
ArXivPDFHTML

Papers citing "InstructPix2Pix: Learning to Follow Image Editing Instructions"

50 / 1,350 papers shown
Title
MIO: A Foundation Model on Multimodal Tokens
MIO: A Foundation Model on Multimodal Tokens
Zekun Wang
King Zhu
Chunpu Xu
Wangchunshu Zhou
Jiaheng Liu
...
Yuanxing Zhang
Ge Zhang
Ke Xu
Jie Fu
Wenhao Huang
MLLM
AuLLM
60
11
0
26 Sep 2024
HazeSpace2M: A Dataset for Haze Aware Single Image Dehazing
HazeSpace2M: A Dataset for Haze Aware Single Image Dehazing
Md Tanvir Islam
Nasir Rahim
Saeed Anwar
Muhammad Saqib
Sambit Bakshi
Khan Muhammad
41
4
0
25 Sep 2024
GeoBiked: A Dataset with Geometric Features and Automated Labeling
  Techniques to Enable Deep Generative Models in Engineering Design
GeoBiked: A Dataset with Geometric Features and Automated Labeling Techniques to Enable Deep Generative Models in Engineering Design
Phillip Mueller
Sebastian Mueller
Lars Mikelsons
26
1
0
25 Sep 2024
Skyeyes: Ground Roaming using Aerial View Images
Skyeyes: Ground Roaming using Aerial View Images
Zhiyuan Gao
Wenbin Teng
Gonglin Chen
Jinsen Wu
Ningli Xu
R. Qin
Andrew Feng
Yajie Zhao
VGen
36
1
0
25 Sep 2024
Prompt Sliders for Fine-Grained Control, Editing and Erasing of Concepts
  in Diffusion Models
Prompt Sliders for Fine-Grained Control, Editing and Erasing of Concepts in Diffusion Models
Deepak Sridhar
Nuno Vasconcelos
DiffM
36
0
0
25 Sep 2024
Generative Object Insertion in Gaussian Splatting with a Multi-View Diffusion Model
Generative Object Insertion in Gaussian Splatting with a Multi-View Diffusion Model
Hongliang Zhong
Can Wang
Jingbo Zhang
Jing Liao
3DGS
DiffM
39
2
0
25 Sep 2024
ImPoster: Text and Frequency Guidance for Subject Driven Action
  Personalization using Diffusion Models
ImPoster: Text and Frequency Guidance for Subject Driven Action Personalization using Diffusion Models
D. Kothandaraman
Kuldeep Kulkarni
Sumit Shekhar
Balaji Vasan Srinivasan
Dinesh Manocha
DiffM
46
1
0
24 Sep 2024
TextToon: Real-Time Text Toonify Head Avatar from Single Video
TextToon: Real-Time Text Toonify Head Avatar from Single Video
Luchuan Song
Lele Chen
Celong Liu
Pinxin Liu
Chenliang Xu
DiffM
32
7
0
23 Sep 2024
MaterialFusion: Enhancing Inverse Rendering with Material Diffusion Priors
MaterialFusion: Enhancing Inverse Rendering with Material Diffusion Priors
Yehonathan Litman
Or Patashnik
Kangle Deng
Aviral Agrawal
Rushikesh Zawar
Fernando de la Torre
Shubham Tulsiani
52
5
0
23 Sep 2024
PixWizard: Versatile Image-to-Image Visual Assistant with Open-Language Instructions
PixWizard: Versatile Image-to-Image Visual Assistant with Open-Language Instructions
Weifeng Lin
Xinyu Wei
Renrui Zhang
Le Zhuo
Shitian Zhao
...
Junlin Xie
Junlin Xie
Yu Qiao
Peng Gao
Hongsheng Li
MLLM
DiffM
60
10
0
23 Sep 2024
Self-Supervised Audio-Visual Soundscape Stylization
Self-Supervised Audio-Visual Soundscape Stylization
Tingle Li
Renhao Wang
Po-Yao Huang
Andrew Owens
Gopala Anumanchipalli
DiffM
SSL
38
4
0
22 Sep 2024
Dormant: Defending against Pose-driven Human Image Animation
Dormant: Defending against Pose-driven Human Image Animation
Jiachen Zhou
Mingsi Wang
Tianlin Li
Guozhu Meng
Kai Chen
67
3
0
22 Sep 2024
JVID: Joint Video-Image Diffusion for Visual-Quality and
  Temporal-Consistency in Video Generation
JVID: Joint Video-Image Diffusion for Visual-Quality and Temporal-Consistency in Video Generation
Hadrien Reynaud
Matthew Baugh
Mischa Dombrowski
Sarah Cechnicka
Qingjie Meng
Bernhard Kainz
VLM
42
0
0
21 Sep 2024
Portrait Video Editing Empowered by Multimodal Generative Priors
Portrait Video Editing Empowered by Multimodal Generative Priors
Xuan Gao
Haiyao Xiao
Chenglai Zhong
Shimin Hu
Yudong Guo
Juyong Zhang
VGen
3DGS
42
4
0
20 Sep 2024
DNI: Dilutional Noise Initialization for Diffusion Video Editing
DNI: Dilutional Noise Initialization for Diffusion Video Editing
Sunjae Yoon
Gwanhyeong Koo
Ji Woo Hong
Chang D. Yoo
DiffM
43
2
0
19 Sep 2024
Vision Language Models Can Parse Floor Plan Maps
Vision Language Models Can Parse Floor Plan Maps
David DeFazio
Hrudayangam Mehta
Jeremy Blackburn
Shiqi Zhang
CoGe
23
0
0
19 Sep 2024
LEMON: Localized Editing with Mesh Optimization and Neural Shaders
LEMON: Localized Editing with Mesh Optimization and Neural Shaders
Furkan Mert Algan
Umut Yazgan
Driton Salihu
Cem Eteke
Eckehard G. Steinbach
DiffM
21
0
0
18 Sep 2024
ORB-SfMLearner: ORB-Guided Self-supervised Visual Odometry with Selective Online Adaptation
ORB-SfMLearner: ORB-Guided Self-supervised Visual Odometry with Selective Online Adaptation
Yanlin Jin
Rui-Yang Ju
Haojun Liu
Yuzhong Zhong
34
0
0
18 Sep 2024
OmniGen: Unified Image Generation
OmniGen: Unified Image Generation
Shitao Xiao
Yueze Wang
Yueze Wang
Huaying Yuan
Xingrun Xing
Ruiran Yan
Shuting Wang
Tiejun Huang
Zheng Liu
DiffM
VLM
SyDa
59
65
0
17 Sep 2024
SimInversion: A Simple Framework for Inversion-Based Text-to-Image
  Editing
SimInversion: A Simple Framework for Inversion-Based Text-to-Image Editing
Qi Qian
Haiyang Xu
Ming Yan
Juhua Hu
DiffM
37
1
0
16 Sep 2024
TextureDiffusion: Target Prompt Disentangled Editing for Various Texture Transfer
TextureDiffusion: Target Prompt Disentangled Editing for Various Texture Transfer
Zihan Su
Junhao Zhuang
Chun Yuan
DiffM
53
0
0
15 Sep 2024
InstantDrag: Improving Interactivity in Drag-based Image Editing
InstantDrag: Improving Interactivity in Drag-based Image Editing
Joonghyuk Shin
Daehyeon Choi
Jaesik Park
DiffM
46
7
0
13 Sep 2024
Improving Text-guided Object Inpainting with Semantic Pre-inpainting
Improving Text-guided Object Inpainting with Semantic Pre-inpainting
Yifu Chen
Jingwen Chen
Yingwei Pan
Yehao Li
Ting Yao
Zhineng Chen
Tao Mei
DiffM
31
6
0
12 Sep 2024
Diffusion-Based Image-to-Image Translation by Noise Correction via
  Prompt Interpolation
Diffusion-Based Image-to-Image Translation by Noise Correction via Prompt Interpolation
Junsung Lee
Minsoo Kang
Bohyung Han
DiffM
VLM
31
3
0
12 Sep 2024
Data Augmentation via Latent Diffusion for Saliency Prediction
Data Augmentation via Latent Diffusion for Saliency Prediction
Bahar Aydemir
Deblina Bhattacharjee
Tong Zhang
Mathieu Salzmann
Sabine Süsstrunk
31
1
0
11 Sep 2024
Towards Predicting Temporal Changes in a Patient's Chest X-ray Images based on Electronic Health Records
Towards Predicting Temporal Changes in a Patient's Chest X-ray Images based on Electronic Health Records
Daeun Kyung
J. Kim
Tackeun Kim
Edward Choi
MedIm
DiffM
43
1
0
11 Sep 2024
Quantifying and Enabling the Interpretability of CLIP-like Models
Quantifying and Enabling the Interpretability of CLIP-like Models
Avinash Madasu
Yossi Gandelsman
Vasudev Lal
Phillip Howard
VLM
56
2
0
10 Sep 2024
PoseEmbroider: Towards a 3D, Visual, Semantic-aware Human Pose
  Representation
PoseEmbroider: Towards a 3D, Visual, Semantic-aware Human Pose Representation
Ginger Delmas
Philippe Weinzaepfel
Francesc Moreno-Noguer
Grégory Rogez
39
2
0
10 Sep 2024
NeIn: Telling What You Don't Want
NeIn: Telling What You Don't Want
Nhat-Tan Bui
Dinh-Hieu Hoang
Quoc-Huy Trinh
Minh-Triet Tran
Truong Nguyen
Susan Gauch
43
2
0
09 Sep 2024
Rethinking The Training And Evaluation of Rich-Context Layout-to-Image Generation
Rethinking The Training And Evaluation of Rich-Context Layout-to-Image Generation
Jiaxin Cheng
Zixu Zhao
Tong He
Tianjun Xiao
Yicong Zhou
Zheng Zhang
DiffM
47
0
0
07 Sep 2024
DC-Solver: Improving Predictor-Corrector Diffusion Sampler via Dynamic
  Compensation
DC-Solver: Improving Predictor-Corrector Diffusion Sampler via Dynamic Compensation
Wenliang Zhao
Haolin Wang
Jie Zhou
Jiwen Lu
DiffM
27
1
0
05 Sep 2024
DiVE: DiT-based Video Generation with Enhanced Control
DiVE: DiT-based Video Generation with Enhanced Control
Junpeng Jiang
Gangyi Hong
Lijun Zhou
Enhui Ma
Hengtong Hu
...
Kaicheng Yu
Haiyang Sun
Kun Zhan
Peng Jia
Miao Zhang
VGen
DiffM
38
11
0
03 Sep 2024
Guide-and-Rescale: Self-Guidance Mechanism for Effective Tuning-Free
  Real Image Editing
Guide-and-Rescale: Self-Guidance Mechanism for Effective Tuning-Free Real Image Editing
Vadim Titov
Madina Khalmatova
Alexandra Ivanova
Dmitry Vetrov
Aibek Alanov
DiffM
48
5
0
02 Sep 2024
COMOGen: A Controllable Text-to-3D Multi-object Generation Framework
COMOGen: A Controllable Text-to-3D Multi-object Generation Framework
Shaorong Sun
Shuchao Pang
Yazhou Yao
Xiaoshui Huang
29
1
0
01 Sep 2024
EraseDraw: Learning to Insert Objects by Erasing Them from Images
EraseDraw: Learning to Insert Objects by Erasing Them from Images
Alper Canberk
Maksym Bondarenko
Ege Ozguroglu
Ruoshi Liu
Carl Vondrick
DiffM
35
2
0
31 Aug 2024
Training-Free Sketch-Guided Diffusion with Latent Optimization
Training-Free Sketch-Guided Diffusion with Latent Optimization
Sandra Zhang Ding
Jiafeng Mao
Kiyoharu Aizawa
DiffM
99
2
0
31 Aug 2024
Box2Flow: Instance-based Action Flow Graphs from Videos
Box2Flow: Instance-based Action Flow Graphs from Videos
Jiatong Li
Kalliopi Basioti
Vladimir Pavlovic
45
0
0
30 Aug 2024
GradBias: Unveiling Word Influence on Bias in Text-to-Image Generative
  Models
GradBias: Unveiling Word Influence on Bias in Text-to-Image Generative Models
Moreno DÍncà
E. Peruzzo
Massimiliano Mancini
Xingqian Xu
Humphrey Shi
N. Sebe
44
0
0
29 Aug 2024
TEDRA: Text-based Editing of Dynamic and Photoreal Actors
TEDRA: Text-based Editing of Dynamic and Photoreal Actors
Basavaraj Sunagad
Heming Zhu
Mohit Mendiratta
Adam Kortylewski
Christian Theobalt
Marc Habermann
DiffM
37
1
0
28 Aug 2024
Merging and Splitting Diffusion Paths for Semantically Coherent
  Panoramas
Merging and Splitting Diffusion Paths for Semantically Coherent Panoramas
Fabio Quattrini
Vittorio Pippi
Silvia Cascianelli
Rita Cucchiara
45
3
0
28 Aug 2024
Alfie: Democratising RGBA Image Generation With No $$$
Alfie: Democratising RGBA Image Generation With No
Fabio Quattrini
Vittorio Pippi
Silvia Cascianelli
Rita Cucchiara
DiffM
46
5
0
27 Aug 2024
CrossViewDiff: A Cross-View Diffusion Model for Satellite-to-Street View
  Synthesis
CrossViewDiff: A Cross-View Diffusion Model for Satellite-to-Street View Synthesis
Weijia Li
Jun He
Junyan Ye
Huaping Zhong
Zhimeng Zheng
Zilong Huang
Dahua Lin
Conghui He
41
6
0
27 Aug 2024
DefectTwin: When LLM Meets Digital Twin for Railway Defect Inspection
DefectTwin: When LLM Meets Digital Twin for Railway Defect Inspection
Rahatara Ferdousi
M. Anwar Hossain
Chunsheng Yang
Abdulmotaleb El Saddik
16
3
0
26 Aug 2024
GR-MG: Leveraging Partially Annotated Data via Multi-Modal Goal
  Conditioned Policy
GR-MG: Leveraging Partially Annotated Data via Multi-Modal Goal Conditioned Policy
Peiyan Li
Hongtao Wu
Yan Huang
Chilam Cheang
Liang Wang
Tao Kong
VGen
54
11
0
26 Aug 2024
ConceptMix: A Compositional Image Generation Benchmark with Controllable
  Difficulty
ConceptMix: A Compositional Image Generation Benchmark with Controllable Difficulty
Xindi Wu
Dingli Yu
Yangsibo Huang
Olga Russakovsky
Sanjeev Arora
CoGe
EGVM
53
12
0
26 Aug 2024
I2EBench: A Comprehensive Benchmark for Instruction-based Image Editing
I2EBench: A Comprehensive Benchmark for Instruction-based Image Editing
Yiwei Ma
Jiayi Ji
Ke Ye
Weihuang Lin
Zhibin Wang
Yonghan Zheng
Qiang-feng Zhou
Xiaoshuai Sun
Rongrong Ji
46
6
0
26 Aug 2024
Avatar Concept Slider: Controllable Editing of Concepts in 3D Human Avatars
Avatar Concept Slider: Controllable Editing of Concepts in 3D Human Avatars
Yixuan He
Lin Geng Foo
Ajmal Mian
Hossein Rahmani
Jun Liu
Christian Theobalt
35
1
0
26 Aug 2024
Prompt-Softbox-Prompt: A free-text Embedding Control for Image Editing
Prompt-Softbox-Prompt: A free-text Embedding Control for Image Editing
Yitong Yang
Yinglin Wang
Jing Wang
Tian Zhang
DiffM
40
1
0
24 Aug 2024
Latent Space Disentanglement in Diffusion Transformers Enables Zero-shot
  Fine-grained Semantic Editing
Latent Space Disentanglement in Diffusion Transformers Enables Zero-shot Fine-grained Semantic Editing
Zitao Shuai
Chenwei Wu
Zhengxu Tang
Bowen Song
Liyue Shen
33
0
0
23 Aug 2024
Abstract Art Interpretation Using ControlNet
Abstract Art Interpretation Using ControlNet
Rishabh Srivastava
Addrish Roy
13
0
0
23 Aug 2024
Previous
123...8910...252627
Next