ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2211.09800
  4. Cited By
InstructPix2Pix: Learning to Follow Image Editing Instructions
v1v2 (latest)

InstructPix2Pix: Learning to Follow Image Editing Instructions

17 November 2022
Tim Brooks
Aleksander Holynski
Alexei A. Efros
    DiffM
ArXiv (abs)PDFHTML

Papers citing "InstructPix2Pix: Learning to Follow Image Editing Instructions"

50 / 1,418 papers shown
Title
What Changed? Detecting and Evaluating Instruction-Guided Image Edits with Multimodal Large Language Models
What Changed? Detecting and Evaluating Instruction-Guided Image Edits with Multimodal Large Language Models
Lorenzo Baraldi
Davide Bucciarelli
Federico Betti
Marcella Cornia
Lorenzo Baraldi
N. Sebe
Rita Cucchiara
231
0
0
26 May 2025
WeatherEdit: Controllable Weather Editing with 4D Gaussian Field
WeatherEdit: Controllable Weather Editing with 4D Gaussian Field
Chenghao Qian
Wenjing Li
Yuhu Guo
Gustav Markkula
DiffM
23
0
0
26 May 2025
Structure Disruption: Subverting Malicious Diffusion-Based Inpainting via Self-Attention Query Perturbation
Structure Disruption: Subverting Malicious Diffusion-Based Inpainting via Self-Attention Query Perturbation
Yuhao He
Jinyu Tian
Haiwei Wu
Jianqing Li
DiffMAAML
46
0
0
26 May 2025
PreP-OCR: A Complete Pipeline for Document Image Restoration and Enhanced OCR Accuracy
PreP-OCR: A Complete Pipeline for Document Image Restoration and Enhanced OCR Accuracy
Shuhao Guan
Moule Lin
Cheng Xu
Xinyi Liu
Jinman Zhao
Jiexin Fan
Qi Xu
Derek Greene
65
2
0
26 May 2025
MIND-Edit: MLLM Insight-Driven Editing via Language-Vision Projection
MIND-Edit: MLLM Insight-Driven Editing via Language-Vision Projection
Shuyu Wang
Weiqi Li
Qian Wang
Shijie Zhao
Jian Zhang
DiffM
53
0
0
25 May 2025
Improving Novel view synthesis of 360$^\circ$ Scenes in Extremely Sparse Views by Jointly Training Hemisphere Sampled Synthetic Images
Improving Novel view synthesis of 360∘^\circ∘ Scenes in Extremely Sparse Views by Jointly Training Hemisphere Sampled Synthetic Images
Guangan Chen
A. Truong
Hanhe Lin
M. Vlaminck
Wilfried Philips
H. Luong
3DGS
35
0
0
25 May 2025
Beyond Editing Pairs: Fine-Grained Instructional Image Editing via Multi-Scale Learnable Regions
Beyond Editing Pairs: Fine-Grained Instructional Image Editing via Multi-Scale Learnable Regions
Chenrui Ma
Xi Xiao
Tianyang Wang
Yanning Shen
DiffM
44
0
0
25 May 2025
OmniConsistency: Learning Style-Agnostic Consistency from Paired Stylization Data
OmniConsistency: Learning Style-Agnostic Consistency from Paired Stylization Data
Yiren Song
Cheng Liu
Mike Zheng Shou
DiffM
178
2
0
24 May 2025
Affective Image Editing: Shaping Emotional Factors via Text Descriptions
Affective Image Editing: Shaping Emotional Factors via Text Descriptions
Peixuan Zhang
Shuchen Weng
Chengxuan Zhu
Binghao Tang
Zijian Jia
Si Li
Boxin Shi
DiffM
29
0
0
24 May 2025
DetailFusion: A Dual-branch Framework with Detail Enhancement for Composed Image Retrieval
DetailFusion: A Dual-branch Framework with Detail Enhancement for Composed Image Retrieval
Yuxin Yang
Yinan Zhou
Yuxin Chen
Ziqi Zhang
Zongyang Ma
...
Bing Li
Lin Song
Jun Gao
Peng Li
Weiming Hu
199
0
0
23 May 2025
ComfyMind: Toward General-Purpose Generation via Tree-Based Planning and Reactive Feedback
ComfyMind: Toward General-Purpose Generation via Tree-Based Planning and Reactive Feedback
Litao Guo
Xinli Xu
Luozhou Wang
Jiantao Lin
Jinsong Zhou
Zixin Zhang
Bolan Su
Ying-Cong Chen
LLMAGLRM
86
1
0
23 May 2025
Slot-MLLM: Object-Centric Visual Tokenization for Multimodal LLM
Slot-MLLM: Object-Centric Visual Tokenization for Multimodal LLM
Donghwan Chi
Hyomin Kim
Yoonjin Oh
Yongjin Kim
Donghoon Lee
DaeJin Jo
Jongmin Kim
Junyeob Baek
Sungjin Ahn
Sungwoong Kim
MLLMVLM
480
0
0
23 May 2025
Conditional Panoramic Image Generation via Masked Autoregressive Modeling
Conditional Panoramic Image Generation via Masked Autoregressive Modeling
Chaoyang Wang
Xiangtai Li
Lu Qi
X. Lin
Jinbin Bai
Qianyu Zhou
Yunhai Tong
DiffM
87
1
0
22 May 2025
Forward-only Diffusion Probabilistic Models
Forward-only Diffusion Probabilistic Models
Ziwei Luo
Fredrik K. Gustafsson
Jens Sjölund
Thomas B. Schön
64
0
0
22 May 2025
KRIS-Bench: Benchmarking Next-Level Intelligent Image Editing Models
KRIS-Bench: Benchmarking Next-Level Intelligent Image Editing Models
Yongliang Wu
Zonghui Li
Xinting Hu
Xinyu Ye
Xianfang Zeng
Gang Yu
Wenbo Zhu
Bernt Schiele
Ming-Hsuan Yang
Xu Yang
VLM
93
0
0
22 May 2025
Exploring the Limits of Vision-Language-Action Manipulations in Cross-task Generalization
Exploring the Limits of Vision-Language-Action Manipulations in Cross-task Generalization
Jiaming Zhou
Ke Ye
Jiayi Liu
Teli Ma
Zifang Wang
Ronghe Qiu
Kun-Yu Lin
Zhilin Zhao
Junwei Liang
117
2
0
21 May 2025
OmniStyle: Filtering High Quality Style Transfer Data at Scale
OmniStyle: Filtering High Quality Style Transfer Data at Scale
Ye Wang
Ruiqi Liu
Jiang Lin
Fei Liu
Zili Yi
Yilin Wang
Rui Ma
74
0
0
20 May 2025
Anti-Inpainting: A Proactive Defense against Malicious Diffusion-based Inpainters under Unknown Conditions
Anti-Inpainting: A Proactive Defense against Malicious Diffusion-based Inpainters under Unknown Conditions
Yimao Guo
Zuomin Qu
Wei Lu
Xiangyang Luo
DiffMAAML
63
0
0
19 May 2025
CompBench: Benchmarking Complex Instruction-guided Image Editing
CompBench: Benchmarking Complex Instruction-guided Image Editing
Bohan Jia
Wenxuan Huang
Yuntian Tang
Junbo Qiao
Jincheng Liao
...
Lin Chen
Fei Zhao
Zihan Wang
Yuan Xie
Shaohui Lin
CoGe
154
1
0
18 May 2025
SGD-Mix: Enhancing Domain-Specific Image Classification with Label-Preserving Data Augmentation
SGD-Mix: Enhancing Domain-Specific Image Classification with Label-Preserving Data Augmentation
Yixuan Dong
Fang-Yi Su
Jung-Hsien Chiang
DiffM
60
0
0
17 May 2025
iSegMan: Interactive Segment-and-Manipulate 3D Gaussians
iSegMan: Interactive Segment-and-Manipulate 3D Gaussians
Yian Zhao
Wanshi Xu
Ruochong Zheng
Pengchong Qiao
Chang Liu
Jie Chen
3DGS
91
0
0
17 May 2025
DDAE++: Enhancing Diffusion Models Towards Unified Generative and Discriminative Learning
DDAE++: Enhancing Diffusion Models Towards Unified Generative and Discriminative Learning
Weilai Xiang
Hongyu Yang
Di Huang
Yunhong Wang
122
0
0
16 May 2025
X-Edit: Detecting and Localizing Edits in Images Altered by Text-Guided Diffusion Models
X-Edit: Detecting and Localizing Edits in Images Altered by Text-Guided Diffusion Models
Valentina Bazyleva
Nicolo Bonettini
Gaurav Bharaj
DiffM
113
0
0
16 May 2025
NeuSEditor: From Multi-View Images to Text-Guided Neural Surface Edits
NeuSEditor: From Multi-View Images to Text-Guided Neural Surface Edits
Nail Ibrahimli
Julian F. P. Kooij
Liangliang Nan
58
0
0
16 May 2025
3D-Fixup: Advancing Photo Editing with 3D Priors
3D-Fixup: Advancing Photo Editing with 3D Priors
Yen-Chi Cheng
Krishna Kumar Singh
Jae Shin Yoon
Alex Schwing
Liangyan Gui
Matheus Gadelha
Paul Guerrero
Nanxuan Zhao
DiffM
122
0
0
15 May 2025
Does Feasibility Matter? Understanding the Impact of Feasibility on Synthetic Training Data
Does Feasibility Matter? Understanding the Impact of Feasibility on Synthetic Training Data
Yiwen Liu
Jessica Bader
Jae Myung Kim
DiffM
77
1
0
15 May 2025
IMAGE-ALCHEMY: Advancing subject fidelity in personalised text-to-image generation
IMAGE-ALCHEMY: Advancing subject fidelity in personalised text-to-image generation
Amritanshu Tiwari
Cherish Puniani
Kaustubh Sharma
Ojasva Nema
DiffM
101
0
0
15 May 2025
Marigold: Affordable Adaptation of Diffusion-Based Image Generators for Image Analysis
Bingxin Ke
Kevin Qu
Tianfu Wang
Nando Metzger
Shengyu Huang
Bo Li
Anton Obukhov
Konrad Schindler
DiffMVLM
122
1
0
14 May 2025
Visually Guided Decoding: Gradient-Free Hard Prompt Inversion with Language Models
Visually Guided Decoding: Gradient-Free Hard Prompt Inversion with Language Models
Donghoon Kim
Minji Bae
Kyuhong Shim
B. Shim
75
1
0
13 May 2025
Object detection in adverse weather conditions for autonomous vehicles using Instruct Pix2Pix
Object detection in adverse weather conditions for autonomous vehicles using Instruct Pix2Pix
Unai Gurbindo
Axel Brando
Jaume Abella
Caroline König
89
0
0
13 May 2025
IntrinsicEdit: Precise generative image manipulation in intrinsic space
IntrinsicEdit: Precise generative image manipulation in intrinsic space
Linjie Lyu
Valentin Deschaintre
Yannick Hold-Geoffroy
Jian Yang
Jae Shin Yoon
Thomas Leimkuhler
Christian Theobalt
Iliyan Georgiev
DiffM
77
0
0
13 May 2025
UniSkill: Imitating Human Videos via Cross-Embodiment Skill Representations
UniSkill: Imitating Human Videos via Cross-Embodiment Skill Representations
Hanjung Kim
Jaehyun Kang
Hyolim Kang
Meedeum Cho
Seon Joo Kim
Youngwoon Lee
103
0
0
13 May 2025
HistDiST: Histopathological Diffusion-based Stain Transfer
HistDiST: Histopathological Diffusion-based Stain Transfer
Erik Großkopf
Valay Bundele
Mehran Hossienzadeh
Hendrik Lensch
79
1
0
11 May 2025
DAPE: Dual-Stage Parameter-Efficient Fine-Tuning for Consistent Video Editing with Diffusion Models
DAPE: Dual-Stage Parameter-Efficient Fine-Tuning for Consistent Video Editing with Diffusion Models
Junhao Xia
Chaoyang Zhang
Yecheng Zhang
Chengyang Zhou
Zhichang Wang
Bochun Liu
Dongshuo Yin
DiffMVGen
96
0
0
11 May 2025
MonetGPT: Solving Puzzles Enhances MLLMs' Image Retouching Skills
MonetGPT: Solving Puzzles Enhances MLLMs' Image Retouching Skills
Niladri Shekhar Dutt
Duygu Ceylan
Niloy J. Mitra
DiffM
62
0
0
09 May 2025
MDE-Edit: Masked Dual-Editing for Multi-Object Image Editing via Diffusion Models
MDE-Edit: Masked Dual-Editing for Multi-Object Image Editing via Diffusion Models
Hongyang Zhu
Haipeng Liu
Bo Fu
Yang Wang
DiffM
129
0
0
08 May 2025
A Preliminary Study for GPT-4o on Image Restoration
A Preliminary Study for GPT-4o on Image Restoration
Hao Yang
Yiran Yang
Ruikun Zhang
Liyuan Pan
103
1
0
08 May 2025
MeshGen: Generating PBR Textured Mesh with Render-Enhanced Auto-Encoder and Generative Data Augmentation
MeshGen: Generating PBR Textured Mesh with Render-Enhanced Auto-Encoder and Generative Data Augmentation
Zilong Chen
Yikai Wang
Wenqiang Sun
Feng Wang
Yiwen Chen
Huaping Liu
88
0
0
07 May 2025
SuperEdit: Rectifying and Facilitating Supervision for Instruction-Based Image Editing
SuperEdit: Rectifying and Facilitating Supervision for Instruction-Based Image Editing
Ming Li
Xin Gu
Fan Chen
X. Xing
Longyin Wen
Chong Chen
Sijie Zhu
DiffM
263
2
0
05 May 2025
Ming-Lite-Uni: Advancements in Unified Architecture for Natural Multimodal Interaction
Ming-Lite-Uni: Advancements in Unified Architecture for Natural Multimodal Interaction
Biao Gong
Cheng Zou
Dandan Zheng
Hu Yu
Jingdong Chen
...
Qingpei Guo
Rui Liu
Weilong Chai
Xinyu Xiao
Ziyuan Huang
MLLM
218
3
0
05 May 2025
Unified Multimodal Understanding and Generation Models: Advances, Challenges, and Opportunities
Unified Multimodal Understanding and Generation Models: Advances, Challenges, and Opportunities
Wei Wei
Jintao Guo
Shanshan Zhao
Minghao Fu
Lunhao Duan
...
Guo-Hua Wang
Qing-Guo Chen
Zhao Xu
Weihua Luo
Kaifu Zhang
DiffM
303
1
0
05 May 2025
Benchmarking Feature Upsampling Methods for Vision Foundation Models using Interactive Segmentation
Benchmarking Feature Upsampling Methods for Vision Foundation Models using Interactive Segmentation
Volodymyr Havrylov
Haiwen Huang
Dan Zhang
Andreas Geiger
495
0
0
04 May 2025
Segment Any RGB-Thermal Model with Language-aided Distillation
Segment Any RGB-Thermal Model with Language-aided Distillation
Dong Xing
Xianxun Zhu
Wei Zhou
Qika Lin
Hang Yang
Yuqing Wang
VLM
185
0
0
04 May 2025
Rethinking Score Distilling Sampling for 3D Editing and Generation
Rethinking Score Distilling Sampling for 3D Editing and Generation
Xingyu Miao
Haoran Duan
Yang Long
Jiawei Han
95
1
0
03 May 2025
InstructAttribute: Fine-grained Object Attributes editing with Instruction
InstructAttribute: Fine-grained Object Attributes editing with Instruction
Xingxi Yin
Jingfeng Zhang
Zhi Li
You Li
Yanzhe Zhang
Yin Zhang
DiffM
455
1
0
01 May 2025
Controllable Weather Synthesis and Removal with Video Diffusion Models
Controllable Weather Synthesis and Removal with Video Diffusion Models
Chih-Hao Lin
Ziyi Wang
Ruofan Liang
Yuxuan Zhang
Sanja Fidler
Shenlong Wang
Zan Gojcic
DiffMVGen
70
1
0
01 May 2025
Multi-Modal Language Models as Text-to-Image Model Evaluators
Multi-Modal Language Models as Text-to-Image Model Evaluators
Jiahui Chen
Candace Ross
Reyhane Askari Hemmat
Koustuv Sinha
Melissa Hall
M. Drozdzal
Adriana Romero-Soriano
EGVM
105
0
0
01 May 2025
In-Context Edit: Enabling Instructional Image Editing with In-Context Generation in Large Scale Diffusion Transformer
In-Context Edit: Enabling Instructional Image Editing with In-Context Generation in Large Scale Diffusion Transformer
Zechuan Zhang
Ji Xie
Yu Lu
Zongxin Yang
Yue Yang
DiffM
146
11
0
29 Apr 2025
Creating Your Editable 3D Photorealistic Avatar with Tetrahedron-constrained Gaussian Splatting
Creating Your Editable 3D Photorealistic Avatar with Tetrahedron-constrained Gaussian Splatting
Hanxi Liu
Yifang Men
Zhouhui Lian
3DGS
73
0
0
29 Apr 2025
Advance Fake Video Detection via Vision Transformers
Advance Fake Video Detection via Vision Transformers
Joy Battocchio
S. Dell’Anna
Andrea Montibeller
Giulia Boato
ViTVGen
81
0
0
29 Apr 2025
Previous
12345...272829
Next