ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2211.09800
  4. Cited By
InstructPix2Pix: Learning to Follow Image Editing Instructions

InstructPix2Pix: Learning to Follow Image Editing Instructions

17 November 2022
Tim Brooks
Aleksander Holynski
Alexei A. Efros
    DiffM
ArXivPDFHTML

Papers citing "InstructPix2Pix: Learning to Follow Image Editing Instructions"

50 / 1,348 papers shown
Title
Training Data Synthesis with Difficulty Controlled Diffusion Model
Training Data Synthesis with Difficulty Controlled Diffusion Model
Zerun Wang
Jiafeng Mao
Xueting Wang
Toshihiko Yamasaki
DiffM
80
0
0
27 Nov 2024
Generative Image Layer Decomposition with Visual Effects
Generative Image Layer Decomposition with Visual Effects
Jinrui Yang
Qing Liu
Y. Li
S. Kim
D. Pakhomov
Mengwei Ren
Jianming Zhang
Zhe-nan Lin
Cihang Xie
Yuyin Zhou
DiffM
105
2
0
26 Nov 2024
InsightEdit: Towards Better Instruction Following for Image Editing
InsightEdit: Towards Better Instruction Following for Image Editing
Yingjing Xu
Jie Kong
Jiazhi Wang
Xiao Pan
Bo Lin
Qiang Liu
DiffM
91
1
0
26 Nov 2024
Omegance: A Single Parameter for Various Granularities in
  Diffusion-Based Synthesis
Omegance: A Single Parameter for Various Granularities in Diffusion-Based Synthesis
Xinyu Hou
Zongsheng Yue
Xiaoming Li
Chen Change Loy
VGen
DiffM
99
0
0
26 Nov 2024
GenDeg: Diffusion-based Degradation Synthesis for Generalizable All-In-One Image Restoration
GenDeg: Diffusion-based Degradation Synthesis for Generalizable All-In-One Image Restoration
Sudarshan Rajagopalan
Nithin Gopalakrishnan Nair
Jay N. Paranjape
Vishal M. Patel
DiffM
90
0
0
26 Nov 2024
One Diffusion to Generate Them All
One Diffusion to Generate Them All
Duong H. Le
Tuan Pham
Sangho Lee
Christopher Clark
Aniruddha Kembhavi
Stephan Mandt
Ranjay Krishna
Jiasen Lu
VLM
74
5
0
25 Nov 2024
Unlocking the Potential of Text-to-Image Diffusion with PAC-Bayesian
  Theory
Unlocking the Potential of Text-to-Image Diffusion with PAC-Bayesian Theory
Eric Hanchen Jiang
Yasi Zhang
Zhi Zhang
Yixin Wan
Andrew Lizarraga
Shufan Li
Ying Nian Wu
DiffM
77
2
0
25 Nov 2024
UVCG: Leveraging Temporal Consistency for Universal Video Protection
UVCG: Leveraging Temporal Consistency for Universal Video Protection
KaiZhou Li
Jindong Gu
Xinchun Yu
Junjie Cao
Yansong Tang
Xiao-Ping Zhang
AAML
79
0
0
25 Nov 2024
Edit Away and My Face Will not Stay: Personal Biometric Defense against Malicious Generative Editing
Edit Away and My Face Will not Stay: Personal Biometric Defense against Malicious Generative Editing
Hanhui Wang
Yihua Zhang
Ruizheng Bai
Yue Zhao
Sijia Liu
Z. Tu
AAML
PICV
98
2
0
25 Nov 2024
DynamicAvatars: Accurate Dynamic Facial Avatars Reconstruction and
  Precise Editing with Diffusion Models
DynamicAvatars: Accurate Dynamic Facial Avatars Reconstruction and Precise Editing with Diffusion Models
Yangyang Qian
Yuan Sun
Yu-Xiao Guo
DiffM
176
0
0
24 Nov 2024
Unveil Inversion and Invariance in Flow Transformer for Versatile Image Editing
Unveil Inversion and Invariance in Flow Transformer for Versatile Image Editing
P. Xu
Boyuan Jiang
Xiaobin Hu
Donghao Luo
Q. He
Jingyang Zhang
Chengjie Wang
Yunsheng Wu
Charles Ling
Boyu Wang
92
2
0
24 Nov 2024
AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea
AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea
Qifan Yu
Wei Chow
Zhongqi Yue
Kaihang Pan
Yang Wu
Xiaoyang Wan
Juncheng Billy Li
Siliang Tang
H. Zhang
Yueting Zhuang
DiffM
106
17
0
24 Nov 2024
FATE: Full-head Gaussian Avatar with Textural Editing from Monocular Video
FATE: Full-head Gaussian Avatar with Textural Editing from Monocular Video
Jiawei Zhang
Zijian Wu
Zhiyang Liang
Yicheng Gong
Dongfang Hu
Yao Yao
Xun Cao
Hao Zhu
3DGS
84
1
0
23 Nov 2024
GIFT: A Framework for Global Interpretable Faithful Textual Explanations of Vision Classifiers
GIFT: A Framework for Global Interpretable Faithful Textual Explanations of Vision Classifiers
Éloi Zablocki
Valentin Gerard
Amaia Cardiel
Eric Gaussier
Matthieu Cord
Eduardo Valle
81
0
0
23 Nov 2024
TKG-DM: Training-free Chroma Key Content Generation Diffusion Model
TKG-DM: Training-free Chroma Key Content Generation Diffusion Model
Ryugo Morita
Stanislav Frolov
Brian B. Moser
Takahiro Shirakawa
Ko Watanabe
Andreas Dengel
Jinjia Zhou
DiffM
82
0
0
23 Nov 2024
VIVID-10M: A Dataset and Baseline for Versatile and Interactive Video
  Local Editing
VIVID-10M: A Dataset and Baseline for Versatile and Interactive Video Local Editing
Jiahao Hu
Tianxiong Zhong
Xuebo Wang
Boyuan Jiang
Xingye Tian
Fei Yang
Pengfei Wan
Di Zhang
VGen
74
2
0
22 Nov 2024
HyperGAN-CLIP: A Unified Framework for Domain Adaptation, Image
  Synthesis and Manipulation
HyperGAN-CLIP: A Unified Framework for Domain Adaptation, Image Synthesis and Manipulation
Abdul Basit Anees
A. Baykal
Muhammed Burak Kizil
Duygu Ceylan
Erkut Erdem
Aykut Erdem
CLIP
69
1
0
19 Nov 2024
StableV2V: Stablizing Shape Consistency in Video-to-Video Editing
Chang-Shu Liu
Rui Li
Kaidong Zhang
Yunwei Lan
Dong Liu
DiffM
VGen
58
3
0
17 Nov 2024
Generating Compositional Scenes via Text-to-image RGBA Instance Generation
Alessandro Fontanella
Petru-Daniel Tudosiu
Yongxin Yang
Shifeng Zhang
Sarah Parisot
38
2
0
16 Nov 2024
MaskMedPaint: Masked Medical Image Inpainting with Diffusion Models for
  Mitigation of Spurious Correlations
MaskMedPaint: Masked Medical Image Inpainting with Diffusion Models for Mitigation of Spurious Correlations
Qixuan Jin
Walter Gerych
Marzyeh Ghassemi
DiffM
MedIm
33
0
0
16 Nov 2024
ColorEdit: Training-free Image-Guided Color editing with diffusion model
ColorEdit: Training-free Image-Guided Color editing with diffusion model
Xingxi Yin
Zhi Li
Jingfeng Zhang
Chenglin Li
Yin Zhang
DiffM
51
0
0
15 Nov 2024
Latent Space Disentanglement in Diffusion Transformers Enables Precise
  Zero-shot Semantic Editing
Latent Space Disentanglement in Diffusion Transformers Enables Precise Zero-shot Semantic Editing
Zitao Shuai
Chenwei Wu
Zhengxu Tang
Bowen Song
Liyue Shen
DiffM
55
0
0
12 Nov 2024
Material Transforms from Disentangled NeRF Representations
Material Transforms from Disentangled NeRF Representations
Ivan Lopes
Jean-François Lalonde
Raoul de Charette
34
0
0
12 Nov 2024
Semi-Truths: A Large-Scale Dataset of AI-Augmented Images for Evaluating
  Robustness of AI-Generated Image detectors
Semi-Truths: A Large-Scale Dataset of AI-Augmented Images for Evaluating Robustness of AI-Generated Image detectors
Anisha Pal
Julia Kruk
Mansi Phute
Manognya Bhattaram
Diyi Yang
Duen Horng Chau
Judy Hoffman
AAML
47
2
0
12 Nov 2024
Add-it: Training-Free Object Insertion in Images With Pretrained
  Diffusion Models
Add-it: Training-Free Object Insertion in Images With Pretrained Diffusion Models
Yoad Tewel
Rinon Gal
Dvir Samuel
Y. Atzmon
Lior Wolf
Gal Chechik
VLM
56
6
0
11 Nov 2024
Extreme Rotation Estimation in the Wild
Extreme Rotation Estimation in the Wild
Hana Bezalel
Dotan Ankri
Ruojin Cai
Hadar Averbuch-Elor
36
2
0
11 Nov 2024
OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision
OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision
Cong Wei
Zheyang Xiong
Weiming Ren
Xinrun Du
Ge Zhang
Wenhu Chen
107
19
0
11 Nov 2024
ProEdit: Simple Progression is All You Need for High-Quality 3D Scene
  Editing
ProEdit: Simple Progression is All You Need for High-Quality 3D Scene Editing
Jun-Kun Chen
Yu-Xiong Wang
DiffM
45
4
0
07 Nov 2024
Controlling Human Shape and Pose in Text-to-Image Diffusion Models via
  Domain Adaptation
Controlling Human Shape and Pose in Text-to-Image Diffusion Models via Domain Adaptation
Benito Buchheim
M. Reimann
Jürgen Döllner
26
0
0
07 Nov 2024
ReEdit: Multimodal Exemplar-Based Image Editing with Diffusion Models
ReEdit: Multimodal Exemplar-Based Image Editing with Diffusion Models
Ashutosh Srivastava
Tarun Ram Menta
Abhinav Java
Avadhoot Jadhav
Silky Singh
Surgan Jandial
Balaji Krishnamurthy
DiffM
38
1
0
06 Nov 2024
TI-PREGO: Chain of Thought and In-Context Learning for Online Mistake
  Detection in PRocedural EGOcentric Videos
TI-PREGO: Chain of Thought and In-Context Learning for Online Mistake Detection in PRocedural EGOcentric Videos
Leonardo Plini
Luca Scofano
Edoardo De Matteis
Guido Maria DÁmely di Melendugno
Alessandro Flaborea
Andrea Sanchietti
G. Farinella
Fabio Galasso
Antonino Furnari
EgoV
LRM
45
1
0
04 Nov 2024
AutoVFX: Physically Realistic Video Editing from Natural Language
  Instructions
AutoVFX: Physically Realistic Video Editing from Natural Language Instructions
Hao-Yu Hsu
Zhi-Hao Lin
Albert Zhai
Hongchi Xia
Shenlong Wang
VGen
50
9
0
04 Nov 2024
DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for
  Efficient Robot Execution
DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution
Yang Yue
Yulin Wang
Bingyi Kang
Yizeng Han
Shenzhi Wang
Shiji Song
Jiashi Feng
Gao Huang
VLM
40
16
0
04 Nov 2024
NeRF-Aug: Data Augmentation for Robotics with Neural Radiance Fields
NeRF-Aug: Data Augmentation for Robotics with Neural Radiance Fields
Eric Zhu
Mara Levy
M. Gwilliam
Abhinav Shrivastava
48
0
0
04 Nov 2024
Towards Small Object Editing: A Benchmark Dataset and A Training-Free
  Approach
Towards Small Object Editing: A Benchmark Dataset and A Training-Free Approach
Qihe Pan
Zhen Zhao
Zicheng Wang
Sifan Long
Yiming Wu
Wei Ji
Haoran Liang
Ronghua Liang
31
1
0
03 Nov 2024
MultiPull: Detailing Signed Distance Functions by Pulling Multi-Level
  Queries at Multi-Step
MultiPull: Detailing Signed Distance Functions by Pulling Multi-Level Queries at Multi-Step
Takeshi Noda
C. L. P. Chen
Weiqi Zhang
Xinhai Liu
Y. Liu
Zhizhong Han
3DPC
45
8
0
02 Nov 2024
X-Drive: Cross-modality consistent multi-sensor data synthesis for
  driving scenarios
X-Drive: Cross-modality consistent multi-sensor data synthesis for driving scenarios
Yichen Xie
Chenfeng Xu
C-T.John Peng
Shuqi Zhao
Nhat Ho
Alexander T. Pham
Mingyu Ding
M. Tomizuka
W. Zhan
DiffM
41
2
0
02 Nov 2024
Cityscape-Adverse: Benchmarking Robustness of Semantic Segmentation with
  Realistic Scene Modifications via Diffusion-Based Image Editing
Cityscape-Adverse: Benchmarking Robustness of Semantic Segmentation with Realistic Scene Modifications via Diffusion-Based Image Editing
Naufal Suryanto
Andro Aprila Adiputra
Ahmada Yusril Kadiptya
Thi-Thu-Huong Le
Derry Pratama
Yongsu Kim
Howon Kim
DiffM
57
0
0
01 Nov 2024
Fashion-VDM: Video Diffusion Model for Virtual Try-On
Fashion-VDM: Video Diffusion Model for Virtual Try-On
J. Karras
Yingwei Li
Nan Liu
Luyang Zhu
Innfarn Yoo
Andreas Lugmayr
Chris Lee
Ira Kemelmacher-Shlizerman
DiffM
VGen
40
4
0
31 Oct 2024
Scaling Concept With Text-Guided Diffusion Models
Scaling Concept With Text-Guided Diffusion Models
Chao Huang
Susan Liang
Yunlong Tang
Yapeng Tian
Anurag Kumar
Chenliang Xu
DiffM
51
6
0
31 Oct 2024
PACA: Perspective-Aware Cross-Attention Representation for Zero-Shot
  Scene Rearrangement
PACA: Perspective-Aware Cross-Attention Representation for Zero-Shot Scene Rearrangement
Shutong Jin
Ruiyu Wang
Kuangyi Chen
Florian T. Pokorny
29
0
0
29 Oct 2024
Adapting Diffusion Models for Improved Prompt Compliance and
  Controllable Image Synthesis
Adapting Diffusion Models for Improved Prompt Compliance and Controllable Image Synthesis
Deepak Sridhar
Abhishek Peri
Rohith Rachala
Nuno Vasconcelos
DiffM
37
0
0
29 Oct 2024
AdvI2I: Adversarial Image Attack on Image-to-Image Diffusion models
AdvI2I: Adversarial Image Attack on Image-to-Image Diffusion models
Yaopei Zeng
Yuanpu Cao
Bochuan Cao
Yurui Chang
Jinghui Chen
Lu Lin
DiffM
36
3
0
28 Oct 2024
Novel Object Synthesis via Adaptive Text-Image Harmony
Novel Object Synthesis via Adaptive Text-Image Harmony
Zeren Xiong
Zedong Zhang
Zikun Chen
Shuo Chen
Xianrui Li
Gan Sun
Jian Yang
Jun Li
DiffM
40
4
0
28 Oct 2024
GHIL-Glue: Hierarchical Control with Filtered Subgoal Images
GHIL-Glue: Hierarchical Control with Filtered Subgoal Images
Kyle Hatch
Ashwin Balakrishna
Oier Mees
Suraj Nair
Seohong Park
...
Masha Itkina
Benjamin Eysenbach
Sergey Levine
Thomas Kollar
Benjamin Burchfiel
65
2
0
26 Oct 2024
ArCSEM: Artistic Colorization of SEM Images via Gaussian Splatting
ArCSEM: Artistic Colorization of SEM Images via Gaussian Splatting
Takuma Nishimura
Andreea Dogaru
Martin Oeggerli
Bernhard Egger
33
0
0
25 Oct 2024
BIFRÖST: 3D-Aware Image compositing with Language Instructions
BIFRÖST: 3D-Aware Image compositing with Language Instructions
Lingxiao Li
Kaixiong Gong
Weihong Li
Xili Dai
Tao Chen
Xiaojun Yuan
Xiangyu Yue
29
2
0
24 Oct 2024
Schedule Your Edit: A Simple yet Effective Diffusion Noise Schedule for
  Image Editing
Schedule Your Edit: A Simple yet Effective Diffusion Noise Schedule for Image Editing
Haonan Lin
Mengmeng Wang
Jiahao Wang
Wenbin An
Yan Chen
Yong Liu
Feng Tian
Guang Dai
Jingdong Wang
Qianying Wang
DiffM
43
9
0
24 Oct 2024
ChatSearch: a Dataset and a Generative Retrieval Model for General
  Conversational Image Retrieval
ChatSearch: a Dataset and a Generative Retrieval Model for General Conversational Image Retrieval
Zijia Zhao
Longteng Guo
Tongtian Yue
Erdong Hu
Shuai Shao
Zehuan Yuan
Hua Huang
Jiaheng Liu
26
1
0
24 Oct 2024
Robust Watermarking Using Generative Priors Against Image Editing: From Benchmarking to Advances
Robust Watermarking Using Generative Priors Against Image Editing: From Benchmarking to Advances
Shilin Lu
Zihan Zhou
Jiayou Lu
Yuanzhi Zhu
A. Kong
WIGM
97
10
0
24 Oct 2024
Previous
123...678...252627
Next