ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2211.09800
  4. Cited By
InstructPix2Pix: Learning to Follow Image Editing Instructions

InstructPix2Pix: Learning to Follow Image Editing Instructions

17 November 2022
Tim Brooks
Aleksander Holynski
Alexei A. Efros
    DiffM
ArXivPDFHTML

Papers citing "InstructPix2Pix: Learning to Follow Image Editing Instructions"

50 / 1,356 papers shown
Title
Edit-Your-Motion: Space-Time Diffusion Decoupling Learning for Video
  Motion Editing
Edit-Your-Motion: Space-Time Diffusion Decoupling Learning for Video Motion Editing
Yi Zuo
Lingling Li
Licheng Jiao
Fang Liu
Xu Liu
Wenping Ma
Shuyuan Yang
Yuwei Guo
VGen
DiffM
41
1
0
07 May 2024
SEED-Data-Edit Technical Report: A Hybrid Dataset for Instructional
  Image Editing
SEED-Data-Edit Technical Report: A Hybrid Dataset for Instructional Image Editing
Yuying Ge
Sijie Zhao
Chen Li
Yixiao Ge
Ying Shan
43
29
0
07 May 2024
Video Diffusion Models: A Survey
Video Diffusion Models: A Survey
Andrew Melnik
Michal Ljubljanac
Cong Lu
Qi Yan
Weiming Ren
Helge J. Ritter
VGen
77
13
0
06 May 2024
Exploring Text-based Realistic Building Facades Editing Applicaiton
Exploring Text-based Realistic Building Facades Editing Applicaiton
Jing Wang
Xin Zhang
AI4CE
47
1
0
05 May 2024
Auto-Encoding Morph-Tokens for Multimodal LLM
Auto-Encoding Morph-Tokens for Multimodal LLM
Kaihang Pan
Siliang Tang
Juncheng Li
Zhaoyu Fan
Wei Chow
Shuicheng Yan
Tat-Seng Chua
Yueting Zhuang
Hanwang Zhang
MLLM
35
18
0
03 May 2024
Customizing Text-to-Image Models with a Single Image Pair
Customizing Text-to-Image Models with a Single Image Pair
Maxwell Jones
Sheng-Yu Wang
Nupur Kumari
David Bau
Jun-Yan Zhu
DiffM
30
20
0
02 May 2024
LocInv: Localization-aware Inversion for Text-Guided Image Editing
LocInv: Localization-aware Inversion for Text-Guided Image Editing
Chuanming Tang
Kai Wang
Fei Yang
Joost van de Weijer
DiffM
49
3
0
02 May 2024
TexSliders: Diffusion-Based Texture Editing in CLIP Space
TexSliders: Diffusion-Based Texture Editing in CLIP Space
Julia Guerrero-Viu
Milos Hasan
Arthur Roullier
Midhun Harikumar
Yiwei Hu
Paul Guerrero
Diego F. F. Gutierrez
B. Masiá
Valentin Deschaintre
DiffM
33
13
0
01 May 2024
RGB$\leftrightarrow$X: Image decomposition and synthesis using material-
  and lighting-aware diffusion models
RGB↔\leftrightarrow↔X: Image decomposition and synthesis using material- and lighting-aware diffusion models
Zheng Zeng
Valentin Deschaintre
Iliyan Georgiev
Yannick Hold-Geoffroy
Yiwei Hu
Fujun Luan
Ling-Qi Yan
Miloš Hašan
DiffM
52
36
0
01 May 2024
GraCo: Granularity-Controllable Interactive Segmentation
GraCo: Granularity-Controllable Interactive Segmentation
Yian Zhao
Kehan Li
Ze-Long Cheng
Pengchong Qiao
Xiawu Zheng
Rongrong Ji
Chang Liu
Li-ming Yuan
Jie Chen
44
9
0
01 May 2024
Streamlining Image Editing with Layered Diffusion Brushes
Streamlining Image Editing with Layered Diffusion Brushes
Peyman Gholami
Robert Xiao
DiffM
34
1
0
01 May 2024
Synthetic Image Verification in the Era of Generative AI: What Works and
  What Isn't There Yet
Synthetic Image Verification in the Era of Generative AI: What Works and What Isn't There Yet
D. Tariang
Riccardo Corvi
D. Cozzolino
Giovanni Poggi
Koki Nagano
L. Verdoliva
61
8
0
30 Apr 2024
NeRF-Insert: 3D Local Editing with Multimodal Control Signals
NeRF-Insert: 3D Local Editing with Multimodal Control Signals
Benet Oriol Sabat
Alessandro Achille
Matthew Trager
Stefano Soatto
37
2
0
30 Apr 2024
DGE: Direct Gaussian 3D Editing by Consistent Multi-view Editing
DGE: Direct Gaussian 3D Editing by Consistent Multi-view Editing
Minghao Chen
Iro Laina
Andrea Vedaldi
3DGS
55
23
0
29 Apr 2024
G-Refine: A General Quality Refiner for Text-to-Image Generation
G-Refine: A General Quality Refiner for Text-to-Image Generation
Chunyi Li
Haoning Wu
Hongkun Hao
Zicheng Zhang
Tengchaun Kou
Chaofeng Chen
Lei Bai
Xiaohong Liu
Weisi Lin
Guangtao Zhai
37
4
0
29 Apr 2024
WorldGPT: Empowering LLM as Multimodal World Model
WorldGPT: Empowering LLM as Multimodal World Model
Zhiqi Ge
Hongzhe Huang
Mingze Zhou
Juncheng Li
Guoming Wang
Siliang Tang
Yueting Zhuang
43
27
0
28 Apr 2024
Paint by Inpaint: Learning to Add Image Objects by Removing Them First
Paint by Inpaint: Learning to Add Image Objects by Removing Them First
Navve Wasserman
Noam Rotstein
Roy Ganz
Ron Kimmel
DiffM
44
15
0
28 Apr 2024
DM-Align: Leveraging the Power of Natural Language Instructions to Make
  Changes to Images
DM-Align: Leveraging the Power of Natural Language Instructions to Make Changes to Images
Maria Mihaela Truşcǎ
Tinne Tuytelaars
Marie-Francine Moens
DiffM
54
1
0
27 Apr 2024
SDFD: Building a Versatile Synthetic Face Image Dataset with Diverse
  Attributes
SDFD: Building a Versatile Synthetic Face Image Dataset with Diverse Attributes
Georgia Baltsou
Ioannis Sarridis
C. Koutlis
Symeon Papadopoulos
34
2
0
26 Apr 2024
ObjectAdd: Adding Objects into Image via a Training-Free Diffusion Modification Fashion
ObjectAdd: Adding Objects into Image via a Training-Free Diffusion Modification Fashion
Ziyue Zhang
Mingbao Lin
Rongrong Ji
Yuxin Zhang
Rongrong Ji
DiffM
59
3
0
26 Apr 2024
Multimodal Semantic-Aware Automatic Colorization with Diffusion Prior
Multimodal Semantic-Aware Automatic Colorization with Diffusion Prior
Han Wang
Xinning Chai
Yiwen Wang
Yuhong Zhang
Rong Xie
Li Song
DiffM
36
2
0
25 Apr 2024
Editable Image Elements for Controllable Synthesis
Editable Image Elements for Controllable Synthesis
Jiteng Mu
Michael Gharbi
Richard Zhang
Eli Shechtman
Nuno Vasconcelos
Xiaolong Wang
Taesung Park
DiffM
58
9
0
24 Apr 2024
ID-Aligner: Enhancing Identity-Preserving Text-to-Image Generation with
  Reward Feedback Learning
ID-Aligner: Enhancing Identity-Preserving Text-to-Image Generation with Reward Feedback Learning
Weifeng Chen
Jiacheng Zhang
Jie Wu
Hefeng Wu
Xuefeng Xiao
Liang Lin
47
12
0
23 Apr 2024
Align Your Steps: Optimizing Sampling Schedules in Diffusion Models
Align Your Steps: Optimizing Sampling Schedules in Diffusion Models
Amirmojtaba Sabour
Sanja Fidler
Karsten Kreis
DiffM
45
26
0
22 Apr 2024
Infusion: Preventing Customized Text-to-Image Diffusion from Overfitting
Infusion: Preventing Customized Text-to-Image Diffusion from Overfitting
Weili Zeng
Yichao Yan
Qi Zhu
Zhuo Chen
Pengzhi Chu
Weiming Zhao
Xiaokang Yang
105
9
0
22 Apr 2024
U Can't Gen This? A Survey of Intellectual Property Protection Methods
  for Data in Generative AI
U Can't Gen This? A Survey of Intellectual Property Protection Methods for Data in Generative AI
Tanja Sarcevic
Alicja Karlowicz
Rudolf Mayer
Ricardo A. Baeza-Yates
Andreas Rauber
56
6
0
22 Apr 2024
Gorgeous: Create Your Desired Character Facial Makeup from Any Ideas
Gorgeous: Create Your Desired Character Facial Makeup from Any Ideas
Jia Wei Sii
Chee Seng Chan
DiffM
58
0
0
22 Apr 2024
A Multimodal Automated Interpretability Agent
A Multimodal Automated Interpretability Agent
Tamar Rott Shaham
Sarah Schwettmann
Franklin Wang
Achyuta Rajaram
Evan Hernandez
Jacob Andreas
Antonio Torralba
43
18
0
22 Apr 2024
SEED-X: Multimodal Models with Unified Multi-granularity Comprehension and Generation
SEED-X: Multimodal Models with Unified Multi-granularity Comprehension and Generation
Yuying Ge
Sijie Zhao
Jinguo Zhu
Yixiao Ge
Kun Yi
Lin Song
Chen Li
Xiaohan Ding
Ying Shan
VLM
70
113
0
22 Apr 2024
LASER: Tuning-Free LLM-Driven Attention Control for Efficient Text-conditioned Image-to-Animation
LASER: Tuning-Free LLM-Driven Attention Control for Efficient Text-conditioned Image-to-Animation
Haoyu Zheng
Wenqiao Zhang
Yaoke Wang
Hao Zhou
Jiang Liu
Juncheng Li
Zheqi Lv
Siliang Tang
Yueting Zhuang
Yueting Zhuang
49
1
0
21 Apr 2024
Generating Daylight-driven Architectural Design via Diffusion Models
Generating Daylight-driven Architectural Design via Diffusion Models
Pengzhi Li
Baijuan Li
AI4CE
DiffM
26
11
0
20 Apr 2024
FilterPrompt: Guiding Image Transfer in Diffusion Models
FilterPrompt: Guiding Image Transfer in Diffusion Models
Xi Wang
Yichen Peng
Heng Fang
Haoran Xie
Xi Yang
Chuntao Li
DiffM
45
0
0
20 Apr 2024
Physical Backdoor Attack can Jeopardize Driving with
  Vision-Large-Language Models
Physical Backdoor Attack can Jeopardize Driving with Vision-Large-Language Models
Zhenyang Ni
Rui Ye
Yuxian Wei
Zhen Xiang
Yanfeng Wang
Siheng Chen
AAML
43
10
0
19 Apr 2024
Customizing Text-to-Image Diffusion with Camera Viewpoint Control
Customizing Text-to-Image Diffusion with Camera Viewpoint Control
Nupur Kumari
Grace Su
Richard Zhang
Taesung Park
Eli Shechtman
Jun-Yan Zhu
DiffM
46
3
0
18 Apr 2024
Curriculum Point Prompting for Weakly-Supervised Referring Image
  Segmentation
Curriculum Point Prompting for Weakly-Supervised Referring Image Segmentation
Qiyuan Dai
Sibei Yang
34
8
0
18 Apr 2024
Factorized Diffusion: Perceptual Illusions by Noise Decomposition
Factorized Diffusion: Perceptual Illusions by Noise Decomposition
Daniel Geng
Inbum Park
Andrew Owens
DiffM
49
16
0
17 Apr 2024
Improving Composed Image Retrieval via Contrastive Learning with Scaling
  Positives and Negatives
Improving Composed Image Retrieval via Contrastive Learning with Scaling Positives and Negatives
Zhangchi Feng
Richong Zhang
Zhijie Nie
51
7
0
17 Apr 2024
TiNO-Edit: Timestep and Noise Optimization for Robust Diffusion-Based
  Image Editing
TiNO-Edit: Timestep and Noise Optimization for Robust Diffusion-Based Image Editing
Sherry X. Chen
Yaron Vaxman
Elad Ben Baruch
David Asulin
Aviad Moreshet
Kuo-Chin Lien
Misha Sra
Pradeep Sen
39
3
0
17 Apr 2024
HQ-Edit: A High-Quality Dataset for Instruction-based Image Editing
HQ-Edit: A High-Quality Dataset for Instruction-based Image Editing
Mude Hui
Siwei Yang
Bingchen Zhao
Yichun Shi
Heng Wang
Peng Wang
Yuyin Zhou
Cihang Xie
43
57
0
15 Apr 2024
MaxFusion: Plug&Play Multi-Modal Generation in Text-to-Image Diffusion
  Models
MaxFusion: Plug&Play Multi-Modal Generation in Text-to-Image Diffusion Models
Nithin Gopalakrishnan Nair
Jeya Maria Jose Valanarasu
Vishal M. Patel
MoMe
42
7
0
15 Apr 2024
Conditional Prototype Rectification Prompt Learning
Conditional Prototype Rectification Prompt Learning
Haoxing Chen
Yaohui Li
Zizheng Huang
Yan Hong
Zhuoer Xu
Zhangxuan Gu
Jun Lan
Huijia Zhu
Weiqiang Wang
VLM
50
3
0
15 Apr 2024
In-Context Translation: Towards Unifying Image Recognition, Processing,
  and Generation
In-Context Translation: Towards Unifying Image Recognition, Processing, and Generation
Han Xue
Qianru Sun
Li Song
Wenjun Zhang
Zhiwu Huang
MLLM
44
0
0
15 Apr 2024
Magic Clothing: Controllable Garment-Driven Image Synthesis
Magic Clothing: Controllable Garment-Driven Image Synthesis
Weifeng Chen
Tao Gu
Yuhao Xu
Chengcai Chen
48
17
0
15 Apr 2024
PhyScene: Physically Interactable 3D Scene Synthesis for Embodied AI
PhyScene: Physically Interactable 3D Scene Synthesis for Embodied AI
Yandan Yang
Baoxiong Jia
Peiyuan Zhi
Siyuan Huang
LM&Ro
VGen
54
43
0
15 Apr 2024
S3Editor: A Sparse Semantic-Disentangled Self-Training Framework for
  Face Video Editing
S3Editor: A Sparse Semantic-Disentangled Self-Training Framework for Face Video Editing
Guangzhi Wang
Tianyi Chen
Kamran Ghasedi
HsiangTao Wu
Tianyu Ding
Chris Nuesmeyer
Ilya Zharkov
Mohan Kankanhalli
Luming Liang
44
1
0
11 Apr 2024
OpenBias: Open-set Bias Detection in Text-to-Image Generative Models
OpenBias: Open-set Bias Detection in Text-to-Image Generative Models
Moreno DÍncà
E. Peruzzo
Massimiliano Mancini
Dejia Xu
Vidit Goel
Xingqian Xu
Zhangyang Wang
Humphrey Shi
N. Sebe
58
32
0
11 Apr 2024
ControlNet++: Improving Conditional Controls with Efficient Consistency
  Feedback
ControlNet++: Improving Conditional Controls with Efficient Consistency Feedback
Ming Li
Taojiannan Yang
Huafeng Kuang
Jie Wu
Zhaoning Wang
Xuefeng Xiao
Chong Chen
47
63
0
11 Apr 2024
Accelerating Inference in Large Language Models with a Unified Layer
  Skipping Strategy
Accelerating Inference in Large Language Models with a Unified Layer Skipping Strategy
Yijin Liu
Fandong Meng
Jie Zhou
AI4CE
37
8
0
10 Apr 2024
UDiFF: Generating Conditional Unsigned Distance Fields with Optimal
  Wavelet Diffusion
UDiFF: Generating Conditional Unsigned Distance Fields with Optimal Wavelet Diffusion
Junsheng Zhou
Weiqi Zhang
Baorui Ma
Kanle Shi
Yu-Shen Liu
Zhizhong Han
59
18
0
10 Apr 2024
Tuning-Free Adaptive Style Incorporation for Structure-Consistent
  Text-Driven Style Transfer
Tuning-Free Adaptive Style Incorporation for Structure-Consistent Text-Driven Style Transfer
Yanqi Ge
Jiaqi Liu
Qingnan Fan
Xi Jiang
Ye Huang
Shuai Qin
Hong Gu
Wen Li
Lixin Duan
DiffM
54
1
0
10 Apr 2024
Previous
123...131415...262728
Next