ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2211.09800
  4. Cited By
InstructPix2Pix: Learning to Follow Image Editing Instructions
v1v2 (latest)

InstructPix2Pix: Learning to Follow Image Editing Instructions

17 November 2022
Tim Brooks
Aleksander Holynski
Alexei A. Efros
    DiffM
ArXiv (abs)PDFHTML

Papers citing "InstructPix2Pix: Learning to Follow Image Editing Instructions"

50 / 1,418 papers shown
Title
Neural Radiance Field-based Visual Rendering: A Comprehensive Review
Neural Radiance Field-based Visual Rendering: A Comprehensive Review
Mingyuan Yao
Yukang Huo
Yang Ran
Qingbin Tian
Ruifeng Wang
Haihua Wang
AI4CE
84
9
0
31 Mar 2024
A Review of Modern Recommender Systems Using Generative Models
  (Gen-RecSys)
A Review of Modern Recommender Systems Using Generative Models (Gen-RecSys)
Yashar Deldjoo
Zhankui He
Julian McAuley
Anton Korikov
Scott Sanner
Arnau Ramisa
René Vidal
M. Sathiamoorthy
Atoosa Kasirzadeh
Silvia Milano
VLM
152
61
0
31 Mar 2024
Benchmarking Counterfactual Image Generation
Benchmarking Counterfactual Image Generation
Thomas Melistas
Nikos Spyrou
Nefeli Gkouti
Pedro Sanchez
Athanasios Vlontzos
Yannis Panagakis
G. Papanastasiou
Sotirios A. Tsaftaris
EGVMCML
134
11
0
29 Mar 2024
U-VAP: User-specified Visual Appearance Personalization via Decoupled
  Self Augmentation
U-VAP: User-specified Visual Appearance Personalization via Decoupled Self Augmentation
You Wu
Kean Liu
Xiaoyue Mi
Fan Tang
Juan Cao
Jintao Li
DiffM
91
5
0
29 Mar 2024
InterDreamer: Zero-Shot Text to 3D Dynamic Human-Object Interaction
InterDreamer: Zero-Shot Text to 3D Dynamic Human-Object Interaction
Sirui Xu
Ziyin Wang
Yu Wang
Liangyan Gui
107
31
0
28 Mar 2024
MagicLens: Self-Supervised Image Retrieval with Open-Ended Instructions
MagicLens: Self-Supervised Image Retrieval with Open-Ended Instructions
Kai Zhang
Yi Luan
Hexiang Hu
Kenton Lee
Siyuan Qiao
Wenhu Chen
Yu-Chuan Su
Ming-Wei Chang
VLMLRM
102
40
0
28 Mar 2024
Enhance Image Classification via Inter-Class Image Mixup with Diffusion
  Model
Enhance Image Classification via Inter-Class Image Mixup with Diffusion Model
Zhicai Wang
Longhui Wei
Tan Wang
Heyu Chen
Yanbin Hao
Xiang Wang
Xiangnan He
Qi Tian
VLMDiffM
78
18
0
28 Mar 2024
Locate, Assign, Refine: Taming Customized Promptable Image Inpainting
Locate, Assign, Refine: Taming Customized Promptable Image Inpainting
Yulin Pan
Chaojie Mao
Zeyinzi Jiang
Zhen Han
Jingfeng Zhang
Xiangteng He
DiffM
78
2
0
28 Mar 2024
Lift3D: Zero-Shot Lifting of Any 2D Vision Model to 3D
Lift3D: Zero-Shot Lifting of Any 2D Vision Model to 3D
Mukund Varma
Peihao Wang
Zhiwen Fan
Zhangyang Wang
Hao Su
R. Ramamoorthi
VLM
88
8
0
27 Mar 2024
CPR: Retrieval Augmented Generation for Copyright Protection
CPR: Retrieval Augmented Generation for Copyright Protection
Aditya Golatkar
Alessandro Achille
Luca Zancato
Yu-Xiang Wang
Ashwin Swaminathan
Stefano Soatto
DiffM
82
17
0
27 Mar 2024
ObjectDrop: Bootstrapping Counterfactuals for Photorealistic Object
  Removal and Insertion
ObjectDrop: Bootstrapping Counterfactuals for Photorealistic Object Removal and Insertion
Daniel Winter
Matan Cohen
Shlomi Fruchter
Yael Pritch
Alex Rav-Acha
Yedid Hoshen
DiffM
95
32
0
27 Mar 2024
ImageNet-D: Benchmarking Neural Network Robustness on Diffusion
  Synthetic Object
ImageNet-D: Benchmarking Neural Network Robustness on Diffusion Synthetic Object
Chenshuang Zhang
Fei Pan
Junmo Kim
In So Kweon
Chengzhi Mao
85
11
1
27 Mar 2024
InstructBrush: Learning Attention-based Instruction Optimization for
  Image Editing
InstructBrush: Learning Attention-based Instruction Optimization for Image Editing
Ruoyu Zhao
Qingnan Fan
Fei Kou
Shuai Qin
Hong Gu
Wei Wu
Pengcheng Xu
Mingrui Zhu
Nannan Wang
Xinbo Gao
70
4
0
27 Mar 2024
FlexEdit: Flexible and Controllable Diffusion-based Object-centric Image
  Editing
FlexEdit: Flexible and Controllable Diffusion-based Object-centric Image Editing
Trong-Tung Nguyen
Duc A. Nguyen
Anh Tran
Cuong Pham
DiffM
77
7
0
27 Mar 2024
NeuroPictor: Refining fMRI-to-Image Reconstruction via Multi-individual
  Pretraining and Multi-level Modulation
NeuroPictor: Refining fMRI-to-Image Reconstruction via Multi-individual Pretraining and Multi-level Modulation
Jingyang Huo
Yikai Wang
Xuelin Qian
Yun Wang
Chong Li
Jianfeng Feng
Yanwei Fu
DiffMMedIm
77
10
0
27 Mar 2024
AID: Attention Interpolation of Text-to-Image Diffusion
AID: Attention Interpolation of Text-to-Image Diffusion
Qiyuan He
Jinghao Wang
Ziwei Liu
Angela Yao
DiffM
84
10
0
26 Mar 2024
AniArtAvatar: Animatable 3D Art Avatar from a Single Image
AniArtAvatar: Animatable 3D Art Avatar from a Single Image
Shaoxu Li
83
1
0
26 Mar 2024
InterHandGen: Two-Hand Interaction Generation via Cascaded Reverse
  Diffusion
InterHandGen: Two-Hand Interaction Generation via Cascaded Reverse Diffusion
Jihyun Lee
Shunsuke Saito
Giljoo Nam
Minhyuk Sung
Tae-Kyun Kim
73
14
0
26 Mar 2024
TRIP: Temporal Residual Learning with Image Noise Prior for
  Image-to-Video Diffusion Models
TRIP: Temporal Residual Learning with Image Noise Prior for Image-to-Video Diffusion Models
Zhongwei Zhang
Fuchen Long
Yingwei Pan
Zhaofan Qiu
Ting Yao
Yang Cao
Tao Mei
VGen
95
29
0
25 Mar 2024
SD-DiT: Unleashing the Power of Self-supervised Discrimination in
  Diffusion Transformer
SD-DiT: Unleashing the Power of Self-supervised Discrimination in Diffusion Transformer
Rui Zhu
Yingwei Pan
Yehao Li
Ting Yao
Zhenglong Sun
Tao Mei
C. Chen
120
26
0
25 Mar 2024
Composed Video Retrieval via Enriched Context and Discriminative
  Embeddings
Composed Video Retrieval via Enriched Context and Discriminative Embeddings
Omkar Thawakar
Muzammal Naseer
Rao Muhammad Anwer
Salman Khan
Michael Felsberg
Mubarak Shah
Fahad Shahbaz Khan
56
11
0
25 Mar 2024
Continuous, Subject-Specific Attribute Control in T2I Models by Identifying Semantic Directions
Continuous, Subject-Specific Attribute Control in T2I Models by Identifying Semantic Directions
S. A. Baumann
Felix Krause
Michael Neumayr
Nick Stracke
Vincent Tao Hu
Bjorn Ommer
Björn Ommer
DiffMLM&Ro
134
12
0
25 Mar 2024
Pose-Guided Self-Training with Two-Stage Clustering for Unsupervised
  Landmark Discovery
Pose-Guided Self-Training with Two-Stage Clustering for Unsupervised Landmark Discovery
Siddharth Tourani
Ahmed Alwheibi
Arif Mahmood
Muhammad Haris Khan
DiffM
83
2
0
24 Mar 2024
ThemeStation: Generating Theme-Aware 3D Assets from Few Exemplars
ThemeStation: Generating Theme-Aware 3D Assets from Few Exemplars
Zhenwei Wang
Tengfei Wang
Gerhard Hancke
Ziwei Liu
Rynson W. H. Lau
DiffMVGen
89
6
0
22 Mar 2024
DragAPart: Learning a Part-Level Motion Prior for Articulated Objects
DragAPart: Learning a Part-Level Motion Prior for Articulated Objects
Ruining Li
Chuanxia Zheng
Christian Rupprecht
Andrea Vedaldi
DiffM
110
19
0
22 Mar 2024
Recent Trends in 3D Reconstruction of General Non-Rigid Scenes
Recent Trends in 3D Reconstruction of General Non-Rigid Scenes
Raza Yunus
J. E. Lenssen
Michael Niemeyer
Yiyi Liao
Christian Rupprecht
Christian Theobalt
Gerard Pons-Moll
Jia-Bin Huang
Vladislav Golyanik
Eddy Ilg
138
26
0
22 Mar 2024
DreamFlow: High-Quality Text-to-3D Generation by Approximating
  Probability Flow
DreamFlow: High-Quality Text-to-3D Generation by Approximating Probability Flow
Kyungmin Lee
Kihyuk Sohn
Jinwoo Shin
80
19
0
22 Mar 2024
Controlled Training Data Generation with Diffusion Models
Controlled Training Data Generation with Diffusion Models
Teresa Yeo
Andrei Atanov
Harold Benoit
Aleksandr Alekseev
Ruchira Ray
Pooya Esmaeil Akhoondi
Amir Zamir
109
6
0
22 Mar 2024
AnyV2V: A Tuning-Free Framework For Any Video-to-Video Editing Tasks
AnyV2V: A Tuning-Free Framework For Any Video-to-Video Editing Tasks
Max Ku
Cong Wei
Weiming Ren
Huan Yang
Wenhu Chen
VGenDiffM
168
29
0
21 Mar 2024
Efficient Video Diffusion Models via Content-Frame Motion-Latent
  Decomposition
Efficient Video Diffusion Models via Content-Frame Motion-Latent Decomposition
Sihyun Yu
Weili Nie
De-An Huang
Boyi Li
Jinwoo Shin
A. Anandkumar
VGenDiffM
100
19
0
21 Mar 2024
TimeRewind: Rewinding Time with Image-and-Events Video Diffusion
TimeRewind: Rewinding Time with Image-and-Events Video Diffusion
Jingxi Chen
Brandon Yushan Feng
Haoming Cai
Mingyang Xie
Christopher A. Metzler
Cornelia Fermuller
Yiannis Aloimonos
90
4
0
20 Mar 2024
DepthFM: Fast Monocular Depth Estimation with Flow Matching
DepthFM: Fast Monocular Depth Estimation with Flow Matching
Ming Gui
Johannes S. Fischer
Ulrich Prestel
Pingchuan Ma
Dmytro Kotovenko
Olga Grebenkova
S. A. Baumann
Vincent Tao Hu
Bjorn Ommer
MDE
107
59
0
20 Mar 2024
ZoDi: Zero-Shot Domain Adaptation with Diffusion-Based Image Transfer
ZoDi: Zero-Shot Domain Adaptation with Diffusion-Based Image Transfer
Hiroki Azuma
Yusuke Matsui
Atsuto Maki
VLM
72
1
0
20 Mar 2024
Ground-A-Score: Scaling Up the Score Distillation for Multi-Attribute
  Editing
Ground-A-Score: Scaling Up the Score Distillation for Multi-Attribute Editing
Hangeol Chang
Jinho Chang
Jong Chul Ye
DiffM
106
3
0
20 Mar 2024
IDAdapter: Learning Mixed Features for Tuning-Free Personalization of
  Text-to-Image Models
IDAdapter: Learning Mixed Features for Tuning-Free Personalization of Text-to-Image Models
Siying Cui
Jia Guo
Xiang An
Jiankang Deng
Yongle Zhao
Xinyu Wei
Ziyong Feng
DiffM
94
24
0
20 Mar 2024
Mora: Enabling Generalist Video Generation via A Multi-Agent Framework
Mora: Enabling Generalist Video Generation via A Multi-Agent Framework
Zhengqing Yuan
Ruoxi Chen
Zhaoxu Li
Haolong Jia
Lifang He
Chi Wang
Lichao Sun
VGen
109
28
0
20 Mar 2024
Magic Fixup: Streamlining Photo Editing by Watching Dynamic Videos
Magic Fixup: Streamlining Photo Editing by Watching Dynamic Videos
Hadi Alzayer
Zhihao Xia
Xuaner Zhang
Eli Shechtman
Jia-Bin Huang
Michael Gharbi
DiffMVGen
67
20
0
19 Mar 2024
Wear-Any-Way: Manipulable Virtual Try-on via Sparse Correspondence
  Alignment
Wear-Any-Way: Manipulable Virtual Try-on via Sparse Correspondence Alignment
Mengting Chen
Xi Chen
Zhonghua Zhai
Chen Ju
Xuewen Hong
Jinsong Lan
Shuai Xiao
OODDiffM
88
26
0
19 Mar 2024
FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation
FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation
Shuai Yang
Yifan Zhou
Ziwei Liu
Chen Change Loy
VGenDiffM
129
33
0
19 Mar 2024
You Only Sample Once: Taming One-Step Text-to-Image Synthesis by
  Self-Cooperative Diffusion GANs
You Only Sample Once: Taming One-Step Text-to-Image Synthesis by Self-Cooperative Diffusion GANs
Yihong Luo
Xiaolong Chen
Xinghua Qu
Jing Tang
94
11
0
19 Mar 2024
Generative Enhancement for 3D Medical Images
Generative Enhancement for 3D Medical Images
Lingting Zhu
Noel Codella
Dongdong Chen
Zhenchao Jin
Lu Yuan
Lequan Yu
DiffMMedIm
95
10
0
19 Mar 2024
Tuning-Free Image Customization with Image and Text Guidance
Tuning-Free Image Customization with Image and Text Guidance
Pengzhi Li
Qiang Nie
Ying Chen
Xi Jiang
Kai Wu
Yuhuan Lin
Yong-Jin Liu
Jinlong Peng
Chengjie Wang
Feng Zheng
DiffM
68
21
0
19 Mar 2024
LASPA: Latent Spatial Alignment for Fast Training-free Single Image
  Editing
LASPA: Latent Spatial Alignment for Fast Training-free Single Image Editing
Yazeed Alharbi
Peter Wonka
DiffM
66
0
0
19 Mar 2024
MineDreamer: Learning to Follow Instructions via Chain-of-Imagination
  for Simulated-World Control
MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simulated-World Control
Enshen Zhou
Yiran Qin
Zhen-fei Yin
Yuzhou Huang
Ruimao Zhang
Lu Sheng
Yu Qiao
Jing Shao
LM&RoAI4CE
113
36
0
18 Mar 2024
One-Step Image Translation with Text-to-Image Models
One-Step Image Translation with Text-to-Image Models
Gaurav Parmar
Taesung Park
Srinivasa Narasimhan
Jun-Yan Zhu
106
57
0
18 Mar 2024
Generic 3D Diffusion Adapter Using Controlled Multi-View Editing
Generic 3D Diffusion Adapter Using Controlled Multi-View Editing
Hansheng Chen
Ruoxi Shi
Yulin Liu
Bokui Shen
Jiayuan Gu
Gordon Wetzstein
Hao Su
Leonidas Guibas
DiffM
106
22
0
18 Mar 2024
Fast High-Resolution Image Synthesis with Latent Adversarial Diffusion
  Distillation
Fast High-Resolution Image Synthesis with Latent Adversarial Diffusion Distillation
Axel Sauer
Frederic Boesel
Tim Dockhorn
A. Blattmann
Patrick Esser
Robin Rombach
DiffM
116
135
0
18 Mar 2024
SpatialPIN: Enhancing Spatial Reasoning Capabilities of Vision-Language
  Models through Prompting and Interacting 3D Priors
SpatialPIN: Enhancing Spatial Reasoning Capabilities of Vision-Language Models through Prompting and Interacting 3D Priors
Chenyang Ma
Kai Lu
Ta-Ying Cheng
Niki Trigoni
Andrew Markham
LRM
85
16
0
18 Mar 2024
EffiVED:Efficient Video Editing via Text-instruction Diffusion Models
EffiVED:Efficient Video Editing via Text-instruction Diffusion Models
Zhenghao Zhang
Zuozhuo Dai
Long Qin
Weizhi Wang
DiffMVGen
74
2
0
18 Mar 2024
Diffusion Models are Geometry Critics: Single Image 3D Editing Using
  Pre-Trained Diffusion Priors
Diffusion Models are Geometry Critics: Single Image 3D Editing Using Pre-Trained Diffusion Priors
Ruicheng Wang
Jianfeng Xiang
Jiaolong Yang
Xin Tong
DiffM
91
5
0
18 Mar 2024
Previous
123...161718...272829
Next