ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2211.09800
  4. Cited By
InstructPix2Pix: Learning to Follow Image Editing Instructions

InstructPix2Pix: Learning to Follow Image Editing Instructions

17 November 2022
Tim Brooks
Aleksander Holynski
Alexei A. Efros
    DiffM
ArXivPDFHTML

Papers citing "InstructPix2Pix: Learning to Follow Image Editing Instructions"

50 / 1,356 papers shown
Title
RealmDreamer: Text-Driven 3D Scene Generation with Inpainting and Depth Diffusion
RealmDreamer: Text-Driven 3D Scene Generation with Inpainting and Depth Diffusion
Jaidev Shriram
Alex Trevithick
Lingjie Liu
Ravi Ramamoorthi
DiffM
3DGS
77
56
0
10 Apr 2024
GeoSynth: Contextually-Aware High-Resolution Satellite Image Synthesis
GeoSynth: Contextually-Aware High-Resolution Satellite Image Synthesis
Srikumar Sastry
Subash Khanal
Aayush Dhakal
Nathan Jacobs
69
7
0
09 Apr 2024
SmartControl: Enhancing ControlNet for Handling Rough Visual Conditions
SmartControl: Enhancing ControlNet for Handling Rough Visual Conditions
Xiaoyu Liu
Yuxiang Wei
Ming-Yu Liu
Xianhui Lin
Peiran Ren
Xuansong Xie
Wangmeng Zuo
DiffM
52
5
0
09 Apr 2024
ZeST: Zero-Shot Material Transfer from a Single Image
ZeST: Zero-Shot Material Transfer from a Single Image
Ta-Ying Cheng
Prafull Sharma
Andrew Markham
Niki Trigoni
Varun Jampani
43
10
0
09 Apr 2024
DreamView: Injecting View-specific Text Guidance into Text-to-3D
  Generation
DreamView: Injecting View-specific Text Guidance into Text-to-3D Generation
Junkai Yan
Yipeng Gao
Q. Yang
Xihan Wei
Xuansong Xie
Ancong Wu
Wei-Shi Zheng
45
2
0
09 Apr 2024
Tackling Structural Hallucination in Image Translation with Local
  Diffusion
Tackling Structural Hallucination in Image Translation with Local Diffusion
Seunghoi Kim
Chen Jin
Tom Diethe
Matteo Figini
Henry F. J. Tregidgo
A. Mullokandov
Philip Teare
Daniel C. Alexander
MedIm
35
5
0
09 Apr 2024
UniFL: Improve Stable Diffusion via Unified Feedback Learning
UniFL: Improve Stable Diffusion via Unified Feedback Learning
Jiacheng Zhang
Jie Wu
Yuxi Ren
Xin Xia
Huafeng Kuang
...
Jiashi Li
Xuefeng Xiao
Min Zheng
Lean Fu
Guanbin Li
45
5
0
08 Apr 2024
Responsible Visual Editing
Responsible Visual Editing
Minheng Ni
Yeli Shen
Lei Zhang
W. Zuo
DiffM
35
0
0
08 Apr 2024
Mask-ControlNet: Higher-Quality Image Generation with An Additional Mask
  Prompt
Mask-ControlNet: Higher-Quality Image Generation with An Additional Mask Prompt
Zhiqi Huang
Hui Xiong
Haoyu Wang
Longguang Wang
Zhiheng Li
DiffM
40
0
0
08 Apr 2024
DATENeRF: Depth-Aware Text-based Editing of NeRFs
DATENeRF: Depth-Aware Text-based Editing of NeRFs
Sara Rojas
Julien Philip
Kai Zhang
Sai Bi
Fujun Luan
Guohao Li
Kalyan Sunkavalli
DiffM
32
3
0
06 Apr 2024
Aligning Diffusion Models by Optimizing Human Utility
Aligning Diffusion Models by Optimizing Human Utility
Shufan Li
Konstantinos Kallidromitis
Akash Gokul
Yusuke Kato
Kazuki Kozuka
109
30
0
06 Apr 2024
Idea-2-3D: Collaborative LMM Agents Enable 3D Model Generation from
  Interleaved Multimodal Inputs
Idea-2-3D: Collaborative LMM Agents Enable 3D Model Generation from Interleaved Multimodal Inputs
Junhao Chen
Xiang Li
Xiaojun Ye
Chao Li
Zhaoxin Fan
Hao Zhao
VGen
3DV
220
4
0
05 Apr 2024
InstructHumans: Editing Animated 3D Human Textures with Instructions
InstructHumans: Editing Animated 3D Human Textures with Instructions
Jiayin Zhu
Linlin Yang
Angela Yao
DiffM
54
1
0
05 Apr 2024
DiffuseMix: Label-Preserving Data Augmentation with Diffusion Models
DiffuseMix: Label-Preserving Data Augmentation with Diffusion Models
Khawar Islam
Muhammad Zaigham Zaheer
Arif Mahmood
Karthik Nandakumar
DiffM
42
28
0
05 Apr 2024
Gen3DSR: Generalizable 3D Scene Reconstruction via Divide and Conquer from a Single View
Gen3DSR: Generalizable 3D Scene Reconstruction via Divide and Conquer from a Single View
Andreea Dogaru
M. Ozer
Bernhard Egger
3DGS
64
4
0
04 Apr 2024
Many-to-many Image Generation with Auto-regressive Diffusion Models
Many-to-many Image Generation with Auto-regressive Diffusion Models
Ying Shen
Yizhe Zhang
Shuangfei Zhai
Lifu Huang
J. Susskind
Jiatao Gu
51
6
0
03 Apr 2024
MULAN: A Multi Layer Annotated Dataset for Controllable Text-to-Image
  Generation
MULAN: A Multi Layer Annotated Dataset for Controllable Text-to-Image Generation
Petru-Daniel Tudosiu
Yongxin Yang
Shifeng Zhang
Fei Chen
Jingyu Sun
Gerasimos Lampouras
Ignacio Iacobacci
Sarah Parisot
50
10
0
03 Apr 2024
GenN2N: Generative NeRF2NeRF Translation
GenN2N: Generative NeRF2NeRF Translation
Xiangyue Liu
Han Xue
Kunming Luo
Ping Tan
Li Yi
29
4
0
03 Apr 2024
InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image
  Generation
InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation
Haofan Wang
Matteo Spinelli
Qixun Wang
Xu Bai
Zekui Qin
Anthony Chen
DiffM
47
86
0
03 Apr 2024
Freditor: High-Fidelity and Transferable NeRF Editing by Frequency
  Decomposition
Freditor: High-Fidelity and Transferable NeRF Editing by Frequency Decomposition
Yisheng He
Weihao Yuan
Siyu Zhu
Zilong Dong
Liefeng Bo
Qixing Huang
47
3
0
03 Apr 2024
GeneAvatar: Generic Expression-Aware Volumetric Head Avatar Editing from
  a Single Image
GeneAvatar: Generic Expression-Aware Volumetric Head Avatar Editing from a Single Image
Chong Bao
Yinda Zhang
Yuan Li
Xiyu Zhang
Bangbang Yang
Hujun Bao
Marc Pollefeys
Guofeng Zhang
Zhaopeng Cui
DiffM
47
5
0
02 Apr 2024
PREGO: online mistake detection in PRocedural EGOcentric videos
PREGO: online mistake detection in PRocedural EGOcentric videos
Alessandro Flaborea
Guido Maria DÁmely di Melendugno
Leonardo Plini
Luca Scofano
Edoardo De Matteis
Antonino Furnari
G. Farinella
Fabio Galasso
EgoV
61
11
0
02 Apr 2024
MotionChain: Conversational Motion Controllers via Multimodal Prompts
MotionChain: Conversational Motion Controllers via Multimodal Prompts
Biao Jiang
Xin Chen
C. Zhang
Fukun Yin
Zhuoyuan Li
Gang Yu
Jiayuan Fan
VGen
LRM
40
10
0
02 Apr 2024
FashionEngine: Interactive 3D Human Generation and Editing via
  Multimodal Controls
FashionEngine: Interactive 3D Human Generation and Editing via Multimodal Controls
Tao Hu
Fangzhou Hong
Zhaoxi Chen
Ziwei Liu
DiffM
40
0
0
02 Apr 2024
Evaluating Text-to-Visual Generation with Image-to-Text Generation
Evaluating Text-to-Visual Generation with Image-to-Text Generation
Zhiqiu Lin
Deepak Pathak
Baiqi Li
Jiayao Li
Xide Xia
Graham Neubig
Pengchuan Zhang
Deva Ramanan
EGVM
58
133
0
01 Apr 2024
Large Motion Model for Unified Multi-Modal Motion Generation
Large Motion Model for Unified Multi-Modal Motion Generation
Mingyuan Zhang
Daisheng Jin
Chenyang Gu
Fangzhou Hong
Zhongang Cai
...
Chongzhi Zhang
Xinying Guo
Lei Yang
Ying He
Ziwei Liu
VGen
62
25
0
01 Apr 2024
An image speaks a thousand words, but can everyone listen? On image
  transcreation for cultural relevance
An image speaks a thousand words, but can everyone listen? On image transcreation for cultural relevance
Simran Khanuja
Sathyanarayanan Ramamoorthy
Yueqi Song
Graham Neubig
DiffM
24
11
0
01 Apr 2024
Feature Splatting: Language-Driven Physics-Based Scene Synthesis and
  Editing
Feature Splatting: Language-Driven Physics-Based Scene Synthesis and Editing
Ri-Zhao Qiu
Ge Yang
Weijia Zeng
Xiaolong Wang
3DGS
49
22
0
01 Apr 2024
CLIPtone: Unsupervised Learning for Text-based Image Tone Adjustment
CLIPtone: Unsupervised Learning for Text-based Image Tone Adjustment
Hyeongmin Lee
Kyoungkook Kang
Jungseul Ok
Sunghyun Cho
CLIP
40
2
0
01 Apr 2024
Drag Your Noise: Interactive Point-based Editing via Diffusion Semantic
  Propagation
Drag Your Noise: Interactive Point-based Editing via Diffusion Semantic Propagation
Haofeng Liu
Chenshu Xu
Yifei Yang
Lihua Zeng
Shengfeng He
DiffM
68
23
0
01 Apr 2024
Neural Radiance Field-based Visual Rendering: A Comprehensive Review
Neural Radiance Field-based Visual Rendering: A Comprehensive Review
Mingyuan Yao
Yukang Huo
Yang Ran
Qingbin Tian
Ruifeng Wang
Haihua Wang
AI4CE
46
9
0
31 Mar 2024
A Review of Modern Recommender Systems Using Generative Models
  (Gen-RecSys)
A Review of Modern Recommender Systems Using Generative Models (Gen-RecSys)
Yashar Deldjoo
Zhankui He
Julian McAuley
Anton Korikov
Scott Sanner
Arnau Ramisa
René Vidal
M. Sathiamoorthy
Atoosa Kasirzadeh
Silvia Milano
VLM
41
42
0
31 Mar 2024
Benchmarking Counterfactual Image Generation
Benchmarking Counterfactual Image Generation
Thomas Melistas
Nikos Spyrou
Nefeli Gkouti
Pedro Sanchez
Athanasios Vlontzos
Yannis Panagakis
G. Papanastasiou
Sotirios A. Tsaftaris
EGVM
CML
51
7
0
29 Mar 2024
U-VAP: User-specified Visual Appearance Personalization via Decoupled
  Self Augmentation
U-VAP: User-specified Visual Appearance Personalization via Decoupled Self Augmentation
You Wu
Kean Liu
Xiaoyue Mi
Fan Tang
Juan Cao
Jintao Li
DiffM
40
4
0
29 Mar 2024
InterDreamer: Zero-Shot Text to 3D Dynamic Human-Object Interaction
InterDreamer: Zero-Shot Text to 3D Dynamic Human-Object Interaction
Sirui Xu
Ziyin Wang
Yu Wang
Liangyan Gui
50
25
0
28 Mar 2024
MagicLens: Self-Supervised Image Retrieval with Open-Ended Instructions
MagicLens: Self-Supervised Image Retrieval with Open-Ended Instructions
Kai Zhang
Yi Luan
Hexiang Hu
Kenton Lee
Siyuan Qiao
Wenhu Chen
Yu-Chuan Su
Ming-Wei Chang
VLM
LRM
47
2
0
28 Mar 2024
Enhance Image Classification via Inter-Class Image Mixup with Diffusion
  Model
Enhance Image Classification via Inter-Class Image Mixup with Diffusion Model
Zhicai Wang
Longhui Wei
Tan Wang
Heyu Chen
Yanbin Hao
Xiang Wang
Xiangnan He
Qi Tian
VLM
DiffM
40
17
0
28 Mar 2024
Locate, Assign, Refine: Taming Customized Promptable Image Inpainting
Locate, Assign, Refine: Taming Customized Promptable Image Inpainting
Yulin Pan
Chaojie Mao
Zeyinzi Jiang
Zhen Han
Jingfeng Zhang
Xiangteng He
DiffM
44
2
0
28 Mar 2024
Lift3D: Zero-Shot Lifting of Any 2D Vision Model to 3D
Lift3D: Zero-Shot Lifting of Any 2D Vision Model to 3D
Mukund Varma
Peihao Wang
Zhiwen Fan
Zhangyang Wang
Hao Su
R. Ramamoorthi
VLM
45
8
0
27 Mar 2024
CPR: Retrieval Augmented Generation for Copyright Protection
CPR: Retrieval Augmented Generation for Copyright Protection
Aditya Golatkar
Alessandro Achille
L. Zancato
Yu-Xiang Wang
Ashwin Swaminathan
Stefano Soatto
DiffM
40
16
0
27 Mar 2024
ObjectDrop: Bootstrapping Counterfactuals for Photorealistic Object
  Removal and Insertion
ObjectDrop: Bootstrapping Counterfactuals for Photorealistic Object Removal and Insertion
Daniel Winter
Matan Cohen
Shlomi Fruchter
Yael Pritch
Alex Rav-Acha
Yedid Hoshen
DiffM
48
27
0
27 Mar 2024
ImageNet-D: Benchmarking Neural Network Robustness on Diffusion
  Synthetic Object
ImageNet-D: Benchmarking Neural Network Robustness on Diffusion Synthetic Object
Chenshuang Zhang
Fei Pan
Junmo Kim
In So Kweon
Chengzhi Mao
57
11
1
27 Mar 2024
InstructBrush: Learning Attention-based Instruction Optimization for
  Image Editing
InstructBrush: Learning Attention-based Instruction Optimization for Image Editing
Ruoyu Zhao
Qingnan Fan
Fei Kou
Shuai Qin
Hong Gu
Wei Wu
Pengcheng Xu
Mingrui Zhu
Nannan Wang
Xinbo Gao
43
4
0
27 Mar 2024
FlexEdit: Flexible and Controllable Diffusion-based Object-centric Image
  Editing
FlexEdit: Flexible and Controllable Diffusion-based Object-centric Image Editing
Trong-Tung Nguyen
Duc A. Nguyen
Anh Tran
Cuong Pham
DiffM
55
7
0
27 Mar 2024
NeuroPictor: Refining fMRI-to-Image Reconstruction via Multi-individual
  Pretraining and Multi-level Modulation
NeuroPictor: Refining fMRI-to-Image Reconstruction via Multi-individual Pretraining and Multi-level Modulation
Jingyang Huo
Yikai Wang
Xuelin Qian
Yun Wang
Chong Li
Jianfeng Feng
Yanwei Fu
DiffM
MedIm
48
10
0
27 Mar 2024
AID: Attention Interpolation of Text-to-Image Diffusion
AID: Attention Interpolation of Text-to-Image Diffusion
Qiyuan He
Jinghao Wang
Ziwei Liu
Angela Yao
DiffM
45
9
0
26 Mar 2024
AniArtAvatar: Animatable 3D Art Avatar from a Single Image
AniArtAvatar: Animatable 3D Art Avatar from a Single Image
Shaoxu Li
47
1
0
26 Mar 2024
InterHandGen: Two-Hand Interaction Generation via Cascaded Reverse
  Diffusion
InterHandGen: Two-Hand Interaction Generation via Cascaded Reverse Diffusion
Jihyun Lee
Shunsuke Saito
Giljoo Nam
Minhyuk Sung
Tae-Kyun Kim
50
11
0
26 Mar 2024
TRIP: Temporal Residual Learning with Image Noise Prior for
  Image-to-Video Diffusion Models
TRIP: Temporal Residual Learning with Image Noise Prior for Image-to-Video Diffusion Models
Zhongwei Zhang
Fuchen Long
Yingwei Pan
Zhaofan Qiu
Ting Yao
Yang Cao
Tao Mei
VGen
48
25
0
25 Mar 2024
SD-DiT: Unleashing the Power of Self-supervised Discrimination in
  Diffusion Transformer
SD-DiT: Unleashing the Power of Self-supervised Discrimination in Diffusion Transformer
Rui Zhu
Yingwei Pan
Yehao Li
Ting Yao
Zhenglong Sun
Tao Mei
C. Chen
50
25
0
25 Mar 2024
Previous
123...141516...262728
Next