ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2211.09800
  4. Cited By
InstructPix2Pix: Learning to Follow Image Editing Instructions

InstructPix2Pix: Learning to Follow Image Editing Instructions

17 November 2022
Tim Brooks
Aleksander Holynski
Alexei A. Efros
    DiffM
ArXivPDFHTML

Papers citing "InstructPix2Pix: Learning to Follow Image Editing Instructions"

50 / 1,348 papers shown
Title
In-Context Learning Unlocked for Diffusion Models
In-Context Learning Unlocked for Diffusion Models
Zhendong Wang
Yifan Jiang
Yadong Lu
Yelong Shen
Pengcheng He
Weizhu Chen
Zhangyang Wang
Mingyuan Zhou
VLM
DiffM
86
68
0
01 May 2023
Let the Chart Spark: Embedding Semantic Context into Chart with
  Text-to-Image Generative Model
Let the Chart Spark: Embedding Semantic Context into Chart with Text-to-Image Generative Model
Shishi Xiao
Suizi Huang
Yue Lin
Yilin Ye
Weizhen Zeng
36
30
0
28 Apr 2023
IconShop: Text-Guided Vector Icon Synthesis with Autoregressive
  Transformers
IconShop: Text-Guided Vector Icon Synthesis with Autoregressive Transformers
Rong Wu
Wanchao Su
Kede Ma
Jing Liao
29
34
0
27 Apr 2023
Learning Human-Human Interactions in Images from Weak Textual
  Supervision
Learning Human-Human Interactions in Images from Weak Textual Supervision
Morris Alper
Hadar Averbuch-Elor
VLM
37
2
0
27 Apr 2023
Patch Diffusion: Faster and More Data-Efficient Training of Diffusion
  Models
Patch Diffusion: Faster and More Data-Efficient Training of Diffusion Models
Zhendong Wang
Yifan Jiang
Huangjie Zheng
Peihao Wang
Pengcheng He
Zhangyang Wang
Weizhu Chen
Mingyuan Zhou
25
96
0
25 Apr 2023
SINC: Spatial Composition of 3D Human Motions for Simultaneous Action
  Generation
SINC: Spatial Composition of 3D Human Motions for Simultaneous Action Generation
Nikos Athanasiou
Mathis Petrovich
Michael J. Black
Gül Varol
13
40
0
20 Apr 2023
HyperStyle3D: Text-Guided 3D Portrait Stylization via Hypernetworks
HyperStyle3D: Text-Guided 3D Portrait Stylization via Hypernetworks
Zhuo Chen
Xudong Xu
Yichao Yan
Ye Pan
Wenhan Zhu
Wayne Wu
Bo Dai
Xiaokang Yang
3DH
24
8
0
19 Apr 2023
Visual Instruction Tuning
Visual Instruction Tuning
Haotian Liu
Chunyuan Li
Qingyang Wu
Yong Jae Lee
SyDa
VLM
MLLM
82
4,242
0
17 Apr 2023
Delta Denoising Score
Delta Denoising Score
Amir Hertz
Kfir Aberman
Daniel Cohen-Or
DiffM
25
89
0
14 Apr 2023
One-Shot Stylization for Full-Body Human Images
One-Shot Stylization for Full-Body Human Images
Aiyu Cui
Svetlana Lazebnik
3DH
21
0
0
14 Apr 2023
Expressive Text-to-Image Generation with Rich Text
Expressive Text-to-Image Generation with Rich Text
Songwei Ge
Taesung Park
Jun-Yan Zhu
Jia-Bin Huang
DiffM
79
79
0
13 Apr 2023
Segment Everything Everywhere All at Once
Segment Everything Everywhere All at Once
Xueyan Zou
Jianwei Yang
Hao Zhang
Feng Li
Linjie Li
Jianfeng Wang
Lijuan Wang
Jianfeng Gao
Yong Jae Lee
MLLM
VLM
9
457
0
13 Apr 2023
An Edit Friendly DDPM Noise Space: Inversion and Manipulations
An Edit Friendly DDPM Noise Space: Inversion and Manipulations
Inbar Huberman-Spiegelglas
Vladimir Kulikov
T. Michaeli
DiffM
13
140
0
12 Apr 2023
DreamPose: Fashion Image-to-Video Synthesis via Stable Diffusion
DreamPose: Fashion Image-to-Video Synthesis via Stable Diffusion
J. Karras
Aleksander Holynski
Ting-Chun Wang
Ira Kemelmacher-Shlizerman
DiffM
VGen
21
137
0
12 Apr 2023
Improving Diffusion Models for Scene Text Editing with Dual Encoders
Improving Diffusion Models for Scene Text Editing with Dual Encoders
Jiabao Ji
Guanhua Zhang
Zhaowen Wang
Bairu Hou
Zhifei Zhang
Brian L. Price
Shiyu Chang
DiffM
30
29
0
12 Apr 2023
Re-imagine the Negative Prompt Algorithm: Transform 2D Diffusion into
  3D, alleviate Janus problem and Beyond
Re-imagine the Negative Prompt Algorithm: Transform 2D Diffusion into 3D, alleviate Janus problem and Beyond
Mohammadreza Armandpour
A. Sadeghian
Huangjie Zheng
Amir Sadeghian
Mingyuan Zhou
DiffM
18
123
0
11 Apr 2023
Leveraging Neural Representations for Audio Manipulation
Leveraging Neural Representations for Audio Manipulation
Scott H. Hawley
C. Steinmetz
25
2
0
10 Apr 2023
Towards Real-time Text-driven Image Manipulation with Unconditional
  Diffusion Models
Towards Real-time Text-driven Image Manipulation with Unconditional Diffusion Models
Nikita Starodubcev
Dmitry Baranchuk
Valentin Khrulkov
Artem Babenko
DiffM
47
4
0
10 Apr 2023
InstantBooth: Personalized Text-to-Image Generation without Test-Time
  Finetuning
InstantBooth: Personalized Text-to-Image Generation without Test-Time Finetuning
Jing Shi
Wei Xiong
Zhe-nan Lin
H. J. Jung
DiffM
119
278
0
06 Apr 2023
Inst-Inpaint: Instructing to Remove Objects with Diffusion Models
Inst-Inpaint: Instructing to Remove Objects with Diffusion Models
Ahmet Burak Yildirim
Vedat Baday
Erkut Erdem
Aykut Erdem
Aysegül Dündar
DiffM
14
60
0
06 Apr 2023
Taming Encoder for Zero Fine-tuning Image Customization with
  Text-to-Image Diffusion Models
Taming Encoder for Zero Fine-tuning Image Customization with Text-to-Image Diffusion Models
Xuhui Jia
Yang Zhao
Kelvin C. K. Chan
Yandong Li
Han-Ying Zhang
Boqing Gong
Tingbo Hou
H. Wang
Yu-Chuan Su
DiffM
19
100
0
05 Apr 2023
AUDIT: Audio Editing by Following Instructions with Latent Diffusion
  Models
AUDIT: Audio Editing by Following Instructions with Latent Diffusion Models
Yuancheng Wang
Zeqian Ju
Xuejiao Tan
Lei He
Zhizheng Wu
Jiang Bian
Sheng Zhao
DiffM
19
47
0
03 Apr 2023
Subject-driven Text-to-Image Generation via Apprenticeship Learning
Subject-driven Text-to-Image Generation via Apprenticeship Learning
Wenhu Chen
Hexiang Hu
Yandong Li
Nataniel Rui
Xuhui Jia
Ming-Wei Chang
William W. Cohen
DiffM
11
187
0
01 Apr 2023
Going Beyond Nouns With Vision & Language Models Using Synthetic Data
Going Beyond Nouns With Vision & Language Models Using Synthetic Data
Paola Cascante-Bonilla
Khaled Shehada
James Smith
Sivan Doveh
Donghyun Kim
...
Gül Varol
A. Oliva
Vicente Ordonez
Rogerio Feris
Leonid Karlinsky
VLM
SyDa
22
40
0
30 Mar 2023
PAIR-Diffusion: A Comprehensive Multimodal Object-Level Image Editor
PAIR-Diffusion: A Comprehensive Multimodal Object-Level Image Editor
Vidit Goel
E. Peruzzo
Yifan Jiang
Dejia Xu
Xingqian Xu
N. Sebe
Trevor Darrell
Zhangyang Wang
Humphrey Shi
DiffM
22
6
0
30 Mar 2023
MDP: A Generalized Framework for Text-Guided Image Editing by
  Manipulating the Diffusion Path
MDP: A Generalized Framework for Text-Guided Image Editing by Manipulating the Diffusion Path
Qian Wang
Biao Zhang
Michael Birsak
Peter Wonka
DiffM
24
17
0
29 Mar 2023
Instruct 3D-to-3D: Text Instruction Guided 3D-to-3D conversion
Instruct 3D-to-3D: Text Instruction Guided 3D-to-3D conversion
Hiromichi Kamata
Yuiko Sakuma
Akio Hayakawa
Masato Ishii
T. Narihira
DiffM
29
37
0
28 Mar 2023
The Stable Signature: Rooting Watermarks in Latent Diffusion Models
The Stable Signature: Rooting Watermarks in Latent Diffusion Models
Pierre Fernandez
Guillaume Couairon
Hervé Jégou
Matthijs Douze
Teddy Furon
WIGM
15
176
0
27 Mar 2023
Training-free Content Injection using h-space in Diffusion Models
Training-free Content Injection using h-space in Diffusion Models
Jaeseok Jeong
Mingi Kwon
Youngjung Uh
DiffM
18
24
0
27 Mar 2023
Guiding AI-Generated Digital Content with Wireless Perception
Guiding AI-Generated Digital Content with Wireless Perception
Jiacheng Wang
Hongyang Du
Dusit Niyato
Zehui Xiong
Jiawen Kang
Shiwen Mao
Xuemin
X. Shen
26
12
0
26 Mar 2023
Human Preference Score: Better Aligning Text-to-Image Models with Human
  Preference
Human Preference Score: Better Aligning Text-to-Image Models with Human Preference
Xiaoshi Wu
Keqiang Sun
Feng Zhu
Rui Zhao
Hongsheng Li
20
131
0
25 Mar 2023
DreamBooth3D: Subject-Driven Text-to-3D Generation
DreamBooth3D: Subject-Driven Text-to-3D Generation
Amit Raj
S. Kaza
Ben Poole
Michael Niemeyer
Nataniel Ruiz
...
Kfir Aberman
Michael Rubinstein
Jonathan T. Barron
Yuanzhen Li
Varun Jampani
DiffM
24
219
0
23 Mar 2023
Text2Video-Zero: Text-to-Image Diffusion Models are Zero-Shot Video
  Generators
Text2Video-Zero: Text-to-Image Diffusion Models are Zero-Shot Video Generators
Levon Khachatryan
A. Movsisyan
Vahram Tadevosyan
Roberto Henschel
Zhangyang Wang
Shant Navasardyan
Humphrey Shi
VGen
27
541
0
23 Mar 2023
Instruct-NeRF2NeRF: Editing 3D Scenes with Instructions
Instruct-NeRF2NeRF: Editing 3D Scenes with Instructions
Ayaan Haque
Matthew Tancik
Alexei A. Efros
Aleksander Holynski
Angjoo Kanazawa
VGen
DiffM
25
360
0
22 Mar 2023
Pix2Video: Video Editing using Image Diffusion
Pix2Video: Video Editing using Image Diffusion
Duygu Ceylan
C. Huang
Niloy J. Mitra
DiffM
VGen
35
245
0
22 Mar 2023
LD-ZNet: A Latent Diffusion Approach for Text-Based Image Segmentation
LD-ZNet: A Latent Diffusion Approach for Text-Based Image Segmentation
K. Pnvr
Bharat Singh
P. Ghosh
Behjat Siddiquie
David Jacobs
DiffM
22
29
0
22 Mar 2023
Vox-E: Text-guided Voxel Editing of 3D Objects
Vox-E: Text-guided Voxel Editing of 3D Objects
Etai Sella
Gal Fiebelman
Peter Hedman
Hadar Averbuch-Elor
DiffM
28
73
0
21 Mar 2023
Text2Room: Extracting Textured 3D Meshes from 2D Text-to-Image Models
Text2Room: Extracting Textured 3D Meshes from 2D Text-to-Image Models
Lukas Höllein
Ang Cao
Andrew Owens
Justin Johnson
Matthias Nießner
DiffM
30
177
0
21 Mar 2023
CompoDiff: Versatile Composed Image Retrieval With Latent Diffusion
CompoDiff: Versatile Composed Image Retrieval With Latent Diffusion
Geonmo Gu
Sanghyuk Chun
Wonjae Kim
HeeJae Jun
Yoohoon Kang
Sangdoo Yun
DiffM
23
50
0
21 Mar 2023
Zero-1-to-3: Zero-shot One Image to 3D Object
Zero-1-to-3: Zero-shot One Image to 3D Object
Ruoshi Liu
Rundi Wu
Basile Van Hoorick
P. Tokmakov
Sergey Zakharov
Carl Vondrick
DiffM
29
1,046
0
20 Mar 2023
Localizing Object-level Shape Variations with Text-to-Image Diffusion
  Models
Localizing Object-level Shape Variations with Text-to-Image Diffusion Models
Or Patashnik
Daniel Garibi
Idan Azuri
Hadar Averbuch-Elor
Daniel Cohen-Or
DiffM
29
110
0
20 Mar 2023
SVDiff: Compact Parameter Space for Diffusion Fine-Tuning
SVDiff: Compact Parameter Space for Diffusion Fine-Tuning
Ligong Han
Yinxiao Li
Han Zhang
P. Milanfar
Dimitris N. Metaxas
Feng Yang
DiffM
38
270
0
20 Mar 2023
DialogPaint: A Dialog-based Image Editing Model
DialogPaint: A Dialog-based Image Editing Model
Jingxuan Wei
Shiyu Wu
Xin Jiang
Yequan Wang
KELM
DiffM
22
5
0
17 Mar 2023
GlueGen: Plug and Play Multi-modal Encoders for X-to-image Generation
GlueGen: Plug and Play Multi-modal Encoders for X-to-image Generation
Can Qin
Ning Yu
Chen Xing
Shu Zhen Zhang
Zeyuan Chen
Stefano Ermon
Yun Fu
Caiming Xiong
Ran Xu
DiffM
35
19
0
17 Mar 2023
HIVE: Harnessing Human Feedback for Instructional Visual Editing
HIVE: Harnessing Human Feedback for Instructional Visual Editing
Shu Zhen Zhang
Xinyi Yang
Yihao Feng
Can Qin
Chia-Chih Chen
...
Haiquan Wang
Silvio Savarese
Stefano Ermon
Caiming Xiong
Ran Xu
18
103
0
16 Mar 2023
Efficient Diffusion Training via Min-SNR Weighting Strategy
Efficient Diffusion Training via Min-SNR Weighting Strategy
Tiankai Hang
Shuyang Gu
Chen Li
Jianmin Bao
Dong Chen
Han Hu
Xin Geng
B. Guo
16
150
0
16 Mar 2023
P+: Extended Textual Conditioning in Text-to-Image Generation
P+: Extended Textual Conditioning in Text-to-Image Generation
A. Voynov
Qinghao Chu
Daniel Cohen-Or
Kfir Aberman
VLM
DiffM
46
176
0
16 Mar 2023
Automatic Geo-alignment of Artwork in Children's Story Books
Automatic Geo-alignment of Artwork in Children's Story Books
Jakub J Dylag
V. Suarez
James Wald
Aneesha Amodini Uvara
DiffM
36
0
0
16 Mar 2023
Aerial Diffusion: Text Guided Ground-to-Aerial View Translation from a
  Single Image using Diffusion Models
Aerial Diffusion: Text Guided Ground-to-Aerial View Translation from a Single Image using Diffusion Models
D. Kothandaraman
Tianyi Zhou
Ming Lin
Dinesh Manocha
24
5
0
15 Mar 2023
Class-Guided Image-to-Image Diffusion: Cell Painting from Brightfield
  Images with Class Labels
Class-Guided Image-to-Image Diffusion: Cell Painting from Brightfield Images with Class Labels
J. Cross-Zamirski
P. Anand
Guy B. Williams
E. Mouchet
Yinhai Wang
Carola-Bibiane Schönlieb
VLM
DiffM
MedIm
6
8
0
15 Mar 2023
Previous
123...252627
Next