ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2211.09800
  4. Cited By
InstructPix2Pix: Learning to Follow Image Editing Instructions

InstructPix2Pix: Learning to Follow Image Editing Instructions

17 November 2022
Tim Brooks
Aleksander Holynski
Alexei A. Efros
    DiffM
ArXivPDFHTML

Papers citing "InstructPix2Pix: Learning to Follow Image Editing Instructions"

50 / 1,355 papers shown
Title
SeamlessNeRF: Stitching Part NeRFs with Gradient Propagation
SeamlessNeRF: Stitching Part NeRFs with Gradient Propagation
Bingchen Gong
Yuehao Wang
Xiaoguang Han
Qi Dou
34
3
0
30 Oct 2023
IterInv: Iterative Inversion for Pixel-Level T2I Models
IterInv: Iterative Inversion for Pixel-Level T2I Models
Chuanming Tang
Kai Wang
Joost van de Weijer
43
4
0
30 Oct 2023
Learning to Follow Object-Centric Image Editing Instructions Faithfully
Learning to Follow Object-Centric Image Editing Instructions Faithfully
Tuhin Chakrabarty
Kanishk Singh
Arkadiy Saakyan
Smaranda Muresan
DiffM
27
6
0
29 Oct 2023
SD4Match: Learning to Prompt Stable Diffusion Model for Semantic
  Matching
SD4Match: Learning to Prompt Stable Diffusion Model for Semantic Matching
Xinghui Li
Jingyi Lu
Kai Han
V. Prisacariu
DiffM
30
19
0
26 Oct 2023
PERF: Panoramic Neural Radiance Field from a Single Panorama
PERF: Panoramic Neural Radiance Field from a Single Panorama
Guangcong Wang
Peng Wang
Zhaoxi Chen
Wenping Wang
Chen Change Loy
Ziwei Liu
MDE
25
31
0
25 Oct 2023
Fuse Your Latents: Video Editing with Multi-source Latent Diffusion
  Models
Fuse Your Latents: Video Editing with Multi-source Latent Diffusion Models
Tianyi Lu
Xing Zhang
Jiaxi Gu
Hang Xu
Renjing Pei
Songcen Xu
Zuxuan Wu
DiffM
VGen
33
4
0
25 Oct 2023
On the Proactive Generation of Unsafe Images From Text-To-Image Models Using Benign Prompts
On the Proactive Generation of Unsafe Images From Text-To-Image Models Using Benign Prompts
Yixin Wu
Ning Yu
Michael Backes
Yun Shen
Yang Zhang
DiffM
59
8
0
25 Oct 2023
CVPR 2023 Text Guided Video Editing Competition
CVPR 2023 Text Guided Video Editing Competition
Jay Zhangjie Wu
Xiuyu Li
Difei Gao
Zhen Dong
Jinbin Bai
...
Xu Cheng
Jie Tang
Mike Zheng Shou
Kurt Keutzer
Forrest N. Iandola
38
34
0
24 Oct 2023
WordArt Designer: User-Driven Artistic Typography Synthesis using Large
  Language Models
WordArt Designer: User-Driven Artistic Typography Synthesis using Large Language Models
Jun-Yan He
Zhi-Qi Cheng
Chenyang Li
Jingdong Sun
Wangmeng Xiang
...
Yusen Hu
Bin Luo
Yifeng Geng
Xuansong Xie
Jingren Zhou
26
13
0
20 Oct 2023
EMIT-Diff: Enhancing Medical Image Segmentation via Text-Guided
  Diffusion Model
EMIT-Diff: Enhancing Medical Image Segmentation via Text-Guided Diffusion Model
Zheyu Zhang
Lanhong Yao
Bin Wang
Debesh Jha
Elif Keles
Alpay Medetalibeyoglu
Ulas Bagci
MedIm
46
10
0
19 Oct 2023
Audio Editing with Non-Rigid Text Prompts
Audio Editing with Non-Rigid Text Prompts
Francesco Paissan
Luca Della Libera
Zhepei Wang
Mirco Ravanelli
Paris Smaragdis
Cem Subakan
DiffM
46
5
0
19 Oct 2023
Object-aware Inversion and Reassembly for Image Editing
Object-aware Inversion and Reassembly for Image Editing
Zhen Yang
Dinggang Gui
Wen Wang
Hao Chen
Bohan Zhuang
Chunhua Shen
DiffM
38
15
0
18 Oct 2023
Progressive3D: Progressively Local Editing for Text-to-3D Content
  Creation with Complex Semantic Prompts
Progressive3D: Progressively Local Editing for Text-to-3D Content Creation with Complex Semantic Prompts
Xinhua Cheng
Tianyu Yang
Jianan Wang
Yu Li
Lei Zhang
Jian Zhang
Li-ming Yuan
DiffM
33
43
0
18 Oct 2023
Learning Neural Implicit through Volume Rendering with Attentive Depth
  Fusion Priors
Learning Neural Implicit through Volume Rendering with Attentive Depth Fusion Priors
Pengchong Hu
Zhizhong Han
38
12
0
17 Oct 2023
BiomedJourney: Counterfactual Biomedical Image Generation by
  Instruction-Learning from Multimodal Patient Journeys
BiomedJourney: Counterfactual Biomedical Image Generation by Instruction-Learning from Multimodal Patient Journeys
Yu Gu
Jianwei Yang
Naoto Usuyama
Chun-yue Li
Sheng Zhang
M. Lungren
Jianfeng Gao
Hoifung Poon
MedIm
32
22
0
16 Oct 2023
A Survey on Video Diffusion Models
A Survey on Video Diffusion Models
Zhen Xing
Qijun Feng
Haoran Chen
Qi Dai
Hang-Rui Hu
Hang Xu
Zuxuan Wu
Yu-Gang Jiang
EGVM
VGen
62
119
0
16 Oct 2023
TOSS:High-quality Text-guided Novel View Synthesis from a Single Image
TOSS:High-quality Text-guided Novel View Synthesis from a Single Image
Yukai Shi
Jianan Wang
He Cao
Boshi Tang
Xianbiao Qi
Tianyu Yang
Yukun Huang
Shilong Liu
Lei Zhang
H. Shum
DiffM
32
20
0
16 Oct 2023
Zero-Shot Robotic Manipulation with Pretrained Image-Editing Diffusion
  Models
Zero-Shot Robotic Manipulation with Pretrained Image-Editing Diffusion Models
Kevin Black
Mitsuhiko Nakamoto
P. Atreya
Homer Walke
Chelsea Finn
Aviral Kumar
Sergey Levine
DiffM
LM&Ro
35
133
0
16 Oct 2023
ConsistNet: Enforcing 3D Consistency for Multi-view Images Diffusion
ConsistNet: Enforcing 3D Consistency for Multi-view Images Diffusion
Jiayu Yang
Ziang Cheng
Yunfei Duan
Pan Ji
Hongdong Li
DiffM
47
54
0
16 Oct 2023
AutoDIR: Automatic All-in-One Image Restoration with Latent Diffusion
AutoDIR: Automatic All-in-One Image Restoration with Latent Diffusion
Yitong Jiang
Zhaoyang Zhang
Tianfan Xue
Liang Feng
DiffM
54
43
0
16 Oct 2023
ProteusNeRF: Fast Lightweight NeRF Editing using 3D-Aware Image Context
ProteusNeRF: Fast Lightweight NeRF Editing using 3D-Aware Image Context
Binglun Wang
Niladri Shekhar Dutt
Niloy J. Mitra
52
10
0
15 Oct 2023
Compositional Abilities Emerge Multiplicatively: Exploring Diffusion
  Models on a Synthetic Task
Compositional Abilities Emerge Multiplicatively: Exploring Diffusion Models on a Synthetic Task
Maya Okawa
Ekdeep Singh Lubana
Robert P. Dick
Hidenori Tanaka
CoGe
DiffM
39
46
0
13 Oct 2023
Idea2Img: Iterative Self-Refinement with GPT-4V(ision) for Automatic
  Image Design and Generation
Idea2Img: Iterative Self-Refinement with GPT-4V(ision) for Automatic Image Design and Generation
Zhengyuan Yang
Jianfeng Wang
Linjie Li
Kevin Qinghong Lin
Chung-Ching Lin
Zicheng Liu
Lijuan Wang
LRM
MLLM
DiffM
13
22
0
12 Oct 2023
Mini-DALLE3: Interactive Text to Image by Prompting Large Language
  Models
Mini-DALLE3: Interactive Text to Image by Prompting Large Language Models
Zeqiang Lai
Xizhou Zhu
Jifeng Dai
Yu Qiao
Wenhai Wang
MLLM
DiffM
54
23
0
11 Oct 2023
KwaiYiiMath: Technical Report
KwaiYiiMath: Technical Report
Jia-Yi Fu
Lei Lin
Xiaoyang Gao
Pengli Liu
Zhengzong Chen
...
Zijia Lin
Fuzheng Zhang
Zhongyuan Wang
Di Zhang
Kun Gai
LRM
ReLM
RALM
53
2
0
11 Oct 2023
Multi-Concept T2I-Zero: Tweaking Only The Text Embeddings and Nothing
  Else
Multi-Concept T2I-Zero: Tweaking Only The Text Embeddings and Nothing Else
Hazarapet Tunanyan
Dejia Xu
Shant Navasardyan
Zhangyang Wang
Humphrey Shi
DiffM
88
7
0
11 Oct 2023
Uni-paint: A Unified Framework for Multimodal Image Inpainting with
  Pretrained Diffusion Model
Uni-paint: A Unified Framework for Multimodal Image Inpainting with Pretrained Diffusion Model
Shiyuan Yang
Xiaodong Chen
Jing Liao
DiffM
30
59
0
11 Oct 2023
State of the Art on Diffusion Models for Visual Computing
State of the Art on Diffusion Models for Visual Computing
Ryan Po
Wang Yifan
Vladislav Golyanik
Kfir Aberman
Jonathan T. Barron
...
Matthias Nießner
Bjorn Ommer
Christian Theobalt
Peter Wonka
Gordon Wetzstein
38
103
0
11 Oct 2023
Latent Diffusion Counterfactual Explanations
Latent Diffusion Counterfactual Explanations
Karim Farid
Simon Schrodi
Max Argus
Thomas Brox
DiffM
48
13
0
10 Oct 2023
FireAct: Toward Language Agent Fine-tuning
FireAct: Toward Language Agent Fine-tuning
Baian Chen
Chang Shu
Ehsan Shareghi
Nigel Collier
Karthik Narasimhan
Shunyu Yao
ALM
LLMAG
107
98
0
09 Oct 2023
IPDreamer: Appearance-Controllable 3D Object Generation with Complex
  Image Prompts
IPDreamer: Appearance-Controllable 3D Object Generation with Complex Image Prompts
Bo-Wen Zeng
Shanglin Li
Yutang Feng
Ling Yang
Hong Li
...
Conghui He
Wentao Zhang
Jianzhuang Liu
Baochang Zhang
Shuicheng Yan
DiffM
40
2
0
09 Oct 2023
Efficient-3DiM: Learning a Generalizable Single-image Novel-view
  Synthesizer in One Day
Efficient-3DiM: Learning a Generalizable Single-image Novel-view Synthesizer in One Day
Yi Ding
Hao Tang
Jen-Hao Rick Chang
Liangchen Song
Zhangyang Wang
Liangliang Cao
DiffM
48
10
0
04 Oct 2023
Kosmos-G: Generating Images in Context with Multimodal Large Language
  Models
Kosmos-G: Generating Images in Context with Multimodal Large Language Models
Xichen Pan
Li Dong
Shaohan Huang
Zhiliang Peng
Wenhu Chen
Furu Wei
VLM
11
62
0
04 Oct 2023
Probing Intersectional Biases in Vision-Language Models with
  Counterfactual Examples
Probing Intersectional Biases in Vision-Language Models with Counterfactual Examples
Phillip Howard
Avinash Madasu
Tiep Le
Gustavo Lujan Moreno
Vasudev Lal
VLM
29
4
0
04 Oct 2023
T$^3$Bench: Benchmarking Current Progress in Text-to-3D Generation
T3^33Bench: Benchmarking Current Progress in Text-to-3D Generation
Yuze He
Yushi Bai
Matthieu Lin
Wang Zhao
Yubin Hu
Jenny Sheng
Ran Yi
Juanzi Li
Yong Liu
44
31
0
04 Oct 2023
Magicremover: Tuning-free Text-guided Image inpainting with Diffusion
  Models
Magicremover: Tuning-free Text-guided Image inpainting with Diffusion Models
Si-hang Yang
Lu Zhang
Liqian Ma
Yu Liu
JingJing Fu
You He
DiffM
36
11
0
04 Oct 2023
MagicDrive: Street View Generation with Diverse 3D Geometry Control
MagicDrive: Street View Generation with Diverse 3D Geometry Control
Ruiyuan Gao
Kai Chen
Enze Xie
Lanqing Hong
Zhenguo Li
Dit-Yan Yeung
Qiang Xu
DiffM
44
104
0
04 Oct 2023
EditVal: Benchmarking Diffusion Based Text-Guided Image Editing Methods
EditVal: Benchmarking Diffusion Based Text-Guided Image Editing Methods
Samyadeep Basu
Mehrdad Saberi
S. Bhardwaj
Atoosa Malemir Chegini
Daniela Massiceti
Maziar Sanjabi
S. Hu
S. Feizi
59
17
0
03 Oct 2023
TP2O: Creative Text Pair-to-Object Generation using Balance
  Swap-Sampling
TP2O: Creative Text Pair-to-Object Generation using Balance Swap-Sampling
Jun Li
Zedong Zhang
Jian Yang
DiffM
44
6
0
03 Oct 2023
ImagenHub: Standardizing the evaluation of conditional image generation
  models
ImagenHub: Standardizing the evaluation of conditional image generation models
Max W.F. Ku
Tianle Li
Kai Zhang
Yujie Lu
Xingyu Fu
Wenwen Zhuang
Wenhu Chen
EGVM
44
47
0
02 Oct 2023
Direct Inversion: Boosting Diffusion-based Editing with 3 Lines of Code
Direct Inversion: Boosting Diffusion-based Editing with 3 Lines of Code
Xu Ju
Ailing Zeng
Hao Wang
Shaoteng Liu
Qiang Xu
DiffM
45
68
0
02 Oct 2023
CoDi: Conditional Diffusion Distillation for Higher-Fidelity and Faster
  Image Generation
CoDi: Conditional Diffusion Distillation for Higher-Fidelity and Faster Image Generation
Kangfu Mei
M. Delbracio
Hossein Talebi
Zhengzhong Tu
Vishal M. Patel
P. Milanfar
VLM
DiffM
64
11
0
02 Oct 2023
Making LLaMA SEE and Draw with SEED Tokenizer
Making LLaMA SEE and Draw with SEED Tokenizer
Yuying Ge
Sijie Zhao
Ziyun Zeng
Yixiao Ge
Chen Li
Xintao Wang
Ying Shan
38
128
0
02 Oct 2023
Controlling Vision-Language Models for Multi-Task Image Restoration
Controlling Vision-Language Models for Multi-Task Image Restoration
Ziwei Luo
Fredrik K. Gustafsson
Zheng Zhao
Jens Sjölund
Thomas B. Schon
VLM
81
32
0
02 Oct 2023
DiffPoseTalk: Speech-Driven Stylistic 3D Facial Animation and Head Pose
  Generation via Diffusion Models
DiffPoseTalk: Speech-Driven Stylistic 3D Facial Animation and Head Pose Generation via Diffusion Models
Zhiyao Sun
Tian Lv
Sheng Ye
Matthieu Lin
Jenny Sheng
Yuhui Wen
Minjing Yu
Yong Liu
DiffM
49
45
0
30 Sep 2023
InstructCV: Instruction-Tuned Text-to-Image Diffusion Models as Vision
  Generalists
InstructCV: Instruction-Tuned Text-to-Image Diffusion Models as Vision Generalists
Yulu Gan
Sungwoo Park
Alexander Schubert
Anthony Philippakis
Ahmed Alaa
VLM
44
22
0
30 Sep 2023
Guiding Instruction-based Image Editing via Multimodal Large Language
  Models
Guiding Instruction-based Image Editing via Multimodal Large Language Models
Johannes Frey
Wenze Hu
Xianzhi Du
William Yang Wang
Yinfei Yang
Zhe Gan
45
89
0
29 Sep 2023
Leveraging Optimization for Adaptive Attacks on Image Watermarks
Leveraging Optimization for Adaptive Attacks on Image Watermarks
Nils Lukas
Abdulrahman Diaa
L. Fenaux
Florian Kerschbaum
AAML
WIGM
27
24
0
29 Sep 2023
RealFill: Reference-Driven Generation for Authentic Image Completion
RealFill: Reference-Driven Generation for Authentic Image Completion
Luming Tang
Nataniel Ruiz
Qinghao Chu
Yuanzhen Li
Aleksander Holynski
...
Bharath Hariharan
Yael Pritch
Neal Wadhwa
Kfir Aberman
Michael Rubinstein
DiffM
23
43
0
28 Sep 2023
KV Inversion: KV Embeddings Learning for Text-Conditioned Real Image
  Action Editing
KV Inversion: KV Embeddings Learning for Text-Conditioned Real Image Action Editing
Jiarui Yao
Yifan Liu
Simon S. Du
Shifeng Chen
DiffM
24
24
0
28 Sep 2023
Previous
123...212223...262728
Next