ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2211.09800
  4. Cited By
InstructPix2Pix: Learning to Follow Image Editing Instructions
v1v2 (latest)

InstructPix2Pix: Learning to Follow Image Editing Instructions

17 November 2022
Tim Brooks
Aleksander Holynski
Alexei A. Efros
    DiffM
ArXiv (abs)PDFHTML

Papers citing "InstructPix2Pix: Learning to Follow Image Editing Instructions"

50 / 1,418 papers shown
Title
Antagonising explanation and revealing bias directly through sequencing
  and multimodal inference
Antagonising explanation and revealing bias directly through sequencing and multimodal inference
Luís Arandas
Mick Grierson
Miguel Carvalhais
PINNDiffM
46
3
0
25 Aug 2023
GridPull: Towards Scalability in Learning Implicit Representations from
  3D Point Clouds
GridPull: Towards Scalability in Learning Implicit Representations from 3D Point Clouds
Chao Chen
Yu-Shen Liu
Zhizhong Han
3DPC
75
12
0
25 Aug 2023
A Survey of Diffusion Based Image Generation Models: Issues and Their
  Solutions
A Survey of Diffusion Based Image Generation Models: Issues and Their Solutions
Tianyi Zhang
Zheng Wang
Jin Huang
M. M. Tasnim
Wei Shi
VLM
83
22
0
25 Aug 2023
Instruction Position Matters in Sequence Generation with Large Language
  Models
Instruction Position Matters in Sequence Generation with Large Language Models
Yanjun Liu
Xianfeng Zeng
Fandong Meng
Jie Zhou
LRM
107
9
0
23 Aug 2023
IT3D: Improved Text-to-3D Generation with Explicit View Synthesis
IT3D: Improved Text-to-3D Generation with Explicit View Synthesis
Yiwen Chen
Chi Zhang
Xiaofeng Yang
Zhongang Cai
Gang Yu
Lei Yang
Guo-Shing Lin
DiffM
98
64
0
22 Aug 2023
Learning a More Continuous Zero Level Set in Unsigned Distance Fields
  through Level Set Projection
Learning a More Continuous Zero Level Set in Unsigned Distance Fields through Level Set Projection
Junsheng Zhou
Baorui Ma
Shujuan Li
Yu-Shen Liu
Zhizhong Han
111
38
0
22 Aug 2023
Instruction Tuning for Large Language Models: A Survey
Instruction Tuning for Large Language Models: A Survey
Shengyu Zhang
Linfeng Dong
Xiaoya Li
Sen Zhang
Xiaofei Sun
...
Jiwei Li
Runyi Hu
Tianwei Zhang
Leilei Gan
Guoyin Wang
LM&MA
110
610
0
21 Aug 2023
FocalDreamer: Text-driven 3D Editing via Focal-fusion Assembly
FocalDreamer: Text-driven 3D Editing via Focal-fusion Assembly
Yuhan Li
Yishun Dou
Yue Shi
Yulai Lei
Xuanhong Chen
Yi Zhang
Peng Zhou
Bingbing Ni
3DGS
102
58
0
21 Aug 2023
ASPIRE: Language-Guided Data Augmentation for Improving Robustness
  Against Spurious Correlations
ASPIRE: Language-Guided Data Augmentation for Improving Robustness Against Spurious Correlations
Sreyan Ghosh
Chandra Kiran Reddy Evuru
Sonal Kumar
Utkarsh Tyagi
Sakshi Singh
Sanjoy Chowdhury
Dinesh Manocha
OOD
61
1
0
19 Aug 2023
MeDM: Mediating Image Diffusion Models for Video-to-Video Translation
  with Temporal Correspondence Guidance
MeDM: Mediating Image Diffusion Models for Video-to-Video Translation with Temporal Correspondence Guidance
Ernie Chu
Tzu-Hua Huang
Shuohao Lin
Jun-Cheng Chen
DiffMVGen
77
16
0
19 Aug 2023
User-centric AIGC products: Explainable Artificial Intelligence and AIGC
  products
User-centric AIGC products: Explainable Artificial Intelligence and AIGC products
Hanjie Yu
Yan Dong
Qiong Wu
30
5
0
19 Aug 2023
SimDA: Simple Diffusion Adapter for Efficient Video Generation
SimDA: Simple Diffusion Adapter for Efficient Video Generation
Zhen Xing
Qi Dai
Hang-Rui Hu
Zuxuan Wu
Yu-Gang Jiang
VGenDiffM
103
84
0
18 Aug 2023
StableVideo: Text-driven Consistency-aware Diffusion Video Editing
StableVideo: Text-driven Consistency-aware Diffusion Video Editing
Wenhao Chai
Xun Guo
Gaoang Wang
Yang Lu
VGenDiffM
91
157
0
18 Aug 2023
O$^2$-Recon: Completing 3D Reconstruction of Occluded Objects in the
  Scene with a Pre-trained 2D Diffusion Model
O2^22-Recon: Completing 3D Reconstruction of Occluded Objects in the Scene with a Pre-trained 2D Diffusion Model
Yubin Hu
Sheng Ye
Wang Zhao
Matthieu Lin
Yuze He
Yu-Hui Wen
Ying He
Yang Liu
DiffM
85
4
0
18 Aug 2023
Watch Your Steps: Local Image and Scene Editing by Text Instructions
Watch Your Steps: Local Image and Scene Editing by Text Instructions
Ashkan Mirzaei
Tristan Aumentado-Armstrong
Marcus A. Brubaker
J. Kelly
Alex Levinshtein
Konstantinos G. Derpanis
Igor Gilitschenski
DiffM
72
36
0
17 Aug 2023
CoDeF: Content Deformation Fields for Temporally Consistent Video
  Processing
CoDeF: Content Deformation Fields for Temporally Consistent Video Processing
Ouyang Hao
Qiuyu Wang
Yuxi Xiao
Qingyan Bai
Juntao Zhang
Kecheng Zheng
Xiaowei Zhou
Qifeng Chen
Yujun Shen
DiffMVGen
123
85
0
15 Aug 2023
Dancing Avatar: Pose and Text-Guided Human Motion Videos Synthesis with
  Image Diffusion Model
Dancing Avatar: Pose and Text-Guided Human Motion Videos Synthesis with Image Diffusion Model
Bosheng Qin
Wentao Ye
Qifan Yu
Siliang Tang
Yueting Zhuang
DiffMVGen
52
13
0
15 Aug 2023
A Survey on Model Compression for Large Language Models
A Survey on Model Compression for Large Language Models
Xunyu Zhu
Jian Li
Yong Liu
Can Ma
Weiping Wang
139
233
0
15 Aug 2023
SGDiff: A Style Guided Diffusion Model for Fashion Synthesis
SGDiff: A Style Guided Diffusion Model for Fashion Synthesis
Zheng Sun
Yanghong Zhou
Honghong He
P. Y. Mok
DiffM
89
30
0
15 Aug 2023
Masked-Attention Diffusion Guidance for Spatially Controlling
  Text-to-Image Generation
Masked-Attention Diffusion Guidance for Spatially Controlling Text-to-Image Generation
Yuki Endo
70
8
0
11 Aug 2023
PromptPaint: Steering Text-to-Image Generation Through Paint Medium-like
  Interactions
PromptPaint: Steering Text-to-Image Generation Through Paint Medium-like Interactions
John Joon Young Chung
Eytan Adar
DiffM
74
66
0
09 Aug 2023
GEMRec: Towards Generative Model Recommendation
GEMRec: Towards Generative Model Recommendation
Yuanhe Guo
Haoming Liu
Hongyi Wen
DiffMVLM
25
2
0
04 Aug 2023
ConceptLab: Creative Concept Generation using VLM-Guided Diffusion Prior
  Constraints
ConceptLab: Creative Concept Generation using VLM-Guided Diffusion Prior Constraints
Elad Richardson
Kfir Goldberg
Yuval Alaluf
Daniel Cohen-Or
DiffM
95
12
0
03 Aug 2023
ImageBrush: Learning Visual In-Context Instructions for Exemplar-Based
  Image Manipulation
ImageBrush: Learning Visual In-Context Instructions for Exemplar-Based Image Manipulation
Yasheng Sun
Yifan Yang
Houwen Peng
Yifei Shen
Yuqing Yang
Hang-Rui Hu
Lili Qiu
Hideki Koike
DiffMLM&Ro
87
39
0
02 Aug 2023
Visual Instruction Inversion: Image Editing via Visual Prompting
Visual Instruction Inversion: Image Editing via Visual Prompting
Thao Nguyen
Yuheng Li
Utkarsh Ojha
Yong Jae Lee
DiffM
41
24
0
26 Jul 2023
Points-to-3D: Bridging the Gap between Sparse Points and
  Shape-Controllable Text-to-3D Generation
Points-to-3D: Bridging the Gap between Sparse Points and Shape-Controllable Text-to-3D Generation
Chaohui Yu
Qiang-feng Zhou
Jingliang Li
Zhe Zhang
Zhibin Wang
Fan Wang
DiffM
89
41
0
26 Jul 2023
Understanding the Latent Space of Diffusion Models through the Lens of
  Riemannian Geometry
Understanding the Latent Space of Diffusion Models through the Lens of Riemannian Geometry
Yong-Hyun Park
Mingi Kwon
J. Choi
Junghyo Jo
Youngjung Uh
DiffM
126
72
0
24 Jul 2023
Interpolating between Images with Diffusion Models
Interpolating between Images with Diffusion Models
Clinton Jia Wang
Polina Golland
DiffM
86
20
0
24 Jul 2023
Subject-Diffusion:Open Domain Personalized Text-to-Image Generation
  without Test-time Fine-tuning
Subject-Diffusion:Open Domain Personalized Text-to-Image Generation without Test-time Fine-tuning
Jiancang Ma
Junhao Liang
Chen Chen
H. Lu
108
150
0
21 Jul 2023
Text2Layer: Layered Image Generation using Latent Diffusion Model
Text2Layer: Layered Image Generation using Latent Diffusion Model
Xinyang Zhang
Wentian Zhao
Xin Lu
J. Chien
DiffM
63
12
0
19 Jul 2023
Let's ViCE! Mimicking Human Cognitive Behavior in Image Generation
  Evaluation
Let's ViCE! Mimicking Human Cognitive Behavior in Image Generation Evaluation
Federico Betti
Jacopo Staiano
Lorenzo Baraldi
Lorenzo Baraldi
Rita Cucchiara
N. Sebe
EGVM
50
7
0
18 Jul 2023
Can Pre-Trained Text-to-Image Models Generate Visual Goals for
  Reinforcement Learning?
Can Pre-Trained Text-to-Image Models Generate Visual Goals for Reinforcement Learning?
Jialu Gao
Kaizhe Hu
Guowei Xu
Huazhe Xu
LM&Ro
89
17
0
15 Jul 2023
Domain-Agnostic Tuning-Encoder for Fast Personalization of Text-To-Image
  Models
Domain-Agnostic Tuning-Encoder for Fast Personalization of Text-To-Image Models
Moab Arar
Rinon Gal
Yuval Atzmon
Gal Chechik
Daniel Cohen-Or
Ariel Shamir
Amit H. Bermano
DiffM
92
76
0
13 Jul 2023
FreeDrag: Feature Dragging for Reliable Point-based Image Editing
FreeDrag: Feature Dragging for Reliable Point-based Image Editing
Pengyang Ling
Lin Chen
Pan Zhang
H. Chen
Yi Jin
Jinjin Zheng
DiffM
108
16
0
10 Jul 2023
Text-Guided Synthesis of Eulerian Cinemagraphs
Text-Guided Synthesis of Eulerian Cinemagraphs
Aniruddha Mahapatra
Aliaksandr Siarohin
Hsin-Ying Lee
Sergey Tulyakov
Sitong Su
DiffMVGen
94
21
0
06 Jul 2023
Human Inspired Progressive Alignment and Comparative Learning for
  Grounded Word Acquisition
Human Inspired Progressive Alignment and Comparative Learning for Grounded Word Acquisition
Yuwei Bao
B. Lattimer
J. Chai
CLL
84
1
0
05 Jul 2023
DragonDiffusion: Enabling Drag-style Manipulation on Diffusion Models
DragonDiffusion: Enabling Drag-style Manipulation on Diffusion Models
Chong Mou
Xintao Wang
Jie Song
Ying Shan
Jian Zhang
DiffM
125
154
0
05 Jul 2023
Collaborative Score Distillation for Consistent Visual Synthesis
Collaborative Score Distillation for Consistent Visual Synthesis
Subin Kim
Kyungmin Lee
June Suk Choi
Jongheon Jeong
Kihyuk Sohn
Jinwoo Shin
DiffM
62
21
0
04 Jul 2023
LEDITS: Real Image Editing with DDPM Inversion and Semantic Guidance
LEDITS: Real Image Editing with DDPM Inversion and Semantic Guidance
Linoy Tsaban
Apolinário Passos
DiffM
78
42
0
02 Jul 2023
DreamIdentity: Improved Editability for Efficient Face-identity
  Preserved Image Generation
DreamIdentity: Improved Editability for Efficient Face-identity Preserved Image Generation
Zhuowei Chen
Shancheng Fang
Wei Liu
Qian He
Mengqi Huang
Yongdong Zhang
Zhendong Mao
DiffM
125
24
0
01 Jul 2023
DisCo: Disentangled Control for Realistic Human Dance Generation
DisCo: Disentangled Control for Realistic Human Dance Generation
Tan Wang
Linjie Li
Kevin Qinghong Lin
Yuanhao Zhai
Chung-Ching Lin
Zhengyuan Yang
Hanwang Zhang
Zicheng Liu
Lijuan Wang
VGen
141
89
0
30 Jun 2023
High Fidelity Image Counterfactuals with Probabilistic Causal Models
High Fidelity Image Counterfactuals with Probabilistic Causal Models
Fabio De Sousa Ribeiro
Tian Xia
M. Monteiro
Nick Pawlowski
Ben Glocker
DiffM
86
40
0
27 Jun 2023
Freestyle 3D-Aware Portrait Synthesis Based on Compositional Generative
  Priors
Freestyle 3D-Aware Portrait Synthesis Based on Compositional Generative Priors
Tianxiang Ma
Kang Zhao
Jianxin Sun
Yingya Zhang
Jing Dong
71
1
0
27 Jun 2023
A-STAR: Test-time Attention Segregation and Retention for Text-to-image
  Synthesis
A-STAR: Test-time Attention Segregation and Retention for Text-to-image Synthesis
Aishwarya Agarwal
Srikrishna Karanam
K. J. Joseph
Apoorv Saxena
Koustava Goswami
Balaji Vasan Srinivasan
VLMDiffM
39
51
0
26 Jun 2023
DragDiffusion: Harnessing Diffusion Models for Interactive Point-based
  Image Editing
DragDiffusion: Harnessing Diffusion Models for Interactive Point-based Image Editing
Yujun Shi
Chuhui Xue
Jun Hao Liew
Jiachun Pan
Hanshu Yan
Wenqing Zhang
Vincent Y. F. Tan
Song Bai
149
220
0
26 Jun 2023
Text-Anchored Score Composition: Tackling Condition Misalignment in
  Text-to-Image Diffusion Models
Text-Anchored Score Composition: Tackling Condition Misalignment in Text-to-Image Diffusion Models
Luozhou Wang
Guibao Shen
Wenhang Ge
Guangyong Chen
Yijun Li
Yingke Chen
DiffM
78
4
0
26 Jun 2023
DreamEditor: Text-Driven 3D Scene Editing with Neural Fields
DreamEditor: Text-Driven 3D Scene Editing with Neural Fields
Jingyu Zhuang
Chen Wang
Lingjie Liu
Liang Lin
Guanbin Li
DiffM
78
130
0
23 Jun 2023
Blended-NeRF: Zero-Shot Object Generation and Blending in Existing
  Neural Radiance Fields
Blended-NeRF: Zero-Shot Object Generation and Blending in Existing Neural Radiance Fields
Ori Gordon
Omri Avrahami
Dani Lischinski
DiffM
80
22
0
22 Jun 2023
TauPETGen: Text-Conditional Tau PET Image Synthesis Based on Latent
  Diffusion Models
TauPETGen: Text-Conditional Tau PET Image Synthesis Based on Latent Diffusion Models
Se-In Jang
C. Lois
Emma G. Thibault
J. Becker
Yafei Dong
M. Normandin
Julie C Price
Keith A. Johnson
Xiaofeng Liu
Kuang Gong
DiffMMedIm
29
10
0
21 Jun 2023
Instruct-NeuralTalker: Editing Audio-Driven Talking Radiance Fields with
  Instructions
Instruct-NeuralTalker: Editing Audio-Driven Talking Radiance Fields with Instructions
Yuqi Sun
Reian He
Weimin Tan
Bo Yan
DiffM
63
2
0
19 Jun 2023
Previous
123...242526272829
Next