ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2211.09800
  4. Cited By
InstructPix2Pix: Learning to Follow Image Editing Instructions

InstructPix2Pix: Learning to Follow Image Editing Instructions

17 November 2022
Tim Brooks
Aleksander Holynski
Alexei A. Efros
    DiffM
ArXivPDFHTML

Papers citing "InstructPix2Pix: Learning to Follow Image Editing Instructions"

50 / 1,350 papers shown
Title
Multimodal Deep Learning for Scientific Imaging Interpretation
Multimodal Deep Learning for Scientific Imaging Interpretation
Abdulelah S. Alshehri
Franklin L. Lee
Shihu Wang
29
2
0
21 Sep 2023
PIE: Simulating Disease Progression via Progressive Image Editing
PIE: Simulating Disease Progression via Progressive Image Editing
Kaizhao Liang
Xu Cao
Kuei-Da Liao
Tianren Gao
Wenqian Ye
Zhengyu Chen
Jianguo Cao
Tejas Nama
Jimeng Sun
MedIm
AI4CE
26
5
0
21 Sep 2023
Interactive Flexible Style Transfer for Vector Graphics
Interactive Flexible Style Transfer for Vector Graphics
Jeremy Warner
Kyu Won Kim
Bjoern Hartmann
13
9
0
20 Sep 2023
Language-driven Object Fusion into Neural Radiance Fields with
  Pose-Conditioned Dataset Updates
Language-driven Object Fusion into Neural Radiance Fields with Pose-Conditioned Dataset Updates
Kashun Shum
Jaeyeon Kim
Binh-Son Hua
Duc Thanh Nguyen
Sai-Kit Yeung
3DH
AI4CE
21
7
0
20 Sep 2023
Forgedit: Text Guided Image Editing via Learning and Forgetting
Forgedit: Text Guided Image Editing via Learning and Forgetting
Shiwen Zhang
Shuai Xiao
Weilin Huang
DiffM
31
18
0
19 Sep 2023
Diffusion Methods for Generating Transition Paths
Diffusion Methods for Generating Transition Paths
Luke Triplett
Jianfeng Lu
25
5
0
19 Sep 2023
Progressive Text-to-Image Diffusion with Soft Latent Direction
Progressive Text-to-Image Diffusion with Soft Latent Direction
Yuteng Ye
Jiale Cai
Hang Zhou
Guanwen Li
Youjia Zhang
Zikai Song
Chenxing Gao
Junqing Yu
Wei Yang
43
5
0
18 Sep 2023
PoseFix: Correcting 3D Human Poses with Natural Language
PoseFix: Correcting 3D Human Poses with Natural Language
Ginger Delmas
Philippe Weinzaepfel
Francesc Moreno-Noguer
Grégory Rogez
27
22
0
15 Sep 2023
Limitations of Face Image Generation
Limitations of Face Image Generation
Harrison Rosenberg
Shimaa Ahmed
Guruprasad V Ramesh
Ramya Korlakai Vinayak
Kassem Fawaz
36
1
0
13 Sep 2023
ITI-GEN: Inclusive Text-to-Image Generation
ITI-GEN: Inclusive Text-to-Image Generation
Cheng Zhang
Xuanbai Chen
Siqi Chai
Chen Henry Wu
Dmitry Lagun
Thabo Beeler
Fernando de la Torre
VLM
35
52
0
11 Sep 2023
Editing 3D Scenes via Text Prompts without Retraining
Editing 3D Scenes via Text Prompts without Retraining
Shuangkang Fang
Yufeng Wang
Yezhou Yang
Yi-Hsuan Tsai
Wenrui Ding
Shuchang Zhou
Ming Yang
DiffM
24
2
0
10 Sep 2023
MoEController: Instruction-based Arbitrary Image Manipulation with
  Mixture-of-Expert Controllers
MoEController: Instruction-based Arbitrary Image Manipulation with Mixture-of-Expert Controllers
Sijia Li
Chen Chen
H. Lu
DiffM
58
9
0
08 Sep 2023
InstructDiffusion: A Generalist Modeling Interface for Vision Tasks
InstructDiffusion: A Generalist Modeling Interface for Vision Tasks
Zigang Geng
Binxin Yang
Tiankai Hang
Chen Li
Shuyang Gu
...
Jianmin Bao
Zheng-Wei Zhang
Han Hu
Dongdong Chen
Baining Guo
DiffM
VLM
53
93
0
07 Sep 2023
My Art My Choice: Adversarial Protection Against Unruly AI
My Art My Choice: Adversarial Protection Against Unruly AI
Anthony Rhodes
Ram Bhagat
U. Ciftci
Ilke Demir
DiffM
45
4
0
06 Sep 2023
SLiMe: Segment Like Me
SLiMe: Segment Like Me
Aliasghar Khani
Saeid Asgari Taghanaki
Aditya Sanghi
Ali Mahdavi-Amiri
Ghassan Hamarneh
VLM
37
30
0
06 Sep 2023
Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction
  Tuning
Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning
L. Yu
Bowen Shi
Ramakanth Pasunuru
Benjamin Muller
O. Yu. Golovneva
...
Yaniv Taigman
Maryam Fazel-Zarandi
Asli Celikyilmaz
Luke Zettlemoyer
Armen Aghajanyan
MLLM
38
135
0
05 Sep 2023
Hierarchical Masked 3D Diffusion Model for Video Outpainting
Hierarchical Masked 3D Diffusion Model for Video Outpainting
Fanda Fan
Chaoxu Guo
Litong Gong
Biao Wang
T. Ge
Yuning Jiang
Chunjie Luo
Jianfeng Zhan
DiffM
VGen
21
13
0
05 Sep 2023
Iterative Multi-granular Image Editing using Diffusion Models
Iterative Multi-granular Image Editing using Diffusion Models
K. J. Joseph
Prateksha Udhayanan
Tripti Shukla
Aishwarya Agarwal
Srikrishna Karanam
Koustava Goswami
Balaji Vasan Srinivasan
DiffM
33
16
0
01 Sep 2023
VideoGen: A Reference-Guided Latent Diffusion Approach for High
  Definition Text-to-Video Generation
VideoGen: A Reference-Guided Latent Diffusion Approach for High Definition Text-to-Video Generation
Xin Li
Wenqing Chu
Ye Wu
Weihang Yuan
Fanglong Liu
Qi Zhang
Fu Li
Haocheng Feng
Errui Ding
Jingdong Wang
VGen
45
52
0
01 Sep 2023
Affective Visual Dialog: A Large-Scale Benchmark for Emotional Reasoning
  Based on Visually Grounded Conversations
Affective Visual Dialog: A Large-Scale Benchmark for Emotional Reasoning Based on Visually Grounded Conversations
Kilichbek Haydarov
Xiaoqian Shen
Avinash Madasu
Mahmoud Salem
Jia Li
Gamaleldin F. Elsayed
Mohamed Elhoseiny
39
4
0
30 Aug 2023
CoVR: Learning Composed Video Retrieval from Web Video Captions
CoVR: Learning Composed Video Retrieval from Web Video Captions
Lucas Ventura
Antoine Yang
Cordelia Schmid
Gül Varol
22
26
0
28 Aug 2023
Pixel-Aware Stable Diffusion for Realistic Image Super-resolution and
  Personalized Stylization
Pixel-Aware Stable Diffusion for Realistic Image Super-resolution and Personalized Stylization
Tao Yang
Rongyuan Wu
Peiran Ren
Xuansong Xie
Lei Zhang
DiffM
47
138
0
28 Aug 2023
ORES: Open-vocabulary Responsible Visual Synthesis
ORES: Open-vocabulary Responsible Visual Synthesis
Minheng Ni
Chenfei Wu
Xiaodong Wang
Sheng-Siang Yin
Lijuan Wang
Zicheng Liu
Nan Duan
DiffM
31
8
0
26 Aug 2023
Antagonising explanation and revealing bias directly through sequencing
  and multimodal inference
Antagonising explanation and revealing bias directly through sequencing and multimodal inference
Luís Arandas
Mick Grierson
Miguel Carvalhais
PINN
DiffM
21
2
0
25 Aug 2023
GridPull: Towards Scalability in Learning Implicit Representations from
  3D Point Clouds
GridPull: Towards Scalability in Learning Implicit Representations from 3D Point Clouds
Chao Chen
Yu-Shen Liu
Zhizhong Han
3DPC
36
12
0
25 Aug 2023
A Survey of Diffusion Based Image Generation Models: Issues and Their
  Solutions
A Survey of Diffusion Based Image Generation Models: Issues and Their Solutions
Tianyi Zhang
Zheng Wang
Jin Huang
M. M. Tasnim
Wei Shi
VLM
16
21
0
25 Aug 2023
Instruction Position Matters in Sequence Generation with Large Language
  Models
Instruction Position Matters in Sequence Generation with Large Language Models
Yanjun Liu
Xianfeng Zeng
Fandong Meng
Jie Zhou
LRM
54
8
0
23 Aug 2023
IT3D: Improved Text-to-3D Generation with Explicit View Synthesis
IT3D: Improved Text-to-3D Generation with Explicit View Synthesis
Yiwen Chen
Chi Zhang
Xiaofeng Yang
Zhongang Cai
Gang Yu
Lei Yang
Guo-Shing Lin
DiffM
34
62
0
22 Aug 2023
Learning a More Continuous Zero Level Set in Unsigned Distance Fields
  through Level Set Projection
Learning a More Continuous Zero Level Set in Unsigned Distance Fields through Level Set Projection
Junsheng Zhou
Baorui Ma
Shujuan Li
Yu-Shen Liu
Zhizhong Han
40
36
0
22 Aug 2023
Instruction Tuning for Large Language Models: A Survey
Instruction Tuning for Large Language Models: A Survey
Shengyu Zhang
Linfeng Dong
Xiaoya Li
Sen Zhang
Xiaofei Sun
...
Jiwei Li
Runyi Hu
Tianwei Zhang
Fei Wu
Guoyin Wang
LM&MA
24
546
0
21 Aug 2023
FocalDreamer: Text-driven 3D Editing via Focal-fusion Assembly
FocalDreamer: Text-driven 3D Editing via Focal-fusion Assembly
Yuhan Li
Yishun Dou
Yue Shi
Yulai Lei
Xuanhong Chen
Yi Zhang
Peng Zhou
Bingbing Ni
3DGS
30
56
0
21 Aug 2023
ASPIRE: Language-Guided Data Augmentation for Improving Robustness
  Against Spurious Correlations
ASPIRE: Language-Guided Data Augmentation for Improving Robustness Against Spurious Correlations
Sreyan Ghosh
Chandra Kiran Reddy Evuru
Sonal Kumar
Utkarsh Tyagi
Sakshi Singh
Sanjoy Chowdhury
Dinesh Manocha
OOD
30
1
0
19 Aug 2023
MeDM: Mediating Image Diffusion Models for Video-to-Video Translation
  with Temporal Correspondence Guidance
MeDM: Mediating Image Diffusion Models for Video-to-Video Translation with Temporal Correspondence Guidance
Ernie Chu
Tzu-Hua Huang
Shuohao Lin
Jun-Cheng Chen
DiffM
VGen
34
13
0
19 Aug 2023
User-centric AIGC products: Explainable Artificial Intelligence and AIGC
  products
User-centric AIGC products: Explainable Artificial Intelligence and AIGC products
Hanjie Yu
Yan Dong
Qiong Wu
18
4
0
19 Aug 2023
SimDA: Simple Diffusion Adapter for Efficient Video Generation
SimDA: Simple Diffusion Adapter for Efficient Video Generation
Zhen Xing
Qi Dai
Hang-Rui Hu
Zuxuan Wu
Yu-Gang Jiang
VGen
DiffM
35
81
0
18 Aug 2023
StableVideo: Text-driven Consistency-aware Diffusion Video Editing
StableVideo: Text-driven Consistency-aware Diffusion Video Editing
Wenhao Chai
Xun Guo
Gaoang Wang
Yang Lu
VGen
DiffM
27
147
0
18 Aug 2023
O$^2$-Recon: Completing 3D Reconstruction of Occluded Objects in the
  Scene with a Pre-trained 2D Diffusion Model
O2^22-Recon: Completing 3D Reconstruction of Occluded Objects in the Scene with a Pre-trained 2D Diffusion Model
Yubin Hu
Sheng Ye
Wang Zhao
Matthieu Lin
Yuze He
Yu-Hui Wen
Ying He
Yong-Jin Liu
DiffM
33
4
0
18 Aug 2023
Watch Your Steps: Local Image and Scene Editing by Text Instructions
Watch Your Steps: Local Image and Scene Editing by Text Instructions
Ashkan Mirzaei
Tristan Aumentado-Armstrong
Marcus A. Brubaker
J. Kelly
Alex Levinshtein
Konstantinos G. Derpanis
Igor Gilitschenski
DiffM
27
34
0
17 Aug 2023
CoDeF: Content Deformation Fields for Temporally Consistent Video
  Processing
CoDeF: Content Deformation Fields for Temporally Consistent Video Processing
Ouyang Hao
Qiuyu Wang
Yuxi Xiao
Qingyan Bai
Juntao Zhang
Kecheng Zheng
Xiaowei Zhou
Qifeng Chen
Yujun Shen
DiffM
VGen
46
81
0
15 Aug 2023
Dancing Avatar: Pose and Text-Guided Human Motion Videos Synthesis with
  Image Diffusion Model
Dancing Avatar: Pose and Text-Guided Human Motion Videos Synthesis with Image Diffusion Model
Bosheng Qin
Wentao Ye
Qifan Yu
Siliang Tang
Yueting Zhuang
DiffM
VGen
29
13
0
15 Aug 2023
A Survey on Model Compression for Large Language Models
A Survey on Model Compression for Large Language Models
Xunyu Zhu
Jian Li
Yong Liu
Can Ma
Weiping Wang
36
193
0
15 Aug 2023
SGDiff: A Style Guided Diffusion Model for Fashion Synthesis
SGDiff: A Style Guided Diffusion Model for Fashion Synthesis
Zheng Sun
Yanghong Zhou
Honghong He
P. Y. Mok
DiffM
37
26
0
15 Aug 2023
Masked-Attention Diffusion Guidance for Spatially Controlling
  Text-to-Image Generation
Masked-Attention Diffusion Guidance for Spatially Controlling Text-to-Image Generation
Yuki Endo
33
8
0
11 Aug 2023
PromptPaint: Steering Text-to-Image Generation Through Paint Medium-like
  Interactions
PromptPaint: Steering Text-to-Image Generation Through Paint Medium-like Interactions
John Joon Young Chung
Eytan Adar
DiffM
33
57
0
09 Aug 2023
GEMRec: Towards Generative Model Recommendation
GEMRec: Towards Generative Model Recommendation
Yuanhe Guo
Haoming Liu
Hongyi Wen
DiffM
VLM
15
1
0
04 Aug 2023
ConceptLab: Creative Concept Generation using VLM-Guided Diffusion Prior
  Constraints
ConceptLab: Creative Concept Generation using VLM-Guided Diffusion Prior Constraints
Elad Richardson
Kfir Goldberg
Yuval Alaluf
Daniel Cohen-Or
DiffM
31
10
0
03 Aug 2023
ImageBrush: Learning Visual In-Context Instructions for Exemplar-Based
  Image Manipulation
ImageBrush: Learning Visual In-Context Instructions for Exemplar-Based Image Manipulation
Yasheng Sun
Yifan Yang
Houwen Peng
Yifei Shen
Yuqing Yang
Hang-Rui Hu
Lili Qiu
Hideki Koike
DiffM
LM&Ro
37
33
0
02 Aug 2023
Visual Instruction Inversion: Image Editing via Visual Prompting
Visual Instruction Inversion: Image Editing via Visual Prompting
Thao Nguyen
Yuheng Li
Utkarsh Ojha
Yong Jae Lee
DiffM
32
22
0
26 Jul 2023
Points-to-3D: Bridging the Gap between Sparse Points and
  Shape-Controllable Text-to-3D Generation
Points-to-3D: Bridging the Gap between Sparse Points and Shape-Controllable Text-to-3D Generation
Chaohui Yu
Qiang-feng Zhou
Jingliang Li
Zhe Zhang
Zhibin Wang
Fan Wang
DiffM
30
38
0
26 Jul 2023
Understanding the Latent Space of Diffusion Models through the Lens of
  Riemannian Geometry
Understanding the Latent Space of Diffusion Models through the Lens of Riemannian Geometry
Yong-Hyun Park
Mingi Kwon
J. Choi
Junghyo Jo
Youngjung Uh
DiffM
40
60
0
24 Jul 2023
Previous
123...222324252627
Next