ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2211.09800
  4. Cited By
InstructPix2Pix: Learning to Follow Image Editing Instructions

InstructPix2Pix: Learning to Follow Image Editing Instructions

17 November 2022
Tim Brooks
Aleksander Holynski
Alexei A. Efros
    DiffM
ArXivPDFHTML

Papers citing "InstructPix2Pix: Learning to Follow Image Editing Instructions"

50 / 1,355 papers shown
Title
ZONE: Zero-Shot Instruction-Guided Local Editing
ZONE: Zero-Shot Instruction-Guided Local Editing
Shanglin Li
Bo-Wen Zeng
Yutang Feng
Sicheng Gao
Xuhui Liu
...
Li Lin
Xu Tang
Yao Hu
Jianzhuang Liu
Baochang Zhang
DiffM
38
30
0
28 Dec 2023
Semantic Guidance Tuning for Text-To-Image Diffusion Models
Hyun Kang
Dohae Lee
Myungjin Shin
In-Kwon Lee
32
1
0
26 Dec 2023
LLM-Powered Hierarchical Language Agent for Real-time Human-AI
  Coordination
LLM-Powered Hierarchical Language Agent for Real-time Human-AI Coordination
Jijia Liu
Chao Yu
Jiaxuan Gao
Yuqing Xie
Qingmin Liao
Yi Wu
Yu Wang
LLMAG
LM&Ro
92
35
0
23 Dec 2023
VIEScore: Towards Explainable Metrics for Conditional Image Synthesis
  Evaluation
VIEScore: Towards Explainable Metrics for Conditional Image Synthesis Evaluation
Max W.F. Ku
Dongfu Jiang
Cong Wei
Xiang Yue
Wenhu Chen
34
50
0
22 Dec 2023
Plan, Posture and Go: Towards Open-World Text-to-Motion Generation
Plan, Posture and Go: Towards Open-World Text-to-Motion Generation
Jinpeng Liu
Wen-Dao Dai
Chunyu Wang
Yiji Cheng
Yansong Tang
Xin Tong
VGen
DiffM
77
17
0
22 Dec 2023
VideoPoet: A Large Language Model for Zero-Shot Video Generation
VideoPoet: A Large Language Model for Zero-Shot Video Generation
Dan Kondratyuk
Lijun Yu
Xiuye Gu
José Lezama
Jonathan Huang
...
Irfan Essa
Huisheng Wang
David A. Ross
Bryan Seybold
Lu Jiang
VGen
25
240
0
21 Dec 2023
HD-Painter: High-Resolution and Prompt-Faithful Text-Guided Image
  Inpainting with Diffusion Models
HD-Painter: High-Resolution and Prompt-Faithful Text-Guided Image Inpainting with Diffusion Models
Hayk Manukyan
Andranik Sargsyan
Barsegh Atanyan
Zhangyang Wang
Shant Navasardyan
Humphrey Shi
DiffM
40
28
0
21 Dec 2023
Free-Editor: Zero-shot Text-driven 3D Scene Editing
Free-Editor: Zero-shot Text-driven 3D Scene Editing
Nazmul Karim
Hasan Iqbal
Umar Khalid
Jingyi Hua
Chong Chen
DiffM
51
8
0
21 Dec 2023
DreamDistribution: Learning Prompt Distribution for Diverse In-distribution Generation
DreamDistribution: Learning Prompt Distribution for Diverse In-distribution Generation
Brian Nlong Zhao
Yuhang Xiao
Lyne Tchapmi
Xinyang Jiang
Yifan Yang
Dongsheng Li
Laurent Itti
Vibhav Vineet
Yunhao Ge
VLM
115
7
0
21 Dec 2023
Generative Multimodal Models are In-Context Learners
Generative Multimodal Models are In-Context Learners
Quan-Sen Sun
Yufeng Cui
Xiaosong Zhang
Fan Zhang
Qiying Yu
...
Yueze Wang
Yongming Rao
Jingjing Liu
Tiejun Huang
Xinlong Wang
MLLM
LRM
45
248
0
20 Dec 2023
Fairy: Fast Parallelized Instruction-Guided Video-to-Video Synthesis
Fairy: Fast Parallelized Instruction-Guided Video-to-Video Synthesis
Bichen Wu
Ching-Yao Chuang
Xiaoyan Wang
Yichen Jia
K. Krishnakumar
Tong Xiao
Feng Liang
Licheng Yu
Peter Vajda
DiffM
VGen
20
22
0
20 Dec 2023
Adaptive Guidance: Training-free Acceleration of Conditional Diffusion
  Models
Adaptive Guidance: Training-free Acceleration of Conditional Diffusion Models
Angela Castillo
Jonas Kohler
Juan C. Pérez
Juan Pablo Pérez
Albert Pumarola
Guohao Li
Pablo Arbelaez
Ali K. Thabet
37
12
0
19 Dec 2023
Optimizing Diffusion Noise Can Serve As Universal Motion Priors
Optimizing Diffusion Noise Can Serve As Universal Motion Priors
Korrawe Karunratanakul
Konpat Preechakul
Emre Aksan
Thabo Beeler
Supasorn Suwajanakorn
Siyu Tang
DiffM
38
37
0
19 Dec 2023
CreativeConnect: Supporting Reference Recombination for Graphic Design
  Ideation with Generative AI
CreativeConnect: Supporting Reference Recombination for Graphic Design Ideation with Generative AI
DaEun Choi
Sumin Hong
Jeongeon Park
John Joon Young Chung
Juho Kim
25
52
0
19 Dec 2023
MaskINT: Video Editing via Interpolative Non-autoregressive Masked
  Transformers
MaskINT: Video Editing via Interpolative Non-autoregressive Masked Transformers
Haoyu Ma
Shahin Mahdizadehaghdam
Bichen Wu
Zhipeng Fan
Yuchao Gu
Wenliang Zhao
Lior Shapira
Xiaohui Xie
DiffM
VGen
32
4
0
19 Dec 2023
Scene-Conditional 3D Object Stylization and Composition
Scene-Conditional 3D Object Stylization and Composition
Jinghao Zhou
Tomas Jakab
Philip Torr
Christian Rupprecht
DiffM
75
2
0
19 Dec 2023
MAG-Edit: Localized Image Editing in Complex Scenarios via Mask-Based
  Attention-Adjusted Guidance
MAG-Edit: Localized Image Editing in Complex Scenarios via Mask-Based Attention-Adjusted Guidance
Qi Mao
Lan Chen
Yuchao Gu
Zhen Fang
Mike Zheng Shou
DiffM
38
9
0
18 Dec 2023
SCEdit: Efficient and Controllable Image Diffusion Generation via Skip
  Connection Editing
SCEdit: Efficient and Controllable Image Diffusion Generation via Skip Connection Editing
Zeyinzi Jiang
Chaojie Mao
Yulin Pan
Zhen Han
Jingfeng Zhang
32
28
0
18 Dec 2023
SPIRE: Semantic Prompt-Driven Image Restoration
SPIRE: Semantic Prompt-Driven Image Restoration
Chenyang Qi
Zhengzhong Tu
Keren Ye
M. Delbracio
P. Milanfar
Qifeng Chen
Hossein Talebi
DiffM
38
11
0
18 Dec 2023
Your Student is Better Than Expected: Adaptive Teacher-Student
  Collaboration for Text-Conditional Diffusion Models
Your Student is Better Than Expected: Adaptive Teacher-Student Collaboration for Text-Conditional Diffusion Models
Nikita Starodubcev
Artem Fedorov
Artem Babenko
Dmitry Baranchuk
DiffM
53
3
0
17 Dec 2023
CogCartoon: Towards Practical Story Visualization
CogCartoon: Towards Practical Story Visualization
Zhongyang Zhu
Jie Tang
DiffM
37
3
0
17 Dec 2023
Iterative Motion Editing with Natural Language
Iterative Motion Editing with Natural Language
Purvi Goel
Kuan-Chieh Wang
Chenxi Liu
Kayvon Fatahalian
DiffM
43
22
0
15 Dec 2023
Rich Human Feedback for Text-to-Image Generation
Rich Human Feedback for Text-to-Image Generation
Youwei Liang
Junfeng He
Gang Li
Peizhao Li
Arseniy Klimovskiy
...
Yiwen Luo
Yang Li
Kai Kohlhoff
Deepak Ramachandran
Vidhya Navalpakkam
EGVM
29
68
0
15 Dec 2023
Latent Diffusion Models with Image-Derived Annotations for Enhanced
  AI-Assisted Cancer Diagnosis in Histopathology
Latent Diffusion Models with Image-Derived Annotations for Enhanced AI-Assisted Cancer Diagnosis in Histopathology
Pedro Osório
Guillermo Jiménez-Pérez
Javier Montalt-Tordera
Jens Hooge
Guillem Duran Ballester
...
Sabrina Schroeder
K. Siudak
Julia Vienenkoetter
Bettina Lawrenz
Sadegh Mohammadi
MedIm
30
8
0
15 Dec 2023
Collaborating Foundation Models for Domain Generalized Semantic
  Segmentation
Collaborating Foundation Models for Domain Generalized Semantic Segmentation
Yasser Benigmim
Subhankar Roy
S. Essid
Vicky Kalogeiton
Stéphane Lathuilière
30
12
0
15 Dec 2023
Focus on Your Instruction: Fine-grained and Multi-instruction Image
  Editing by Attention Modulation
Focus on Your Instruction: Fine-grained and Multi-instruction Image Editing by Attention Modulation
Qin Guo
Tianwei Lin
DiffM
27
31
0
15 Dec 2023
Tell Me What You See: Text-Guided Real-World Image Denoising
Tell Me What You See: Text-Guided Real-World Image Denoising
E. Yosef
Raja Giryes
DiffM
61
2
0
15 Dec 2023
InstructPipe: Generating Visual Blocks Pipelines with Human Instructions and LLMs
InstructPipe: Generating Visual Blocks Pipelines with Human Instructions and LLMs
Zhongyi Zhou
Jing Jin
Vrushank Phadnis
Xiuxiu Yuan
Jun Jiang
...
A. Olwal
David Kim
Ram Iyengar
Na Li
Andrea Colaço
35
0
0
15 Dec 2023
LatentEditor: Text Driven Local Editing of 3D Scenes
LatentEditor: Text Driven Local Editing of 3D Scenes
Umar Khalid
Hasan Iqbal
Nazmul Karim
Jingyi Hua
Chong Chen
DiffM
3DGS
29
13
0
14 Dec 2023
LIME: Localized Image Editing via Attention Regularization in Diffusion
  Models
LIME: Localized Image Editing via Attention Regularization in Diffusion Models
Enis Simsar
A. Tonioni
Yongqin Xian
Thomas Hofmann
Federico Tombari
DiffM
44
8
0
14 Dec 2023
VL-GPT: A Generative Pre-trained Transformer for Vision and Language
  Understanding and Generation
VL-GPT: A Generative Pre-trained Transformer for Vision and Language Understanding and Generation
Jinguo Zhu
Xiaohan Ding
Yixiao Ge
Yuying Ge
Sijie Zhao
Hengshuang Zhao
Xiaohua Wang
Ying Shan
ViT
VLM
24
33
0
14 Dec 2023
ZeroRF: Fast Sparse View 360° Reconstruction with Zero Pretraining
ZeroRF: Fast Sparse View 360° Reconstruction with Zero Pretraining
Ruoxi Shi
Xinyue Wei
Cheng Wang
Hao Su
43
16
0
14 Dec 2023
SHAP-EDITOR: Instruction-guided Latent 3D Editing in Seconds
SHAP-EDITOR: Instruction-guided Latent 3D Editing in Seconds
Minghao Chen
Junyu Xie
Iro Laina
Andrea Vedaldi
KELM
55
9
0
14 Dec 2023
Efficient-NeRF2NeRF: Streamlining Text-Driven 3D Editing with Multiview
  Correspondence-Enhanced Diffusion Models
Efficient-NeRF2NeRF: Streamlining Text-Driven 3D Editing with Multiview Correspondence-Enhanced Diffusion Models
Liangchen Song
Liangliang Cao
Jiatao Gu
Yifan Jiang
Junsong Yuan
Hao Tang
DiffM
31
13
0
13 Dec 2023
EVP: Enhanced Visual Perception using Inverse Multi-Attentive Feature
  Refinement and Regularized Image-Text Alignment
EVP: Enhanced Visual Perception using Inverse Multi-Attentive Feature Refinement and Regularized Image-Text Alignment
M. Lavrenyuk
Shariq Farooq Bhat
Matthias Müller
Peter Wonka
ObjD
MDE
36
9
0
13 Dec 2023
ToViLaG: Your Visual-Language Generative Model is Also An Evildoer
ToViLaG: Your Visual-Language Generative Model is Also An Evildoer
Xinpeng Wang
Xiaoyuan Yi
Han Jiang
Shanlin Zhou
Zhihua Wei
Xing Xie
40
13
0
13 Dec 2023
DiffuseRAW: End-to-End Generative RAW Image Processing for Low-Light
  Images
DiffuseRAW: End-to-End Generative RAW Image Processing for Low-Light Images
Rishit Dagli
32
2
0
13 Dec 2023
The Lottery Ticket Hypothesis in Denoising: Towards Semantic-Driven
  Initialization
The Lottery Ticket Hypothesis in Denoising: Towards Semantic-Driven Initialization
Jiafeng Mao
Xueting Wang
Kiyoharu Aizawa
DiffM
71
3
0
13 Dec 2023
FreeControl: Training-Free Spatial Control of Any Text-to-Image
  Diffusion Model with Any Condition
FreeControl: Training-Free Spatial Control of Any Text-to-Image Diffusion Model with Any Condition
Sicheng Mo
Fangzhou Mu
Kuan Heng Lin
Yanli Liu
Bochen Guan
Yin Li
Bolei Zhou
DiffM
53
60
0
12 Dec 2023
GenHowTo: Learning to Generate Actions and State Transformations from
  Instructional Videos
GenHowTo: Learning to Generate Actions and State Transformations from Instructional Videos
Tomávs Souvcek
Dima Damen
Michael Wray
Ivan Laptev
Josef Sivic
VGen
21
19
0
12 Dec 2023
MaTe3D: Mask-guided Text-based 3D-aware Portrait Editing
MaTe3D: Mask-guided Text-based 3D-aware Portrait Editing
Kangneng Zhou
Daiheng Gao
Xuan Wang
Jie Zhang
Peng Zhang
...
Shiqi Yang
Bang Zhang
Liefeng Bo
Yaxing Wang
Ming-Ming Cheng
DiffM
41
4
0
12 Dec 2023
Relightful Harmonization: Lighting-aware Portrait Background Replacement
Relightful Harmonization: Lighting-aware Portrait Background Replacement
Mengwei Ren
Wei Xiong
Jae Shin Yoon
Zhixin Shu
Jianming Zhang
HyunJoon Jung
Guido Gerig
He Zhang
DiffM
43
18
0
11 Dec 2023
CAD: Photorealistic 3D Generation via Adversarial Distillation
CAD: Photorealistic 3D Generation via Adversarial Distillation
Bo Liu
Despoina Paschalidou
Ian Huang
Hongyu Liu
Bokui Shen
Xiaoyu Xiang
Jing Liao
Leonidas J. Guibas
DiffM
78
11
0
11 Dec 2023
UpFusion: Novel View Diffusion from Unposed Sparse View Observations
UpFusion: Novel View Diffusion from Unposed Sparse View Observations
Bharath Raj Nagoor Kani
Hsin-Ying Lee
Sergey Tulyakov
Shubham Tulsiani
43
5
0
11 Dec 2023
SmartEdit: Exploring Complex Instruction-based Image Editing with
  Multimodal Large Language Models
SmartEdit: Exploring Complex Instruction-based Image Editing with Multimodal Large Language Models
Yuzhou Huang
Liangbin Xie
Xintao Wang
Ziyang Yuan
Xiaodong Cun
...
Jiantao Zhou
Chao Dong
Rui Huang
Ruimao Zhang
Ying Shan
DiffM
42
60
0
11 Dec 2023
InstructAny2Pix: Flexible Visual Editing via Multimodal Instruction
  Following
InstructAny2Pix: Flexible Visual Editing via Multimodal Instruction Following
Shufan Li
Harkanwar Singh
Aditya Grover
DiffM
30
7
0
11 Dec 2023
Stellar: Systematic Evaluation of Human-Centric Personalized
  Text-to-Image Methods
Stellar: Systematic Evaluation of Human-Centric Personalized Text-to-Image Methods
Panos Achlioptas
Alexandros Benetatos
Iordanis Fostiropoulos
Dimitris Skourtis
34
8
0
11 Dec 2023
Learning Naturally Aggregated Appearance for Efficient 3D Editing
Learning Naturally Aggregated Appearance for Efficient 3D Editing
Ka Leong Cheng
Qiuyu Wang
Zifan Shi
Kecheng Zheng
Yinghao Xu
Ouyang Hao
Qifeng Chen
Yujun Shen
3DH
63
4
0
11 Dec 2023
Neutral Editing Framework for Diffusion-based Video Editing
Neutral Editing Framework for Diffusion-based Video Editing
Sunjae Yoon
Gwanhyeong Koo
Jiajing Hong
Changdong Yoo
VGen
DiffM
27
1
0
10 Dec 2023
A Video is Worth 256 Bases: Spatial-Temporal Expectation-Maximization
  Inversion for Zero-Shot Video Editing
A Video is Worth 256 Bases: Spatial-Temporal Expectation-Maximization Inversion for Zero-Shot Video Editing
Maomao Li
Yu Li
Tianyu Yang
Yunfei Liu
Dongxu Yue
Zhihui Lin
Dong Xu
VGen
34
8
0
10 Dec 2023
Previous
123...181920...262728
Next