ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2211.09800
  4. Cited By
InstructPix2Pix: Learning to Follow Image Editing Instructions
v1v2 (latest)

InstructPix2Pix: Learning to Follow Image Editing Instructions

17 November 2022
Tim Brooks
Aleksander Holynski
Alexei A. Efros
    DiffM
ArXiv (abs)PDFHTML

Papers citing "InstructPix2Pix: Learning to Follow Image Editing Instructions"

50 / 1,418 papers shown
Title
Recent Advances in 3D Gaussian Splatting
Recent Advances in 3D Gaussian Splatting
Tong Wu
Yu-Jie Yuan
Ling-Xiao Zhang
Jie Yang
Yan-Pei Cao
Ling-Qi Yan
Lin Gao
3DGS
157
106
0
17 Mar 2024
Omni-Recon: Harnessing Image-based Rendering for General-Purpose Neural
  Radiance Fields
Omni-Recon: Harnessing Image-based Rendering for General-Purpose Neural Radiance Fields
Yonggan Fu
Huaizhi Qu
Zhifan Ye
Chaojian Li
Kevin Zhao
Yingyan Lin
AI4CE
111
0
0
17 Mar 2024
Source Prompt Disentangled Inversion for Boosting Image Editability with
  Diffusion Models
Source Prompt Disentangled Inversion for Boosting Image Editability with Diffusion Models
Rui Li
Ruihuang Li
Song Guo
Lei Zhang
DiffM
82
10
0
17 Mar 2024
StainDiffuser: MultiTask Dual Diffusion Model for Virtual Staining
StainDiffuser: MultiTask Dual Diffusion Model for Virtual Staining
Tushar Kataria
Beatrice Knudsen
Shireen Y. Elhabian
DiffMMedIm
105
10
0
17 Mar 2024
Efficient Diffusion-Driven Corruption Editor for Test-Time Adaptation
Efficient Diffusion-Driven Corruption Editor for Test-Time Adaptation
Yeongtak Oh
Jonghyun Lee
Jooyoung Choi
Dahuin Jung
Uiwon Hwang
Sungroh Yoon
TTADiffM
82
5
0
16 Mar 2024
MicroDiffusion: Implicit Representation-Guided Diffusion for 3D
  Reconstruction from Limited 2D Microscopy Projections
MicroDiffusion: Implicit Representation-Guided Diffusion for 3D Reconstruction from Limited 2D Microscopy Projections
Mude Hui
Zihao Wei
Hongru Zhu
Fei Xia
Yuyin Zhou
MedIm
71
8
0
16 Mar 2024
Strong and Controllable Blind Image Decomposition
Strong and Controllable Blind Image Decomposition
Zeyu Zhang
Junlin Han
Chenhui Gou
Hongdong Li
Liang Zheng
82
2
0
15 Mar 2024
E4C: Enhance Editability for Text-Based Image Editing by Harnessing
  Efficient CLIP Guidance
E4C: Enhance Editability for Text-Based Image Editing by Harnessing Efficient CLIP Guidance
Tianrui Huang
Pu Cao
Lu Yang
Chun Liu
Mengjie Hu
Zhiwei Liu
Qing-Huang Song
DiffM
79
0
0
15 Mar 2024
ST-LDM: A Universal Framework for Text-Grounded Object Generation in
  Real Images
ST-LDM: A Universal Framework for Text-Grounded Object Generation in Real Images
Xiangtian Xue
Jiasong Wu
Youyong Kong
L. Senhadji
Huazhong Shu
DiffM
71
1
0
15 Mar 2024
Eta Inversion: Designing an Optimal Eta Function for Diffusion-based
  Real Image Editing
Eta Inversion: Designing an Optimal Eta Function for Diffusion-based Real Image Editing
Wonjun Kang
Kevin Galim
Hyung Il Koo
DiffM
75
5
0
14 Mar 2024
Video Editing via Factorized Diffusion Distillation
Video Editing via Factorized Diffusion Distillation
Uriel Singer
Amit Zohar
Yuval Kirstain
Shelly Sheynin
Adam Polyak
Devi Parikh
Yaniv Taigman
DiffMVGen
91
15
0
14 Mar 2024
Switch Diffusion Transformer: Synergizing Denoising Tasks with Sparse
  Mixture-of-Experts
Switch Diffusion Transformer: Synergizing Denoising Tasks with Sparse Mixture-of-Experts
Byeongjun Park
Hyojun Go
Jin-Young Kim
Sangmin Woo
Seokil Ham
Changick Kim
DiffMMoE
106
13
0
14 Mar 2024
Rethinking Referring Object Removal
Rethinking Referring Object Removal
Xiangtian Xue
Jiasong Wu
Youyong Kong
L. Senhadji
Huazhong Shu
DiffM
79
0
0
14 Mar 2024
Explore In-Context Segmentation via Latent Diffusion Models
Explore In-Context Segmentation via Latent Diffusion Models
Chaoyang Wang
Xiangtai Li
Henghui Ding
Lu Qi
Jiangning Zhang
Yunhai Tong
Chen Change Loy
Shuicheng Yan
DiffM
158
7
0
14 Mar 2024
Unveiling the Truth: Exploring Human Gaze Patterns in Fake Images
Unveiling the Truth: Exploring Human Gaze Patterns in Fake Images
Giuseppe Cartella
Vittorio Cuculo
Marcella Cornia
Rita Cucchiara
DiffM
128
5
0
13 Mar 2024
GaussCtrl: Multi-View Consistent Text-Driven 3D Gaussian Splatting
  Editing
GaussCtrl: Multi-View Consistent Text-Driven 3D Gaussian Splatting Editing
Jing Wu
Jiawang Bian
Xinghui Li
Guangrun Wang
Ian D Reid
Philip Torr
V. Prisacariu
3DGS
98
42
0
13 Mar 2024
Make Me Happier: Evoking Emotions Through Image Diffusion Models
Make Me Happier: Evoking Emotions Through Image Diffusion Models
Qing Lin
Jingfeng Zhang
Yew-Soon Ong
Mengmi Zhang
62
3
0
13 Mar 2024
Bridging Different Language Models and Generative Vision Models for
  Text-to-Image Generation
Bridging Different Language Models and Generative Vision Models for Text-to-Image Generation
Shihao Zhao
Shaozhe Hao
Bojia Zi
Huaizhe Xu
Kwan-Yee K. Wong
DiffMVLM
108
9
0
12 Mar 2024
V3D: Video Diffusion Models are Effective 3D Generators
V3D: Video Diffusion Models are Effective 3D Generators
Zilong Chen
Yikai Wang
Feng Wang
Zhengyi Wang
Huaping Liu
VGen
117
68
0
11 Mar 2024
GSEdit: Efficient Text-Guided Editing of 3D Objects via Gaussian
  Splatting
GSEdit: Efficient Text-Guided Editing of 3D Objects via Gaussian Splatting
Francesco Palandra
Andrea Sanchietti
Daniele Baieri
Emanuele Rodolà
3DGS
79
22
0
08 Mar 2024
XPSR: Cross-modal Priors for Diffusion-based Image Super-Resolution
XPSR: Cross-modal Priors for Diffusion-based Image Super-Resolution
Yunpeng Qu
Kun Yuan
Kai Zhao
Qizhi Xie
Jinhua Hao
Ming Sun
Chao Zhou
88
19
0
08 Mar 2024
InstructGIE: Towards Generalizable Image Editing
InstructGIE: Towards Generalizable Image Editing
Zichong Meng
Changdi Yang
Jun Liu
Hao Tang
Pu Zhao
Yanzhi Wang
DiffM
99
9
0
08 Mar 2024
Pix2Gif: Motion-Guided Diffusion for GIF Generation
Pix2Gif: Motion-Guided Diffusion for GIF Generation
Hitesh Kandala
Jianfeng Gao
Jianwei Yang
VGenDiffM
85
3
0
07 Mar 2024
StableDrag: Stable Dragging for Point-based Image Editing
StableDrag: Stable Dragging for Point-based Image Editing
Yutao Cui
Xiaotong Zhao
Guozhen Zhang
Shengming Cao
Kai Ma
Limin Wang
89
14
0
07 Mar 2024
NoiseCollage: A Layout-Aware Text-to-Image Diffusion Model Based on
  Noise Cropping and Merging
NoiseCollage: A Layout-Aware Text-to-Image Diffusion Model Based on Noise Cropping and Merging
Takahiro Shirakawa
Seiichi Uchida
DiffM
62
19
0
06 Mar 2024
Towards Understanding Cross and Self-Attention in Stable Diffusion for
  Text-Guided Image Editing
Towards Understanding Cross and Self-Attention in Stable Diffusion for Text-Guided Image Editing
Bingyan Liu
Chengyu Wang
Tingfeng Cao
Kui Jia
Jun Huang
DiffM
82
63
0
06 Mar 2024
Scaling Rectified Flow Transformers for High-Resolution Image Synthesis
Scaling Rectified Flow Transformers for High-Resolution Image Synthesis
Patrick Esser
Sumith Kulal
A. Blattmann
Rahim Entezari
Jonas Muller
...
Zion English
Kyle Lacey
Alex Goodwin
Yannik Marek
Robin Rombach
DiffM
321
1,410
0
05 Mar 2024
Doubly Abductive Counterfactual Inference for Text-based Image Editing
Doubly Abductive Counterfactual Inference for Text-based Image Editing
Xue Song
Jiequan Cui
Hanwang Zhang
Jingjing Chen
Richang Hong
Yu-Gang Jiang
DiffM
61
13
0
05 Mar 2024
RT-Sketch: Goal-Conditioned Imitation Learning from Hand-Drawn Sketches
RT-Sketch: Goal-Conditioned Imitation Learning from Hand-Drawn Sketches
Priya Sundaresan
Q. Vuong
Jiayuan Gu
Peng Xu
Ted Xiao
...
Ajinkya Jain
Karol Hausman
Dorsa Sadigh
Jeannette Bohg
S. Schaal
VGen
99
26
0
05 Mar 2024
OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable
  Virtual Try-on
OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on
Yuhao Xu
Tao Gu
Weifeng Chen
Chengcai Chen
DiffM
91
66
0
04 Mar 2024
Diff-Plugin: Revitalizing Details for Diffusion-based Low-level Tasks
Diff-Plugin: Revitalizing Details for Diffusion-based Low-level Tasks
Yuhao Liu
Zhanghan Ke
Fang Liu
Nanxuan Zhao
Rynson W. H. Lau
DiffM
114
23
0
01 Mar 2024
LoMOE: Localized Multi-Object Editing via Multi-Diffusion
LoMOE: Localized Multi-Object Editing via Multi-Diffusion
Goirik Chakrabarty
Aditya Chandrasekar
Ramya Hebbalaguppe
AP Prathosh
DiffM
80
6
0
01 Mar 2024
Ask Your Distribution Shift if Pre-Training is Right for You
Ask Your Distribution Shift if Pre-Training is Right for You
Benjamin Cohen-Wang
Joshua Vendrow
Aleksander Madry
OOD
94
3
0
29 Feb 2024
Large Language Models and Games: A Survey and Roadmap
Large Language Models and Games: A Survey and Roadmap
Roberto Gallotta
Graham Todd
Marvin Zammit
Sam Earle
Antonios Liapis
Julian Togelius
Georgios N. Yannakakis
LLMAGLM&MAAI4CELRM
131
86
0
28 Feb 2024
From Summary to Action: Enhancing Large Language Models for Complex
  Tasks with Open World APIs
From Summary to Action: Enhancing Large Language Models for Complex Tasks with Open World APIs
Yulong Liu
Yunlong Yuan
Chunwei Wang
Jianhua Han
Yongqiang Ma
Li Zhang
Nanning Zheng
Hang Xu
LLMAG
58
5
0
28 Feb 2024
Coarse-to-Fine Latent Diffusion for Pose-Guided Person Image Synthesis
Coarse-to-Fine Latent Diffusion for Pose-Guided Person Image Synthesis
Yanzuo Lu
Manlin Zhang
Andy J. Ma
Xiaohua Xie
Jian-Huang Lai
DiffM
68
24
0
28 Feb 2024
CustomSketching: Sketch Concept Extraction for Sketch-based Image
  Synthesis and Editing
CustomSketching: Sketch Concept Extraction for Sketch-based Image Synthesis and Editing
Chufeng Xiao
Hongbo Fu
DiffM
83
3
0
27 Feb 2024
Diffusion Model-Based Image Editing: A Survey
Diffusion Model-Based Image Editing: A Survey
Yi Huang
Jiancheng Huang
Yifan Liu
Mingfu Yan
Jiaxi Lv
Jianzhuang Liu
Wei Xiong
He Zhang
Liangliang Cao
Liangliang Cao
EGVM
263
103
0
27 Feb 2024
Placing Objects in Context via Inpainting for Out-of-distribution
  Segmentation
Placing Objects in Context via Inpainting for Out-of-distribution Segmentation
Pau de Jorge
Riccardo Volpi
P. Dokania
Philip Torr
Grégory Rogez
DiffM
116
5
0
26 Feb 2024
Intelligent Director: An Automatic Framework for Dynamic Visual
  Composition using ChatGPT
Intelligent Director: An Automatic Framework for Dynamic Visual Composition using ChatGPT
Sixiao Zheng
Jingyang Huo
Yu Wang
Yanwei Fu
VGenDiffM
69
1
0
24 Feb 2024
Snap Video: Scaled Spatiotemporal Transformers for Text-to-Video
  Synthesis
Snap Video: Scaled Spatiotemporal Transformers for Text-to-Video Synthesis
Willi Menapace
Aliaksandr Siarohin
Ivan Skorokhodov
Ekaterina Deyneka
Tsai-Shien Chen
...
Yuwei Fang
A. Stoliar
Elisa Ricci
Jian Ren
Sergey Tulyakov
VGen
134
62
0
22 Feb 2024
Consolidating Attention Features for Multi-view Image Editing
Consolidating Attention Features for Multi-view Image Editing
Or Patashnik
Rinon Gal
Daniel Cohen-Or
Jun-Yan Zhu
Fernando de la Torre
73
6
0
22 Feb 2024
LLMBind: A Unified Modality-Task Integration Framework
LLMBind: A Unified Modality-Task Integration Framework
Bin Zhu
Munan Ning
Peng Jin
Bin Lin
Jinfa Huang
...
Junwu Zhang
Zhenyu Tang
Mingjun Pan
Xing Zhou
Li-ming Yuan
MLLM
74
6
0
22 Feb 2024
Real-time 3D-aware Portrait Editing from a Single Image
Real-time 3D-aware Portrait Editing from a Single Image
Qingyan Bai
Zifan Shi
Yinghao Xu
Hao Ouyang
Qiuyu Wang
Ceyuan Yang
Xuan Wang
Gordon Wetzstein
Yujun Shen
Qifeng Chen
3DHDiffM
122
10
0
21 Feb 2024
CoFRIDA: Self-Supervised Fine-Tuning for Human-Robot Co-Painting
CoFRIDA: Self-Supervised Fine-Tuning for Human-Robot Co-Painting
Peter Schaldenbrand
Gaurav Parmar
Jun-Yan Zhu
James McCann
Jean Oh
63
14
0
21 Feb 2024
UniEdit: A Unified Tuning-Free Framework for Video Motion and Appearance
  Editing
UniEdit: A Unified Tuning-Free Framework for Video Motion and Appearance Editing
Jianhong Bai
Tianyu He
Yuchi Wang
Junliang Guo
Haoji Hu
Zuozhu Liu
Jiang Bian
VGen
100
30
0
20 Feb 2024
Robust-Wide: Robust Watermarking against Instruction-driven Image
  Editing
Robust-Wide: Robust Watermarking against Instruction-driven Image Editing
Runyi Hu
Jie Zhang
Ting Xu
Jiwei Li
Tianwei Zhang
DiffMWIGM
117
8
0
20 Feb 2024
The Revolution of Multimodal Large Language Models: A Survey
The Revolution of Multimodal Large Language Models: A Survey
Davide Caffagni
Federico Cocchi
Luca Barsellotti
Nicholas Moratelli
Sara Sarto
Lorenzo Baraldi
Lorenzo Baraldi
Marcella Cornia
Rita Cucchiara
LRMVLM
135
64
0
19 Feb 2024
DiLightNet: Fine-grained Lighting Control for Diffusion-based Image
  Generation
DiLightNet: Fine-grained Lighting Control for Diffusion-based Image Generation
Chong Zeng
Yue Dong
Pieter Peers
Youkang Kong
Hongzhi Wu
Xin Tong
126
35
0
19 Feb 2024
On Good Practices for Task-Specific Distillation of Large Pretrained
  Visual Models
On Good Practices for Task-Specific Distillation of Large Pretrained Visual Models
Juliette Marrie
Michael Arbel
Julien Mairal
Diane Larlus
VLMMQ
92
1
0
17 Feb 2024
Previous
123...171819...272829
Next