ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2211.09800
  4. Cited By
InstructPix2Pix: Learning to Follow Image Editing Instructions

InstructPix2Pix: Learning to Follow Image Editing Instructions

17 November 2022
Tim Brooks
Aleksander Holynski
Alexei A. Efros
    DiffM
ArXivPDFHTML

Papers citing "InstructPix2Pix: Learning to Follow Image Editing Instructions"

50 / 1,356 papers shown
Title
Composed Video Retrieval via Enriched Context and Discriminative
  Embeddings
Composed Video Retrieval via Enriched Context and Discriminative Embeddings
Omkar Thawakar
Muzammal Naseer
Rao Muhammad Anwer
Salman Khan
Michael Felsberg
Mubarak Shah
Fahad Shahbaz Khan
34
8
0
25 Mar 2024
Continuous, Subject-Specific Attribute Control in T2I Models by Identifying Semantic Directions
Continuous, Subject-Specific Attribute Control in T2I Models by Identifying Semantic Directions
S. A. Baumann
Felix Krause
Michael Neumayr
Nick Stracke
Vincent Tao Hu
Bjorn Ommer
Björn Ommer
DiffM
LM&Ro
76
11
0
25 Mar 2024
Pose-Guided Self-Training with Two-Stage Clustering for Unsupervised
  Landmark Discovery
Pose-Guided Self-Training with Two-Stage Clustering for Unsupervised Landmark Discovery
Siddharth Tourani
Ahmed Alwheibi
Arif Mahmood
Muhammad Haris Khan
DiffM
46
1
0
24 Mar 2024
ThemeStation: Generating Theme-Aware 3D Assets from Few Exemplars
ThemeStation: Generating Theme-Aware 3D Assets from Few Exemplars
Zhenwei Wang
Tengfei Wang
Gerhard Hancke
Ziwei Liu
Rynson W. H. Lau
DiffM
VGen
43
6
0
22 Mar 2024
DragAPart: Learning a Part-Level Motion Prior for Articulated Objects
DragAPart: Learning a Part-Level Motion Prior for Articulated Objects
Ruining Li
Chuanxia Zheng
Christian Rupprecht
Andrea Vedaldi
DiffM
45
18
0
22 Mar 2024
Recent Trends in 3D Reconstruction of General Non-Rigid Scenes
Recent Trends in 3D Reconstruction of General Non-Rigid Scenes
Raza Yunus
J. E. Lenssen
Michael Niemeyer
Yiyi Liao
Christian Rupprecht
Christian Theobalt
Gerard Pons-Moll
Jia-Bin Huang
Vladislav Golyanik
Eddy Ilg
53
25
0
22 Mar 2024
DreamFlow: High-Quality Text-to-3D Generation by Approximating
  Probability Flow
DreamFlow: High-Quality Text-to-3D Generation by Approximating Probability Flow
Kyungmin Lee
Kihyuk Sohn
Jinwoo Shin
53
19
0
22 Mar 2024
Controlled Training Data Generation with Diffusion Models
Controlled Training Data Generation with Diffusion Models
Teresa Yeo
Andrei Atanov
Harold Benoit
Aleksandr Alekseev
Ruchira Ray
Pooya Esmaeil Akhoondi
Amir Zamir
52
4
0
22 Mar 2024
AnyV2V: A Tuning-Free Framework For Any Video-to-Video Editing Tasks
AnyV2V: A Tuning-Free Framework For Any Video-to-Video Editing Tasks
Max W.F. Ku
Cong Wei
Weiming Ren
Huan Yang
Wenhu Chen
VGen
DiffM
80
21
0
21 Mar 2024
Efficient Video Diffusion Models via Content-Frame Motion-Latent
  Decomposition
Efficient Video Diffusion Models via Content-Frame Motion-Latent Decomposition
Sihyun Yu
Weili Nie
De-An Huang
Boyi Li
Jinwoo Shin
A. Anandkumar
VGen
DiffM
36
15
0
21 Mar 2024
TimeRewind: Rewinding Time with Image-and-Events Video Diffusion
TimeRewind: Rewinding Time with Image-and-Events Video Diffusion
Jingxi Chen
Brandon Yushan Feng
Haoming Cai
Mingyang Xie
Christopher A. Metzler
Cornelia Fermuller
Yiannis Aloimonos
40
4
0
20 Mar 2024
DepthFM: Fast Monocular Depth Estimation with Flow Matching
DepthFM: Fast Monocular Depth Estimation with Flow Matching
Ming Gui
Johannes S. Fischer
Ulrich Prestel
Pingchuan Ma
Dmytro Kotovenko
Olga Grebenkova
S. A. Baumann
Vincent Tao Hu
Bjorn Ommer
MDE
41
53
0
20 Mar 2024
ZoDi: Zero-Shot Domain Adaptation with Diffusion-Based Image Transfer
ZoDi: Zero-Shot Domain Adaptation with Diffusion-Based Image Transfer
Hiroki Azuma
Yusuke Matsui
Atsuto Maki
VLM
44
1
0
20 Mar 2024
Ground-A-Score: Scaling Up the Score Distillation for Multi-Attribute
  Editing
Ground-A-Score: Scaling Up the Score Distillation for Multi-Attribute Editing
Hangeol Chang
Jinho Chang
Jong Chul Ye
DiffM
53
3
0
20 Mar 2024
IDAdapter: Learning Mixed Features for Tuning-Free Personalization of
  Text-to-Image Models
IDAdapter: Learning Mixed Features for Tuning-Free Personalization of Text-to-Image Models
Siying Cui
Jia Guo
Xiang An
Jiankang Deng
Yongle Zhao
Xinyu Wei
Ziyong Feng
DiffM
42
21
0
20 Mar 2024
Mora: Enabling Generalist Video Generation via A Multi-Agent Framework
Mora: Enabling Generalist Video Generation via A Multi-Agent Framework
Zhengqing Yuan
Ruoxi Chen
Zhaoxu Li
Haolong Jia
Lifang He
Chi Wang
Lichao Sun
VGen
68
27
0
20 Mar 2024
Magic Fixup: Streamlining Photo Editing by Watching Dynamic Videos
Magic Fixup: Streamlining Photo Editing by Watching Dynamic Videos
Hadi Alzayer
Zhihao Xia
Xuaner Zhang
Eli Shechtman
Jia-Bin Huang
Michael Gharbi
DiffM
VGen
37
19
0
19 Mar 2024
Wear-Any-Way: Manipulable Virtual Try-on via Sparse Correspondence
  Alignment
Wear-Any-Way: Manipulable Virtual Try-on via Sparse Correspondence Alignment
Mengting Chen
Xi Chen
Zhonghua Zhai
Chen Ju
Xuewen Hong
Jinsong Lan
Shuai Xiao
OOD
DiffM
53
23
0
19 Mar 2024
FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation
FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation
Shuai Yang
Yifan Zhou
Ziwei Liu
Chen Change Loy
VGen
DiffM
74
27
0
19 Mar 2024
You Only Sample Once: Taming One-Step Text-to-Image Synthesis by
  Self-Cooperative Diffusion GANs
You Only Sample Once: Taming One-Step Text-to-Image Synthesis by Self-Cooperative Diffusion GANs
Yihong Luo
Xiaolong Chen
Xinghua Qu
Jing Tang
61
6
0
19 Mar 2024
Generative Enhancement for 3D Medical Images
Generative Enhancement for 3D Medical Images
Lingting Zhu
Noel Codella
Dongdong Chen
Zhenchao Jin
Lu Yuan
Lequan Yu
DiffM
MedIm
47
10
0
19 Mar 2024
Tuning-Free Image Customization with Image and Text Guidance
Tuning-Free Image Customization with Image and Text Guidance
Pengzhi Li
Qiang Nie
Ying Chen
Xi Jiang
Kai Wu
Yuhuan Lin
Yong-Jin Liu
Jinlong Peng
Chengjie Wang
Feng Zheng
DiffM
30
19
0
19 Mar 2024
LASPA: Latent Spatial Alignment for Fast Training-free Single Image
  Editing
LASPA: Latent Spatial Alignment for Fast Training-free Single Image Editing
Yazeed Alharbi
Peter Wonka
DiffM
40
0
0
19 Mar 2024
MineDreamer: Learning to Follow Instructions via Chain-of-Imagination
  for Simulated-World Control
MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simulated-World Control
Enshen Zhou
Yiran Qin
Zhen-fei Yin
Yuzhou Huang
Ruimao Zhang
Lu Sheng
Yu Qiao
Jing Shao
LM&Ro
AI4CE
52
34
0
18 Mar 2024
One-Step Image Translation with Text-to-Image Models
One-Step Image Translation with Text-to-Image Models
Gaurav Parmar
Taesung Park
Srinivasa Narasimhan
Jun-Yan Zhu
42
45
0
18 Mar 2024
Generic 3D Diffusion Adapter Using Controlled Multi-View Editing
Generic 3D Diffusion Adapter Using Controlled Multi-View Editing
Hansheng Chen
Ruoxi Shi
Yulin Liu
Bokui Shen
Jiayuan Gu
Gordon Wetzstein
Hao Su
Leonidas J. Guibas
DiffM
55
19
0
18 Mar 2024
Fast High-Resolution Image Synthesis with Latent Adversarial Diffusion
  Distillation
Fast High-Resolution Image Synthesis with Latent Adversarial Diffusion Distillation
Axel Sauer
Frederic Boesel
Tim Dockhorn
A. Blattmann
Patrick Esser
Robin Rombach
DiffM
55
110
0
18 Mar 2024
SpatialPIN: Enhancing Spatial Reasoning Capabilities of Vision-Language
  Models through Prompting and Interacting 3D Priors
SpatialPIN: Enhancing Spatial Reasoning Capabilities of Vision-Language Models through Prompting and Interacting 3D Priors
Chenyang Ma
Kai Lu
Ta-Ying Cheng
Niki Trigoni
Andrew Markham
LRM
40
8
0
18 Mar 2024
EffiVED:Efficient Video Editing via Text-instruction Diffusion Models
EffiVED:Efficient Video Editing via Text-instruction Diffusion Models
Zhenghao Zhang
Zuozhuo Dai
Long Qin
Weizhi Wang
DiffM
VGen
47
2
0
18 Mar 2024
Diffusion Models are Geometry Critics: Single Image 3D Editing Using
  Pre-Trained Diffusion Priors
Diffusion Models are Geometry Critics: Single Image 3D Editing Using Pre-Trained Diffusion Priors
Ruicheng Wang
Jianfeng Xiang
Jiaolong Yang
Xin Tong
DiffM
44
3
0
18 Mar 2024
Recent Advances in 3D Gaussian Splatting
Recent Advances in 3D Gaussian Splatting
Tong Wu
Yu-Jie Yuan
Ling-Xiao Zhang
Jie Yang
Yan-Pei Cao
Ling-Qi Yan
Lin Gao
3DGS
76
87
0
17 Mar 2024
Omni-Recon: Harnessing Image-based Rendering for General-Purpose Neural
  Radiance Fields
Omni-Recon: Harnessing Image-based Rendering for General-Purpose Neural Radiance Fields
Yonggan Fu
Huaizhi Qu
Zhifan Ye
Chaojian Li
Kevin Zhao
Yingyan Lin
AI4CE
47
0
0
17 Mar 2024
Source Prompt Disentangled Inversion for Boosting Image Editability with
  Diffusion Models
Source Prompt Disentangled Inversion for Boosting Image Editability with Diffusion Models
Rui Li
Ruihuang Li
Song Guo
Lei Zhang
DiffM
42
7
0
17 Mar 2024
StainDiffuser: MultiTask Dual Diffusion Model for Virtual Staining
StainDiffuser: MultiTask Dual Diffusion Model for Virtual Staining
Tushar Kataria
Beatrice Knudsen
Shireen Y. Elhabian
DiffM
MedIm
37
9
0
17 Mar 2024
Efficient Diffusion-Driven Corruption Editor for Test-Time Adaptation
Efficient Diffusion-Driven Corruption Editor for Test-Time Adaptation
Yeongtak Oh
Jonghyun Lee
Jooyoung Choi
Dahuin Jung
Uiwon Hwang
Sungroh Yoon
TTA
DiffM
50
4
0
16 Mar 2024
MicroDiffusion: Implicit Representation-Guided Diffusion for 3D
  Reconstruction from Limited 2D Microscopy Projections
MicroDiffusion: Implicit Representation-Guided Diffusion for 3D Reconstruction from Limited 2D Microscopy Projections
Mude Hui
Zihao Wei
Hongru Zhu
Fei Xia
Yuyin Zhou
MedIm
54
8
0
16 Mar 2024
Strong and Controllable Blind Image Decomposition
Strong and Controllable Blind Image Decomposition
Zeyu Zhang
Junlin Han
Chenhui Gou
Hongdong Li
Liang Zheng
46
1
0
15 Mar 2024
ST-LDM: A Universal Framework for Text-Grounded Object Generation in
  Real Images
ST-LDM: A Universal Framework for Text-Grounded Object Generation in Real Images
Xiangtian Xue
Jiasong Wu
Youyong Kong
L. Senhadji
Huazhong Shu
DiffM
48
1
0
15 Mar 2024
Eta Inversion: Designing an Optimal Eta Function for Diffusion-based
  Real Image Editing
Eta Inversion: Designing an Optimal Eta Function for Diffusion-based Real Image Editing
Wonjun Kang
Kevin Galim
Hyung Il Koo
DiffM
39
5
0
14 Mar 2024
Video Editing via Factorized Diffusion Distillation
Video Editing via Factorized Diffusion Distillation
Uriel Singer
Amit Zohar
Yuval Kirstain
Shelly Sheynin
Adam Polyak
Devi Parikh
Yaniv Taigman
DiffM
VGen
51
12
0
14 Mar 2024
Switch Diffusion Transformer: Synergizing Denoising Tasks with Sparse
  Mixture-of-Experts
Switch Diffusion Transformer: Synergizing Denoising Tasks with Sparse Mixture-of-Experts
Byeongjun Park
Hyojun Go
Jin-Young Kim
Sangmin Woo
Seokil Ham
Changick Kim
DiffM
MoE
66
13
0
14 Mar 2024
Rethinking Referring Object Removal
Rethinking Referring Object Removal
Xiangtian Xue
Jiasong Wu
Youyong Kong
L. Senhadji
Huazhong Shu
DiffM
42
0
0
14 Mar 2024
Explore In-Context Segmentation via Latent Diffusion Models
Explore In-Context Segmentation via Latent Diffusion Models
Chaoyang Wang
Xiangtai Li
Henghui Ding
Lu Qi
Jiangning Zhang
Yunhai Tong
Chen Change Loy
Shuicheng Yan
DiffM
63
6
0
14 Mar 2024
Unveiling the Truth: Exploring Human Gaze Patterns in Fake Images
Unveiling the Truth: Exploring Human Gaze Patterns in Fake Images
Giuseppe Cartella
Vittorio Cuculo
Marcella Cornia
Rita Cucchiara
DiffM
81
5
0
13 Mar 2024
GaussCtrl: Multi-View Consistent Text-Driven 3D Gaussian Splatting
  Editing
GaussCtrl: Multi-View Consistent Text-Driven 3D Gaussian Splatting Editing
Jing Wu
Jiawang Bian
Xinghui Li
Guangrun Wang
Ian D Reid
Philip Torr
V. Prisacariu
3DGS
32
33
0
13 Mar 2024
Make Me Happier: Evoking Emotions Through Image Diffusion Models
Make Me Happier: Evoking Emotions Through Image Diffusion Models
Qing Lin
Jingfeng Zhang
Yew-Soon Ong
Mengmi Zhang
42
3
0
13 Mar 2024
Bridging Different Language Models and Generative Vision Models for
  Text-to-Image Generation
Bridging Different Language Models and Generative Vision Models for Text-to-Image Generation
Shihao Zhao
Shaozhe Hao
Bojia Zi
Huaizhe Xu
Kwan-Yee K. Wong
DiffM
VLM
68
8
0
12 Mar 2024
V3D: Video Diffusion Models are Effective 3D Generators
V3D: Video Diffusion Models are Effective 3D Generators
Zilong Chen
Yikai Wang
Feng Wang
Zhengyi Wang
Huaping Liu
VGen
45
62
0
11 Mar 2024
GSEdit: Efficient Text-Guided Editing of 3D Objects via Gaussian
  Splatting
GSEdit: Efficient Text-Guided Editing of 3D Objects via Gaussian Splatting
Francesco Palandra
Andrea Sanchietti
Daniele Baieri
Emanuele Rodolà
3DGS
48
20
0
08 Mar 2024
XPSR: Cross-modal Priors for Diffusion-based Image Super-Resolution
XPSR: Cross-modal Priors for Diffusion-based Image Super-Resolution
Yunpeng Qu
Kun Yuan
Kai Zhao
Qizhi Xie
Jinhua Hao
Ming Sun
Chao Zhou
32
17
0
08 Mar 2024
Previous
123...151617...262728
Next