ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2211.09800
  4. Cited By
InstructPix2Pix: Learning to Follow Image Editing Instructions

InstructPix2Pix: Learning to Follow Image Editing Instructions

17 November 2022
Tim Brooks
Aleksander Holynski
Alexei A. Efros
    DiffM
ArXivPDFHTML

Papers citing "InstructPix2Pix: Learning to Follow Image Editing Instructions"

50 / 1,353 papers shown
Title
V-LASIK: Consistent Glasses-Removal from Videos Using Synthetic Data
V-LASIK: Consistent Glasses-Removal from Videos Using Synthetic Data
Rotem Shalev-Arkushin
Aharon Azulay
Tavi Halperin
Eitan Richardson
Amit H. Bermano
Ohad Fried
DiffM
54
0
0
20 Jun 2024
Advancing Fine-Grained Classification by Structure and Subject Preserving Augmentation
Advancing Fine-Grained Classification by Structure and Subject Preserving Augmentation
Eyal Michaeli
Ohad Fried
62
1
0
20 Jun 2024
Style-NeRF2NeRF: 3D Style Transfer From Style-Aligned Multi-View Images
Style-NeRF2NeRF: 3D Style Transfer From Style-Aligned Multi-View Images
Haruo Fujiwara
Yusuke Mukuta
Tatsuya Harada
64
4
0
19 Jun 2024
VIA: Unified Spatiotemporal Video Adaptation Framework for Global and Local Video Editing
VIA: Unified Spatiotemporal Video Adaptation Framework for Global and Local Video Editing
Jing Gu
Yuwei Fang
Ivan Skorokhodov
Peter Wonka
Xinya Du
Sergey Tulyakov
Xin Eric Wang
45
1
0
18 Jun 2024
Generative Visual Instruction Tuning
Generative Visual Instruction Tuning
Jefferson Hernandez
Ruben Villegas
Vicente Ordonez
VLM
38
3
0
17 Jun 2024
Adding Conditional Control to Diffusion Models with Reinforcement Learning
Adding Conditional Control to Diffusion Models with Reinforcement Learning
Yulai Zhao
Masatoshi Uehara
Gabriele Scalia
Tommaso Biancalani
Sergey Levine
Ehsan Hajiramezanali
Ehsan Hajiramezanali
AI4CE
65
3
0
17 Jun 2024
Poetry2Image: An Iterative Correction Framework for Images Generated
  from Chinese Classical Poetry
Poetry2Image: An Iterative Correction Framework for Images Generated from Chinese Classical Poetry
Jing Jiang
Yiran Ling
Binzhu Li
Pengxiang Li
Junming Piao
Yu Zhang
EGVM
DiffM
37
1
0
15 Jun 2024
Crafting Parts for Expressive Object Composition
Crafting Parts for Expressive Object Composition
Harsh Rangwani
Aishwarya Agarwal
Kuldeep Kulkarni
R. Venkatesh Babu
Srikrishna Karanam
DiffM
49
2
0
14 Jun 2024
InstructRL4Pix: Training Diffusion for Image Editing by Reinforcement
  Learning
InstructRL4Pix: Training Diffusion for Image Editing by Reinforcement Learning
Tiancheng Li
Yu Lei
Huajun Chen
Nan Zhuang
EGVM
40
0
0
14 Jun 2024
ControlVAR: Exploring Controllable Visual Autoregressive Modeling
ControlVAR: Exploring Controllable Visual Autoregressive Modeling
Xiang Li
Kai Qiu
Hao Chen
Jason Kuen
Zhe-nan Lin
Rita Singh
Bhiksha Raj
DiffM
43
21
0
14 Jun 2024
ConsistDreamer: 3D-Consistent 2D Diffusion for High-Fidelity Scene
  Editing
ConsistDreamer: 3D-Consistent 2D Diffusion for High-Fidelity Scene Editing
Jun-Kun Chen
Samuel Rota Buló
Norman Muller
Lorenzo Porzi
Peter Kontschieder
Yu-Xiong Wang
DiffM
36
9
0
13 Jun 2024
Instruct 4D-to-4D: Editing 4D Scenes as Pseudo-3D Scenes Using 2D
  Diffusion
Instruct 4D-to-4D: Editing 4D Scenes as Pseudo-3D Scenes Using 2D Diffusion
Linzhan Mou
Jun-Kun Chen
Yu-Xiong Wang
VGen
DiffM
44
10
0
13 Jun 2024
CLIPAway: Harmonizing Focused Embeddings for Removing Objects via
  Diffusion Models
CLIPAway: Harmonizing Focused Embeddings for Removing Objects via Diffusion Models
Yigit Ekin
Ahmet Burak Yildirim
Erdem Çağlar
Aykut Erdem
Erkut Erdem
Aysegül Dündar
DiffM
39
8
0
13 Jun 2024
CMC-Bench: Towards a New Paradigm of Visual Signal Compression
CMC-Bench: Towards a New Paradigm of Visual Signal Compression
Chunyi Li
Xiele Wu
H. Wu
Donghui Feng
Zicheng Zhang
Guo Lu
Xiongkuo Min
Xiaohong Liu
Guangtao Zhai
Weisi Lin
VLM
51
5
0
13 Jun 2024
Toffee: Efficient Million-Scale Dataset Construction for Subject-Driven
  Text-to-Image Generation
Toffee: Efficient Million-Scale Dataset Construction for Subject-Driven Text-to-Image Generation
Yufan Zhou
Ruiyi Zhang
Kaizhi Zheng
Nanxuan Zhao
Jiuxiang Gu
Zichao Wang
Xin Eric Wang
Tong Sun
DiffM
35
2
0
13 Jun 2024
Neural Assets: 3D-Aware Multi-Object Scene Synthesis with Image
  Diffusion Models
Neural Assets: 3D-Aware Multi-Object Scene Synthesis with Image Diffusion Models
Ziyi Wu
Yulia Rubanova
Rishabh Kabra
Drew A. Hudson
Igor Gilitschenski
Yusuf Aytar
Sjoerd van Steenkiste
Kelsey R. Allen
Thomas Kipf
VGen
DiffM
51
9
0
13 Jun 2024
Preserving Identity with Variational Score for General-purpose 3D
  Editing
Preserving Identity with Variational Score for General-purpose 3D Editing
Duong H. Le
Tuan Pham
Aniruddha Kembhavi
Stephan Mandt
Wei-Chiu Ma
Jiasen Lu
54
0
0
13 Jun 2024
COVE: Unleashing the Diffusion Feature Correspondence for Consistent
  Video Editing
COVE: Unleashing the Diffusion Feature Correspondence for Consistent Video Editing
Jiangshan Wang
Yue Ma
Jiayi Guo
Yicheng Xiao
Gao Huang
Xiu Li
DiffM
36
18
0
13 Jun 2024
MMFakeBench: A Mixed-Source Multimodal Misinformation Detection Benchmark for LVLMs
MMFakeBench: A Mixed-Source Multimodal Misinformation Detection Benchmark for LVLMs
Xuannan Liu
Zekun Li
Peipei Li
Shuhan Xia
Xing Cui
Linzhi Huang
Huaibo Huang
Weihong Deng
Zhaofeng He
58
15
0
13 Jun 2024
ICE-G: Image Conditional Editing of 3D Gaussian Splats
ICE-G: Image Conditional Editing of 3D Gaussian Splats
Vishnu Jaganathan
Hannah Hanyun Huang
Muhammad Zubair Irshad
Varun Jampani
Amit Raj
Z. Kira
3DGS
50
8
0
12 Jun 2024
HOI-Swap: Swapping Objects in Videos with Hand-Object Interaction
  Awareness
HOI-Swap: Swapping Objects in Videos with Hand-Object Interaction Awareness
Zihui Xue
Mi Luo
Changan Chen
Kristen Grauman
DiffM
37
6
0
11 Jun 2024
Zero-shot Image Editing with Reference Imitation
Zero-shot Image Editing with Reference Imitation
Xi Chen
Yutong Feng
Mengting Chen
Yiyang Wang
Shilong Zhang
Yu Liu
Yujun Shen
Hengshuang Zhao
DiffM
37
21
0
11 Jun 2024
Neural Gaffer: Relighting Any Object via Diffusion
Neural Gaffer: Relighting Any Object via Diffusion
Haian Jin
Yuan Li
Fujun Luan
Yuanbo Xiangli
Sai Bi
Kai Zhang
Zexiang Xu
Jin Sun
Noah Snavely
46
15
0
11 Jun 2024
Is One GPU Enough? Pushing Image Generation at Higher-Resolutions with
  Foundation Models
Is One GPU Enough? Pushing Image Generation at Higher-Resolutions with Foundation Models
Athanasios Tragakis
Marco Aversa
Chaitanya Kaul
Roderick Murray-Smith
Daniele Faccio
62
2
0
11 Jun 2024
MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidance
MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidance
X. Wang
Siming Fu
Qihan Huang
Wanggui He
Hao Jiang
DiffM
53
41
0
11 Jun 2024
NaRCan: Natural Refined Canonical Image with Integration of Diffusion
  Prior for Video Editing
NaRCan: Natural Refined Canonical Image with Integration of Diffusion Prior for Video Editing
Ting-Hsuan Chen
Jiewen Chan
Hau-Shiang Shiu
Shih-Han Yen
Chang-Han Yeh
Yu-Lun Liu
VGen
DiffM
51
3
0
10 Jun 2024
AID: Adapting Image2Video Diffusion Models for Instruction-guided Video
  Prediction
AID: Adapting Image2Video Diffusion Models for Instruction-guided Video Prediction
Zhen Xing
Qi Dai
Zejia Weng
Zuxuan Wu
Yu-Gang Jiang
VGen
49
14
0
10 Jun 2024
Diffusion-RPO: Aligning Diffusion Models through Relative Preference
  Optimization
Diffusion-RPO: Aligning Diffusion Models through Relative Preference Optimization
Yi Gu
Zhendong Wang
Yueqin Yin
Yujia Xie
Mingyuan Zhou
38
15
0
10 Jun 2024
Tuning-Free Visual Customization via View Iterative Self-Attention
  Control
Tuning-Free Visual Customization via View Iterative Self-Attention Control
Xiaojie Li
Chenghao Gu
Shuzhao Xie
Yunpeng Bai
Weixiang Zhang
Zhi Wang
47
0
0
10 Jun 2024
InfoGaussian: Structure-Aware Dynamic Gaussians through Lightweight
  Information Shaping
InfoGaussian: Structure-Aware Dynamic Gaussians through Lightweight Information Shaping
Yunchao Zhang
Guandao Yang
Leonidas J. Guibas
Yanchao Yang
3DGS
33
1
0
09 Jun 2024
OmniControlNet: Dual-stage Integration for Conditional Image Generation
OmniControlNet: Dual-stage Integration for Conditional Image Generation
Yilin Wang
Haiyang Xu
Xiang Zhang
Zeyuan Chen
Zhizhou Sha
Zirui Wang
Zhuowen Tu
VLM
34
15
0
09 Jun 2024
Can Prompt Modifiers Control Bias? A Comparative Analysis of
  Text-to-Image Generative Models
Can Prompt Modifiers Control Bias? A Comparative Analysis of Text-to-Image Generative Models
P. W. Shin
Jihyun Janice Ahn
Wenpeng Yin
Jack Sampson
Vijaykrishnan Narayanan
38
3
0
09 Jun 2024
GenHeld: Generating and Editing Handheld Objects
GenHeld: Generating and Editing Handheld Objects
Chaerin Min
Srinath Sridhar
49
0
0
07 Jun 2024
Varying Manifolds in Diffusion: From Time-varying Geometries to Visual
  Saliency
Varying Manifolds in Diffusion: From Time-varying Geometries to Visual Saliency
Junhao Chen
Manyi Li
Zherong Pan
Xifeng Gao
Changhe Tu
DiffM
43
2
0
07 Jun 2024
Towards Semantic Equivalence of Tokenization in Multimodal LLM
Towards Semantic Equivalence of Tokenization in Multimodal LLM
Shengqiong Wu
Hao Fei
Xiangtai Li
Jiayi Ji
Hanwang Zhang
Tat-Seng Chua
Shuicheng Yan
MLLM
65
32
0
07 Jun 2024
M&M VTO: Multi-Garment Virtual Try-On and Editing
M&M VTO: Multi-Garment Virtual Try-On and Editing
Luyang Zhu
Yingwei Li
Nan Liu
Hao Peng
Dawei Yang
Ira Kemelmacher-Shlizerman
DiffM
55
7
0
06 Jun 2024
GenAI Arena: An Open Evaluation Platform for Generative Models
GenAI Arena: An Open Evaluation Platform for Generative Models
Dongfu Jiang
Max W.F. Ku
Tianle Li
Yuansheng Ni
Shizhuo Sun
Rongqi Fan
Wenhu Chen
EGVM
46
20
0
06 Jun 2024
VISTA: Visualized Text Embedding For Universal Multi-Modal Retrieval
VISTA: Visualized Text Embedding For Universal Multi-Modal Retrieval
Yueze Wang
Zheng Liu
Shitao Xiao
Bo Zhao
Yongping Xiong
51
23
0
06 Jun 2024
JIGMARK: A Black-Box Approach for Enhancing Image Watermarks against
  Diffusion Model Edits
JIGMARK: A Black-Box Approach for Enhancing Image Watermarks against Diffusion Model Edits
Minzhou Pan
Yi Zeng
Xue Lin
Ning Yu
Cho-Jui Hsieh
Peter Henderson
Ruoxi Jia
WIGM
48
3
0
06 Jun 2024
Bayesian Power Steering: An Effective Approach for Domain Adaptation of
  Diffusion Models
Bayesian Power Steering: An Effective Approach for Domain Adaptation of Diffusion Models
Ding Huang
Ting Li
Jian Huang
DiffM
51
1
0
06 Jun 2024
Dream-in-Style: Text-to-3D Generation Using Stylized Score Distillation
Dream-in-Style: Text-to-3D Generation Using Stylized Score Distillation
Hubert Kompanowski
Binh-Son Hua
DiffM
64
3
0
05 Jun 2024
Follow-Your-Emoji: Fine-Controllable and Expressive Freestyle Portrait
  Animation
Follow-Your-Emoji: Fine-Controllable and Expressive Freestyle Portrait Animation
Yi Ma
Hongyu Liu
Haozhao Wang
Heng Pan
Yingqing He
...
Ailing Zeng
Chengfei Cai
H. Shum
Wen Liu
Qifeng Chen
52
52
0
04 Jun 2024
Enhancing Temporal Consistency in Video Editing by Reconstructing Videos with 3D Gaussian Splatting
Enhancing Temporal Consistency in Video Editing by Reconstructing Videos with 3D Gaussian Splatting
Inkyu Shin
Qihang Yu
Xiaohui Shen
In So Kweon
KuK-Jin Yoon
Liang-Chieh Chen
VGen
DiffM
71
1
0
04 Jun 2024
Turning Text and Imagery into Captivating Visual Video
Turning Text and Imagery into Captivating Visual Video
Mingming Wang
Elijah Miller
VGen
42
0
0
03 Jun 2024
DiffUHaul: A Training-Free Method for Object Dragging in Images
DiffUHaul: A Training-Free Method for Object Dragging in Images
Omri Avrahami
Rinon Gal
Gal Chechik
Ohad Fried
Dani Lischinski
Arash Vahdat
Weili Nie
55
15
0
03 Jun 2024
Learning Temporally Consistent Video Depth from Video Diffusion Priors
Learning Temporally Consistent Video Depth from Video Diffusion Priors
Jiahao Shao
Yuanbo Yang
Hongyu Zhou
Youmin Zhang
Yujun Shen
Matteo Poggi
Yiyi Liao
VGen
DiffM
MDE
47
38
0
03 Jun 2024
Report on Methods and Applications for Crafting 3D Humans
Report on Methods and Applications for Crafting 3D Humans
Lei Liu
K. Zhao
64
0
0
03 Jun 2024
Dimba: Transformer-Mamba Diffusion Models
Dimba: Transformer-Mamba Diffusion Models
Zhengcong Fei
Mingyuan Fan
Changqian Yu
Debang Li
Youqiang Zhang
Junshi Huang
Mamba
67
17
0
03 Jun 2024
Uni-ISP: Unifying the Learning of ISPs from Multiple Cameras
Uni-ISP: Unifying the Learning of ISPs from Multiple Cameras
Lingen Li
Mingde Yao
Xingyu Meng
Muquan Yu
Tianfan Xue
Liang Feng
44
0
0
03 Jun 2024
Bilateral Guided Radiance Field Processing
Bilateral Guided Radiance Field Processing
Yuehao Wang
Chaoyi Wang
Bingchen Gong
Tianfan Xue
47
7
0
01 Jun 2024
Previous
123...111213...262728
Next