Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2211.09800
Cited By
v1
v2 (latest)
InstructPix2Pix: Learning to Follow Image Editing Instructions
17 November 2022
Tim Brooks
Aleksander Holynski
Alexei A. Efros
DiffM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"InstructPix2Pix: Learning to Follow Image Editing Instructions"
50 / 1,418 papers shown
Title
Recent Advances in 3D Gaussian Splatting
Tong Wu
Yu-Jie Yuan
Ling-Xiao Zhang
Jie Yang
Yan-Pei Cao
Ling-Qi Yan
Lin Gao
3DGS
157
106
0
17 Mar 2024
Omni-Recon: Harnessing Image-based Rendering for General-Purpose Neural Radiance Fields
Yonggan Fu
Huaizhi Qu
Zhifan Ye
Chaojian Li
Kevin Zhao
Yingyan Lin
AI4CE
111
0
0
17 Mar 2024
Source Prompt Disentangled Inversion for Boosting Image Editability with Diffusion Models
Rui Li
Ruihuang Li
Song Guo
Lei Zhang
DiffM
82
10
0
17 Mar 2024
StainDiffuser: MultiTask Dual Diffusion Model for Virtual Staining
Tushar Kataria
Beatrice Knudsen
Shireen Y. Elhabian
DiffM
MedIm
105
10
0
17 Mar 2024
Efficient Diffusion-Driven Corruption Editor for Test-Time Adaptation
Yeongtak Oh
Jonghyun Lee
Jooyoung Choi
Dahuin Jung
Uiwon Hwang
Sungroh Yoon
TTA
DiffM
82
5
0
16 Mar 2024
MicroDiffusion: Implicit Representation-Guided Diffusion for 3D Reconstruction from Limited 2D Microscopy Projections
Mude Hui
Zihao Wei
Hongru Zhu
Fei Xia
Yuyin Zhou
MedIm
71
8
0
16 Mar 2024
Strong and Controllable Blind Image Decomposition
Zeyu Zhang
Junlin Han
Chenhui Gou
Hongdong Li
Liang Zheng
82
2
0
15 Mar 2024
E4C: Enhance Editability for Text-Based Image Editing by Harnessing Efficient CLIP Guidance
Tianrui Huang
Pu Cao
Lu Yang
Chun Liu
Mengjie Hu
Zhiwei Liu
Qing-Huang Song
DiffM
79
0
0
15 Mar 2024
ST-LDM: A Universal Framework for Text-Grounded Object Generation in Real Images
Xiangtian Xue
Jiasong Wu
Youyong Kong
L. Senhadji
Huazhong Shu
DiffM
71
1
0
15 Mar 2024
Eta Inversion: Designing an Optimal Eta Function for Diffusion-based Real Image Editing
Wonjun Kang
Kevin Galim
Hyung Il Koo
DiffM
75
5
0
14 Mar 2024
Video Editing via Factorized Diffusion Distillation
Uriel Singer
Amit Zohar
Yuval Kirstain
Shelly Sheynin
Adam Polyak
Devi Parikh
Yaniv Taigman
DiffM
VGen
91
15
0
14 Mar 2024
Switch Diffusion Transformer: Synergizing Denoising Tasks with Sparse Mixture-of-Experts
Byeongjun Park
Hyojun Go
Jin-Young Kim
Sangmin Woo
Seokil Ham
Changick Kim
DiffM
MoE
106
13
0
14 Mar 2024
Rethinking Referring Object Removal
Xiangtian Xue
Jiasong Wu
Youyong Kong
L. Senhadji
Huazhong Shu
DiffM
79
0
0
14 Mar 2024
Explore In-Context Segmentation via Latent Diffusion Models
Chaoyang Wang
Xiangtai Li
Henghui Ding
Lu Qi
Jiangning Zhang
Yunhai Tong
Chen Change Loy
Shuicheng Yan
DiffM
158
7
0
14 Mar 2024
Unveiling the Truth: Exploring Human Gaze Patterns in Fake Images
Giuseppe Cartella
Vittorio Cuculo
Marcella Cornia
Rita Cucchiara
DiffM
128
5
0
13 Mar 2024
GaussCtrl: Multi-View Consistent Text-Driven 3D Gaussian Splatting Editing
Jing Wu
Jiawang Bian
Xinghui Li
Guangrun Wang
Ian D Reid
Philip Torr
V. Prisacariu
3DGS
98
42
0
13 Mar 2024
Make Me Happier: Evoking Emotions Through Image Diffusion Models
Qing Lin
Jingfeng Zhang
Yew-Soon Ong
Mengmi Zhang
62
3
0
13 Mar 2024
Bridging Different Language Models and Generative Vision Models for Text-to-Image Generation
Shihao Zhao
Shaozhe Hao
Bojia Zi
Huaizhe Xu
Kwan-Yee K. Wong
DiffM
VLM
108
9
0
12 Mar 2024
V3D: Video Diffusion Models are Effective 3D Generators
Zilong Chen
Yikai Wang
Feng Wang
Zhengyi Wang
Huaping Liu
VGen
117
68
0
11 Mar 2024
GSEdit: Efficient Text-Guided Editing of 3D Objects via Gaussian Splatting
Francesco Palandra
Andrea Sanchietti
Daniele Baieri
Emanuele Rodolà
3DGS
79
22
0
08 Mar 2024
XPSR: Cross-modal Priors for Diffusion-based Image Super-Resolution
Yunpeng Qu
Kun Yuan
Kai Zhao
Qizhi Xie
Jinhua Hao
Ming Sun
Chao Zhou
88
19
0
08 Mar 2024
InstructGIE: Towards Generalizable Image Editing
Zichong Meng
Changdi Yang
Jun Liu
Hao Tang
Pu Zhao
Yanzhi Wang
DiffM
99
9
0
08 Mar 2024
Pix2Gif: Motion-Guided Diffusion for GIF Generation
Hitesh Kandala
Jianfeng Gao
Jianwei Yang
VGen
DiffM
85
3
0
07 Mar 2024
StableDrag: Stable Dragging for Point-based Image Editing
Yutao Cui
Xiaotong Zhao
Guozhen Zhang
Shengming Cao
Kai Ma
Limin Wang
89
14
0
07 Mar 2024
NoiseCollage: A Layout-Aware Text-to-Image Diffusion Model Based on Noise Cropping and Merging
Takahiro Shirakawa
Seiichi Uchida
DiffM
62
19
0
06 Mar 2024
Towards Understanding Cross and Self-Attention in Stable Diffusion for Text-Guided Image Editing
Bingyan Liu
Chengyu Wang
Tingfeng Cao
Kui Jia
Jun Huang
DiffM
82
63
0
06 Mar 2024
Scaling Rectified Flow Transformers for High-Resolution Image Synthesis
Patrick Esser
Sumith Kulal
A. Blattmann
Rahim Entezari
Jonas Muller
...
Zion English
Kyle Lacey
Alex Goodwin
Yannik Marek
Robin Rombach
DiffM
321
1,410
0
05 Mar 2024
Doubly Abductive Counterfactual Inference for Text-based Image Editing
Xue Song
Jiequan Cui
Hanwang Zhang
Jingjing Chen
Richang Hong
Yu-Gang Jiang
DiffM
61
13
0
05 Mar 2024
RT-Sketch: Goal-Conditioned Imitation Learning from Hand-Drawn Sketches
Priya Sundaresan
Q. Vuong
Jiayuan Gu
Peng Xu
Ted Xiao
...
Ajinkya Jain
Karol Hausman
Dorsa Sadigh
Jeannette Bohg
S. Schaal
VGen
99
26
0
05 Mar 2024
OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on
Yuhao Xu
Tao Gu
Weifeng Chen
Chengcai Chen
DiffM
91
66
0
04 Mar 2024
Diff-Plugin: Revitalizing Details for Diffusion-based Low-level Tasks
Yuhao Liu
Zhanghan Ke
Fang Liu
Nanxuan Zhao
Rynson W. H. Lau
DiffM
114
23
0
01 Mar 2024
LoMOE: Localized Multi-Object Editing via Multi-Diffusion
Goirik Chakrabarty
Aditya Chandrasekar
Ramya Hebbalaguppe
AP Prathosh
DiffM
80
6
0
01 Mar 2024
Ask Your Distribution Shift if Pre-Training is Right for You
Benjamin Cohen-Wang
Joshua Vendrow
Aleksander Madry
OOD
94
3
0
29 Feb 2024
Large Language Models and Games: A Survey and Roadmap
Roberto Gallotta
Graham Todd
Marvin Zammit
Sam Earle
Antonios Liapis
Julian Togelius
Georgios N. Yannakakis
LLMAG
LM&MA
AI4CE
LRM
131
86
0
28 Feb 2024
From Summary to Action: Enhancing Large Language Models for Complex Tasks with Open World APIs
Yulong Liu
Yunlong Yuan
Chunwei Wang
Jianhua Han
Yongqiang Ma
Li Zhang
Nanning Zheng
Hang Xu
LLMAG
58
5
0
28 Feb 2024
Coarse-to-Fine Latent Diffusion for Pose-Guided Person Image Synthesis
Yanzuo Lu
Manlin Zhang
Andy J. Ma
Xiaohua Xie
Jian-Huang Lai
DiffM
68
24
0
28 Feb 2024
CustomSketching: Sketch Concept Extraction for Sketch-based Image Synthesis and Editing
Chufeng Xiao
Hongbo Fu
DiffM
83
3
0
27 Feb 2024
Diffusion Model-Based Image Editing: A Survey
Yi Huang
Jiancheng Huang
Yifan Liu
Mingfu Yan
Jiaxi Lv
Jianzhuang Liu
Wei Xiong
He Zhang
Liangliang Cao
Liangliang Cao
EGVM
263
103
0
27 Feb 2024
Placing Objects in Context via Inpainting for Out-of-distribution Segmentation
Pau de Jorge
Riccardo Volpi
P. Dokania
Philip Torr
Grégory Rogez
DiffM
116
5
0
26 Feb 2024
Intelligent Director: An Automatic Framework for Dynamic Visual Composition using ChatGPT
Sixiao Zheng
Jingyang Huo
Yu Wang
Yanwei Fu
VGen
DiffM
69
1
0
24 Feb 2024
Snap Video: Scaled Spatiotemporal Transformers for Text-to-Video Synthesis
Willi Menapace
Aliaksandr Siarohin
Ivan Skorokhodov
Ekaterina Deyneka
Tsai-Shien Chen
...
Yuwei Fang
A. Stoliar
Elisa Ricci
Jian Ren
Sergey Tulyakov
VGen
134
62
0
22 Feb 2024
Consolidating Attention Features for Multi-view Image Editing
Or Patashnik
Rinon Gal
Daniel Cohen-Or
Jun-Yan Zhu
Fernando de la Torre
73
6
0
22 Feb 2024
LLMBind: A Unified Modality-Task Integration Framework
Bin Zhu
Munan Ning
Peng Jin
Bin Lin
Jinfa Huang
...
Junwu Zhang
Zhenyu Tang
Mingjun Pan
Xing Zhou
Li-ming Yuan
MLLM
74
6
0
22 Feb 2024
Real-time 3D-aware Portrait Editing from a Single Image
Qingyan Bai
Zifan Shi
Yinghao Xu
Hao Ouyang
Qiuyu Wang
Ceyuan Yang
Xuan Wang
Gordon Wetzstein
Yujun Shen
Qifeng Chen
3DH
DiffM
122
10
0
21 Feb 2024
CoFRIDA: Self-Supervised Fine-Tuning for Human-Robot Co-Painting
Peter Schaldenbrand
Gaurav Parmar
Jun-Yan Zhu
James McCann
Jean Oh
63
14
0
21 Feb 2024
UniEdit: A Unified Tuning-Free Framework for Video Motion and Appearance Editing
Jianhong Bai
Tianyu He
Yuchi Wang
Junliang Guo
Haoji Hu
Zuozhu Liu
Jiang Bian
VGen
100
30
0
20 Feb 2024
Robust-Wide: Robust Watermarking against Instruction-driven Image Editing
Runyi Hu
Jie Zhang
Ting Xu
Jiwei Li
Tianwei Zhang
DiffM
WIGM
117
8
0
20 Feb 2024
The Revolution of Multimodal Large Language Models: A Survey
Davide Caffagni
Federico Cocchi
Luca Barsellotti
Nicholas Moratelli
Sara Sarto
Lorenzo Baraldi
Lorenzo Baraldi
Marcella Cornia
Rita Cucchiara
LRM
VLM
135
64
0
19 Feb 2024
DiLightNet: Fine-grained Lighting Control for Diffusion-based Image Generation
Chong Zeng
Yue Dong
Pieter Peers
Youkang Kong
Hongzhi Wu
Xin Tong
128
35
0
19 Feb 2024
On Good Practices for Task-Specific Distillation of Large Pretrained Visual Models
Juliette Marrie
Michael Arbel
Julien Mairal
Diane Larlus
VLM
MQ
92
1
0
17 Feb 2024
Previous
1
2
3
...
17
18
19
...
27
28
29
Next