Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2211.09800
Cited By
v1
v2 (latest)
InstructPix2Pix: Learning to Follow Image Editing Instructions
17 November 2022
Tim Brooks
Aleksander Holynski
Alexei A. Efros
DiffM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"InstructPix2Pix: Learning to Follow Image Editing Instructions"
50 / 1,418 papers shown
Title
Semantically-aware Neural Radiance Fields for Visual Scene Understanding: A Comprehensive Review
Thang-Anh-Quan Nguyen
Amine Bourki
Mátyás Macudzinski
Anthony Brunel
M. Bennamoun
129
13
0
17 Feb 2024
3D Diffuser Actor: Policy Diffusion with 3D Scene Representations
Tsung-Wei Ke
N. Gkanatsios
Katerina Fragkiadaki
VGen
119
127
0
16 Feb 2024
PEGASUS: Personalized Generative 3D Avatars with Composable Attributes
Hyunsoo Cha
Byungjun Kim
Hanbyul Joo
41
4
0
16 Feb 2024
Classification Diffusion Models: Revitalizing Density Ratio Estimation
Shahar Yadin
Noam Elata
T. Michaeli
DiffM
77
2
0
15 Feb 2024
Zero-Shot Unsupervised and Text-Based Audio Editing Using DDPM Inversion
Hila Manor
T. Michaeli
DiffM
100
29
0
15 Feb 2024
DreamMatcher: Appearance Matching Self-Attention for Semantically-Consistent Text-to-Image Personalization
Jisu Nam
Heesu Kim
Dongjae Lee
Siyoon Jin
Seungryong Kim
Seunggyu Chang
DiffM
109
44
0
15 Feb 2024
Weatherproofing Retrieval for Localization with Generative AI and Geometric Consistency
Yannis Kalantidis
Mert Bulent Sariyildiz
Rafael Sampaio de Rezende
Philippe Weinzaepfel
Diane Larlus
G. Csurka
67
0
0
14 Feb 2024
Learning Continuous 3D Words for Text-to-Image Generation
Ta-Ying Cheng
Matheus Gadelha
Thibault Groueix
Matthew Fisher
R. Měch
Andrew Markham
Niki Trigoni
DiffM
76
14
0
13 Feb 2024
NeRF Analogies: Example-Based Visual Attribute Transfer for NeRFs
Michael Fischer
Zhengqin Li
Thu Nguyen-Phuoc
Aljaz Bozic
Zhao Dong
Carl S. Marshall
Tobias Ritschel
79
11
0
13 Feb 2024
Fine-Tuning Text-To-Image Diffusion Models for Class-Wise Spurious Feature Generation
AprilPyone Maungmaung
H. Nguyen
Hitoshi Kiya
Isao Echizen
83
6
0
13 Feb 2024
HeadStudio: Text to Animatable Head Avatars with 3D Gaussian Splatting
Zhen Zhou
Fan Ma
Hehe Fan
Yi Yang
3DGS
90
24
0
09 Feb 2024
Animated Stickers: Bringing Stickers to Life with Video Diffusion
David Yan
Winnie Zhang
Luxin Zhang
Anmol Kalia
Dingkang Wang
...
Guan Pang
Ali K. Thabet
Peter Vajda
Amy Bearman
Licheng Yu
VGen
DiffM
101
2
0
08 Feb 2024
Real-World Robot Applications of Foundation Models: A Review
Kento Kawaharazuka
T. Matsushima
Andrew Gambardella
Jiaxian Guo
Chris Paxton
Andy Zeng
OffRL
VLM
LM&Ro
116
54
0
08 Feb 2024
Get What You Want, Not What You Don't: Image Content Suppression for Text-to-Image Diffusion Models
Senmao Li
Joost van de Weijer
Taihang Hu
Fahad Shahbaz Khan
Qibin Hou
Yaxing Wang
Jian Yang
DiffM
89
30
0
08 Feb 2024
Counterfactual Image Editing
Yushu Pan
Elias Bareinboim
BDL
CML
76
8
0
07 Feb 2024
SPAD : Spatially Aware Multiview Diffusers
Yash Kant
Ziyi Wu
Michael Vasilkovsky
Guocheng Qian
Jian Ren
R. A. Guler
Guohao Li
Sergey Tulyakov
Igor Gilitschenski
Aliaksandr Siarohin
DiffM
108
38
0
07 Feb 2024
GenLens: A Systematic Evaluation of Visual GenAI Model Outputs
Tica Lin
Hanspeter Pfister
Jui-Hsien Wang
ELM
45
1
0
06 Feb 2024
Point and Instruct: Enabling Precise Image Editing by Unifying Direct Manipulation and Text Instructions
Alec Helbling
Seongmin Lee
Polo Chau
40
0
0
05 Feb 2024
DisDet: Exploring Detectability of Backdoor Attack on Diffusion Models
Yang Sui
Huy Phan
Jinqi Xiao
Tian-Di Zhang
Zijie Tang
Cong Shi
Yan Wang
Yingying Chen
Bo Yuan
DiffM
AAML
72
13
0
05 Feb 2024
CNS-Edit: 3D Shape Editing via Coupled Neural Shape Optimization
Jingyu Hu
Ka-Hei Hui
Zhengzhe Liu
Hao Zhang
Chi-Wing Fu
71
4
0
04 Feb 2024
Image Fusion via Vision-Language Model
Zixiang Zhao
Lilun Deng
Haowen Bai
Yukun Cui
Zhipeng Zhang
...
Haotong Qin
Dongdong Chen
Jiangshe Zhang
Peng Wang
Luc Van Gool
VLM
108
27
0
03 Feb 2024
ViCA-NeRF: View-Consistency-Aware 3D Editing of Neural Radiance Fields
Jiahua Dong
Yu-Xiong Wang
136
39
0
01 Feb 2024
Motion Guidance: Diffusion-Based Image Editing with Differentiable Motion Estimators
Daniel Geng
Andrew Owens
DiffM
83
34
0
31 Jan 2024
Advances in 3D Generation: A Survey
Xiaoyu Li
Qi Zhang
Di Kang
Weihao Cheng
Yiming Gao
Jingbo Zhang
Zhihao Liang
Jing Liao
Yan-Pei Cao
Ying Shan
160
43
0
31 Jan 2024
Image Anything: Towards Reasoning-coherent and Training-free Multi-modal Image Generation
Yuanhuiyi Lyu
Xueye Zheng
Lin Wang
DiffM
104
11
0
31 Jan 2024
BlockFusion: Expandable 3D Scene Generation using Latent Tri-plane Extrapolation
Zhennan Wu
Yang Li
Han Yan
Taizhang Shang
Weixuan Sun
...
Ruikai Cui
Weizhe Liu
Hiroyuki Sato
Hongdong Li
Pan Ji
113
37
0
30 Jan 2024
InstructIR: High-Quality Image Restoration Following Human Instructions
Marcos V. Conde
Gregor Geigle
Radu Timofte
DiffM
127
58
0
29 Jan 2024
Bridging Generative and Discriminative Models for Unified Visual Perception with Diffusion Priors
Shiyin Dong
Mingrui Zhu
Kun Cheng
Nannan Wang
Xinbo Gao
DiffM
42
3
0
29 Jan 2024
Object-Driven One-Shot Fine-tuning of Text-to-Image Diffusion with Prototypical Embedding
Jianxiang Lu
Cong Xie
Hui Guo
DiffM
188
5
0
28 Jan 2024
IntentTuner: An Interactive Framework for Integrating Human Intents in Fine-tuning Text-to-Image Generative Models
Xingchen Zeng
Ziyao Gao
Yilin Ye
Wei Zeng
48
13
0
28 Jan 2024
A Survey on Data Augmentation in Large Model Era
Yue Zhou
Chenlu Guo
Xu Wang
Yi-Ju Chang
Yuan Wu
LM&MA
VLM
128
27
0
27 Jan 2024
Annotated Hands for Generative Models
Yue Yang
Atith N Gandhi
Greg Turk
DiffM
GAN
33
3
0
26 Jan 2024
TIP-Editor: An Accurate 3D Editor Following Both Text-Prompts And Image-Prompts
Jingyu Zhuang
Di Kang
Yan-Pei Cao
Guanbin Li
Liang Lin
Ying Shan
DiffM
3DGS
128
42
0
26 Jan 2024
pix2gestalt: Amodal Segmentation by Synthesizing Wholes
Ege Ozguroglu
Ruoshi Liu
Dídac Surís
Dian Chen
Achal Dave
P. Tokmakov
Carl Vondrick
DiffM
VLM
131
34
0
25 Jan 2024
Sketch2NeRF: Multi-view Sketch-guided Text-to-3D Generation
Minglin Chen
Weihao Yuan
Yukun Wang
Zhe Sheng
Yisheng He
Zilong Dong
Liefeng Bo
Yulan Guo
DiffM
90
4
0
25 Jan 2024
StyleInject: Parameter Efficient Tuning of Text-to-Image Diffusion Models
Mohan Zhou
Yalong Bai
Qing Yang
Tiejun Zhao
52
0
0
25 Jan 2024
CreativeSynth: Cross-Art-Attention for Artistic Image Synthesis with Multimodal Diffusion
Nisha Huang
Weiming Dong
Yuxin Zhang
Fan Tang
Ronghui Li
Chongyang Ma
Xiu Li
Tong-Yee Lee
Changsheng Xu
DiffM
90
7
0
25 Jan 2024
GraphiMind: LLM-centric Interface for Information Graphics Design
Qiruin Huang
Min Lu
J. Lanir
Dani Lischinski
Daniel Cohen-Or
Hui Huang
MLLM
81
8
0
24 Jan 2024
GALA: Generating Animatable Layered Assets from a Single Scan
Taeksoo Kim
Byungjun Kim
Shunsuke Saito
Hanbyul Joo
3DH
92
13
0
23 Jan 2024
CCA: Collaborative Competitive Agents for Image Editing
Tiankai Hang
Shuyang Gu
Dong Chen
Xin Geng
Baining Guo
164
5
0
23 Jan 2024
Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs
Ling Yang
Zhaochen Yu
Chenlin Meng
Minkai Xu
Stefano Ermon
Tengjiao Wang
CoGe
DiffM
125
137
0
22 Jan 2024
Text-to-Image Cross-Modal Generation: A Systematic Review
Maciej Żelaszczyk
Jacek Mańdziuk
100
4
0
21 Jan 2024
LLMRA: Multi-modal Large Language Model based Restoration Assistant
Xiaoyu Jin
Yuan Shi
Bin Xia
Wenming Yang
100
4
0
21 Jan 2024
Fast Registration of Photorealistic Avatars for VR Facial Animation
Chaitanya Patel
Shaojie Bai
Tenia Wang
Jason M. Saragih
S. Wei
84
0
0
19 Jan 2024
Towards Language-Driven Video Inpainting via Multimodal Large Language Models
Jianzong Wu
Xiangtai Li
Chenyang Si
Shangchen Zhou
Jingkang Yang
...
Yining Li
Kai Chen
Yunhai Tong
Ziwei Liu
Chen Change Loy
VGen
DiffM
MLLM
121
17
0
18 Jan 2024
Supervised Fine-tuning in turn Improves Visual Foundation Models
Xiaohu Jiang
Yixiao Ge
Yuying Ge
Dachuan Shi
Chun Yuan
Ying Shan
VLM
CLIP
94
9
0
18 Jan 2024
Edit One for All: Interactive Batch Image Editing
Thao Nguyen
Utkarsh Ojha
Yuheng Li
Haotian Liu
Yong Jae Lee
DiffM
89
3
0
18 Jan 2024
BlenDA: Domain Adaptive Object Detection through diffusion-based blending
Tzuhsuan Huang
Chen-Che Huang
Chung-Hao Ku
Jun-Cheng Chen
95
5
0
18 Jan 2024
Image Translation as Diffusion Visual Programmers
Cheng Han
James Liang
Qifan Wang
Majid Rabbani
S. Dianat
Raghuveer M. Rao
Ying Nian Wu
Dongfang Liu
79
8
0
18 Jan 2024
Data Attribution for Diffusion Models: Timestep-induced Bias in Influence Estimation
Tong Xie
Haoyu Li
Andrew Bai
Cho-Jui Hsieh
TDI
102
4
0
17 Jan 2024
Previous
1
2
3
...
18
19
20
...
27
28
29
Next