Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2211.09800
Cited By
v1
v2 (latest)
InstructPix2Pix: Learning to Follow Image Editing Instructions
17 November 2022
Tim Brooks
Aleksander Holynski
Alexei A. Efros
DiffM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"InstructPix2Pix: Learning to Follow Image Editing Instructions"
50 / 1,418 papers shown
Title
Antagonising explanation and revealing bias directly through sequencing and multimodal inference
Luís Arandas
Mick Grierson
Miguel Carvalhais
PINN
DiffM
46
3
0
25 Aug 2023
GridPull: Towards Scalability in Learning Implicit Representations from 3D Point Clouds
Chao Chen
Yu-Shen Liu
Zhizhong Han
3DPC
75
12
0
25 Aug 2023
A Survey of Diffusion Based Image Generation Models: Issues and Their Solutions
Tianyi Zhang
Zheng Wang
Jin Huang
M. M. Tasnim
Wei Shi
VLM
83
22
0
25 Aug 2023
Instruction Position Matters in Sequence Generation with Large Language Models
Yanjun Liu
Xianfeng Zeng
Fandong Meng
Jie Zhou
LRM
107
9
0
23 Aug 2023
IT3D: Improved Text-to-3D Generation with Explicit View Synthesis
Yiwen Chen
Chi Zhang
Xiaofeng Yang
Zhongang Cai
Gang Yu
Lei Yang
Guo-Shing Lin
DiffM
98
64
0
22 Aug 2023
Learning a More Continuous Zero Level Set in Unsigned Distance Fields through Level Set Projection
Junsheng Zhou
Baorui Ma
Shujuan Li
Yu-Shen Liu
Zhizhong Han
111
38
0
22 Aug 2023
Instruction Tuning for Large Language Models: A Survey
Shengyu Zhang
Linfeng Dong
Xiaoya Li
Sen Zhang
Xiaofei Sun
...
Jiwei Li
Runyi Hu
Tianwei Zhang
Leilei Gan
Guoyin Wang
LM&MA
110
610
0
21 Aug 2023
FocalDreamer: Text-driven 3D Editing via Focal-fusion Assembly
Yuhan Li
Yishun Dou
Yue Shi
Yulai Lei
Xuanhong Chen
Yi Zhang
Peng Zhou
Bingbing Ni
3DGS
102
58
0
21 Aug 2023
ASPIRE: Language-Guided Data Augmentation for Improving Robustness Against Spurious Correlations
Sreyan Ghosh
Chandra Kiran Reddy Evuru
Sonal Kumar
Utkarsh Tyagi
Sakshi Singh
Sanjoy Chowdhury
Dinesh Manocha
OOD
61
1
0
19 Aug 2023
MeDM: Mediating Image Diffusion Models for Video-to-Video Translation with Temporal Correspondence Guidance
Ernie Chu
Tzu-Hua Huang
Shuohao Lin
Jun-Cheng Chen
DiffM
VGen
77
16
0
19 Aug 2023
User-centric AIGC products: Explainable Artificial Intelligence and AIGC products
Hanjie Yu
Yan Dong
Qiong Wu
30
5
0
19 Aug 2023
SimDA: Simple Diffusion Adapter for Efficient Video Generation
Zhen Xing
Qi Dai
Hang-Rui Hu
Zuxuan Wu
Yu-Gang Jiang
VGen
DiffM
103
84
0
18 Aug 2023
StableVideo: Text-driven Consistency-aware Diffusion Video Editing
Wenhao Chai
Xun Guo
Gaoang Wang
Yang Lu
VGen
DiffM
91
157
0
18 Aug 2023
O
2
^2
2
-Recon: Completing 3D Reconstruction of Occluded Objects in the Scene with a Pre-trained 2D Diffusion Model
Yubin Hu
Sheng Ye
Wang Zhao
Matthieu Lin
Yuze He
Yu-Hui Wen
Ying He
Yang Liu
DiffM
85
4
0
18 Aug 2023
Watch Your Steps: Local Image and Scene Editing by Text Instructions
Ashkan Mirzaei
Tristan Aumentado-Armstrong
Marcus A. Brubaker
J. Kelly
Alex Levinshtein
Konstantinos G. Derpanis
Igor Gilitschenski
DiffM
72
36
0
17 Aug 2023
CoDeF: Content Deformation Fields for Temporally Consistent Video Processing
Ouyang Hao
Qiuyu Wang
Yuxi Xiao
Qingyan Bai
Juntao Zhang
Kecheng Zheng
Xiaowei Zhou
Qifeng Chen
Yujun Shen
DiffM
VGen
123
85
0
15 Aug 2023
Dancing Avatar: Pose and Text-Guided Human Motion Videos Synthesis with Image Diffusion Model
Bosheng Qin
Wentao Ye
Qifan Yu
Siliang Tang
Yueting Zhuang
DiffM
VGen
52
13
0
15 Aug 2023
A Survey on Model Compression for Large Language Models
Xunyu Zhu
Jian Li
Yong Liu
Can Ma
Weiping Wang
139
233
0
15 Aug 2023
SGDiff: A Style Guided Diffusion Model for Fashion Synthesis
Zheng Sun
Yanghong Zhou
Honghong He
P. Y. Mok
DiffM
89
30
0
15 Aug 2023
Masked-Attention Diffusion Guidance for Spatially Controlling Text-to-Image Generation
Yuki Endo
70
8
0
11 Aug 2023
PromptPaint: Steering Text-to-Image Generation Through Paint Medium-like Interactions
John Joon Young Chung
Eytan Adar
DiffM
74
66
0
09 Aug 2023
GEMRec: Towards Generative Model Recommendation
Yuanhe Guo
Haoming Liu
Hongyi Wen
DiffM
VLM
25
2
0
04 Aug 2023
ConceptLab: Creative Concept Generation using VLM-Guided Diffusion Prior Constraints
Elad Richardson
Kfir Goldberg
Yuval Alaluf
Daniel Cohen-Or
DiffM
95
12
0
03 Aug 2023
ImageBrush: Learning Visual In-Context Instructions for Exemplar-Based Image Manipulation
Yasheng Sun
Yifan Yang
Houwen Peng
Yifei Shen
Yuqing Yang
Hang-Rui Hu
Lili Qiu
Hideki Koike
DiffM
LM&Ro
87
39
0
02 Aug 2023
Visual Instruction Inversion: Image Editing via Visual Prompting
Thao Nguyen
Yuheng Li
Utkarsh Ojha
Yong Jae Lee
DiffM
41
24
0
26 Jul 2023
Points-to-3D: Bridging the Gap between Sparse Points and Shape-Controllable Text-to-3D Generation
Chaohui Yu
Qiang-feng Zhou
Jingliang Li
Zhe Zhang
Zhibin Wang
Fan Wang
DiffM
89
41
0
26 Jul 2023
Understanding the Latent Space of Diffusion Models through the Lens of Riemannian Geometry
Yong-Hyun Park
Mingi Kwon
J. Choi
Junghyo Jo
Youngjung Uh
DiffM
126
72
0
24 Jul 2023
Interpolating between Images with Diffusion Models
Clinton Jia Wang
Polina Golland
DiffM
86
20
0
24 Jul 2023
Subject-Diffusion:Open Domain Personalized Text-to-Image Generation without Test-time Fine-tuning
Jiancang Ma
Junhao Liang
Chen Chen
H. Lu
108
150
0
21 Jul 2023
Text2Layer: Layered Image Generation using Latent Diffusion Model
Xinyang Zhang
Wentian Zhao
Xin Lu
J. Chien
DiffM
63
12
0
19 Jul 2023
Let's ViCE! Mimicking Human Cognitive Behavior in Image Generation Evaluation
Federico Betti
Jacopo Staiano
Lorenzo Baraldi
Lorenzo Baraldi
Rita Cucchiara
N. Sebe
EGVM
50
7
0
18 Jul 2023
Can Pre-Trained Text-to-Image Models Generate Visual Goals for Reinforcement Learning?
Jialu Gao
Kaizhe Hu
Guowei Xu
Huazhe Xu
LM&Ro
89
17
0
15 Jul 2023
Domain-Agnostic Tuning-Encoder for Fast Personalization of Text-To-Image Models
Moab Arar
Rinon Gal
Yuval Atzmon
Gal Chechik
Daniel Cohen-Or
Ariel Shamir
Amit H. Bermano
DiffM
92
76
0
13 Jul 2023
FreeDrag: Feature Dragging for Reliable Point-based Image Editing
Pengyang Ling
Lin Chen
Pan Zhang
H. Chen
Yi Jin
Jinjin Zheng
DiffM
108
16
0
10 Jul 2023
Text-Guided Synthesis of Eulerian Cinemagraphs
Aniruddha Mahapatra
Aliaksandr Siarohin
Hsin-Ying Lee
Sergey Tulyakov
Sitong Su
DiffM
VGen
94
21
0
06 Jul 2023
Human Inspired Progressive Alignment and Comparative Learning for Grounded Word Acquisition
Yuwei Bao
B. Lattimer
J. Chai
CLL
84
1
0
05 Jul 2023
DragonDiffusion: Enabling Drag-style Manipulation on Diffusion Models
Chong Mou
Xintao Wang
Jie Song
Ying Shan
Jian Zhang
DiffM
125
154
0
05 Jul 2023
Collaborative Score Distillation for Consistent Visual Synthesis
Subin Kim
Kyungmin Lee
June Suk Choi
Jongheon Jeong
Kihyuk Sohn
Jinwoo Shin
DiffM
62
21
0
04 Jul 2023
LEDITS: Real Image Editing with DDPM Inversion and Semantic Guidance
Linoy Tsaban
Apolinário Passos
DiffM
78
42
0
02 Jul 2023
DreamIdentity: Improved Editability for Efficient Face-identity Preserved Image Generation
Zhuowei Chen
Shancheng Fang
Wei Liu
Qian He
Mengqi Huang
Yongdong Zhang
Zhendong Mao
DiffM
125
24
0
01 Jul 2023
DisCo: Disentangled Control for Realistic Human Dance Generation
Tan Wang
Linjie Li
Kevin Qinghong Lin
Yuanhao Zhai
Chung-Ching Lin
Zhengyuan Yang
Hanwang Zhang
Zicheng Liu
Lijuan Wang
VGen
141
89
0
30 Jun 2023
High Fidelity Image Counterfactuals with Probabilistic Causal Models
Fabio De Sousa Ribeiro
Tian Xia
M. Monteiro
Nick Pawlowski
Ben Glocker
DiffM
86
40
0
27 Jun 2023
Freestyle 3D-Aware Portrait Synthesis Based on Compositional Generative Priors
Tianxiang Ma
Kang Zhao
Jianxin Sun
Yingya Zhang
Jing Dong
71
1
0
27 Jun 2023
A-STAR: Test-time Attention Segregation and Retention for Text-to-image Synthesis
Aishwarya Agarwal
Srikrishna Karanam
K. J. Joseph
Apoorv Saxena
Koustava Goswami
Balaji Vasan Srinivasan
VLM
DiffM
39
51
0
26 Jun 2023
DragDiffusion: Harnessing Diffusion Models for Interactive Point-based Image Editing
Yujun Shi
Chuhui Xue
Jun Hao Liew
Jiachun Pan
Hanshu Yan
Wenqing Zhang
Vincent Y. F. Tan
Song Bai
149
220
0
26 Jun 2023
Text-Anchored Score Composition: Tackling Condition Misalignment in Text-to-Image Diffusion Models
Luozhou Wang
Guibao Shen
Wenhang Ge
Guangyong Chen
Yijun Li
Yingke Chen
DiffM
78
4
0
26 Jun 2023
DreamEditor: Text-Driven 3D Scene Editing with Neural Fields
Jingyu Zhuang
Chen Wang
Lingjie Liu
Liang Lin
Guanbin Li
DiffM
78
130
0
23 Jun 2023
Blended-NeRF: Zero-Shot Object Generation and Blending in Existing Neural Radiance Fields
Ori Gordon
Omri Avrahami
Dani Lischinski
DiffM
80
22
0
22 Jun 2023
TauPETGen: Text-Conditional Tau PET Image Synthesis Based on Latent Diffusion Models
Se-In Jang
C. Lois
Emma G. Thibault
J. Becker
Yafei Dong
M. Normandin
Julie C Price
Keith A. Johnson
Xiaofeng Liu
Kuang Gong
DiffM
MedIm
29
10
0
21 Jun 2023
Instruct-NeuralTalker: Editing Audio-Driven Talking Radiance Fields with Instructions
Yuqi Sun
Reian He
Weimin Tan
Bo Yan
DiffM
63
2
0
19 Jun 2023
Previous
1
2
3
...
24
25
26
27
28
29
Next