Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2211.09800
Cited By
InstructPix2Pix: Learning to Follow Image Editing Instructions
17 November 2022
Tim Brooks
Aleksander Holynski
Alexei A. Efros
DiffM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"InstructPix2Pix: Learning to Follow Image Editing Instructions"
50 / 1,355 papers shown
Title
VA3: Virtually Assured Amplification Attack on Probabilistic Copyright Protection for Text-to-Image Generative Models
Xiang Li
Qianli Shen
Kenji Kawaguchi
27
4
0
29 Nov 2023
SmoothVideo: Smooth Video Synthesis with Noise Constraints on Diffusion Models for One-shot Video Tuning
Liang Peng
Haoran Cheng
Zheng Yang
Ruisi Zhao
Linxuan Xia
Chaotian Song
Qinglin Lu
Boxi Wu
Wei Liu
VGen
31
2
0
29 Nov 2023
MagDiff: Multi-Alignment Diffusion for High-Fidelity Video Generation and Editing
Haoyu Zhao
Tianyi Lu
Jiaxi Gu
Xing Zhang
Qingping Zheng
Zuxuan Wu
Hang Xu
Yu-Gang Jiang
VGen
DiffM
38
11
0
29 Nov 2023
Unlocking Spatial Comprehension in Text-to-Image Diffusion Models
Mohammad Mahdi Derakhshani
Menglin Xia
Harkirat Singh Behl
Cees G. M. Snoek
Victor Rühle
37
2
0
28 Nov 2023
UniIR: Training and Benchmarking Universal Multimodal Information Retrievers
Cong Wei
Yang Chen
Haonan Chen
Hexiang Hu
Ge Zhang
Jie Fu
Alan Ritter
Wenhu Chen
47
53
0
28 Nov 2023
Ranni: Taming Text-to-Image Diffusion for Accurate Instruction Following
Yutong Feng
Biao Gong
Di Chen
Yujun Shen
Yu Liu
Jingren Zhou
DiffM
39
43
0
28 Nov 2023
COLE: A Hierarchical Generation Framework for Multi-Layered and Editable Graphic Design
Peidong Jia
Chenxuan Li
Yuhui Yuan
Zeyu Liu
Yichao Shen
...
Dong Chen
Ji Li
Xiaodong Xie
Shanghang Zhang
Baining Guo
30
6
0
28 Nov 2023
Reason out Your Layout: Evoking the Layout Master from Large Language Models for Text-to-Image Synthesis
Xiaohui Chen
Yongfei Liu
Yingxiang Yang
Jianbo Yuan
Quanzeng You
Liping Liu
Hongxia Yang
DiffM
56
11
0
28 Nov 2023
LEDITS++: Limitless Image Editing using Text-to-Image Models
Manuel Brack
Felix Friedrich
Katharina Kornmeier
Linoy Tsaban
P. Schramowski
Kristian Kersting
Apolinário Passos
DiffM
40
69
0
28 Nov 2023
ROSO: Improving Robotic Policy Inference via Synthetic Observations
Yusuke Miyashita
Dimitris Gahtidis
Colin La
Jeremy Rabinowicz
Juxi Leitner
37
1
0
28 Nov 2023
MotionZero:Exploiting Motion Priors for Zero-shot Text-to-Video Generation
Jingkuan Song
Litao Guo
Lianli Gao
Hengtao Shen
Jingkuan Song
VGen
34
4
0
28 Nov 2023
Text-Driven Image Editing via Learnable Regions
Yuanze Lin
Yi-Wen Chen
Yi-Hsuan Tsai
Lu Jiang
Ming-Hsuan Yang
DiffM
39
16
0
28 Nov 2023
CLiC: Concept Learning in Context
Mehdi Safaee
Aryan Mikaeili
Or Patashnik
Daniel Cohen-Or
Ali Mahdavi-Amiri
34
11
0
28 Nov 2023
Self-correcting LLM-controlled Diffusion Models
Tsung-Han Wu
Long Lian
Joseph E. Gonzalez
Boyi Li
Trevor Darrell
79
53
0
27 Nov 2023
GaussianEditor: Editing 3D Gaussians Delicately with Text Instructions
Jiemin Fang
Junjie Wang
Xiaopeng Zhang
Lingxi Xie
Qi Tian
3DGS
DiffM
53
109
0
27 Nov 2023
LLMGA: Multimodal Large Language Model based Generation Assistant
Bin Xia
Shiyin Wang
Yingfan Tao
Yitong Wang
Jiaya Jia
MLLM
43
12
0
27 Nov 2023
Instruct2Attack: Language-Guided Semantic Adversarial Attacks
Jiang-Long Liu
Chen Wei
Yuxiang Guo
Heng Yu
Alan Yuille
S. Feizi
Chun Pong Lau
Rama Chellappa
DiffM
AAML
31
5
0
27 Nov 2023
DreamCreature: Crafting Photorealistic Virtual Creatures from Imagination
KamWoh Ng
Xiatian Zhu
Yi-Zhe Song
Tao Xiang
DiffM
20
6
0
27 Nov 2023
Z
∗
Z^*
Z
∗
: Zero-shot Style Transfer via Attention Rearrangement
Yingying Deng
Xiangyu He
Fan Tang
Weiming Dong
DiffM
36
7
0
25 Nov 2023
GaussianEditor: Swift and Controllable 3D Editing with Gaussian Splatting
Yiwen Chen
Zilong Chen
Chi Zhang
Feng Wang
Xiaofeng Yang
Yikai Wang
Zhongang Cai
Lei Yang
Huaping Liu
Guosheng Lin
3DGS
108
187
0
24 Nov 2023
Highly Detailed and Temporal Consistent Video Stylization via Synchronized Multi-Frame Diffusion
M. Xie
Hanyuan Liu
Chengze Li
Tien-Tsin Wong
VGen
DiffM
38
0
0
24 Nov 2023
DemoFusion: Democratising High-Resolution Image Generation With No
Ruoyi Du
Dongliang Chang
Timothy M. Hospedales
Yi-Zhe Song
Zhanyu Ma
41
49
0
24 Nov 2023
Image Super-Resolution with Text Prompt Diffusion
Zheng Chen
Yulun Zhang
Jinjin Gu
Xin Yuan
Linghe Kong
Guihai Chen
Xiaokang Yang
DiffM
40
19
0
24 Nov 2023
Posterior Distillation Sampling
Juil Koo
Chanho Park
Minhyuk Sung
DiffM
36
27
0
23 Nov 2023
A Somewhat Robust Image Watermark against Diffusion-based Editing Models
Mingtian Tan
Tianhao Wang
Somesh Jha
WIGM
34
3
0
22 Nov 2023
GPT4Motion: Scripting Physical Motions in Text-to-Video Generation via Blender-Oriented GPT Planning
Jiaxi Lv
Yi Huang
Mingfu Yan
Jiancheng Huang
Jianzhuang Liu
Yifan Liu
Yafei Wen
Xiaoxin Chen
Shifeng Chen
VGen
DiffM
32
23
0
21 Nov 2023
Concept Sliders: LoRA Adaptors for Precise Control in Diffusion Models
Rohit Gandikota
Joanna Materzyñska
Tingrui Zhou
Antonio Torralba
David Bau
DiffM
46
62
0
20 Nov 2023
FrePolad: Frequency-Rectified Point Latent Diffusion for Point Cloud Generation
Chenliang Zhou
Fangcheng Zhong
Param Hanji
Zhilin Guo
Kyle Fogarty
Alejandro Sztrajman
Hongyun Gao
Cengiz Öztireli
23
3
0
20 Nov 2023
EditShield: Protecting Unauthorized Image Editing by Instruction-guided Diffusion Models
Ruoxi Chen
Haibo Jin
Yixin Liu
Jinyin Chen
Haohan Wang
Lichao Sun
34
10
0
19 Nov 2023
Emu Video: Factorizing Text-to-Video Generation by Explicit Image Conditioning
Rohit Girdhar
Mannat Singh
Andrew Brown
Quentin Duval
S. Azadi
Sai Saketh Rambhatla
Akbar Shah
Xi Yin
Devi Parikh
Ishan Misra
DiffM
VGen
61
190
0
17 Nov 2023
SelfEval: Leveraging the discriminative nature of generative models for evaluation
Sai Saketh Rambhatla
Ishan Misra
EGVM
38
4
0
17 Nov 2023
Text-to-Sticker: Style Tailoring Latent Diffusion Models for Human Expression
Animesh Sinha
Bo Sun
Anmol Kalia
Arantxa Casanova
Elliot Blanchard
...
Ankit Ramchandani
Maziar Sanjabi
Sonal Gupta
Amy Bearman
Dhruv Mahajan
DiffM
36
4
0
17 Nov 2023
Emu Edit: Precise Image Editing via Recognition and Generation Tasks
Shelly Sheynin
Adam Polyak
Uriel Singer
Yuval Kirstain
Amit Zohar
Oron Ashual
Devi Parikh
Yaniv Taigman
27
131
0
16 Nov 2023
3D Paintbrush: Local Stylization of 3D Shapes with Cascaded Score Distillation
Dale Decatur
Itai Lang
Kfir Aberman
Rana Hanocka
43
16
0
16 Nov 2023
UFOGen: You Forward Once Large Scale Text-to-Image Generation via Diffusion GANs
Yanwu Xu
Yang Zhao
Zhisheng Xiao
Tingbo Hou
137
107
0
14 Nov 2023
Unlock the Power: Competitive Distillation for Multi-Modal Large Language Models
Xinwei Li
Li Lin
Shuai Wang
Chen Qian
25
3
0
14 Nov 2023
FIRST: A Million-Entry Dataset for Text-Driven Fashion Synthesis and Design
Zhen Huang
Yihao Li
Dong Pei
Jiapeng Zhou
Xuliang Ning
Jianlin Han
Xiaoguang Han
Xuejun Chen
53
3
0
13 Nov 2023
MonoDiffusion: Self-Supervised Monocular Depth Estimation Using Diffusion Model
Shuwei Shao
Zhongcai Pei
Weihai Chen
Dingchi Sun
Peter C. Y. Chen
Zhengguo Li
MDE
DiffM
29
7
0
13 Nov 2023
IMPUS: Image Morphing with Perceptually-Uniform Sampling Using Diffusion Models
Zhaoyuan Yang
Zhengyang Yu
Zhiwei Xu
Jaskirat Singh
Jing Zhang
Dylan Campbell
Peter Tu
Richard Hartley
30
11
0
12 Nov 2023
Finetuning Text-to-Image Diffusion Models for Fairness
Xudong Shen
Chao Du
Tianyu Pang
Min Lin
Yongkang Wong
Mohan S. Kankanhalli
31
50
0
11 Nov 2023
3DStyle-Diffusion: Pursuing Fine-grained Text-driven 3D Stylization with 2D Diffusion Models
Haibo Yang
Yang Chen
Yingwei Pan
Ting Yao
Zhineng Chen
Tao Mei
37
19
0
09 Nov 2023
ControlStyle: Text-Driven Stylized Image Generation Using Diffusion Priors
Jingwen Chen
Yingwei Pan
Ting Yao
Tao Mei
DiffM
39
38
0
09 Nov 2023
LLaVA-Plus: Learning to Use Tools for Creating Multimodal Agents
Shilong Liu
Hao Cheng
Haotian Liu
Hao Zhang
Feng Li
...
Hang Su
Jun Zhu
Lei Zhang
Jianfeng Gao
Chun-yue Li
MLLM
VLM
56
106
0
09 Nov 2023
ConRad: Image Constrained Radiance Fields for 3D Generation from a Single Image
Senthil Purushwalkam
Nikhil Naik
42
5
0
09 Nov 2023
A Data Perspective on Enhanced Identity Preservation for Diffusion Personalization
Xingzhe He
Zhiwen Cao
Nicholas I. Kolkin
Lantao Yu
Kun Wan
Helge Rhodin
Ratheesh Kalarot
45
13
0
07 Nov 2023
Cross-Image Attention for Zero-Shot Appearance Transfer
Yuval Alaluf
Daniel Garibi
Or Patashnik
Hadar Averbuch-Elor
Daniel Cohen-Or
DiffM
43
70
0
06 Nov 2023
AnyText: Multilingual Visual Text Generation And Editing
Yuxiang Tuo
Wangmeng Xiang
Jun-Yan He
Yifeng Geng
Xuansong Xie
DiffM
38
76
0
06 Nov 2023
InstructPix2NeRF: Instructed 3D Portrait Editing from a Single Image
Jianhui Li
Shilong Liu
Zidong Liu
Yikai Wang
Kaiwen Zheng
Jinghui Xu
Jianmin Li
Jun Zhu
47
10
0
06 Nov 2023
LLaVA-Interactive: An All-in-One Demo for Image Chat, Segmentation, Generation and Editing
Wei-Ge Chen
Irina Spiridonova
Jianwei Yang
Jianfeng Gao
Chun-yue Li
MLLM
VLM
15
34
0
01 Nov 2023
Consistent Video-to-Video Transfer Using Synthetic Dataset
Jiaxin Cheng
Tianjun Xiao
Tong He
VGen
DiffM
44
14
0
01 Nov 2023
Previous
1
2
3
...
20
21
22
...
26
27
28
Next