Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2211.09800
Cited By
InstructPix2Pix: Learning to Follow Image Editing Instructions
17 November 2022
Tim Brooks
Aleksander Holynski
Alexei A. Efros
DiffM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"InstructPix2Pix: Learning to Follow Image Editing Instructions"
50 / 1,356 papers shown
Title
InstructGIE: Towards Generalizable Image Editing
Zichong Meng
Changdi Yang
Jun Liu
Hao Tang
Pu Zhao
Yanzhi Wang
DiffM
54
6
0
08 Mar 2024
Pix2Gif: Motion-Guided Diffusion for GIF Generation
Hitesh Kandala
Jianfeng Gao
Jianwei Yang
VGen
DiffM
38
4
0
07 Mar 2024
StableDrag: Stable Dragging for Point-based Image Editing
Yutao Cui
Xiaotong Zhao
Guozhen Zhang
Shengming Cao
Kai Ma
Limin Wang
44
11
0
07 Mar 2024
NoiseCollage: A Layout-Aware Text-to-Image Diffusion Model Based on Noise Cropping and Merging
Takahiro Shirakawa
Seiichi Uchida
DiffM
40
15
0
06 Mar 2024
Towards Understanding Cross and Self-Attention in Stable Diffusion for Text-Guided Image Editing
Bingyan Liu
Chengyu Wang
Tingfeng Cao
Kui Jia
Jun Huang
DiffM
48
53
0
06 Mar 2024
Scaling Rectified Flow Transformers for High-Resolution Image Synthesis
Patrick Esser
Sumith Kulal
A. Blattmann
Rahim Entezari
Jonas Muller
...
Zion English
Kyle Lacey
Alex Goodwin
Yannik Marek
Robin Rombach
DiffM
147
1,103
0
05 Mar 2024
Doubly Abductive Counterfactual Inference for Text-based Image Editing
Xue Song
Jiequan Cui
Hanwang Zhang
Jingjing Chen
Richang Hong
Yu-Gang Jiang
DiffM
31
9
0
05 Mar 2024
RT-Sketch: Goal-Conditioned Imitation Learning from Hand-Drawn Sketches
Priya Sundaresan
Q. Vuong
Jiayuan Gu
Peng Xu
Ted Xiao
...
Ajinkya Jain
Karol Hausman
Dorsa Sadigh
Jeannette Bohg
S. Schaal
VGen
39
26
0
05 Mar 2024
OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on
Yuhao Xu
Tao Gu
Weifeng Chen
Chengcai Chen
DiffM
35
52
0
04 Mar 2024
Diff-Plugin: Revitalizing Details for Diffusion-based Low-level Tasks
Yuhao Liu
Zhanghan Ke
Fang Liu
Nanxuan Zhao
Rynson W. H. Lau
DiffM
48
19
0
01 Mar 2024
LoMOE: Localized Multi-Object Editing via Multi-Diffusion
Goirik Chakrabarty
Aditya Chandrasekar
Ramya Hebbalaguppe
AP Prathosh
DiffM
50
6
0
01 Mar 2024
Ask Your Distribution Shift if Pre-Training is Right for You
Benjamin Cohen-Wang
Joshua Vendrow
Aleksander Madry
OOD
39
3
0
29 Feb 2024
Large Language Models and Games: A Survey and Roadmap
Roberto Gallotta
Graham Todd
Marvin Zammit
Sam Earle
Antonios Liapis
Julian Togelius
Georgios N. Yannakakis
LLMAG
LM&MA
AI4CE
LRM
55
73
0
28 Feb 2024
From Summary to Action: Enhancing Large Language Models for Complex Tasks with Open World APIs
Yulong Liu
Yunlong Yuan
Chunwei Wang
Jianhua Han
Yongqiang Ma
Li Zhang
Nanning Zheng
Hang Xu
LLMAG
45
5
0
28 Feb 2024
Coarse-to-Fine Latent Diffusion for Pose-Guided Person Image Synthesis
Yanzuo Lu
Manlin Zhang
Andy J. Ma
Xiaohua Xie
Jian-Huang Lai
DiffM
27
22
0
28 Feb 2024
CustomSketching: Sketch Concept Extraction for Sketch-based Image Synthesis and Editing
Chufeng Xiao
Hongbo Fu
DiffM
43
3
0
27 Feb 2024
Diffusion Model-Based Image Editing: A Survey
Yi Huang
Jiancheng Huang
Yifan Liu
Mingfu Yan
Jiaxi Lv
Jianzhuang Liu
Wei Xiong
He Zhang
Liangliang Cao
Liangliang Cao
EGVM
71
88
0
27 Feb 2024
Placing Objects in Context via Inpainting for Out-of-distribution Segmentation
Pau de Jorge
Riccardo Volpi
P. Dokania
Philip Torr
Grégory Rogez
DiffM
60
5
0
26 Feb 2024
Intelligent Director: An Automatic Framework for Dynamic Visual Composition using ChatGPT
Sixiao Zheng
Jingyang Huo
Yu Wang
Yanwei Fu
VGen
DiffM
44
1
0
24 Feb 2024
Snap Video: Scaled Spatiotemporal Transformers for Text-to-Video Synthesis
Willi Menapace
Aliaksandr Siarohin
Ivan Skorokhodov
Ekaterina Deyneka
Tsai-Shien Chen
...
Yuwei Fang
A. Stoliar
Elisa Ricci
Jian Ren
Sergey Tulyakov
VGen
57
57
0
22 Feb 2024
Consolidating Attention Features for Multi-view Image Editing
Or Patashnik
Rinon Gal
Daniel Cohen-Or
Jun-Yan Zhu
Fernando de la Torre
37
5
0
22 Feb 2024
LLMBind: A Unified Modality-Task Integration Framework
Bin Zhu
Munan Ning
Peng Jin
Bin Lin
Jinfa Huang
...
Junwu Zhang
Zhenyu Tang
Mingjun Pan
Xing Zhou
Li-ming Yuan
MLLM
40
6
0
22 Feb 2024
Real-time 3D-aware Portrait Editing from a Single Image
Qingyan Bai
Zifan Shi
Yinghao Xu
Hao Ouyang
Qiuyu Wang
Ceyuan Yang
Xuan Wang
Gordon Wetzstein
Yujun Shen
Qifeng Chen
3DH
DiffM
72
9
0
21 Feb 2024
CoFRIDA: Self-Supervised Fine-Tuning for Human-Robot Co-Painting
Peter Schaldenbrand
Gaurav Parmar
Jun-Yan Zhu
James McCann
Jean Oh
37
13
0
21 Feb 2024
UniEdit: A Unified Tuning-Free Framework for Video Motion and Appearance Editing
Jianhong Bai
Tianyu He
Yuchi Wang
Junliang Guo
Haoji Hu
Zuozhu Liu
Jiang Bian
VGen
33
26
0
20 Feb 2024
Robust-Wide: Robust Watermarking against Instruction-driven Image Editing
Runyi Hu
Jie Zhang
Ting Xu
Jiwei Li
Tianwei Zhang
DiffM
WIGM
51
7
0
20 Feb 2024
The Revolution of Multimodal Large Language Models: A Survey
Davide Caffagni
Federico Cocchi
Luca Barsellotti
Nicholas Moratelli
Sara Sarto
Lorenzo Baraldi
Lorenzo Baraldi
Marcella Cornia
Rita Cucchiara
LRM
VLM
66
43
0
19 Feb 2024
DiLightNet: Fine-grained Lighting Control for Diffusion-based Image Generation
Chong Zeng
Yue Dong
Pieter Peers
Youkang Kong
Hongzhi Wu
Xin Tong
44
29
0
19 Feb 2024
On Good Practices for Task-Specific Distillation of Large Pretrained Visual Models
Juliette Marrie
Michael Arbel
Julien Mairal
Diane Larlus
VLM
MQ
53
1
0
17 Feb 2024
Semantically-aware Neural Radiance Fields for Visual Scene Understanding: A Comprehensive Review
Thang-Anh-Quan Nguyen
Amine Bourki
Mátyás Macudzinski
Anthony Brunel
M. Bennamoun
48
11
0
17 Feb 2024
3D Diffuser Actor: Policy Diffusion with 3D Scene Representations
Tsung-Wei Ke
N. Gkanatsios
Katerina Fragkiadaki
VGen
39
109
0
16 Feb 2024
PEGASUS: Personalized Generative 3D Avatars with Composable Attributes
Hyunsoo Cha
Byungjun Kim
Hanbyul Joo
23
4
0
16 Feb 2024
Classification Diffusion Models: Revitalizing Density Ratio Estimation
Shahar Yadin
Noam Elata
T. Michaeli
DiffM
48
1
0
15 Feb 2024
Zero-Shot Unsupervised and Text-Based Audio Editing Using DDPM Inversion
Hila Manor
T. Michaeli
DiffM
37
25
0
15 Feb 2024
DreamMatcher: Appearance Matching Self-Attention for Semantically-Consistent Text-to-Image Personalization
Jisu Nam
Heesu Kim
Dongjae Lee
Siyoon Jin
Seungryong Kim
Seunggyu Chang
DiffM
32
40
0
15 Feb 2024
Weatherproofing Retrieval for Localization with Generative AI and Geometric Consistency
Yannis Kalantidis
Mert Bulent Sariyildiz
Rafael Sampaio de Rezende
Philippe Weinzaepfel
Diane Larlus
G. Csurka
24
0
0
14 Feb 2024
Learning Continuous 3D Words for Text-to-Image Generation
Ta-Ying Cheng
Matheus Gadelha
Thibault Groueix
Matthew Fisher
R. Měch
Andrew Markham
Niki Trigoni
DiffM
43
13
0
13 Feb 2024
NeRF Analogies: Example-Based Visual Attribute Transfer for NeRFs
Michael Fischer
Zhengqin Li
Thu Nguyen-Phuoc
Aljaz Bozic
Zhao Dong
Carl S. Marshall
Tobias Ritschel
49
10
0
13 Feb 2024
Fine-Tuning Text-To-Image Diffusion Models for Class-Wise Spurious Feature Generation
AprilPyone Maungmaung
H. Nguyen
Hitoshi Kiya
Isao Echizen
28
6
0
13 Feb 2024
HeadStudio: Text to Animatable Head Avatars with 3D Gaussian Splatting
Zhen Zhou
Fan Ma
Hehe Fan
Yi Yang
3DGS
40
24
0
09 Feb 2024
Animated Stickers: Bringing Stickers to Life with Video Diffusion
David Yan
Winnie Zhang
Luxin Zhang
Anmol Kalia
Dingkang Wang
...
Guan Pang
Ali K. Thabet
Peter Vajda
Amy Bearman
Licheng Yu
VGen
DiffM
67
2
0
08 Feb 2024
Real-World Robot Applications of Foundation Models: A Review
Kento Kawaharazuka
T. Matsushima
Andrew Gambardella
Jiaxian Guo
Chris Paxton
Andy Zeng
OffRL
VLM
LM&Ro
51
47
0
08 Feb 2024
Get What You Want, Not What You Don't: Image Content Suppression for Text-to-Image Diffusion Models
Senmao Li
Joost van de Weijer
Taihang Hu
Fahad Shahbaz Khan
Qibin Hou
Yaxing Wang
Jian Yang
DiffM
55
27
0
08 Feb 2024
Counterfactual Image Editing
Yushu Pan
Elias Bareinboim
BDL
CML
35
5
0
07 Feb 2024
SPAD : Spatially Aware Multiview Diffusers
Yash Kant
Ziyi Wu
Michael Vasilkovsky
Guocheng Qian
Jian Ren
R. A. Guler
Guohao Li
Sergey Tulyakov
Igor Gilitschenski
Aliaksandr Siarohin
DiffM
31
36
0
07 Feb 2024
GenLens: A Systematic Evaluation of Visual GenAI Model Outputs
Tica Lin
Hanspeter Pfister
Jui-Hsien Wang
ELM
23
1
0
06 Feb 2024
Point and Instruct: Enabling Precise Image Editing by Unifying Direct Manipulation and Text Instructions
Alec Helbling
Seongmin Lee
Polo Chau
27
0
0
05 Feb 2024
DisDet: Exploring Detectability of Backdoor Attack on Diffusion Models
Yang Sui
Huy Phan
Jinqi Xiao
Tian-Di Zhang
Zijie Tang
Cong Shi
Yan Wang
Yingying Chen
Bo Yuan
DiffM
AAML
32
12
0
05 Feb 2024
CNS-Edit: 3D Shape Editing via Coupled Neural Shape Optimization
Jingyu Hu
Ka-Hei Hui
Zhengzhe Liu
Hao Zhang
Chi-Wing Fu
36
4
0
04 Feb 2024
Image Fusion via Vision-Language Model
Zixiang Zhao
Lilun Deng
Haowen Bai
Yukun Cui
Zhipeng Zhang
...
Haotong Qin
Dongdong Chen
Jiangshe Zhang
Peng Wang
Luc Van Gool
VLM
43
20
0
03 Feb 2024
Previous
1
2
3
...
16
17
18
...
26
27
28
Next