Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2211.09800
Cited By
InstructPix2Pix: Learning to Follow Image Editing Instructions
17 November 2022
Tim Brooks
Aleksander Holynski
Alexei A. Efros
DiffM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"InstructPix2Pix: Learning to Follow Image Editing Instructions"
50 / 1,348 papers shown
Title
Unforgettable Lessons from Forgettable Images: Intra-Class Memorability Matters in Computer Vision Tasks
Jie Jing
Qing Lin
Shuangpeng Han
Lucia Schiatti
Yen-Ling Kuo
Mengmi Zhang
VLM
23
0
0
30 Dec 2024
DPBridge: Latent Diffusion Bridge for Dense Prediction
Haorui Ji
Taojun Lin
Hongdong Li
DiffM
51
1
0
29 Dec 2024
Bridging Interpretability and Robustness Using LIME-Guided Model Refinement
Navid Nayyem
Abdullah Rakin
Longwei Wang
AAML
FAtt
63
1
0
25 Dec 2024
Forensics of Transpiled Quantum Circuits
Rupshali Roy
Archisman Ghosh
Swaroop Ghosh
56
0
0
25 Dec 2024
Fashionability-Enhancing Outfit Image Editing with Conditional Diffusion Models
Qice Qin
Yuki Hirakawa
Ryotaro Shimizu
Takuya Furusawa
Edgar Simo-Serra
DiffM
36
0
0
24 Dec 2024
Editing Implicit and Explicit Representations of Radiance Fields: A Survey
Arthur Hubert
Gamal Elghazaly
R. Frank
AI4CE
129
0
0
23 Dec 2024
DreamOmni: Unified Image Generation and Editing
Bin Xia
Yuechen Zhang
Jingyao Li
Chengyao Wang
Yitong Wang
Xinglong Wu
Bei Yu
Jiaya Jia
SyDa
MLLM
89
3
0
22 Dec 2024
Mapping the Mind of an Instruction-based Image Editing using SMILE
Zeinab Dehghani
Koorosh Aslansefat
Adil Khan
Adín Ramirez Rivera
Franky George
Muhammad Khalid
DiffM
80
0
0
20 Dec 2024
Reframing Image Difference Captioning with BLIP2IDC and Synthetic Augmentation
Gautier Evennou
Antoine Chaffin
Vivien Chappelier
Ewa Kijak
DiffM
68
0
0
20 Dec 2024
Diffusion-Based Conditional Image Editing through Optimized Inference with Guidance
Hyunsoo Lee
Minsoo Kang
Bohyung Han
74
0
0
20 Dec 2024
Dataset Augmentation by Mixing Visual Concepts
Abdullah Al Rahat
Hemanth Venkateswara
DiffM
76
0
0
19 Dec 2024
DragScene: Interactive 3D Scene Editing with Single-view Drag Instructions
Chenghao Gu
Zhenzhe Li
Zhengqi Zhang
Yunpeng Bai
Shuzhao Xie
Zhi Wang
DiffM
66
1
0
18 Dec 2024
Urban Air Temperature Prediction using Conditional Diffusion Models
Siyang Dai
Jun Liu
Ngai-Man Cheung
82
0
0
18 Dec 2024
Prompt Augmentation for Self-supervised Text-guided Image Manipulation
Rumeysa Bodur
Binod Bhattarai
Tae-Kyun Kim
DiffM
60
2
0
17 Dec 2024
Efficient Diffusion Transformer Policies with Mixture of Expert Denoisers for Multitask Learning
Moritz Reuss
Jyothish Pari
Pulkit Agrawal
Rudolf Lioutikov
DiffM
MoE
76
5
0
17 Dec 2024
Towards a Training Free Approach for 3D Scene Editing
Vivek Madhavaram
Shivangana Rawat
Chaitanya Devaguptapu
Charu Sharma
Manohar Kaul
DiffM
67
0
0
17 Dec 2024
IDEA-Bench: How Far are Generative Models from Professional Designing?
C. Liang
Lianghua Huang
Jingwu Fang
Huanzhang Dou
Wei Wang
Zhi-Fan Wu
Yupeng Shi
Junge Zhang
Xin Zhao
Yu Liu
3DV
77
1
0
16 Dec 2024
LineArt: A Knowledge-guided Training-free High-quality Appearance Transfer for Design Drawing with Diffusion Model
Xi Wang
H. Li
Heng Fang
Yichen Peng
H. Xie
Xi Yang
Chuntao Li
DiffM
72
0
0
16 Dec 2024
InterDyn: Controllable Interactive Dynamics with Video Diffusion Models
Rick Akkerman
Haiwen Feng
M. Black
Dimitrios Tzionas
Victoria Fernandez-Abrevaya
VGen
AI4CE
105
3
0
16 Dec 2024
EditSplat: Multi-View Fusion and Attention-Guided Optimization for View-Consistent 3D Scene Editing with 3D Gaussian Splatting
Dong In Lee
Hyeongcheol Park
Jiyoung Seo
Eunbyung Park
Hyunje Park
Ha Dam Baek
Shin Sangheon
Sangmin kim
Sangpil Kim
3DGS
102
1
0
16 Dec 2024
ColorFlow: Retrieval-Augmented Image Sequence Colorization
Junhao Zhuang
Xuan Ju
Z. Zhang
Yong-Jin Liu
Shiyi Zhang
Chun Yuan
Ying Shan
DiffM
99
1
0
16 Dec 2024
Dual-Schedule Inversion: Training- and Tuning-Free Inversion for Real Image Editing
Jiancheng Huang
Yi Huang
Jianzhuang Liu
Donghao Zhou
Y. Liu
Shifeng Chen
DiffM
92
0
0
15 Dec 2024
VinTAGe: Joint Video and Text Conditioning for Holistic Audio Generation
Saksham Singh Kushwaha
Yapeng Tian
DiffM
VGen
82
2
0
14 Dec 2024
EVLM: Self-Reflective Multimodal Reasoning for Cross-Dimensional Visual Editing
Umar Khalid
Hasan Iqbal
Azib Farooq
Nazanin Rahnavard
Jing Hua
...
H. Iqbal
Azib Farooq
Nazanin Rahnavard
Jing Hua
Chen Chen
72
0
0
13 Dec 2024
SuperMark: Robust and Training-free Image Watermarking via Diffusion-based Super-Resolution
Runyi Hu
J. Zhang
Yiming Li
Jiwei Li
Qing-Wu Guo
Han Qiu
Tianwei Zhang
WIGM
AAML
82
1
0
13 Dec 2024
OFTSR: One-Step Flow for Image Super-Resolution with Tunable Fidelity-Realism Trade-offs
Yuanzhi Zhu
R. Wang
Shilin Lu
Junnan Li
Hanshu Yan
K. Zhang
SupR
79
3
0
12 Dec 2024
Olympus: A Universal Task Router for Computer Vision Tasks
Yuanze Lin
Yunsheng Li
Dongdong Chen
Weijian Xu
Ronald Clark
Philip H. S. Torr
VLM
ObjD
182
0
0
12 Dec 2024
UniReal: Universal Image Generation and Editing via Learning Real-world Dynamics
Xi Chen
Zhifei Zhang
He Zhang
Yuqian Zhou
S. Kim
...
Nanxuan Zhao
Yilin Wang
Hui Ding
Zhe Lin
Hengshuang Zhao
VGen
DiffM
123
21
0
10 Dec 2024
FireFlow: Fast Inversion of Rectified Flow for Image Semantic Editing
Yingying Deng
Xiangyu He
Changwang Mei
Peisong Wang
Fan Tang
78
7
0
10 Dec 2024
PrEditor3D: Fast and Precise 3D Shape Editing
Ziya Erkoç
Can Gümeli
Chaoyang Wang
Matthias Nießner
Angela Dai
Peter Wonka
Hsin-Ying Lee
Peiye Zhuang
71
2
0
09 Dec 2024
HumanEdit: A High-Quality Human-Rewarded Dataset for Instruction-based Image Editing
Jinbin Bai
Wei Chow
L. Yang
Xiangtai Li
Juncheng Billy Li
H. Zhang
Shuicheng Yan
101
3
0
05 Dec 2024
DIVE: Taming DINO for Subject-Driven Video Editing
Yi Huang
Wei Xiong
He Zhang
Chaoqi Chen
Jianzhuang Liu
Mingfu Yan
Shifeng Chen
VGen
DiffM
76
0
0
04 Dec 2024
Composed Image Retrieval for Training-Free Domain Conversion
Nikos Efthymiadis
Bill Psomas
Zakaria Laskar
Konstantinos Karantzalos
Yannis Avrithis
Ondřej Chum
Giorgos Tolias
65
0
0
04 Dec 2024
DynamicControl: Adaptive Condition Selection for Improved Text-to-Image Generation
Q. He
Jinlong Peng
P. Xu
Boyuan Jiang
Xiaobin Hu
...
Y. Liu
Y. Wang
Chengjie Wang
X. Li
J. Zhang
DiffM
122
1
0
04 Dec 2024
Sharp-It: A Multi-view to Multi-view Diffusion Model for 3D Synthesis and Manipulation
Yiftach Edelstein
Or Patashnik
Dana Cohen-Bar
Lihi Zelnik-Manor
76
0
0
03 Dec 2024
GenMix: Effective Data Augmentation with Generative Diffusion Model Image Editing
Khawar Islam
M. Zaheer
Arif Mahmood
Karthik Nandakumar
Naveed Akhtar
DiffM
80
2
0
03 Dec 2024
RandAR: Decoder-only Autoregressive Visual Generation in Random Orders
Ziqi Pang
Tianyuan Zhang
Fujun Luan
Yunze Man
Hao Tan
Kai Zhang
William T. Freeman
Yu-Xiong Wang
VGen
71
13
0
02 Dec 2024
CTRL-D: Controllable Dynamic 3D Scene Editing with Personalized 2D Diffusion
Kai He
Chin-Hsuan Wu
Igor Gilitschenski
DiffM
3DGS
70
0
0
02 Dec 2024
3DSceneEditor: Controllable 3D Scene Editing with Gaussian Splatting
Ziyang Yan
Lei Li
Yihua Shao
Siyu Chen
Wuzong Kai
Jenq-Neng Hwang
Hao Zhao
Fabio Remondino
3DGS
77
3
0
02 Dec 2024
PainterNet: Adaptive Image Inpainting with Actual-Token Attention and Diverse Mask Control
Ruichen Wang
Junliang Zhang
Qingsong Xie
Chen Chen
H. Lu
DiffM
90
1
0
02 Dec 2024
InstantSwap: Fast Customized Concept Swapping across Sharp Shape Differences
Chenyang Zhu
Kai Li
Yue Ma
Longxiang Tang
Chengyu Fang
Chubin Chen
Qifeng Chen
Xiu Li
87
9
0
02 Dec 2024
Unleashing In-context Learning of Autoregressive Models for Few-shot Image Manipulation
Bolin Lai
F. Xu
Miao Liu
Xiaoliang Dai
Nikhil Mehta
...
Zeyi Huang
James M. Rehg
Sangmin Lee
Ning Zhang
Tong Xiao
73
2
0
02 Dec 2024
OmniGuard: Hybrid Manipulation Localization via Augmented Versatile Deep Image Watermarking
X. Zhang
Zecheng Tang
Zhipei Xu
Runyi Li
Youmin Xu
Bin Chen
Feng Gao
Jian Andrew Zhang
WIGM
93
4
0
02 Dec 2024
Lightweight Contenders: Navigating Semi-Supervised Text Mining through Peer Collaboration and Self Transcendence
Qianren Mao
Weifeng Jiang
J. Liu
Chenghua Lin
Qian Li
Xianqing Wen
Jianxin Li
Jinhu Lu
67
0
0
01 Dec 2024
DreamDance: Animating Human Images by Enriching 3D Geometry Cues from 2D Poses
Yatian Pang
Bin Zhu
Bin Lin
Mingzhe Zheng
Francis E. H. Tay
Ser-Nam Lim
Harry Yang
Li Yuan
VGen
3DH
79
3
0
30 Nov 2024
Uniform Attention Maps: Boosting Image Fidelity in Reconstruction and Editing
Wenyi Mo
Tianyu Zhang
Yalong Bai
Bing-Huang Su
Ji-Rong Wen
DiffM
73
0
0
29 Nov 2024
LoRA of Change: Learning to Generate LoRA for the Editing Instruction from A Single Before-After Image Pair
Xue Song
Jiequan Cui
H. Zhang
Jiaxin Shi
Jingjing Chen
Chi Zhang
Yu-Gang Jiang
94
0
0
28 Nov 2024
3D-WAG: Hierarchical Wavelet-Guided Autoregressive Generation for High-Fidelity 3D Shapes
Tejaswini Medi
Arianna Rampini
Pradyumna Reddy
P. Jayaraman
M. Keuper
DiffM
84
0
0
28 Nov 2024
Steering Rectified Flow Models in the Vector Field for Controlled Image Generation
Maitreya Patel
Song Wen
Dimitris N. Metaxas
Yezhou Yang
DiffM
114
3
0
27 Nov 2024
Diffusion Self-Distillation for Zero-Shot Customized Image Generation
Shengqu Cai
Eric Ryan Chan
Yunzhi Zhang
Leonidas J. Guibas
Jiajun Wu
Gordon Wetzstein
75
8
0
27 Nov 2024
Previous
1
2
3
...
5
6
7
...
25
26
27
Next