ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2211.09800
  4. Cited By
InstructPix2Pix: Learning to Follow Image Editing Instructions
v1v2 (latest)

InstructPix2Pix: Learning to Follow Image Editing Instructions

17 November 2022
Tim Brooks
Aleksander Holynski
Alexei A. Efros
    DiffM
ArXiv (abs)PDFHTML

Papers citing "InstructPix2Pix: Learning to Follow Image Editing Instructions"

50 / 1,418 papers shown
Title
Generating Daylight-driven Architectural Design via Diffusion Models
Generating Daylight-driven Architectural Design via Diffusion Models
Pengzhi Li
Baijuan Li
AI4CEDiffM
70
12
0
20 Apr 2024
FilterPrompt: Guiding Image Transfer in Diffusion Models
FilterPrompt: Guiding Image Transfer in Diffusion Models
Xi Wang
Yichen Peng
Heng Fang
Haoran Xie
Xi Yang
Chuntao Li
DiffM
82
0
0
20 Apr 2024
Physical Backdoor Attack can Jeopardize Driving with
  Vision-Large-Language Models
Physical Backdoor Attack can Jeopardize Driving with Vision-Large-Language Models
Zhenyang Ni
Rui Ye
Yuxian Wei
Zhen Xiang
Yanfeng Wang
Siheng Chen
AAML
98
13
0
19 Apr 2024
Customizing Text-to-Image Diffusion with Camera Viewpoint Control
Customizing Text-to-Image Diffusion with Camera Viewpoint Control
Nupur Kumari
Grace Su
Richard Zhang
Taesung Park
Eli Shechtman
Jun-Yan Zhu
DiffM
92
5
0
18 Apr 2024
Curriculum Point Prompting for Weakly-Supervised Referring Image
  Segmentation
Curriculum Point Prompting for Weakly-Supervised Referring Image Segmentation
Qiyuan Dai
Sibei Yang
91
9
0
18 Apr 2024
Factorized Diffusion: Perceptual Illusions by Noise Decomposition
Factorized Diffusion: Perceptual Illusions by Noise Decomposition
Daniel Geng
Inbum Park
Andrew Owens
DiffM
150
16
0
17 Apr 2024
Improving Composed Image Retrieval via Contrastive Learning with Scaling
  Positives and Negatives
Improving Composed Image Retrieval via Contrastive Learning with Scaling Positives and Negatives
Zhangchi Feng
Richong Zhang
Zhijie Nie
148
10
0
17 Apr 2024
TiNO-Edit: Timestep and Noise Optimization for Robust Diffusion-Based
  Image Editing
TiNO-Edit: Timestep and Noise Optimization for Robust Diffusion-Based Image Editing
Sherry X. Chen
Yaron Vaxman
Elad Ben Baruch
David Asulin
Aviad Moreshet
Kuo-Chin Lien
Misha Sra
Pradeep Sen
65
4
0
17 Apr 2024
HQ-Edit: A High-Quality Dataset for Instruction-based Image Editing
HQ-Edit: A High-Quality Dataset for Instruction-based Image Editing
Mude Hui
Siwei Yang
Bingchen Zhao
Yichun Shi
Heng Wang
Peng Wang
Yuyin Zhou
Cihang Xie
84
73
0
15 Apr 2024
MaxFusion: Plug&Play Multi-Modal Generation in Text-to-Image Diffusion
  Models
MaxFusion: Plug&Play Multi-Modal Generation in Text-to-Image Diffusion Models
Nithin Gopalakrishnan Nair
Jeya Maria Jose Valanarasu
Vishal M. Patel
MoMe
88
7
0
15 Apr 2024
Conditional Prototype Rectification Prompt Learning
Conditional Prototype Rectification Prompt Learning
Haoxing Chen
Yaohui Li
Zizheng Huang
Yan Hong
Zhuoer Xu
Zhangxuan Gu
Jun Lan
Huijia Zhu
Weiqiang Wang
VLM
98
3
0
15 Apr 2024
In-Context Translation: Towards Unifying Image Recognition, Processing,
  and Generation
In-Context Translation: Towards Unifying Image Recognition, Processing, and Generation
Han Xue
Qianru Sun
Li Song
Wenjun Zhang
Zhiwu Huang
MLLM
72
0
0
15 Apr 2024
Magic Clothing: Controllable Garment-Driven Image Synthesis
Magic Clothing: Controllable Garment-Driven Image Synthesis
Weifeng Chen
Tao Gu
Yuhao Xu
Chengcai Chen
96
18
0
15 Apr 2024
PhyScene: Physically Interactable 3D Scene Synthesis for Embodied AI
PhyScene: Physically Interactable 3D Scene Synthesis for Embodied AI
Yandan Yang
Baoxiong Jia
Peiyuan Zhi
Siyuan Huang
LM&RoVGen
139
47
0
15 Apr 2024
S3Editor: A Sparse Semantic-Disentangled Self-Training Framework for
  Face Video Editing
S3Editor: A Sparse Semantic-Disentangled Self-Training Framework for Face Video Editing
Guangzhi Wang
Tianyi Chen
Kamran Ghasedi
HsiangTao Wu
Tianyu Ding
Chris Nuesmeyer
Ilya Zharkov
Mohan Kankanhalli
Luming Liang
75
1
0
11 Apr 2024
OpenBias: Open-set Bias Detection in Text-to-Image Generative Models
OpenBias: Open-set Bias Detection in Text-to-Image Generative Models
Moreno DÍncà
E. Peruzzo
Massimiliano Mancini
Dejia Xu
Vidit Goel
Xingqian Xu
Zhangyang Wang
Humphrey Shi
N. Sebe
117
37
0
11 Apr 2024
ControlNet++: Improving Conditional Controls with Efficient Consistency
  Feedback
ControlNet++: Improving Conditional Controls with Efficient Consistency Feedback
Ming Li
Taojiannan Yang
Huafeng Kuang
Jie Wu
Zhaoning Wang
Xuefeng Xiao
Chong Chen
86
82
0
11 Apr 2024
Accelerating Inference in Large Language Models with a Unified Layer
  Skipping Strategy
Accelerating Inference in Large Language Models with a Unified Layer Skipping Strategy
Yijin Liu
Fandong Meng
Jie Zhou
AI4CE
81
9
0
10 Apr 2024
UDiFF: Generating Conditional Unsigned Distance Fields with Optimal
  Wavelet Diffusion
UDiFF: Generating Conditional Unsigned Distance Fields with Optimal Wavelet Diffusion
Junsheng Zhou
Weiqi Zhang
Baorui Ma
Kanle Shi
Yu-Shen Liu
Zhizhong Han
121
19
0
10 Apr 2024
Tuning-Free Adaptive Style Incorporation for Structure-Consistent
  Text-Driven Style Transfer
Tuning-Free Adaptive Style Incorporation for Structure-Consistent Text-Driven Style Transfer
Yanqi Ge
Jiaqi Liu
Qingnan Fan
Xi Jiang
Ye Huang
Shuai Qin
Hong Gu
Wen Li
Lixin Duan
DiffM
100
1
0
10 Apr 2024
RealmDreamer: Text-Driven 3D Scene Generation with Inpainting and Depth Diffusion
RealmDreamer: Text-Driven 3D Scene Generation with Inpainting and Depth Diffusion
Jaidev Shriram
Alex Trevithick
Lingjie Liu
Ravi Ramamoorthi
DiffM3DGS
161
59
0
10 Apr 2024
GeoSynth: Contextually-Aware High-Resolution Satellite Image Synthesis
GeoSynth: Contextually-Aware High-Resolution Satellite Image Synthesis
Srikumar Sastry
Subash Khanal
Aayush Dhakal
Nathan Jacobs
93
9
0
09 Apr 2024
SmartControl: Enhancing ControlNet for Handling Rough Visual Conditions
SmartControl: Enhancing ControlNet for Handling Rough Visual Conditions
Xiaoyu Liu
Yuxiang Wei
Ming-Yu Liu
Xianhui Lin
Peiran Ren
Xuansong Xie
Wangmeng Zuo
DiffM
88
6
0
09 Apr 2024
ZeST: Zero-Shot Material Transfer from a Single Image
ZeST: Zero-Shot Material Transfer from a Single Image
Ta-Ying Cheng
Prafull Sharma
Andrew Markham
Niki Trigoni
Varun Jampani
78
11
0
09 Apr 2024
DreamView: Injecting View-specific Text Guidance into Text-to-3D
  Generation
DreamView: Injecting View-specific Text Guidance into Text-to-3D Generation
Junkai Yan
Yipeng Gao
Q. Yang
Xihan Wei
Xuansong Xie
Ancong Wu
Wei-Shi Zheng
78
2
0
09 Apr 2024
Tackling Structural Hallucination in Image Translation with Local
  Diffusion
Tackling Structural Hallucination in Image Translation with Local Diffusion
Seunghoi Kim
Chen Jin
Tom Diethe
Matteo Figini
Henry F. J. Tregidgo
A. Mullokandov
Philip Teare
Daniel C. Alexander
MedIm
94
7
0
09 Apr 2024
UniFL: Improve Stable Diffusion via Unified Feedback Learning
UniFL: Improve Stable Diffusion via Unified Feedback Learning
Jiacheng Zhang
Jie Wu
Yuxi Ren
Xin Xia
Huafeng Kuang
...
Jiashi Li
Xuefeng Xiao
Min Zheng
Lean Fu
Guanbin Li
93
2
0
08 Apr 2024
Responsible Visual Editing
Responsible Visual Editing
Minheng Ni
Yeli Shen
Lei Zhang
W. Zuo
DiffM
42
0
0
08 Apr 2024
Mask-ControlNet: Higher-Quality Image Generation with An Additional Mask
  Prompt
Mask-ControlNet: Higher-Quality Image Generation with An Additional Mask Prompt
Zhiqi Huang
Hui Xiong
Haoyu Wang
Longguang Wang
Zhiheng Li
DiffM
63
0
0
08 Apr 2024
DATENeRF: Depth-Aware Text-based Editing of NeRFs
DATENeRF: Depth-Aware Text-based Editing of NeRFs
Sara Rojas
Julien Philip
Kai Zhang
Sai Bi
Fujun Luan
Guohao Li
Kalyan Sunkavalli
DiffM
70
3
0
06 Apr 2024
Aligning Diffusion Models by Optimizing Human Utility
Aligning Diffusion Models by Optimizing Human Utility
Shufan Li
Konstantinos Kallidromitis
Akash Gokul
Yusuke Kato
Kazuki Kozuka
157
34
0
06 Apr 2024
Idea-2-3D: Collaborative LMM Agents Enable 3D Model Generation from
  Interleaved Multimodal Inputs
Idea-2-3D: Collaborative LMM Agents Enable 3D Model Generation from Interleaved Multimodal Inputs
Junhao Chen
Xiang Li
Xiaojun Ye
Chao Li
Zhaoxin Fan
Hao Zhao
VGen3DV
247
5
0
05 Apr 2024
InstructHumans: Editing Animated 3D Human Textures with Instructions
InstructHumans: Editing Animated 3D Human Textures with Instructions
Jiayin Zhu
Linlin Yang
Angela Yao
DiffM
78
1
0
05 Apr 2024
DiffuseMix: Label-Preserving Data Augmentation with Diffusion Models
DiffuseMix: Label-Preserving Data Augmentation with Diffusion Models
Khawar Islam
Muhammad Zaigham Zaheer
Arif Mahmood
Karthik Nandakumar
DiffM
75
41
0
05 Apr 2024
Gen3DSR: Generalizable 3D Scene Reconstruction via Divide and Conquer from a Single View
Gen3DSR: Generalizable 3D Scene Reconstruction via Divide and Conquer from a Single View
Andreea Dogaru
M. Ozer
Bernhard Egger
3DGS
139
7
0
04 Apr 2024
Many-to-many Image Generation with Auto-regressive Diffusion Models
Many-to-many Image Generation with Auto-regressive Diffusion Models
Ying Shen
Yizhe Zhang
Shuangfei Zhai
Lifu Huang
J. Susskind
Jiatao Gu
122
6
0
03 Apr 2024
MULAN: A Multi Layer Annotated Dataset for Controllable Text-to-Image
  Generation
MULAN: A Multi Layer Annotated Dataset for Controllable Text-to-Image Generation
Petru-Daniel Tudosiu
Yongxin Yang
Shifeng Zhang
Fei Chen
Jingyu Sun
Gerasimos Lampouras
Ignacio Iacobacci
Sarah Parisot
95
12
0
03 Apr 2024
GenN2N: Generative NeRF2NeRF Translation
GenN2N: Generative NeRF2NeRF Translation
Xiangyue Liu
Han Xue
Kunming Luo
Ping Tan
Li Yi
62
5
0
03 Apr 2024
InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image
  Generation
InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation
Haofan Wang
Matteo Spinelli
Qixun Wang
Xu Bai
Zekui Qin
Anthony Chen
DiffM
128
97
0
03 Apr 2024
Freditor: High-Fidelity and Transferable NeRF Editing by Frequency
  Decomposition
Freditor: High-Fidelity and Transferable NeRF Editing by Frequency Decomposition
Yisheng He
Weihao Yuan
Siyu Zhu
Zilong Dong
Liefeng Bo
Qixing Huang
82
3
0
03 Apr 2024
GeneAvatar: Generic Expression-Aware Volumetric Head Avatar Editing from
  a Single Image
GeneAvatar: Generic Expression-Aware Volumetric Head Avatar Editing from a Single Image
Chong Bao
Yinda Zhang
Yuan Li
Xiyu Zhang
Bangbang Yang
Hujun Bao
Marc Pollefeys
Guofeng Zhang
Zhaopeng Cui
DiffM
92
5
0
02 Apr 2024
PREGO: online mistake detection in PRocedural EGOcentric videos
PREGO: online mistake detection in PRocedural EGOcentric videos
Alessandro Flaborea
Guido Maria DÁmely di Melendugno
Leonardo Plini
Luca Scofano
Edoardo De Matteis
Antonino Furnari
G. Farinella
Yuta Kyuragi
EgoV
103
13
0
02 Apr 2024
MotionChain: Conversational Motion Controllers via Multimodal Prompts
MotionChain: Conversational Motion Controllers via Multimodal Prompts
Biao Jiang
Xin Chen
C. Zhang
Fukun Yin
Zhuoyuan Li
Gang Yu
Jiayuan Fan
VGenLRM
100
11
0
02 Apr 2024
FashionEngine: Interactive 3D Human Generation and Editing via
  Multimodal Controls
FashionEngine: Interactive 3D Human Generation and Editing via Multimodal Controls
Tao Hu
Fangzhou Hong
Zhaoxi Chen
Ziwei Liu
DiffM
89
0
0
02 Apr 2024
Evaluating Text-to-Visual Generation with Image-to-Text Generation
Evaluating Text-to-Visual Generation with Image-to-Text Generation
Zhiqiu Lin
Deepak Pathak
Baiqi Li
Jiayao Li
Xide Xia
Graham Neubig
Pengchuan Zhang
Deva Ramanan
EGVM
150
171
0
01 Apr 2024
Large Motion Model for Unified Multi-Modal Motion Generation
Large Motion Model for Unified Multi-Modal Motion Generation
Mingyuan Zhang
Daisheng Jin
Chenyang Gu
Fangzhou Hong
Zhongang Cai
...
Chongzhi Zhang
Xinying Guo
Lei Yang
Ying He
Ziwei Liu
VGen
97
31
0
01 Apr 2024
An image speaks a thousand words, but can everyone listen? On image
  transcreation for cultural relevance
An image speaks a thousand words, but can everyone listen? On image transcreation for cultural relevance
Simran Khanuja
Sathyanarayanan Ramamoorthy
Yueqi Song
Graham Neubig
DiffM
105
18
0
01 Apr 2024
Feature Splatting: Language-Driven Physics-Based Scene Synthesis and
  Editing
Feature Splatting: Language-Driven Physics-Based Scene Synthesis and Editing
Ri-Zhao Qiu
Ge Yang
Weijia Zeng
Xiaolong Wang
3DGS
69
27
0
01 Apr 2024
CLIPtone: Unsupervised Learning for Text-based Image Tone Adjustment
CLIPtone: Unsupervised Learning for Text-based Image Tone Adjustment
Hyeongmin Lee
Kyoungkook Kang
Jungseul Ok
Sunghyun Cho
CLIP
97
4
0
01 Apr 2024
Drag Your Noise: Interactive Point-based Editing via Diffusion Semantic
  Propagation
Drag Your Noise: Interactive Point-based Editing via Diffusion Semantic Propagation
Haofeng Liu
Chenshu Xu
Yifei Yang
Lihua Zeng
Shengfeng He
DiffM
121
28
0
01 Apr 2024
Previous
123...151617...272829
Next