ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2211.09800
  4. Cited By
InstructPix2Pix: Learning to Follow Image Editing Instructions
v1v2 (latest)

InstructPix2Pix: Learning to Follow Image Editing Instructions

17 November 2022
Tim Brooks
Aleksander Holynski
Alexei A. Efros
    DiffM
ArXiv (abs)PDFHTML

Papers citing "InstructPix2Pix: Learning to Follow Image Editing Instructions"

50 / 1,418 papers shown
Title
Diffusion-RPO: Aligning Diffusion Models through Relative Preference
  Optimization
Diffusion-RPO: Aligning Diffusion Models through Relative Preference Optimization
Yi Gu
Zhendong Wang
Yueqin Yin
Yujia Xie
Mingyuan Zhou
100
17
0
10 Jun 2024
Tuning-Free Visual Customization via View Iterative Self-Attention
  Control
Tuning-Free Visual Customization via View Iterative Self-Attention Control
Xiaojie Li
Chenghao Gu
Shuzhao Xie
Yunpeng Bai
Weixiang Zhang
Zhi Wang
86
0
0
10 Jun 2024
InfoGaussian: Structure-Aware Dynamic Gaussians through Lightweight
  Information Shaping
InfoGaussian: Structure-Aware Dynamic Gaussians through Lightweight Information Shaping
Yunchao Zhang
Guandao Yang
Leonidas Guibas
Yanchao Yang
3DGS
90
1
0
09 Jun 2024
OmniControlNet: Dual-stage Integration for Conditional Image Generation
OmniControlNet: Dual-stage Integration for Conditional Image Generation
Yilin Wang
Haiyang Xu
Xiang Zhang
Zeyuan Chen
Zhizhou Sha
Zirui Wang
Zhuowen Tu
VLM
88
15
0
09 Jun 2024
Can Prompt Modifiers Control Bias? A Comparative Analysis of
  Text-to-Image Generative Models
Can Prompt Modifiers Control Bias? A Comparative Analysis of Text-to-Image Generative Models
P. W. Shin
Jihyun Janice Ahn
Wenpeng Yin
Jack Sampson
Vijaykrishnan Narayanan
61
3
0
09 Jun 2024
GenHeld: Generating and Editing Handheld Objects
GenHeld: Generating and Editing Handheld Objects
Chaerin Min
Srinath Sridhar
132
0
0
07 Jun 2024
Varying Manifolds in Diffusion: From Time-varying Geometries to Visual
  Saliency
Varying Manifolds in Diffusion: From Time-varying Geometries to Visual Saliency
Junhao Chen
Manyi Li
Zherong Pan
Xifeng Gao
Changhe Tu
DiffM
117
2
0
07 Jun 2024
Towards Semantic Equivalence of Tokenization in Multimodal LLM
Towards Semantic Equivalence of Tokenization in Multimodal LLM
Shengqiong Wu
Hao Fei
Xiangtai Li
Jiayi Ji
Hanwang Zhang
Tat-Seng Chua
Shuicheng Yan
MLLM
165
37
0
07 Jun 2024
M&M VTO: Multi-Garment Virtual Try-On and Editing
M&M VTO: Multi-Garment Virtual Try-On and Editing
Luyang Zhu
Yingwei Li
Nan Liu
Hao Peng
Dawei Yang
Ira Kemelmacher-Shlizerman
DiffM
79
8
0
06 Jun 2024
GenAI Arena: An Open Evaluation Platform for Generative Models
GenAI Arena: An Open Evaluation Platform for Generative Models
Dongfu Jiang
Max Ku
Tianle Li
Yuansheng Ni
Shizhuo Sun
Rongqi Fan
Wenhu Chen
EGVM
125
21
0
06 Jun 2024
VISTA: Visualized Text Embedding For Universal Multi-Modal Retrieval
VISTA: Visualized Text Embedding For Universal Multi-Modal Retrieval
Yueze Wang
Zheng Liu
Shitao Xiao
Bo Zhao
Yongping Xiong
114
29
0
06 Jun 2024
JIGMARK: A Black-Box Approach for Enhancing Image Watermarks against
  Diffusion Model Edits
JIGMARK: A Black-Box Approach for Enhancing Image Watermarks against Diffusion Model Edits
Minzhou Pan
Yi Zeng
Xue Lin
Ning Yu
Cho-Jui Hsieh
Peter Henderson
Ruoxi Jia
WIGM
131
4
0
06 Jun 2024
Bayesian Power Steering: An Effective Approach for Domain Adaptation of
  Diffusion Models
Bayesian Power Steering: An Effective Approach for Domain Adaptation of Diffusion Models
Ding Huang
Ting Li
Jian Huang
DiffM
93
1
0
06 Jun 2024
Dream-in-Style: Text-to-3D Generation Using Stylized Score Distillation
Dream-in-Style: Text-to-3D Generation Using Stylized Score Distillation
Hubert Kompanowski
Binh-Son Hua
DiffM
109
3
0
05 Jun 2024
Follow-Your-Emoji: Fine-Controllable and Expressive Freestyle Portrait
  Animation
Follow-Your-Emoji: Fine-Controllable and Expressive Freestyle Portrait Animation
Yi Ma
Hongyu Liu
Haobo Wang
Heng Pan
Yingqing He
...
Ailing Zeng
Chengfei Cai
H. Shum
Wen Liu
Qifeng Chen
130
61
0
04 Jun 2024
Enhancing Temporal Consistency in Video Editing by Reconstructing Videos with 3D Gaussian Splatting
Enhancing Temporal Consistency in Video Editing by Reconstructing Videos with 3D Gaussian Splatting
Inkyu Shin
Qihang Yu
Xiaohui Shen
In So Kweon
KuK-Jin Yoon
Liang-Chieh Chen
VGenDiffM
125
1
0
04 Jun 2024
Turning Text and Imagery into Captivating Visual Video
Turning Text and Imagery into Captivating Visual Video
Mingming Wang
Elijah Miller
VGen
73
0
0
03 Jun 2024
DiffUHaul: A Training-Free Method for Object Dragging in Images
DiffUHaul: A Training-Free Method for Object Dragging in Images
Omri Avrahami
Rinon Gal
Gal Chechik
Ohad Fried
Dani Lischinski
Arash Vahdat
Weili Nie
100
15
0
03 Jun 2024
Report on Methods and Applications for Crafting 3D Humans
Report on Methods and Applications for Crafting 3D Humans
Lei Liu
K. Zhao
123
0
0
03 Jun 2024
Dimba: Transformer-Mamba Diffusion Models
Dimba: Transformer-Mamba Diffusion Models
Zhengcong Fei
Mingyuan Fan
Changqian Yu
Debang Li
Youqiang Zhang
Junshi Huang
Mamba
105
19
0
03 Jun 2024
Uni-ISP: Unifying the Learning of ISPs from Multiple Cameras
Uni-ISP: Unifying the Learning of ISPs from Multiple Cameras
Lingen Li
Mingde Yao
Xingyu Meng
Muquan Yu
Tianfan Xue
Liang Feng
86
0
0
03 Jun 2024
Learning Temporally Consistent Video Depth from Video Diffusion Priors
Learning Temporally Consistent Video Depth from Video Diffusion Priors
Jiahao Shao
Yuanbo Yang
Hongyu Zhou
Youmin Zhang
Yujun Shen
Vitor Campagnolo Guizilini
Yue Wang
Matteo Poggi
Yiyi Liao
VGenDiffMMDE
123
43
0
03 Jun 2024
Bilateral Guided Radiance Field Processing
Bilateral Guided Radiance Field Processing
Yuehao Wang
Chaoyi Wang
Bingchen Gong
Tianfan Xue
134
10
0
01 Jun 2024
Empowering Visual Creativity: A Vision-Language Assistant to Image
  Editing Recommendations
Empowering Visual Creativity: A Vision-Language Assistant to Image Editing Recommendations
Tiancheng Shen
Jun Hao Liew
Long Mai
Lu Qi
Jiashi Feng
Jiaya Jia
DiffM
58
2
0
31 May 2024
Slight Corruption in Pre-training Data Makes Better Diffusion Models
Slight Corruption in Pre-training Data Makes Better Diffusion Models
Hao Chen
Yujin Han
Diganta Misra
Xiang Li
Kai Hu
Difan Zou
Masashi Sugiyama
Jindong Wang
Bhiksha Raj
DiffM
125
5
0
30 May 2024
ParSEL: Parameterized Shape Editing with Language
ParSEL: Parameterized Shape Editing with Language
Aditya Ganeshan
Ryan Y. Huang
Xianghao Xu
R. K. Jones
Daniel E. Ritchie
KELM
81
3
0
30 May 2024
SemFlow: Binding Semantic Segmentation and Image Synthesis via Rectified
  Flow
SemFlow: Binding Semantic Segmentation and Image Synthesis via Rectified Flow
Chaoyang Wang
Xiangtai Li
Lu Qi
Henghui Ding
Yunhai Tong
Ming-Hsuan Yang
DiffM
134
7
0
30 May 2024
Creating Language-driven Spatial Variations of Icon Images
Creating Language-driven Spatial Variations of Icon Images
Xianghao Xu
Aditya Ganeshan
K. Willis
Yewen Pu
Daniel E. Ritchie
78
0
0
30 May 2024
Personalized Interiors at Scale: Leveraging AI for Efficient and
  Customizable Design Solutions
Personalized Interiors at Scale: Leveraging AI for Efficient and Customizable Design Solutions
Kaiwen Zhou
Tianyu Wang
77
2
0
29 May 2024
Instruct-MusicGen: Unlocking Text-to-Music Editing for Music Language
  Models via Instruction Tuning
Instruct-MusicGen: Unlocking Text-to-Music Editing for Music Language Models via Instruction Tuning
Yixiao Zhang
Yukara Ikemiya
Woosung Choi
Naoki Murata
Marco A. Martínez-Ramírez
Liwei Lin
Gus Xia
Wei-Hsiang Liao
Yuki Mitsufuji
Simon Dixon
108
12
0
28 May 2024
Multi-modal Generation via Cross-Modal In-Context Learning
Multi-modal Generation via Cross-Modal In-Context Learning
Amandeep Kumar
Muzammal Naseer
Sanath Narayan
Rao Muhammad Anwer
Salman Khan
Hisham Cholakkal
MLLM
87
0
0
28 May 2024
EG4D: Explicit Generation of 4D Object without Score Distillation
EG4D: Explicit Generation of 4D Object without Score Distillation
Qi Sun
Zhiyang Guo
Bo Liu
Jing Nathan Yan
Shengming Yin
Wen-gang Zhou
Jing Liao
Houqiang Li
VGen3DGS
107
15
0
28 May 2024
The Evolution of Multimodal Model Architectures
The Evolution of Multimodal Model Architectures
S. Wadekar
Abhishek Chaurasia
Aman Chadha
Eugenio Culurciello
109
18
0
28 May 2024
Vista: A Generalizable Driving World Model with High Fidelity and
  Versatile Controllability
Vista: A Generalizable Driving World Model with High Fidelity and Versatile Controllability
Shenyuan Gao
Jiazhi Yang
Li Chen
Kashyap Chitta
Yihang Qiu
Andreas Geiger
Jun Zhang
Hongyang Li
167
103
0
27 May 2024
From Text to Blueprint: Leveraging Text-to-Image Tools for Floor Plan
  Creation
From Text to Blueprint: Leveraging Text-to-Image Tools for Floor Plan Creation
Xiaoyu Li
Jonathan Benjamin
Xin Zhang
105
1
0
27 May 2024
PatchScaler: An Efficient Patch-Independent Diffusion Model for
  Super-Resolution
PatchScaler: An Efficient Patch-Independent Diffusion Model for Super-Resolution
Yong Liu
Hang Dong
Jinshan Pan
Qingji Dong
Kai-xiang Chen
Rongxiang Zhang
Lean Fu
Fei Wang
DiffM
80
1
0
27 May 2024
Training-free Editioning of Text-to-Image Models
Training-free Editioning of Text-to-Image Models
Jinqi Wang
Yunfei Fu
Zhangcan Ding
Bailin Deng
Yu-Kun Lai
Yipeng Qin
DiffMVLM
66
0
0
27 May 2024
TIE: Revolutionizing Text-based Image Editing for Complex-Prompt
  Following and High-Fidelity Editing
TIE: Revolutionizing Text-based Image Editing for Complex-Prompt Following and High-Fidelity Editing
Xinyu Zhang
Mengxue Kang
Fei Wei
Shuang Xu
Yuhe Liu
Lin Ma
MLLMDiffM
75
2
0
27 May 2024
PromptFix: You Prompt and We Fix the Photo
PromptFix: You Prompt and We Fix the Photo
Yongsheng Yu
Ziyun Zeng
Hang Hua
Jianlong Fu
Jiebo Luo
MLLMDiffMVLM
90
28
0
27 May 2024
I2VEdit: First-Frame-Guided Video Editing via Image-to-Video Diffusion
  Models
I2VEdit: First-Frame-Guided Video Editing via Image-to-Video Diffusion Models
Wenqi Ouyang
Yi Dong
Lei Yang
Jianlou Si
Xingang Pan
VGenDiffM
102
16
0
26 May 2024
Sp2360: Sparse-view 360 Scene Reconstruction using Cascaded 2D Diffusion
  Priors
Sp2360: Sparse-view 360 Scene Reconstruction using Cascaded 2D Diffusion Priors
Soumava Paul
Christopher Wewer
Bernt Schiele
J. E. Lenssen
3DGS
82
4
0
26 May 2024
User-Friendly Customized Generation with Multi-Modal Prompts
User-Friendly Customized Generation with Multi-Modal Prompts
Linhao Zhong
Yan Hong
Wentao Chen
Binglin Zhou
Yiyi Zhang
Jianfu Zhang
Liqing Zhang
DiffM
75
1
0
26 May 2024
LEAST: "Local" text-conditioned image style transfer
LEAST: "Local" text-conditioned image style transfer
Silky Singh
Surgan Jandial
Simra Shahid
Abhinav Java
92
0
0
25 May 2024
ModelLock: Locking Your Model With a Spell
ModelLock: Locking Your Model With a Spell
Yifeng Gao
Yuhua Sun
Xingjun Ma
Zuxuan Wu
Yu-Gang Jiang
VLM
88
1
0
25 May 2024
FastDrag: Manipulate Anything in One Step
FastDrag: Manipulate Anything in One Step
Xuanjia Zhao
Jian Guan
Congyi Fan
Dongli Xu
Youtian Lin
Haiwei Pan
Pengming Feng
DiffM
77
7
0
24 May 2024
Challenges and Opportunities in 3D Content Generation
Challenges and Opportunities in 3D Content Generation
Ke Zhao
Andreas Larsen
106
0
0
24 May 2024
Enhancing Text-to-Image Editing via Hybrid Mask-Informed Fusion
Enhancing Text-to-Image Editing via Hybrid Mask-Informed Fusion
Aoxue Li
Mingyang Yi
Zhenguo Li
DiffM
79
0
0
24 May 2024
Looking Backward: Streaming Video-to-Video Translation with Feature Banks
Looking Backward: Streaming Video-to-Video Translation with Feature Banks
Feng Liang
Akio Kodaira
Chenfeng Xu
Masayoshi Tomizuka
Kurt Keutzer
Diana Marculescu
DiffMVGen
197
9
0
24 May 2024
Improved Distribution Matching Distillation for Fast Image Synthesis
Improved Distribution Matching Distillation for Fast Image Synthesis
Tianwei Yin
Michael Gharbi
Taesung Park
Richard Zhang
Eli Shechtman
Frédo Durand
William T. Freeman
DiffM
147
127
0
23 May 2024
EditWorld: Simulating World Dynamics for Instruction-Following Image
  Editing
EditWorld: Simulating World Dynamics for Instruction-Following Image Editing
Ling Yang
Bo-Wen Zeng
Jiaming Liu
Hong Li
Minghao Xu
Wentao Zhang
Shuicheng Yan
DiffM
86
16
0
23 May 2024
Previous
123...131415...272829
Next