ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2211.09800
  4. Cited By
InstructPix2Pix: Learning to Follow Image Editing Instructions

InstructPix2Pix: Learning to Follow Image Editing Instructions

17 November 2022
Tim Brooks
Aleksander Holynski
Alexei A. Efros
    DiffM
ArXivPDFHTML

Papers citing "InstructPix2Pix: Learning to Follow Image Editing Instructions"

50 / 1,356 papers shown
Title
InstructGIE: Towards Generalizable Image Editing
InstructGIE: Towards Generalizable Image Editing
Zichong Meng
Changdi Yang
Jun Liu
Hao Tang
Pu Zhao
Yanzhi Wang
DiffM
54
6
0
08 Mar 2024
Pix2Gif: Motion-Guided Diffusion for GIF Generation
Pix2Gif: Motion-Guided Diffusion for GIF Generation
Hitesh Kandala
Jianfeng Gao
Jianwei Yang
VGen
DiffM
38
4
0
07 Mar 2024
StableDrag: Stable Dragging for Point-based Image Editing
StableDrag: Stable Dragging for Point-based Image Editing
Yutao Cui
Xiaotong Zhao
Guozhen Zhang
Shengming Cao
Kai Ma
Limin Wang
44
11
0
07 Mar 2024
NoiseCollage: A Layout-Aware Text-to-Image Diffusion Model Based on
  Noise Cropping and Merging
NoiseCollage: A Layout-Aware Text-to-Image Diffusion Model Based on Noise Cropping and Merging
Takahiro Shirakawa
Seiichi Uchida
DiffM
40
15
0
06 Mar 2024
Towards Understanding Cross and Self-Attention in Stable Diffusion for
  Text-Guided Image Editing
Towards Understanding Cross and Self-Attention in Stable Diffusion for Text-Guided Image Editing
Bingyan Liu
Chengyu Wang
Tingfeng Cao
Kui Jia
Jun Huang
DiffM
48
53
0
06 Mar 2024
Scaling Rectified Flow Transformers for High-Resolution Image Synthesis
Scaling Rectified Flow Transformers for High-Resolution Image Synthesis
Patrick Esser
Sumith Kulal
A. Blattmann
Rahim Entezari
Jonas Muller
...
Zion English
Kyle Lacey
Alex Goodwin
Yannik Marek
Robin Rombach
DiffM
147
1,103
0
05 Mar 2024
Doubly Abductive Counterfactual Inference for Text-based Image Editing
Doubly Abductive Counterfactual Inference for Text-based Image Editing
Xue Song
Jiequan Cui
Hanwang Zhang
Jingjing Chen
Richang Hong
Yu-Gang Jiang
DiffM
31
9
0
05 Mar 2024
RT-Sketch: Goal-Conditioned Imitation Learning from Hand-Drawn Sketches
RT-Sketch: Goal-Conditioned Imitation Learning from Hand-Drawn Sketches
Priya Sundaresan
Q. Vuong
Jiayuan Gu
Peng Xu
Ted Xiao
...
Ajinkya Jain
Karol Hausman
Dorsa Sadigh
Jeannette Bohg
S. Schaal
VGen
39
26
0
05 Mar 2024
OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable
  Virtual Try-on
OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on
Yuhao Xu
Tao Gu
Weifeng Chen
Chengcai Chen
DiffM
35
52
0
04 Mar 2024
Diff-Plugin: Revitalizing Details for Diffusion-based Low-level Tasks
Diff-Plugin: Revitalizing Details for Diffusion-based Low-level Tasks
Yuhao Liu
Zhanghan Ke
Fang Liu
Nanxuan Zhao
Rynson W. H. Lau
DiffM
48
19
0
01 Mar 2024
LoMOE: Localized Multi-Object Editing via Multi-Diffusion
LoMOE: Localized Multi-Object Editing via Multi-Diffusion
Goirik Chakrabarty
Aditya Chandrasekar
Ramya Hebbalaguppe
AP Prathosh
DiffM
50
6
0
01 Mar 2024
Ask Your Distribution Shift if Pre-Training is Right for You
Ask Your Distribution Shift if Pre-Training is Right for You
Benjamin Cohen-Wang
Joshua Vendrow
Aleksander Madry
OOD
39
3
0
29 Feb 2024
Large Language Models and Games: A Survey and Roadmap
Large Language Models and Games: A Survey and Roadmap
Roberto Gallotta
Graham Todd
Marvin Zammit
Sam Earle
Antonios Liapis
Julian Togelius
Georgios N. Yannakakis
LLMAG
LM&MA
AI4CE
LRM
55
73
0
28 Feb 2024
From Summary to Action: Enhancing Large Language Models for Complex
  Tasks with Open World APIs
From Summary to Action: Enhancing Large Language Models for Complex Tasks with Open World APIs
Yulong Liu
Yunlong Yuan
Chunwei Wang
Jianhua Han
Yongqiang Ma
Li Zhang
Nanning Zheng
Hang Xu
LLMAG
45
5
0
28 Feb 2024
Coarse-to-Fine Latent Diffusion for Pose-Guided Person Image Synthesis
Coarse-to-Fine Latent Diffusion for Pose-Guided Person Image Synthesis
Yanzuo Lu
Manlin Zhang
Andy J. Ma
Xiaohua Xie
Jian-Huang Lai
DiffM
27
22
0
28 Feb 2024
CustomSketching: Sketch Concept Extraction for Sketch-based Image
  Synthesis and Editing
CustomSketching: Sketch Concept Extraction for Sketch-based Image Synthesis and Editing
Chufeng Xiao
Hongbo Fu
DiffM
43
3
0
27 Feb 2024
Diffusion Model-Based Image Editing: A Survey
Diffusion Model-Based Image Editing: A Survey
Yi Huang
Jiancheng Huang
Yifan Liu
Mingfu Yan
Jiaxi Lv
Jianzhuang Liu
Wei Xiong
He Zhang
Liangliang Cao
Liangliang Cao
EGVM
71
88
0
27 Feb 2024
Placing Objects in Context via Inpainting for Out-of-distribution
  Segmentation
Placing Objects in Context via Inpainting for Out-of-distribution Segmentation
Pau de Jorge
Riccardo Volpi
P. Dokania
Philip Torr
Grégory Rogez
DiffM
60
5
0
26 Feb 2024
Intelligent Director: An Automatic Framework for Dynamic Visual
  Composition using ChatGPT
Intelligent Director: An Automatic Framework for Dynamic Visual Composition using ChatGPT
Sixiao Zheng
Jingyang Huo
Yu Wang
Yanwei Fu
VGen
DiffM
44
1
0
24 Feb 2024
Snap Video: Scaled Spatiotemporal Transformers for Text-to-Video
  Synthesis
Snap Video: Scaled Spatiotemporal Transformers for Text-to-Video Synthesis
Willi Menapace
Aliaksandr Siarohin
Ivan Skorokhodov
Ekaterina Deyneka
Tsai-Shien Chen
...
Yuwei Fang
A. Stoliar
Elisa Ricci
Jian Ren
Sergey Tulyakov
VGen
57
57
0
22 Feb 2024
Consolidating Attention Features for Multi-view Image Editing
Consolidating Attention Features for Multi-view Image Editing
Or Patashnik
Rinon Gal
Daniel Cohen-Or
Jun-Yan Zhu
Fernando de la Torre
37
5
0
22 Feb 2024
LLMBind: A Unified Modality-Task Integration Framework
LLMBind: A Unified Modality-Task Integration Framework
Bin Zhu
Munan Ning
Peng Jin
Bin Lin
Jinfa Huang
...
Junwu Zhang
Zhenyu Tang
Mingjun Pan
Xing Zhou
Li-ming Yuan
MLLM
40
6
0
22 Feb 2024
Real-time 3D-aware Portrait Editing from a Single Image
Real-time 3D-aware Portrait Editing from a Single Image
Qingyan Bai
Zifan Shi
Yinghao Xu
Hao Ouyang
Qiuyu Wang
Ceyuan Yang
Xuan Wang
Gordon Wetzstein
Yujun Shen
Qifeng Chen
3DH
DiffM
72
9
0
21 Feb 2024
CoFRIDA: Self-Supervised Fine-Tuning for Human-Robot Co-Painting
CoFRIDA: Self-Supervised Fine-Tuning for Human-Robot Co-Painting
Peter Schaldenbrand
Gaurav Parmar
Jun-Yan Zhu
James McCann
Jean Oh
37
13
0
21 Feb 2024
UniEdit: A Unified Tuning-Free Framework for Video Motion and Appearance
  Editing
UniEdit: A Unified Tuning-Free Framework for Video Motion and Appearance Editing
Jianhong Bai
Tianyu He
Yuchi Wang
Junliang Guo
Haoji Hu
Zuozhu Liu
Jiang Bian
VGen
33
26
0
20 Feb 2024
Robust-Wide: Robust Watermarking against Instruction-driven Image
  Editing
Robust-Wide: Robust Watermarking against Instruction-driven Image Editing
Runyi Hu
Jie Zhang
Ting Xu
Jiwei Li
Tianwei Zhang
DiffM
WIGM
51
7
0
20 Feb 2024
The Revolution of Multimodal Large Language Models: A Survey
The Revolution of Multimodal Large Language Models: A Survey
Davide Caffagni
Federico Cocchi
Luca Barsellotti
Nicholas Moratelli
Sara Sarto
Lorenzo Baraldi
Lorenzo Baraldi
Marcella Cornia
Rita Cucchiara
LRM
VLM
66
43
0
19 Feb 2024
DiLightNet: Fine-grained Lighting Control for Diffusion-based Image
  Generation
DiLightNet: Fine-grained Lighting Control for Diffusion-based Image Generation
Chong Zeng
Yue Dong
Pieter Peers
Youkang Kong
Hongzhi Wu
Xin Tong
44
29
0
19 Feb 2024
On Good Practices for Task-Specific Distillation of Large Pretrained
  Visual Models
On Good Practices for Task-Specific Distillation of Large Pretrained Visual Models
Juliette Marrie
Michael Arbel
Julien Mairal
Diane Larlus
VLM
MQ
53
1
0
17 Feb 2024
Semantically-aware Neural Radiance Fields for Visual Scene
  Understanding: A Comprehensive Review
Semantically-aware Neural Radiance Fields for Visual Scene Understanding: A Comprehensive Review
Thang-Anh-Quan Nguyen
Amine Bourki
Mátyás Macudzinski
Anthony Brunel
M. Bennamoun
48
11
0
17 Feb 2024
3D Diffuser Actor: Policy Diffusion with 3D Scene Representations
3D Diffuser Actor: Policy Diffusion with 3D Scene Representations
Tsung-Wei Ke
N. Gkanatsios
Katerina Fragkiadaki
VGen
39
109
0
16 Feb 2024
PEGASUS: Personalized Generative 3D Avatars with Composable Attributes
PEGASUS: Personalized Generative 3D Avatars with Composable Attributes
Hyunsoo Cha
Byungjun Kim
Hanbyul Joo
23
4
0
16 Feb 2024
Classification Diffusion Models: Revitalizing Density Ratio Estimation
Classification Diffusion Models: Revitalizing Density Ratio Estimation
Shahar Yadin
Noam Elata
T. Michaeli
DiffM
48
1
0
15 Feb 2024
Zero-Shot Unsupervised and Text-Based Audio Editing Using DDPM Inversion
Zero-Shot Unsupervised and Text-Based Audio Editing Using DDPM Inversion
Hila Manor
T. Michaeli
DiffM
37
25
0
15 Feb 2024
DreamMatcher: Appearance Matching Self-Attention for
  Semantically-Consistent Text-to-Image Personalization
DreamMatcher: Appearance Matching Self-Attention for Semantically-Consistent Text-to-Image Personalization
Jisu Nam
Heesu Kim
Dongjae Lee
Siyoon Jin
Seungryong Kim
Seunggyu Chang
DiffM
32
40
0
15 Feb 2024
Weatherproofing Retrieval for Localization with Generative AI and
  Geometric Consistency
Weatherproofing Retrieval for Localization with Generative AI and Geometric Consistency
Yannis Kalantidis
Mert Bulent Sariyildiz
Rafael Sampaio de Rezende
Philippe Weinzaepfel
Diane Larlus
G. Csurka
24
0
0
14 Feb 2024
Learning Continuous 3D Words for Text-to-Image Generation
Learning Continuous 3D Words for Text-to-Image Generation
Ta-Ying Cheng
Matheus Gadelha
Thibault Groueix
Matthew Fisher
R. Měch
Andrew Markham
Niki Trigoni
DiffM
43
13
0
13 Feb 2024
NeRF Analogies: Example-Based Visual Attribute Transfer for NeRFs
NeRF Analogies: Example-Based Visual Attribute Transfer for NeRFs
Michael Fischer
Zhengqin Li
Thu Nguyen-Phuoc
Aljaz Bozic
Zhao Dong
Carl S. Marshall
Tobias Ritschel
49
10
0
13 Feb 2024
Fine-Tuning Text-To-Image Diffusion Models for Class-Wise Spurious
  Feature Generation
Fine-Tuning Text-To-Image Diffusion Models for Class-Wise Spurious Feature Generation
AprilPyone Maungmaung
H. Nguyen
Hitoshi Kiya
Isao Echizen
28
6
0
13 Feb 2024
HeadStudio: Text to Animatable Head Avatars with 3D Gaussian Splatting
HeadStudio: Text to Animatable Head Avatars with 3D Gaussian Splatting
Zhen Zhou
Fan Ma
Hehe Fan
Yi Yang
3DGS
40
24
0
09 Feb 2024
Animated Stickers: Bringing Stickers to Life with Video Diffusion
Animated Stickers: Bringing Stickers to Life with Video Diffusion
David Yan
Winnie Zhang
Luxin Zhang
Anmol Kalia
Dingkang Wang
...
Guan Pang
Ali K. Thabet
Peter Vajda
Amy Bearman
Licheng Yu
VGen
DiffM
67
2
0
08 Feb 2024
Real-World Robot Applications of Foundation Models: A Review
Real-World Robot Applications of Foundation Models: A Review
Kento Kawaharazuka
T. Matsushima
Andrew Gambardella
Jiaxian Guo
Chris Paxton
Andy Zeng
OffRL
VLM
LM&Ro
51
47
0
08 Feb 2024
Get What You Want, Not What You Don't: Image Content Suppression for
  Text-to-Image Diffusion Models
Get What You Want, Not What You Don't: Image Content Suppression for Text-to-Image Diffusion Models
Senmao Li
Joost van de Weijer
Taihang Hu
Fahad Shahbaz Khan
Qibin Hou
Yaxing Wang
Jian Yang
DiffM
55
27
0
08 Feb 2024
Counterfactual Image Editing
Counterfactual Image Editing
Yushu Pan
Elias Bareinboim
BDL
CML
35
5
0
07 Feb 2024
SPAD : Spatially Aware Multiview Diffusers
SPAD : Spatially Aware Multiview Diffusers
Yash Kant
Ziyi Wu
Michael Vasilkovsky
Guocheng Qian
Jian Ren
R. A. Guler
Guohao Li
Sergey Tulyakov
Igor Gilitschenski
Aliaksandr Siarohin
DiffM
31
36
0
07 Feb 2024
GenLens: A Systematic Evaluation of Visual GenAI Model Outputs
GenLens: A Systematic Evaluation of Visual GenAI Model Outputs
Tica Lin
Hanspeter Pfister
Jui-Hsien Wang
ELM
23
1
0
06 Feb 2024
Point and Instruct: Enabling Precise Image Editing by Unifying Direct
  Manipulation and Text Instructions
Point and Instruct: Enabling Precise Image Editing by Unifying Direct Manipulation and Text Instructions
Alec Helbling
Seongmin Lee
Polo Chau
27
0
0
05 Feb 2024
DisDet: Exploring Detectability of Backdoor Attack on Diffusion Models
DisDet: Exploring Detectability of Backdoor Attack on Diffusion Models
Yang Sui
Huy Phan
Jinqi Xiao
Tian-Di Zhang
Zijie Tang
Cong Shi
Yan Wang
Yingying Chen
Bo Yuan
DiffM
AAML
32
12
0
05 Feb 2024
CNS-Edit: 3D Shape Editing via Coupled Neural Shape Optimization
CNS-Edit: 3D Shape Editing via Coupled Neural Shape Optimization
Jingyu Hu
Ka-Hei Hui
Zhengzhe Liu
Hao Zhang
Chi-Wing Fu
36
4
0
04 Feb 2024
Image Fusion via Vision-Language Model
Image Fusion via Vision-Language Model
Zixiang Zhao
Lilun Deng
Haowen Bai
Yukun Cui
Zhipeng Zhang
...
Haotong Qin
Dongdong Chen
Jiangshe Zhang
Peng Wang
Luc Van Gool
VLM
43
20
0
03 Feb 2024
Previous
123...161718...262728
Next