ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2312.05039
  4. Cited By
SmartMask: Context Aware High-Fidelity Mask Generation for Fine-grained
  Object Insertion and Layout Control

SmartMask: Context Aware High-Fidelity Mask Generation for Fine-grained Object Insertion and Layout Control

8 December 2023
Jaskirat Singh
Jianming Zhang
Qing Liu
Cameron Smith
Zhe Lin
Liang Zheng
    DiffM
ArXiv (abs)PDFHTML

Papers citing "SmartMask: Context Aware High-Fidelity Mask Generation for Fine-grained Object Insertion and Layout Control"

31 / 31 papers shown
Title
SuperEdit: Rectifying and Facilitating Supervision for Instruction-Based Image Editing
SuperEdit: Rectifying and Facilitating Supervision for Instruction-Based Image Editing
Ming Li
Xin Gu
Fan Chen
X. Xing
Longyin Wen
Chong Chen
Sijie Zhu
DiffM
247
2
0
05 May 2025
Diffusion Model-Based Image Editing: A Survey
Diffusion Model-Based Image Editing: A Survey
Yi Huang
Jiancheng Huang
Yifan Liu
Mingfu Yan
Jiaxi Lv
Jianzhuang Liu
Wei Xiong
He Zhang
Liangliang Cao
Liangliang Cao
EGVM
220
103
0
27 Feb 2024
Judging LLM-as-a-Judge with MT-Bench and Chatbot Arena
Judging LLM-as-a-Judge with MT-Bench and Chatbot Arena
Lianmin Zheng
Wei-Lin Chiang
Ying Sheng
Siyuan Zhuang
Zhanghao Wu
...
Dacheng Li
Eric Xing
Haotong Zhang
Joseph E. Gonzalez
Ion Stoica
ALMOSLMELM
452
4,444
0
09 Jun 2023
Segment Anything in High Quality
Segment Anything in High Quality
Lei Ke
Mingqiao Ye
Martin Danelljan
Yifan Liu
Yu-Wing Tai
Chi-Keung Tang
Feng Yu
VLM
109
338
0
02 Jun 2023
LayoutGPT: Compositional Visual Planning and Generation with Large
  Language Models
LayoutGPT: Compositional Visual Planning and Generation with Large Language Models
Weixi Feng
Wanrong Zhu
Tsu-Jui Fu
Varun Jampani
Arjun Reddy Akula
Xuehai He
Sugato Basu
Xinze Wang
William Yang Wang
MLLM
86
179
0
24 May 2023
TopNet: Transformer-based Object Placement Network for Image Compositing
TopNet: Transformer-based Object Placement Network for Image Compositing
Sijie Zhu
Zhe Lin
Scott D. Cohen
Jason Kuen
Zhifei Zhang
Chen Chen
ViT
36
17
0
06 Apr 2023
Adding Conditional Control to Text-to-Image Diffusion Models
Adding Conditional Control to Text-to-Image Diffusion Models
Lvmin Zhang
Anyi Rao
Maneesh Agrawala
AI4CE
184
4,180
1
10 Feb 2023
SmartBrush: Text and Shape Guided Object Inpainting with Diffusion Model
SmartBrush: Text and Shape Guided Object Inpainting with Diffusion Model
Shaoan Xie
Zhifei Zhang
Zhe Lin
Tobias Hinz
Kun Zhang
DiffM
75
247
0
09 Dec 2022
High-Fidelity Guided Image Synthesis with Latent Diffusion Models
High-Fidelity Guided Image Synthesis with Latent Diffusion Models
Jaskirat Singh
Stephen Gould
Liang Zheng
DiffM
81
42
0
30 Nov 2022
InstructPix2Pix: Learning to Follow Image Editing Instructions
InstructPix2Pix: Learning to Follow Image Editing Instructions
Tim Brooks
Aleksander Holynski
Alexei A. Efros
DiffM
213
1,835
0
17 Nov 2022
eDiff-I: Text-to-Image Diffusion Models with an Ensemble of Expert
  Denoisers
eDiff-I: Text-to-Image Diffusion Models with an Ensemble of Expert Denoisers
Yogesh Balaji
Seungjun Nah
Xun Huang
Arash Vahdat
Jiaming Song
...
Timo Aila
S. Laine
Bryan Catanzaro
Tero Karras
Xuan Li
VLMMoE
177
831
0
02 Nov 2022
Scaling Autoregressive Models for Content-Rich Text-to-Image Generation
Scaling Autoregressive Models for Content-Rich Text-to-Image Generation
Jiahui Yu
Yuanzhong Xu
Jing Yu Koh
Thang Luong
Gunjan Baid
...
Zarana Parekh
Xin Li
Han Zhang
Jason Baldridge
Yonghui Wu
EGVM
202
1,133
0
22 Jun 2022
Blended Latent Diffusion
Blended Latent Diffusion
Omri Avrahami
Ohad Fried
Dani Lischinski
DiffM
138
390
0
06 Jun 2022
Fast Object Placement Assessment
Fast Object Placement Assessment
Li Niu
Qingyang Liu
Zhenchen Liu
Jiangtong Li
71
14
0
28 May 2022
Hierarchical Text-Conditional Image Generation with CLIP Latents
Hierarchical Text-Conditional Image Generation with CLIP Latents
Aditya A. Ramesh
Prafulla Dhariwal
Alex Nichol
Casey Chu
Mark Chen
VLMDiffM
425
6,921
0
13 Apr 2022
High-Resolution Image Synthesis with Latent Diffusion Models
High-Resolution Image Synthesis with Latent Diffusion Models
Robin Rombach
A. Blattmann
Dominik Lorenz
Patrick Esser
Bjorn Ommer
3DV
508
15,788
0
20 Dec 2021
GLIDE: Towards Photorealistic Image Generation and Editing with
  Text-Guided Diffusion Models
GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models
Alex Nichol
Prafulla Dhariwal
Aditya A. Ramesh
Pranav Shyam
Pamela Mishkin
Bob McGrew
Ilya Sutskever
Mark Chen
364
3,630
0
20 Dec 2021
Blended Diffusion for Text-driven Editing of Natural Images
Blended Diffusion for Text-driven Editing of Natural Images
Omri Avrahami
Dani Lischinski
Ohad Fried
DiffM
135
954
0
29 Nov 2021
LAION-400M: Open Dataset of CLIP-Filtered 400 Million Image-Text Pairs
LAION-400M: Open Dataset of CLIP-Filtered 400 Million Image-Text Pairs
Christoph Schuhmann
Richard Vencu
Romain Beaumont
R. Kaczmarczyk
Clayton Mullis
Aarush Katta
Theo Coombes
J. Jitsev
Aran Komatsuzaki
VLMMLLMCLIP
243
1,444
0
03 Nov 2021
Making Images Real Again: A Comprehensive Survey on Deep Image
  Composition
Making Images Real Again: A Comprehensive Survey on Deep Image Composition
Li Niu
Wenyan Cong
Liu Liu
Yan Hong
Bo Zhang
Jing Liang
Liqing Zhang
VLMDiffMCoGe
90
77
0
28 Jun 2021
CLIPScore: A Reference-free Evaluation Metric for Image Captioning
CLIPScore: A Reference-free Evaluation Metric for Image Captioning
Jack Hessel
Ari Holtzman
Maxwell Forbes
Ronan Le Bras
Yejin Choi
CLIP
169
1,588
0
18 Apr 2021
Variational Transformer Networks for Layout Generation
Variational Transformer Networks for Layout Generation
Diego Martin Arroyo
Janis Postels
F. Tombari
ViTDRL
46
131
0
06 Apr 2021
You Only Need Adversarial Supervision for Semantic Image Synthesis
You Only Need Adversarial Supervision for Semantic Image Synthesis
V. Sushko
Edgar Schönfeld
Dan Zhang
Juergen Gall
Bernt Schiele
Anna Khoreva
GAN
247
189
0
08 Dec 2020
LayoutTransformer: Layout Generation and Completion with Self-attention
LayoutTransformer: Layout Generation and Completion with Self-attention
Kamal Gupta
Justin Lazarow
Alessandro Achille
Larry S. Davis
Vijay Mahadevan
Abhinav Shrivastava
ViT
91
137
0
25 Jun 2020
SEAN: Image Synthesis with Semantic Region-Adaptive Normalization
SEAN: Image Synthesis with Semantic Region-Adaptive Normalization
Peihao Zhu
Rameen Abdal
Yipeng Qin
Peter Wonka
DiffM
90
446
0
28 Nov 2019
Learning to Predict Layout-to-image Conditional Convolutions for
  Semantic Image Synthesis
Learning to Predict Layout-to-image Conditional Convolutions for Semantic Image Synthesis
Xihui Liu
Guojun Yin
Jing Shao
Xiaogang Wang
Hongsheng Li
90
207
0
15 Oct 2019
MaskGAN: Towards Diverse and Interactive Facial Image Manipulation
MaskGAN: Towards Diverse and Interactive Facial Image Manipulation
Cheng-Han Lee
Ziwei Liu
Lingyun Wu
Ping Luo
CVBM
175
1,076
0
27 Jul 2019
Learning to Generate Synthetic Data via Compositing
Learning to Generate Synthetic Data via Compositing
Shashank Tripathi
Siddhartha Chandra
Amit Agrawal
A. Tyagi
James M. Rehg
Visesh Chari
98
119
0
10 Apr 2019
Semantic Image Synthesis with Spatially-Adaptive Normalization
Semantic Image Synthesis with Spatially-Adaptive Normalization
Taesung Park
Ming-Yuan Liu
Ting-Chun Wang
Jun-Yan Zhu
172
2,695
0
18 Mar 2019
Context-Aware Synthesis and Placement of Object Instances
Context-Aware Synthesis and Placement of Object Instances
Donghoon Lee
Sifei Liu
Liang Feng
Ming-Yuan Liu
Ming-Hsuan Yang
Jan Kautz
78
117
0
06 Dec 2018
Semantic Amodal Segmentation
Semantic Amodal Segmentation
Yan Zhu
Yuandong Tian
Dimitris N. Metaxas
Piotr Dollár
VLM
93
172
0
04 Sep 2015
1