Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2109.07161
Cited By
Resolution-robust Large Mask Inpainting with Fourier Convolutions
15 September 2021
Roman Suvorov
Elizaveta Logacheva
Anton Mashikhin
Anastasia Remizova
Arsenii Ashukha
Aleksei Silvestrov
Naejin Kong
Harshith Goka
Kiwoong Park
Victor Lempitsky
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Resolution-robust Large Mask Inpainting with Fourier Convolutions"
50 / 132 papers shown
Title
MetaUAS: Universal Anomaly Segmentation with One-Prompt Meta-Learning
Bin-Bin Gao
VLM
22
0
0
14 May 2025
GlyphMastero: A Glyph Encoder for High-Fidelity Scene Text Editing
Tong Wang
Ting Liu
Xiaochao Qu
Chengjing Wu
Luoqi Liu
Xiaolin Hu
DiffM
58
0
0
08 May 2025
Corner Cases: How Size and Position of Objects Challenge ImageNet-Trained Models
Mishal Fatima
Steffen Jung
M. Keuper
40
0
0
06 May 2025
PosterO: Structuring Layout Trees to Enable Language Models in Generalized Content-Aware Layout Generation
HsiaoYuan Hsu
Yuxin Peng
21
0
0
06 May 2025
Detecting and Mitigating Hateful Content in Multimodal Memes with Vision-Language Models
Minh-Hao Van
Xintao Wu
VLM
88
0
0
30 Apr 2025
Dream-Box: Object-wise Outlier Generation for Out-of-Distribution Detection
Brian K. S. Isaac-Medina
T. Breckon
OODD
139
0
0
25 Apr 2025
Visibility-Uncertainty-guided 3D Gaussian Inpainting via Scene Conceptional Learning
Mingxuan Cui
Qing Guo
Y. Wang
Hongkai Yu
D. Lin
Q. Zou
Ming-Ming Cheng
X. Li
3DGS
43
0
0
23 Apr 2025
AdaViP: Aligning Multi-modal LLMs via Adaptive Vision-enhanced Preference Optimization
Jinda Lu
Jinghan Li
Yuan Gao
Junkang Wu
Jiancan Wu
X. Wang
Xiangnan He
112
0
0
22 Apr 2025
PRISM: A Unified Framework for Photorealistic Reconstruction and Intrinsic Scene Modeling
Alara Dirik
Tuanfeng Y. Wang
Duygu Ceylan
Stefanos Zafeiriou
Anna Frühstück
DiffM
47
0
0
19 Apr 2025
ForgetMe: Evaluating Selective Forgetting in Generative Models
Zhenyu Yu
Mohd Yamani Inda Idris
Pei Wang
DiffM
MU
34
0
0
17 Apr 2025
Zooming In on Fakes: A Novel Dataset for Localized AI-Generated Image Detection with Forgery Amplification Approach
Lvpan Cai
Haowei Wang
Jiayi Ji
YanShu ZhouMen
Yiwei Ma
Xiaoshuai Sun
Liujuan Cao
Rongrong Ji
ViT
34
0
0
16 Apr 2025
Marmot: Multi-Agent Reasoning for Multi-Object Self-Correcting in Improving Image-Text Alignment
Jiayang Sun
H. Wang
Jie Cao
Huaibo Huang
R. He
DiffM
73
0
0
10 Apr 2025
GraspCorrect: Robotic Grasp Correction via Vision-Language Model-Guided Feedback
Sungjae Lee
Yeonjoo Hong
Kwang In KIm
48
0
0
19 Mar 2025
RETHINED: A New Benchmark and Baseline for Real-Time High-Resolution Image Inpainting On Edge Devices
Marcelo Sanchez
G. Triginer
Ignacio Sarasua
Lara Raad
C. Ballester
63
0
0
18 Mar 2025
Long-horizon Visual Instruction Generation with Logic and Attribute Self-reflection
Yucheng Suo
Fan Ma
Kaixin Shen
Linchao Zhu
Yi Yang
VLM
52
0
0
12 Mar 2025
Consistent Image Layout Editing with Diffusion Models
Tao Xia
Yudi Zhang
Ting Liu Lei Zhang
DiffM
62
1
0
09 Mar 2025
Self-Supervised Large Scale Point Cloud Completion for Archaeological Site Restoration
Aocheng Li
James Zimmer-Dauphinee
Rajesh Kalyanam
Ian Lindsay
Parker VanValkenburgh
Steven A. Wernke
Daniel G. Aliaga
3DPC
49
0
0
06 Mar 2025
InstaFace: Identity-Preserving Facial Editing with Single Image Inference
MD Wahiduzzaman Khan
Mingshan Jia
Shaolin Zhang
En Yu
Caifeng Shan
Kaska Musial-Gabrys
DiffM
54
0
0
27 Feb 2025
LayerPano3D: Layered 3D Panorama for Hyper-Immersive Scene Generation
Shuai Yang
Jing Tan
Mengchen Zhang
Tong Wu
Y. Li
Gordon Wetzstein
Ziwei Liu
D. Lin
MDE
VGen
51
6
0
24 Feb 2025
SegSub: Evaluating Robustness to Knowledge Conflicts and Hallucinations in Vision-Language Models
Peter Carragher
Nikitha Rao
Abhinand Jha
R Raghav
Kathleen M. Carley
VLM
56
0
0
19 Feb 2025
SSDD-GAN: Single-Step Denoising Diffusion GAN for Cochlear Implant Surgical Scene Completion
Yike Zhang
Eduardo Davalos
Jack H. Noble
DiffM
MedIm
72
1
0
08 Feb 2025
Dfilled: Repurposing Edge-Enhancing Diffusion for Guided DSM Void Filling
Daniel Panangian
Ksenia Bittner
DiffM
35
0
0
26 Jan 2025
PAID: A Framework of Product-Centric Advertising Image Design
Hongyu Chen
Min Zhou
Jing Jiang
Jiale Chen
Yang Lu
Bo Xiao
T. Ge
Bo Zheng
DiffM
VLM
38
0
0
24 Jan 2025
Generate E-commerce Product Background by Integrating Category Commonality and Personalized Style
Haohan Wang
Wei Feng
Yang Lu
Yaoyu Li
Zheng Zhang
Jingjing Lv
Xin Zhu
Jun-Jun Shen
DiffM
75
5
0
20 Jan 2025
GeoDiffuser: Geometry-Based Image Editing with Diffusion Models
Rahul Sajnani
Jeroen Vanbaar
Jie Min
Kapil D. Katyal
Srinath Sridhar
DiffM
54
11
0
03 Jan 2025
RORem: Training a Robust Object Remover with Human-in-the-Loop
Ruibin Li
Tao Yang
Song Guo
L. Zhang
42
3
0
01 Jan 2025
Type-R: Automatically Retouching Typos for Text-to-Image Generation
Wataru Shimoda
Naoto Inoue
Daichi Haraguchi
Hayato Mitani
S. Uchida
Kota Yamaguchi
DiffM
93
0
0
27 Nov 2024
Puzzle Similarity: A Perceptually-guided Cross-Reference Metric for Artifact Detection in 3D Scene Reconstructions
Nicolai Hermann
Jorge Condor
Piotr Didyk
3DV
85
0
0
26 Nov 2024
Distilling Spectral Graph for Object-Context Aware Open-Vocabulary Semantic Segmentation
Chanyoung Kim
Dayun Ju
Woojung Han
Ming-Hsuan Yang
Seong Jae Hwang
VLM
VOS
79
0
0
26 Nov 2024
EG-HumanNeRF: Efficient Generalizable Human NeRF Utilizing Human Prior for Sparse View
Zhaorong Wang
Yoshihiro Kanamori
Yuki Endo
3DH
29
1
0
16 Oct 2024
FakeShield: Explainable Image Forgery Detection and Localization via Multi-modal Large Language Models
Zhipei Xu
Xuanyu Zhang
Runyi Li
Zecheng Tang
Qing Huang
Jian Andrew Zhang
AAML
39
16
0
03 Oct 2024
PixWizard: Versatile Image-to-Image Visual Assistant with Open-Language Instructions
Weifeng Lin
Xinyu Wei
Renrui Zhang
Le Zhuo
Shitian Zhao
...
Junlin Xie
Junlin Xie
Yu Qiao
Peng Gao
Hongsheng Li
MLLM
DiffM
57
10
0
23 Sep 2024
MVPGS: Excavating Multi-view Priors for Gaussian Splatting from Sparse Input Views
Wangze Xu
Huachen Gao
Shihe Shen
Rui Peng
Jianbo Jiao
Ronggang Wang
3DGS
23
8
0
22 Sep 2024
Leveraging Text Localization for Scene Text Removal via Text-aware Masked Image Modeling
Zixiao Wang
Hongtao Xie
Yuxin Wang
Yadong Qu
Fengjun Guo
Pengwei Liu
DiffM
33
0
0
20 Sep 2024
HateSieve: A Contrastive Learning Framework for Detecting and Segmenting Hateful Content in Multimodal Memes
Xuanyu Su
Yansong Li
Diana Inkpen
Nathalie Japkowicz
VLM
81
2
0
11 Aug 2024
WAS: Dataset and Methods for Artistic Text Segmentation
Xudong Xie
Yuzhe Li
Yang Liu
Zhifei Zhang
Zhaowen Wang
Wei Xiong
Xiang Bai
DiffM
50
2
0
31 Jul 2024
HoloDreamer: Holistic 3D Panoramic World Generation from Text Descriptions
Haiyang Zhou
Xinhua Cheng
Wangbo Yu
Yonghong Tian
Li-ming Yuan
3DGS
DiffM
61
10
0
21 Jul 2024
Mobius: A High Efficient Spatial-Temporal Parallel Training Paradigm for Text-to-Video Generation Task
Yiran Yang
Jinchao Zhang
Ying Deng
Jie Zhou
DiffM
29
0
0
09 Jul 2024
GenArtist: Multimodal LLM as an Agent for Unified Image Generation and Editing
Zhenyu Wang
Aoxue Li
Zhenguo Li
Xihui Liu
MLLM
DiffM
43
25
0
08 Jul 2024
MobilePortrait: Real-Time One-Shot Neural Head Avatars on Mobile Devices
Jianwen Jiang
Gaojie Lin
Zhengkun Rong
Chao Liang
Yongming Zhu
Jiaqi Yang
Tianyun Zhong
3DH
82
8
0
08 Jul 2024
Inpainting the Gaps: A Novel Framework for Evaluating Explanation Methods in Vision Transformers
Lokesh Badisa
Sumohana S. Channappayya
42
0
0
17 Jun 2024
Varying Manifolds in Diffusion: From Time-varying Geometries to Visual Saliency
Junhao Chen
Manyi Li
Zherong Pan
Xifeng Gao
Changhe Tu
DiffM
37
2
0
07 Jun 2024
Temporally Consistent Object Editing in Videos using Extended Attention
AmirHossein Zamani
Amir G. Aghdam
Tiberiu Popa
Eugene Belilovsky
DiffM
32
1
0
01 Jun 2024
3D StreetUnveiler with Semantic-aware 2DGS -- a simple baseline
Jingwei Xu
Yikai Wang
Yiqun Zhao
Yanwei Fu
Shenghua Gao
3DGS
62
2
0
28 May 2024
Point Resampling and Ray Transformation Aid to Editable NeRF Models
Zhenyang Li
Zilong Chen
Feifan Qu
Mingqing Wang
Yizhou Zhao
Kai Zhang
Yifan Peng
38
1
0
12 May 2024
MVIP-NeRF: Multi-view 3D Inpainting on NeRF Scenes via Diffusion Prior
Honghua Chen
Chen Change Loy
Xingang Pan
39
13
0
05 May 2024
DreamScene4D: Dynamic Multi-Object Scene Generation from Monocular Videos
Wen-Hsuan Chu
Lei Ke
Katerina Fragkiadaki
3DGS
VGen
25
29
0
03 May 2024
Semantically Consistent Video Inpainting with Conditional Diffusion Models
Dylan Green
William Harvey
Saeid Naderiparizi
Matthew Niedoba
Yunpeng Liu
...
Vasileios Lioutas
Setareh Dabiri
Adam Scibior
Berend Zwartsenberg
Frank D. Wood
DiffM
36
1
0
30 Apr 2024
Spatial-frequency Dual-Domain Feature Fusion Network for Low-Light Remote Sensing Image Enhancement
Zishu Yao
Guodong Fan
Jinfu Fan
Senior Member Ieee Min Gan
F. I. C. L. Philip Chen
32
14
0
26 Apr 2024
Gorgeous: Create Your Desired Character Facial Makeup from Any Ideas
Jia Wei Sii
Chee Seng Chan
DiffM
48
0
0
22 Apr 2024
1
2
3
Next