Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
2306.01567
Cited By
v1
v2 (latest)
Segment Anything in High Quality
2 June 2023
Lei Ke
Mingqiao Ye
Martin Danelljan
Yifan Liu
Yu-Wing Tai
Chi-Keung Tang
Feng Yu
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
Github (3956★)
Papers citing
"Segment Anything in High Quality"
50 / 214 papers shown
Title
FocalClick-XL: Towards Unified and High-quality Interactive Segmentation
Xi Chen
Hengshuang Zhao
39
0
0
17 Jun 2025
Generative 4D Scene Gaussian Splatting with Object View-Synthesis Priors
Wen-Hsuan Chu
Lei Ke
Jianmeng Liu
Mingxiao Huo
P. Tokmakov
Katerina Fragkiadaki
3DGS
48
0
0
15 Jun 2025
Stepwise Decomposition and Dual-stream Focus: A Novel Approach for Training-free Camouflaged Object Segmentation
Chao Yin
Hao Li
Kequan Yang
Jide Li
Pinpin Zhu
Xiaoqiang Li
28
0
0
07 Jun 2025
Splat and Replace: 3D Reconstruction with Repetitive Elements
Nicolás Violante
Andréas Meuleman
Alban Gauthier
F. Durand
Thibault Groueix
G. Drettakis
3DGS
42
0
0
06 Jun 2025
GaRA-SAM: Robustifying Segment Anything Model with Gated-Rank Adaptation
Sohyun Lee
Yeho Kwon
Lukas Hoyer
Suha Kwak
85
0
0
03 Jun 2025
Talk2SAM: Text-Guided Semantic Enhancement for Complex-Shaped Object Segmentation
Luka Vetoshkin
Dmitry Yudin
21
0
0
03 Jun 2025
SAM-I2V: Upgrading SAM to Support Promptable Video Segmentation with Less than 0.2% Training Cost
Haiyang Mei
Pengyu Zhang
Mike Zheng Shou
VLM
55
0
0
02 Jun 2025
Adapting Segment Anything Model for Power Transmission Corridor Hazard Segmentation
Hang Chen
Maoyuan Ye
Peng Yang
Haibin He
Juhua Liu
Bo Du
48
0
0
28 May 2025
InfoSAM: Fine-Tuning the Segment Anything Model from An Information-Theoretic Perspective
Yuanhong Zhang
Muyao Yuan
Weizhan Zhang
Tieliang Gong
Wen Wen
Jiangyong Ying
Weijie Shi
VLM
62
0
0
28 May 2025
FruitNeRF++: A Generalized Multi-Fruit Counting Method Utilizing Contrastive Learning and Neural Radiance Fields
Lukas Meyer
Andrei-Timotei Ardelean
Tim Weyrich
Marc Stamminger
49
0
0
26 May 2025
Unifying Segment Anything in Microscopy with Multimodal Large Language Model
Manyu Li
Ruian He
Zixian Zhang
Weimin Tan
Bo Yan
VLM
74
0
0
16 May 2025
Mix-QSAM: Mixed-Precision Quantization of the Segment Anything Model
Navin Ranjan
Andreas E. Savakis
MQ
VLM
151
0
0
08 May 2025
Pro2SAM: Mask Prompt to SAM with Grid Points for Weakly Supervised Object Localization
Xi Yang
Songsong Duan
Nannan Wang
Xinbo Gao
WSOL
147
1
0
08 May 2025
SAM4EM: Efficient memory-based two stage prompt-free segment anything model adapter for complex 3D neuroscience electron microscopy stacks
Uzair Shah
Marco Agus
Daniya Boges
Vanessa Chiappini
M. Alzubaidi
J. Schneider
Markus Hadwiger
Pierre J. Magistretti
Mowafa J Househ
Corrado Calı
85
0
0
30 Apr 2025
UniBiomed: A Universal Foundation Model for Grounded Biomedical Image Interpretation
Linshan Wu
Yuxiang Nie
Sunan He
Jiaxin Zhuang
Hao Chen
...
V. Vardhanabhuti
R. Chan
Yifan Peng
Pranav Rajpurkar
Hao Chen
LM&MA
MedIm
209
0
0
30 Apr 2025
SRMF: A Data Augmentation and Multimodal Fusion Approach for Long-Tail UHR Satellite Image Segmentation
Yulong Guo
Zilun Zhang
Yongheng Shang
Tiancheng Zhao
Shuiguang Deng
Yingchun Yang
Jianwei Yin
132
0
0
28 Apr 2025
AffordanceSAM: Segment Anything Once More in Affordance Grounding
Dengyang Jiang
Mengmeng Wang
Teli Ma
Haoyang Li
Yang Liu
Guang Dai
Lefei Zhang
98
0
0
22 Apr 2025
Landmark-Free Preoperative-to-Intraoperative Registration in Laparoscopic Liver Resection
Jun Zhou
Bingchen Gao
Kai Wang
Jialun Pei
Pheng-Ann Heng
Jing Qin
MedIm
116
1
0
21 Apr 2025
Weak Cube R-CNN: Weakly Supervised 3D Detection using only 2D Bounding Boxes
Andreas Lau Hansen
Lukas Wanzeck
Dim P. Papadopoulos
67
0
0
17 Apr 2025
PosterMaker: Towards High-Quality Product Poster Generation with Accurate Text Rendering
Y. Gao
Zihang Lin
Chuanbin Liu
Min Zhou
T. Ge
Bo Zheng
Hongtao Xie
DiffM
142
5
0
09 Apr 2025
UCS: A Universal Model for Curvilinear Structure Segmentation
Dianshuo Li
Li Chen
Yuhang Cao
Kai Zhu
Jun Cheng
142
0
0
05 Apr 2025
LV-MAE: Learning Long Video Representations through Masked-Embedding Autoencoders
Ilan Naiman
Emanuel Ben-Baruch
Oron Anschel
Alon Shoshan
Igor Kviatkovsky
Manoj Aggarwal
Gérard Medioni
91
0
0
04 Apr 2025
Charm: The Missing Piece in ViT fine-tuning for Image Aesthetic Assessment
Fatemeh Behrad
Tinne Tuytelaars
Johan Wagemans
ViT
122
0
0
03 Apr 2025
SCHNet: SAM Marries CLIP for Human Parsing
Kunliang Liu
Jianming Wang
Rize Jin
Wonjun Hwang
Tae-Sun Chung
VLM
137
0
0
28 Mar 2025
Learning 3D Object Spatial Relationships from Pre-trained 2D Diffusion Models
Sangwon Beak
Hyeonwoo Kim
Hanbyul Joo
110
0
0
25 Mar 2025
RP-SAM2: Refining Point Prompts for Stable Surgical Instrument Segmentation
Nuren Zhaksylyk
Ibrahim Almakky
Jay N. Paranjape
S. Vedula
S. Sikder
Vishal M. Patel
Mohammad Yaqub
85
0
0
25 Mar 2025
OCRT: Boosting Foundation Models in the Open World with Object-Concept-Relation Triad
Luyao Tang
Yuxuan Yuan
Chen Chen
Zeyu Zhang
Yue Huang
Kun Zhang
103
1
0
24 Mar 2025
RAW-Adapter: Adapting Pre-trained Visual Model to Camera RAW Images and A Benchmark
Ziteng Cui
Jianfei Yang
Tatsuya Harada
VLM
98
0
0
21 Mar 2025
M2N2V2: Multi-Modal Unsupervised and Training-free Interactive Segmentation
Markus Karmann
Peng-Tao Jiang
Bo Li
O. Urfalioglu
87
0
0
20 Mar 2025
WideRange4D: Enabling High-Quality 4D Reconstruction with Wide-Range Movements and Scenes
L. Yang
Kaixin Zhu
Juanxi Tian
Bohan Zeng
Matthieu Lin
Hongjuan Pei
Wentao Zhang
Shuicheng Yan
VGen
174
0
0
17 Mar 2025
Segment Any-Quality Images with Generative Latent Space Enhancement
Guangqian Guo
Yoong Guo
Xuehui Yu
Wenbo Li
Yaoxing Wang
Shan Gao
VLM
219
0
0
16 Mar 2025
ROS-SAM: High-Quality Interactive Segmentation for Remote Sensing Moving Object
Zhe Shan
Yang Liu
Lei Zhou
C. Yan
Haoyu Wang
Xia Xie
104
4
0
15 Mar 2025
Breaking the Box: Enhancing Remote Sensing Image Segmentation with Freehand Sketches
Ying Zang
Yuncan Gao
Jiangi Zhang
Yuangi Hu
Runlong Cao
Lanyun Zhu
Qi Zhu
Deyi Ji
Renjun Xu
Tianrun Chen
109
0
0
15 Mar 2025
Unveiling the Invisible: Reasoning Complex Occlusions Amodally with AURA
Zhixuan Li
Hyunse Yoon
Sanghoon Lee
Weisi Lin
97
1
0
13 Mar 2025
OmniPaint: Mastering Object-Oriented Editing via Disentangled Insertion-Removal Inpainting
Yongsheng Yu
Ziyun Zeng
Haitian Zheng
Jiebo Luo
DiffM
152
2
0
13 Mar 2025
6D Object Pose Tracking in Internet Videos for Robotic Manipulation
Georgy Ponimatkin
Martin Cífka
Tomáš Souček
Médéric Fourmy
Yann Labbé
Vladimir Petrik
Josef Sivic
90
1
0
13 Mar 2025
SegAgent: Exploring Pixel Understanding Capabilities in MLLMs by Imitating Human Annotator Trajectories
Muzhi Zhu
Yuzhuo Tian
Hao Chen
Chunluan Zhou
Qingpei Guo
Yongxu Liu
M. Yang
Chunhua Shen
MLLM
VLM
138
1
0
11 Mar 2025
Customized SAM 2 for Referring Remote Sensing Image Segmentation
Fu Rong
Meng Lan
Qian Zhang
Lefei Zhang
124
0
0
10 Mar 2025
Vid2Avatar-Pro: Authentic Avatar from Videos in the Wild via Universal Prior
Chen Guo
Junxuan Li
Yash Kant
Yaser Sheikh
Forrest Iandola
Chen Cao
102
3
0
03 Mar 2025
ZeroPS: High-quality Cross-modal Knowledge Transfer for Zero-Shot 3D Part Segmentation
Yuheng Xue
Nenglun Chen
Jun Liu
Wenyun Sun
3DPC
246
7
0
24 Feb 2025
Soybean pod and seed counting in both outdoor fields and indoor laboratories using unions of deep neural networks
Tianyou Jiang
Mingshun Shao
Tianyi Zhang
Xiaoyu Liu
Qun Yu
113
0
0
24 Feb 2025
MPG-SAM 2: Adapting SAM 2 with Mask Priors and Global Context for Referring Video Object Segmentation
Fu Rong
Meng Lan
Qian Zhang
Lefei Zhang
VOS
VGen
120
1
0
23 Jan 2025
DynamicEarth: How Far are We from Open-Vocabulary Change Detection?
Kaiyu Li
Xiangyong Cao
Yupeng Deng
Chao Pang
Zepeng Xin
Deyu Meng
Zhi Wang
ObjD
159
1
0
22 Jan 2025
SkipClick: Combining Quick Responses and Low-Level Features for Interactive Segmentation in Winter Sports Contexts
Robin Schon
Julian Lorenz
Daniel Kienzle
Rainer Lienhart
89
0
0
14 Jan 2025
SAM-DA: Decoder Adapter for Efficient Medical Domain Adaptation
Javier Gamazo Tejero
Moritz Schmid
Pablo Márquez-Neila
M. Zinkernagel
Sebastian Wolf
Raphael Sznitman
MedIm
71
0
0
12 Jan 2025
Vid2Sim: Realistic and Interactive Simulation from Video for Urban Navigation
Ziyang Xie
Zhizheng Liu
Zhenghao Peng
Wayne Wu
Bolei Zhou
VGen
161
5
0
12 Jan 2025
PGP-SAM: Prototype-Guided Prompt Learning for Efficient Few-Shot Medical Image Segmentation
Zhonghao Yan
Zijin Yin
Tianyu Lin
Xiangzhu Zeng
Kongming Liang
Zhanyu Ma
VLM
MedIm
157
0
0
12 Jan 2025
TAVP: Task-Adaptive Visual Prompt for Cross-domain Few-shot Segmentation
Jiaqi Yang
Ye Huang
Jingxi Hu
Xiangjian He
Linlin Shen
Guoping Qiu
VLM
132
1
0
31 Dec 2024
VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLM
Yuqian Yuan
Hang Zhang
Wentong Li
Zesen Cheng
Boqiang Zhang
...
Deli Zhao
Wenqiao Zhang
Yueting Zhuang
Jianke Zhu
Lidong Bing
168
10
0
31 Dec 2024
Effective and secure federated online learning to rank
Shuyi Wang
110
0
0
26 Dec 2024
1
2
3
4
5
Next