Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
2304.02643
Cited By
Segment Anything
5 April 2023
A. Kirillov
Eric Mintun
Nikhila Ravi
Hanzi Mao
Chloe Rolland
Laura Gustafson
Tete Xiao
Spencer Whitehead
Alexander C. Berg
Wan-Yen Lo
Piotr Dollár
Ross B. Girshick
MLLM
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Segment Anything"
50 / 1,373 papers shown
Title
Track Any Anomalous Object: A Granular Video Anomaly Detection Pipeline
Yuzhi Huang
Chenxin Li
H. Zhang
Zixu Lin
Yunlong Lin
...
Xinyu Liu
Jiechao Gao
Yue Huang
Xinghao Ding
Yixuan Yuan
124
0
0
05 Jun 2025
Neural Network Reprogrammability: A Unified Theme on Model Reprogramming, Prompt Tuning, and Prompt Instruction
Zesheng Ye
C. Cai
Ruijiang Dong
Jianzhong Qi
Lei Feng
Pin-Yu Chen
Feng Liu
254
0
0
05 Jun 2025
Bringing SAM to new heights: Leveraging elevation data for tree crown segmentation from drone imagery
Mélisande Teng
Arthur Ouaknine
Etienne Laliberté
Yoshua Bengio
David Rolnick
Hugo Larochelle
176
0
0
05 Jun 2025
A Large-Scale Referring Remote Sensing Image Segmentation Dataset and Benchmark
Zhigang Yang
Huiguang Yao
Linmao Tian
Xuezhi Zhao
Qiang Li
Qi. Wang
102
0
0
04 Jun 2025
Average Calibration Losses for Reliable Uncertainty in Medical Image Segmentation
Theodore Barfoot
Luis C. Garcia-Peraza-Herrera
Samet Akcay
Ben Glocker
Tom Vercauteren
UQCV
156
0
0
04 Jun 2025
Object-level Self-Distillation for Vision Pretraining
Çağlar Hızlı
Çağatay Yıldız
Pekka Marttinen
OCL
VLM
59
0
0
04 Jun 2025
A Generative Adaptive Replay Continual Learning Model for Temporal Knowledge Graph Reasoning
Zhiyu Zhang
Wei Chen
Youfang Lin
Huaiyu Wan
OffRL
CLL
128
0
0
04 Jun 2025
Robust Neural Rendering in the Wild with Asymmetric Dual 3D Gaussian Splatting
Chengqi Li
Zhihao Shi
Yangdi Lu
Wenbo He
Xiangyu Xu
3DGS
72
0
0
04 Jun 2025
SemNav: A Model-Based Planner for Zero-Shot Object Goal Navigation Using Vision-Foundation Models
Arnab Debnath
Gregory J. Stein
Jana Kosecka
LM&Ro
100
0
0
04 Jun 2025
Talk2SAM: Text-Guided Semantic Enhancement for Complex-Shaped Object Segmentation
Luka Vetoshkin
Dmitry Yudin
26
0
0
03 Jun 2025
BEVCALIB: LiDAR-Camera Calibration via Geometry-Guided Bird's-Eye View Representations
Weiduo Yuan
Jerry Li
Justin Yue
Divyank Shah
Konstantinos Karydis
Hang Qiu
75
0
0
03 Jun 2025
FORLA:Federated Object-centric Representation Learning with Slot Attention
Guiqiu Liao
M. Jogan
Eric Eaton
Daniel A. Hashimoto
FedML
81
0
0
03 Jun 2025
RefEdit: A Benchmark and Method for Improving Instruction-based Image Editing Model on Referring Expressions
Bimsara Pathiraja
Maitreya Patel
Shivam Singh
Yezhou Yang
Chitta Baral
44
0
0
03 Jun 2025
From Flat to Hierarchical: Extracting Sparse Representations with Matching Pursuit
Valérie Costa
Thomas Fel
Ekdeep Singh Lubana
Bahareh Tolooshams
Demba Ba
79
0
0
03 Jun 2025
Native-Resolution Image Synthesis
Zidong Wang
Lei Bai
Xiangyu Yue
Wanli Ouyang
Yiyuan Zhang
85
0
0
03 Jun 2025
GaRA-SAM: Robustifying Segment Anything Model with Gated-Rank Adaptation
Sohyun Lee
Yeho Kwon
Lukas Hoyer
Suha Kwak
89
0
0
03 Jun 2025
Towards In-the-wild 3D Plane Reconstruction from a Single Image
Jiachen Liu
Rui Yu
Sili Chen
Sharon X. Huang
Hengkai Guo
3DV
75
1
0
03 Jun 2025
PartComposer: Learning and Composing Part-Level Concepts from Single-Image Examples
Junyu Liu
R. K. Jones
Daniel E. Ritchie
DiffM
CoGe
80
0
0
03 Jun 2025
Zero-Shot Tree Detection and Segmentation from Aerial Forest Imagery
Michelle Chen
David Russell
Amritha Pallavoor
Derek Young
Jane Wu
VLM
67
0
0
03 Jun 2025
Cross-Modal Urban Sensing: Evaluating Sound-Vision Alignment Across Street-Level and Aerial Imagery
Pengyu Chen
Xiao Huang
Teng Fei
Sicheng Wang
47
0
0
03 Jun 2025
SAMJ: Fast Image Annotation on ImageJ/Fiji via Segment Anything Model
Carlos Garcia-Lopez-de-Haro
Caterina Fuster-Barcelo
Curtis T. Rueden
Jonathan Heras
Vladimir Ulman
...
Kevin W. Eliceiri
Jean-Christophe Olivo-Marin
Jean-Yves Tinevez
Daniel Sage
A. Muñoz-Barrutia
VLM
61
0
0
03 Jun 2025
Towards Auto-Annotation from Annotation Guidelines: A Benchmark through 3D LiDAR Detection
Yechi Ma
Wei Hua
Shu Kong
72
0
0
03 Jun 2025
No Train Yet Gain: Towards Generic Multi-Object Tracking in Sports and Beyond
Tomasz Stanczyk
Seongro Yoon
François Brémond
80
0
0
02 Jun 2025
G4Seg: Generation for Inexact Segmentation Refinement with Diffusion Models
Tianjiao Zhang
Fei Zhang
Jiangchao Yao
Ya Zhang
Yanfeng Wang
DiffM
123
1
0
02 Jun 2025
Medical World Model: Generative Simulation of Tumor Evolution for Treatment Planning
Yijun Yang
Zhao-Yang Wang
Qiuping Liu
Shuwen Sun
Kang Wang
...
Zongwei Zhou
Alan Yuille
Lei Zhu
Yu Zhang
Jieneng Chen
37
0
0
02 Jun 2025
SAM-I2V: Upgrading SAM to Support Promptable Video Segmentation with Less than 0.2% Training Cost
Haiyang Mei
Pengyu Zhang
Mike Zheng Shou
VLM
55
0
0
02 Jun 2025
DualMap: Online Open-Vocabulary Semantic Mapping for Natural Language Navigation in Dynamic Changing Scenes
Jiajun Jiang
Yiming Zhu
Zirui Wu
Jie Song
84
0
0
02 Jun 2025
Ultra-High-Resolution Image Synthesis: Data, Method and Evaluation
Jinjin Zhang
Qiuyu Huang
Junjie Liu
Xiefan Guo
Di Huang
78
0
0
02 Jun 2025
CountingFruit: Real-Time 3D Fruit Counting with Language-Guided Semantic Gaussian Splatting
F. Li
Yangle Liu
Jieming Ma
Hai-Ning Liang
Yaochun Shen
Huangxiang Li
Zhijing Wu
3DGS
52
0
0
01 Jun 2025
FedRPCA: Enhancing Federated LoRA Aggregation Using Robust PCA
Divyansh Jhunjhunwala
Arian Raje
Madan Ravi Ganesh
Chaithanya Kumar Mummadi
Chaoqun Dong
Jiawei Zhou
Wan-Yi Lin
Gauri Joshi
Zhenzhen Li
56
0
0
01 Jun 2025
AuralSAM2: Enabling SAM2 Hear Through Pyramid Audio-Visual Feature Prompting
Yuyuan Liu
Yuanhong Chen
Chong Wang
Junlin Han
Junde Wu
Can Peng
Jingkun Chen
Yu Tian
Gustavo Carneiro
VLM
63
0
0
01 Jun 2025
Advancing from Automated to Autonomous Beamline by Leveraging Computer Vision
Baolu Li
Hongkai Yu
Huiming Sun
Jin Ma
Yuewei Lin
Lu Ma
Yonghua Du
34
0
0
01 Jun 2025
Seg2Any: Open-set Segmentation-Mask-to-Image Generation with Precise Shape and Semantic Control
Danfeng li
Hui Zhang
Sheng Wang
Jiacheng Li
Zuxuan Wu
DiffM
VLM
56
0
0
31 May 2025
XYZ-IBD: A High-precision Bin-picking Dataset for Object 6D Pose Estimation Capturing Real-world Industrial Complexity
Junwen Huang
Jizhong Liang
Jiaqi Hu
Martin Sundermeyer
Peter KT Yu
Nassir Navab
Benjamin Busam
55
0
0
31 May 2025
A Mathematical Perspective On Contrastive Learning
Ricardo Baptista
Andrew Stuart
S. D. Tran
27
0
0
30 May 2025
Object Centric Concept Bottlenecks
David Steinmann
Wolfgang Stammer
Antonia Wüst
Kristian Kersting
OCL
51
0
0
30 May 2025
Tackling View-Dependent Semantics in 3D Language Gaussian Splatting
Jiazhong Cen
Xudong Zhou
Jiemin Fang
Changsong Wen
Lingxi Xie
Xiaopeng Zhang
Wei Shen
Qi Tian
3DGS
48
0
0
30 May 2025
DrVD-Bench: Do Vision-Language Models Reason Like Human Doctors in Medical Image Diagnosis?
Tianhong Zhou
Yin Xu
Yingtao Zhu
Chuxi Xiao
Haiyang Bian
Lei Wei
Xuegong Zhang
LM&MA
VLM
41
0
0
30 May 2025
Seeing is Not Reasoning: MVPBench for Graph-based Evaluation of Multi-path Visual Physical CoT
Zhuobai Dong
Junchao Yi
Ziyuan Zheng
Haochen Han
Xiangxi Zheng
Alex Jinpeng Wang
Fangming Liu
Linjie Li
ReLM
LRM
36
0
0
30 May 2025
Benchmarking Foundation Models for Zero-Shot Biometric Tasks
Redwan Sony
Parisa Farmanifard
Hamzeh Alzwairy
Nitish Shukla
Arun Ross
CVBM
VLM
74
0
0
30 May 2025
GenSpace: Benchmarking Spatially-Aware Image Generation
Zehan Wang
Jiayang Xu
Ziang Zhang
Tianyu Pan
Chao Du
Hengshuang Zhao
Zhou Zhao
EGVM
77
0
0
30 May 2025
KairosAD: A SAM-Based Model for Industrial Anomaly Detection on Embedded Devices
Uzair Khan
Franco Fummi
Luigi Capogrosso
36
0
0
30 May 2025
Unleashing the Power of Intermediate Domains for Mixed Domain Semi-Supervised Medical Image Segmentation
Qinghe Ma
Jian Zhang
Lei Qi
Qian Yu
Yinghuan Shi
Yang Gao
33
0
0
30 May 2025
Weakly-Supervised Affordance Grounding Guided by Part-Level Semantic Priors
Peiran Xu
Yadong Mu
116
2
0
30 May 2025
Pretraining Deformable Image Registration Networks with Random Images
Junyu Chen
Shuwen Wei
Yihao Liu
A. Carass
Yong Du
49
0
0
30 May 2025
Leveraging Intermediate Features of Vision Transformer for Face Anti-Spoofing
Mika Feng
Koichi Ito
T. Aoki
Tetsushi Ohki
M. Nishigaki
CVBM
ViT
64
0
0
30 May 2025
Segmenting France Across Four Centuries
Marta López-Rauhut
Hongyu Zhou
Mathieu Aubry
Loic Landrieu
AI4TS
38
0
0
30 May 2025
Zero-P-to-3: Zero-Shot Partial-View Images to 3D Object
Yuxuan Lin
Ruihang Chu
Zhenyu Chen
Xiao Tang
Lei Ke
...
Zhihao Li
Shiyong Liu
Xiaofei Wu
Jianzhuang Liu
Yujiu Yang
55
0
0
29 May 2025
Bigger, Regularized, Categorical: High-Capacity Value Functions are Efficient Multi-Task Learners
Michal Nauman
Marek Cygan
Carmelo Sferrazza
Aviral Kumar
Pieter Abbeel
OffRL
104
0
0
29 May 2025
PixelThink: Towards Efficient Chain-of-Pixel Reasoning
Song Wang
Gongfan Fang
Lingdong Kong
Xiangtai Li
Jianyun Xu
Sheng Yang
Qiang Li
Jianke Zhu
Xinchao Wang
LRM
141
0
0
29 May 2025
Previous
1
2
3
4
5
6
...
26
27
28
Next