Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
2304.02643
Cited By
Segment Anything
5 April 2023
A. Kirillov
Eric Mintun
Nikhila Ravi
Hanzi Mao
Chloe Rolland
Laura Gustafson
Tete Xiao
Spencer Whitehead
Alexander C. Berg
Wan-Yen Lo
Piotr Dollár
Ross B. Girshick
MLLM
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Segment Anything"
50 / 1,384 papers shown
Title
Label-Efficient LiDAR Panoptic Segmentation
Ahmet Selim Çanakçı
Niclas Vodisch
Kürsat Petek
Wolfram Burgard
Abhinav Valada
3DPC
204
0
0
04 Mar 2025
Out-of-Distribution Segmentation in Autonomous Driving: Problems and State of the Art
Youssef Shoeb
Azarm Nowzad
Hanno Gottschalk
UQCV
288
2
0
04 Mar 2025
FlowPlan: Zero-Shot Task Planning with LLM Flow Engineering for Robotic Instruction Following
Zijun Lin
Chao Tang
Hanjing Ye
Kuanqi Cai
119
0
0
04 Mar 2025
Boltzmann Attention Sampling for Image Analysis with Small Objects
Theodore Zhao
Sid Kiblawi
Naoto Usuyama
Ho Hin Lee
Sam Preston
Hoifung Poon
Mu-Hsin Wei
MedIm
212
0
0
04 Mar 2025
Unveiling the Potential of Segment Anything Model 2 for RGB-Thermal Semantic Segmentation with Language Guidance
Jiayi Zhao
Fei Teng
Kai Luo
Guoqiang Zhao
Hui Yuan
Xu Zheng
Kailun Yang
VLM
133
7
0
04 Mar 2025
WMNav: Integrating Vision-Language Models into World Models for Object Goal Navigation
Dujun Nie
Xianda Guo
Yiqun Duan
Ruijun Zhang
Long Chen
LM&Ro
382
5
0
04 Mar 2025
Every SAM Drop Counts: Embracing Semantic Priors for Multi-Modality Image Fusion and Beyond
Guanyao Wu
Haoyu Liu
Hongming Fu
Yichuan Peng
Jinyuan Liu
Xin-Yue Fan
Risheng Liu
145
0
0
03 Mar 2025
UFO: A Unified Approach to Fine-grained Visual Perception via Open-ended Language Interface
Hao Tang
Chenwei Xie
Haiyang Wang
Xiaoyi Bao
Tingyu Weng
Pandeng Li
Yun Zheng
Liwei Wang
ObjD
VLM
151
2
0
03 Mar 2025
SAR-W-MixMAE: SAR Foundation Model Training Using Backscatter Power Weighting
Ali Caglayan
Nevrez Imamoglu
T. Kouyama
162
0
0
03 Mar 2025
One-shot In-context Part Segmentation
Zhenqi Dai
Ting Liu
Xinyu Zhang
Y. X. Wei
Yanning Zhang
VLM
200
1
0
03 Mar 2025
OnlineAnySeg: Online Zero-Shot 3D Segmentation by Visual Foundation Model Guided 2D Mask Merging
Yijie Tang
JIazhao Zhang
Yuqing Lan
Yulan Guo
Dezun Dong
Chenyang Zhu
K. Xu
443
1
0
03 Mar 2025
Dynamic Gradient Sparsification Training for Few-Shot Fine-tuning of CT Lymph Node Segmentation Foundation Model
Zihao Luo
Zijun Gao
Wenjun Liao
Shichuan Zhang
Guotai Wang
Xiangde Luo
93
0
0
02 Mar 2025
GenAnalysis: Joint Shape Analysis by Learning Man-Made Shape Generators with Deformation Regularizations
Yuezhi Yang
Haitao Yang
Kiyohiro Nakayama
Xiangru Huang
Leonidas Guibas
Qixing Huang
81
0
0
02 Mar 2025
IteRPrimE: Zero-shot Referring Image Segmentation with Iterative Grad-CAM Refinement and Primary Word Emphasis
Yun Wang
Jingchen Ni
Yong-Jin Liu
Chun Yuan
Yansong Tang
98
5
0
02 Mar 2025
Solving Instance Detection from an Open-World Perspective
Qianqian Shen
Yunhan Zhao
Nahyun Kwon
Jeeeun Kim
Yanan Li
Shu Kong
152
2
0
01 Mar 2025
Seeing A 3D World in A Grain of Sand
Yufan Zhang
Yu Ji
Yu Guo
Jinwei Ye
3DV
120
0
0
01 Mar 2025
Brain Foundation Models: A Survey on Advancements in Neural Signal Processing and Brain Discovery
Xinliang Zhou
Chenyu Liu
Zhenpeng Chen
Kun Wang
Yi Ding
Ziyu Jia
Qingsong Wen
AI4CE
123
1
0
01 Mar 2025
Theoretical Insights in Model Inversion Robustness and Conditional Entropy Maximization for Collaborative Inference Systems
Song Xia
Yi Yu
Wenhan Yang
Meiwen Ding
Zhuo Chen
Lingyu Duan
Alex C. Kot
Xudong Jiang
120
4
0
01 Mar 2025
Inst3D-LMM: Instance-Aware 3D Scene Understanding with Multi-modal Instruction Tuning
Hanxun Yu
Wentong Li
Song Wang
Jintai Chen
Jianke Zhu
3DV
LRM
167
9
0
01 Mar 2025
Less is More? Revisiting the Importance of Frame Rate in Real-Time Zero-Shot Surgical Video Segmentation
Utku Ozbulak
Seyed Amir Mousavi
Francesca Tozzi
Nikdokht Rashidian
W. Willaert
W. D. Neve
J. Vankerschaver
93
0
0
28 Feb 2025
T2ICount: Enhancing Cross-modal Understanding for Zero-Shot Counting
Yifei Qian
Zhongliang Guo
Bowen Deng
Chun Tong Lei
Shuai Zhao
Chun Pong Lau
Xiaopeng Hong
Michael P. Pound
DiffM
212
1
0
28 Feb 2025
Spiking Transformer:Introducing Accurate Addition-Only Spiking Self-Attention for Transformer
Yufei Guo
Xiaode Liu
Y. Chen
Weihang Peng
Yuhan Zhang
Zhe Ma
MQ
133
3
0
28 Feb 2025
SeisMoLLM: Advancing Seismic Monitoring via Cross-modal Transfer with Pre-trained Large Language Model
Xinyu Wang
Feng Liu
Rui Su
Ziyi Wang
Junlin Wu
Wanli Ouyang
Lei Bai
Wanli Ouyang
VLM
316
2
0
27 Feb 2025
You Only Click Once: Single Point Weakly Supervised 3D Instance Segmentation for Autonomous Driving
Guangfeng Jiang
Jun Liu
Yongxuan Lv
Yongpeng Wu
Xianfei Li
Wenlong Liao
Tao He
Pai Peng
3DPC
124
0
0
27 Feb 2025
Vector-Quantized Vision Foundation Models for Object-Centric Learning
Rongzhen Zhao
V. Wang
Arno Solin
Joni Pajarinen
OCL
VLM
588
1
0
27 Feb 2025
Attention-Guided Integration of CLIP and SAM for Precise Object Masking in Robotic Manipulation
Muhammad A. Muttaqien
Tomohiro Motoda
Ryo Hanai
Domae Yukiyasu
61
1
0
26 Feb 2025
ArtGS: Building Interactable Replicas of Complex Articulated Objects via Gaussian Splatting
Yu Liu
Baoxiong Jia
Ruijie Lu
Junfeng Ni
Song-Chun Zhu
Siyuan Huang
3DGS
182
14
0
26 Feb 2025
A Survey on Foundation-Model-Based Industrial Defect Detection
Tianle Yang
Luyao Chang
Jiadong Yan
Jiajian Li
Zhi Wang
Ke Zhang
AI4CE
182
4
0
26 Feb 2025
Bayesian Computation in Deep Learning
Wenlong Chen
Bolian Li
Ruqi Zhang
Yingzhen Li
BDL
124
0
0
25 Feb 2025
OmniAlign-V: Towards Enhanced Alignment of MLLMs with Human Preference
Xiangyu Zhao
Shengyuan Ding
Zicheng Zhang
Haian Huang
Maosong Cao
...
Wenhai Wang
Guangtao Zhai
Haodong Duan
Hua Yang
Kai Chen
184
8
0
25 Feb 2025
Self-Supervised Data Generation for Precision Agriculture: Blending Simulated Environments with Real Imagery
Leonardo Saraceni
I. M. Motoi
Daniele Nardi
Thomas Alessandro Ciarfuglia
113
2
0
25 Feb 2025
VesselSAM: Leveraging SAM for Aortic Vessel Segmentation with AtrousLoRA
Adnan Iltaf
Rayan Merghani Ahmed
Bin Li
Bin Li
Shoujun Zhou
164
0
0
25 Feb 2025
A Closer Look at TabPFN v2: Understanding Its Strengths and Extending Its Capabilities
Han-Jia Ye
Si-Yang Liu
Wei-Lun Chao
138
8
0
24 Feb 2025
Introducing Visual Perception Token into Multimodal Large Language Model
Runpeng Yu
Xinyin Ma
Xinchao Wang
MLLM
LRM
179
4
0
24 Feb 2025
ZeroPS: High-quality Cross-modal Knowledge Transfer for Zero-Shot 3D Part Segmentation
Yuheng Xue
Nenglun Chen
Jun Liu
Wenyun Sun
3DPC
246
8
0
24 Feb 2025
A large-scale multicenter breast cancer DCE-MRI benchmark dataset with expert segmentations
Lidia Garrucho
C. Reidel
Kaisar Kushibar
Smriti Joshi
Richard Osuala
...
M. P. Starmans
Fredrik Strand
Oliver Díaz
Laura Igual
Karim Lekadir
158
9
0
24 Feb 2025
SpecDM: Hyperspectral Dataset Synthesis with Pixel-level Semantic Annotations
Wen Liu
Pei Yang
Wenhui Hong
Xiaoguang Mei
Jiayi Ma
DiffM
108
0
0
24 Feb 2025
UrbanSAM: Learning Invariance-Inspired Adapters for Segment Anything Models in Urban Construction
Chenyu Li
Danfeng Hong
Bing Zhang
Yuxuan Li
Gustau Camps-Valls
X. Zhu
J. Chanussot
104
6
0
24 Feb 2025
LayerPano3D: Layered 3D Panorama for Hyper-Immersive Scene Generation
Shuai Yang
Jing Tan
Mengchen Zhang
Tong Wu
Yongqian Li
Gordon Wetzstein
Ziwei Liu
Dahua Lin
MDE
VGen
175
11
0
24 Feb 2025
Gaussian Difference: Find Any Change Instance in 3D Scenes
Binbin Jiang
Rui Huang
Qingyi Zhao
Yuxiang Zhang
118
0
0
24 Feb 2025
Surgical Scene Understanding in the Era of Foundation AI Models: A Comprehensive Review
Ufaq Khan
Umair Nawaz
A. Qayyum
Shazad Ashraf
Muhammad Bilal
Junaid Qadir
150
1
0
24 Feb 2025
IBURD: Image Blending for Underwater Robotic Detection
Jungseok Hong
Sakshi Singh
Junaed Sattar
121
1
0
24 Feb 2025
Vision Foundation Models in Medical Image Analysis: Advances and Challenges
Pengchen Liang
Bin Pu
Haishan Huang
Yiwei Li
Haoran Wang
Weibo Ma
Qing Chang
VLM
MedIm
159
2
0
24 Feb 2025
DUNIA: Pixel-Sized Embeddings via Cross-Modal Alignment for Earth Observation Applications
Ibrahim Fayad
Max Zimmer
Martin Schwartz
P. Ciais
Fabian Gieseke
Gabriel Belouze
Sarah Brood
A. D. Truchis
Alexandre d’Aspremont
AI4TS
107
0
0
24 Feb 2025
MLLMs Know Where to Look: Training-free Perception of Small Visual Details with Multimodal LLMs
Jiarui Zhang
Mahyar Khayatkhoei
P. Chhikara
Filip Ilievski
LRM
121
21
0
24 Feb 2025
Vision-LSTM: xLSTM as Generic Vision Backbone
Benedikt Alkin
M. Beck
Korbinian Poppel
Sepp Hochreiter
Johannes Brandstetter
VLM
239
53
0
24 Feb 2025
MIM-Refiner: A Contrastive Learning Boost from Intermediate Pre-Trained Representations
Benedikt Alkin
Lukas Miklautz
Sepp Hochreiter
Johannes Brandstetter
VLM
276
9
0
24 Feb 2025
Tidiness Score-Guided Monte Carlo Tree Search for Visual Tabletop Rearrangement
Hogun Kee
Wooseok Oh
Minjae Kang
Hyemin Ahn
Songhwai Oh
102
0
0
24 Feb 2025
Soybean pod and seed counting in both outdoor fields and indoor laboratories using unions of deep neural networks
Tianyou Jiang
Mingshun Shao
Tianyi Zhang
Xiaoyu Liu
Qun Yu
114
0
0
24 Feb 2025
Anatomy-Informed Deep Learning and Radiomics for Automated Neurofibroma Segmentation in Whole-Body MRI
Georgii Kolokolnikov
Marie-Lena Schmalhofer
Lennart Well
Said Farschtschi
Victor-Felix Mautner
Inka Ristow
Rene Werner
AI4CE
92
0
0
24 Feb 2025
Previous
1
2
3
...
9
10
11
...
26
27
28
Next