Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2306.14289
Cited By
v1
v2 (latest)
Faster Segment Anything: Towards Lightweight SAM for Mobile Applications
25 June 2023
Chaoning Zhang
Dongshen Han
Yu Qiao
Jung Uk Kim
Sung-Ho Bae
Seungkyu Lee
Choong Seon Hong
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
Github (5194★)
Papers citing
"Faster Segment Anything: Towards Lightweight SAM for Mobile Applications"
50 / 70 papers shown
Title
History-Augmented Vision-Language Models for Frontier-Based Zero-Shot Object Navigation
Mobin Habibpour
Fatemeh Afghah
LM&Ro
15
0
0
19 Jun 2025
A Comprehensive Survey on Video Scene Parsing:Advances, Challenges, and Prospects
Guohuan Xie
Syed Ariff Syed Hesham
Wenya Guo
Bing Li
Ming-Ming Cheng
Guolei Sun
Yun-Hai Liu
29
0
0
16 Jun 2025
Uncertainty-Informed Active Perception for Open Vocabulary Object Goal Navigation
Utkarsh Bajpai
Julius Ruckin
Cyrill Stachniss
Marija Popović
23
0
0
16 Jun 2025
On the development of an AI performance and behavioural measures for teaching and classroom management
Andreea I. Niculescu
Jochen Ehnen
Chen Yi
Du Jiawei
Tay Chiat Pin
...
Teh Kah Kuan
Tran Huy Dat
John Komar
Gi Soong Chee
Kenneth Kwok
13
0
0
11 Jun 2025
Segment This Thing: Foveated Tokenization for Efficient Point-Prompted Segmentation
Tanner Schmidt
Richard Newcombe
VLM
30
0
0
10 Jun 2025
SAM-I2V: Upgrading SAM to Support Promptable Video Segmentation with Less than 0.2% Training Cost
Haiyang Mei
Pengyu Zhang
Mike Zheng Shou
VLM
49
0
0
02 Jun 2025
DualMap: Online Open-Vocabulary Semantic Mapping for Natural Language Navigation in Dynamic Changing Scenes
Jiajun Jiang
Yiming Zhu
Zirui Wu
Jie Song
71
0
0
02 Jun 2025
KairosAD: A SAM-Based Model for Industrial Anomaly Detection on Embedded Devices
Uzair Khan
Franco Fummi
Luigi Capogrosso
27
0
0
30 May 2025
Stairway to Success: Zero-Shot Floor-Aware Object-Goal Navigation via LLM-Driven Coarse-to-Fine Exploration
Zeying Gong
Rong Li
Tianshuai Hu
Ronghe Qiu
Lingdong Kong
Lingfeng Zhang
Yiyi Ding
Leying Zhang
Junwei Liang
58
0
0
29 May 2025
Mobi-
π
π
π
: Mobilizing Your Robot Learning Policy
Jingyun Yang
Isabella Huang
Brandon Vu
Max Bajracharya
Rika Antonova
Jeannette Bohg
45
0
0
29 May 2025
InfoSAM: Fine-Tuning the Segment Anything Model from An Information-Theoretic Perspective
Yuanhong Zhang
Muyao Yuan
Weizhan Zhang
Tieliang Gong
Wen Wen
Jiangyong Ying
Weijie Shi
VLM
41
0
0
28 May 2025
Universal Domain Adaptation for Semantic Segmentation
Seun-An Choe
Keon-Hee Park
J. Choi
Gyeong-Moon Park
78
0
0
28 May 2025
AoP-SAM: Automation of Prompts for Efficient Segmentation
Yi Chen
Mu-Young Son
Chuanbo Hua
Joo-Young Kim
VLM
94
0
0
17 May 2025
Mix-QSAM: Mixed-Precision Quantization of the Segment Anything Model
Navin Ranjan
Andreas E. Savakis
MQ
VLM
145
0
0
08 May 2025
CaRaFFusion: Improving 2D Semantic Segmentation with Camera-Radar Point Cloud Fusion and Zero-Shot Image Inpainting
Huawei Sun
Bora Kunter Sahin
Georg Stettinger
Maximilian Bernhard
Matthias Schubert
Robert Wille
149
0
0
06 May 2025
Cues3D: Unleashing the Power of Sole NeRF for Consistent and Unique Instances in Open-Vocabulary 3D Panoptic Segmentation
Feng Xue
Wenzhuang Xu
Guofeng Zhong
Anlong Minga
N. Sebe
134
0
0
01 May 2025
ApexNav: An Adaptive Exploration Strategy for Zero-Shot Object Navigation with Target-centric Semantic Fusion
Mingjie Zhang
Yuheng Du
Chengkai Wu
Jinni Zhou
Zhenchao Qi
Jun Ma
Boyu Zhou
218
0
0
20 Apr 2025
IAAO: Interactive Affordance Learning for Articulated Objects in 3D Environments
Can Zhang
G. Lee
91
0
0
09 Apr 2025
LV-MAE: Learning Long Video Representations through Masked-Embedding Autoencoders
Ilan Naiman
Emanuel Ben-Baruch
Oron Anschel
Alon Shoshan
Igor Kviatkovsky
Manoj Aggarwal
Gérard Medioni
87
0
0
04 Apr 2025
Customized SAM 2 for Referring Remote Sensing Image Segmentation
Fu Rong
Meng Lan
Qian Zhang
Lefei Zhang
114
0
0
10 Mar 2025
SAQ-SAM: Semantically-Aligned Quantization for Segment Anything Model
Jing Zhang
Zhiyu Li
Qingyi Gu
MQ
VLM
78
0
0
09 Mar 2025
A Token-level Text Image Foundation Model for Document Understanding
Tongkun Guan
Zining Wang
Pei Fu
Zhengtao Guo
Wei Shen
...
Chen Duan
Hao Sun
Qianyi Jiang
Junfeng Luo
Xiaokang Yang
VLM
182
2
0
04 Mar 2025
ZeroPS: High-quality Cross-modal Knowledge Transfer for Zero-Shot 3D Part Segmentation
Yuheng Xue
Nenglun Chen
Jun Liu
Wenyun Sun
3DPC
242
7
0
24 Feb 2025
Vision Foundation Models in Medical Image Analysis: Advances and Challenges
Pengchen Liang
Bin Pu
Haishan Huang
Yiwei Li
Haoran Wang
Weibo Ma
Qing Chang
VLM
MedIm
144
1
0
24 Feb 2025
Imit Diff: Semantics Guided Diffusion Transformer with Dual Resolution Fusion for Imitation Learning
Yuhang Dong
Haizhou Ge
Yupei Zeng
Jing Zhang
Beiwen Tian
...
Yufei Jia
Ruixiang Wang
Ran Yi
Guyue Zhou
Longhua Ma
105
1
0
11 Feb 2025
Exploring Few-Shot Defect Segmentation in General Industrial Scenarios with Metric Learning and Vision Foundation Models
Tongkun Liu
Bing Li
Xiao Jin
Yupeng Shi
Qiuying Li
Xiang Wei
133
0
0
03 Feb 2025
MPG-SAM 2: Adapting SAM 2 with Mask Priors and Global Context for Referring Video Object Segmentation
Fu Rong
Meng Lan
Qian Zhang
Lefei Zhang
VOS
VGen
114
1
0
23 Jan 2025
DynamicEarth: How Far are We from Open-Vocabulary Change Detection?
Kaiyu Li
Xiangyong Cao
Yupeng Deng
Chao Pang
Zepeng Xin
Deyu Meng
Zhi Wang
ObjD
151
1
0
22 Jan 2025
SkipClick: Combining Quick Responses and Low-Level Features for Interactive Segmentation in Winter Sports Contexts
Robin Schon
Julian Lorenz
Daniel Kienzle
Rainer Lienhart
83
0
0
14 Jan 2025
Dr. Tongue: Sign-Oriented Multi-label Detection for Remote Tongue Diagnosis
Yiliang Chen
Steven SC Ho
Cheng Xu
Yao Jie Xie
Wing-Fai Yeung
Shengfeng He
Jing Qin
LM&MA
94
0
0
06 Jan 2025
Rethinking Detecting Salient and Camouflaged Objects in Unconstrained Scenes
Zhangjun Zhou
Yiping Li
Chunlin Zhong
Jianuo Huang
Jialun Pei
He Tang
He Tang
125
0
0
14 Dec 2024
Swiss Army Knife: Synergizing Biases in Knowledge from Vision Foundation Models for Multi-Task Learning
Yuxiang Lu
Shengcao Cao
Yu-Xiong Wang
124
1
0
18 Oct 2024
Order-aware Interactive Segmentation
Bin Wang
Anwesa Choudhuri
Meng Zheng
Zhongpai Gao
Benjamin Planche
Andong Deng
Qin Liu
Terrence Chen
Ulas Bagci
Ziyan Wu
VLM
468
1
0
16 Oct 2024
Zero-Shot Pupil Segmentation with SAM 2: A Case Study of Over 14 Million Images
Virmarie Maquiling
Sean Anthony Byrne
D. Niehorster
Marco Carminati
Enkelejda Kasneci
VLM
118
2
0
11 Oct 2024
On Efficient Variants of Segment Anything Model: A Survey
Xiaorui Sun
Jing Liu
Jikang Cheng
Xiaofeng Zhu
Ping Hu
VLM
143
7
0
07 Oct 2024
Playful DoggyBot: Learning Agile and Precise Quadrupedal Locomotion
Xin Duan
Ziwen Zhuang
Hang Zhao
Soeren Schwertfeger
125
2
0
30 Sep 2024
Autonomous Exploration and Semantic Updating of Large-Scale Indoor Environments with Mobile Robots
Sai Haneesh Allu
Itay Kadosh
Tyler Summers
Yu Xiang
108
0
0
23 Sep 2024
PointSAM: Pointly-Supervised Segment Anything Model for Remote Sensing Images
Nanqing Liu
Xun Xu
Yongyi Su
Haojie Zhang
Heng-Chao Li
VLM
125
17
0
20 Sep 2024
Towards Generalizable Scene Change Detection
Jaewoo Kim
Uehwan Kim
135
0
0
10 Sep 2024
Target-Oriented Object Grasping via Multimodal Human Guidance
Pengwei Xie
Siang Chen
Dingchang Hu
Yixiang Dai
Kaiqin Yang
Guijin Wang
107
4
0
20 Aug 2024
The Collection of a Human Robot Collaboration Dataset for Cooperative Assembly in Glovebox Environments
Shivansh Sharma
Mathew Huang
Sanat Nair
Alan Wen
Christina Petlowany
Juston Moore
Selma Wanna
Mitch Pryor
158
0
0
19 Jul 2024
VCP-CLIP: A visual context prompting model for zero-shot anomaly segmentation
Zhen Qu
Xian Tao
Mukesh Prasad
Fei Shen
Zhengtao Zhang
Xinyi Gong
Guiguang Ding
VLM
108
16
0
17 Jul 2024
DreamStory: Open-Domain Story Visualization by LLM-Guided Multi-Subject Consistent Diffusion
Huiguo He
Huan Yang
Zixi Tuo
Yuan Zhou
Qiuyue Wang
Yuhang Zhang
Zeyu Liu
Wenhao Huang
Hongyang Chao
Jian Yin
DiffM
VGen
200
17
0
17 Jul 2024
Lite-SAM Is Actually What You Need for Segment Everything
Jianhai Fu
Yuanjie Yu
Ningchuan Li
Yi Zhang
Qichao Chen
Jianping Xiong
Jun Yin
Zhiyu Xiang
VLM
94
4
0
12 Jul 2024
Towards Open-World Mobile Manipulation in Homes: Lessons from the Neurips 2023 HomeRobot Open Vocabulary Mobile Manipulation Challenge
Sriram Yenamandra
Arun Ramachandran
Mukul Khanna
Karmesh Yadav
Jay Vakil
...
Z. Kira
Dhruv Batra
Roozbeh Mottaghi
Yonatan Bisk
Chris Paxton
LM&Ro
96
7
0
09 Jul 2024
EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model
Yuxuan Zhang
Tianheng Cheng
Lianghui Zhu
Lei Liu
Heng Liu
Longjin Ran
Xiaoxin Chen
Xiaoxin Chen
Wenyu Liu
Xinggang Wang
VLM
196
31
0
28 Jun 2024
Mamba or RWKV: Exploring High-Quality and High-Efficiency Segment Anything Model
Haobo Yuan
Xiangtai Li
Lu Qi
Tao Zhang
Ming-Hsuan Yang
Shuicheng Yan
Chen Change Loy
VLM
118
10
0
27 Jun 2024
Aligning in a Compact Space: Contrastive Knowledge Distillation between Heterogeneous Architectures
Hongjun Wu
Li Xiao
Xingkuo Zhang
Yining Miao
105
1
0
28 May 2024
Adapting Pre-Trained Vision Models for Novel Instance Detection and Segmentation
Ya Lu
Jishnu Jaykumar
Yunhui Guo
Nicholas Ruozzi
Yu Xiang
VLM
ISeg
151
5
0
28 May 2024
PTQ4SAM: Post-Training Quantization for Segment Anything
Chengtao Lv
Hong Chen
Jinyang Guo
Yifu Ding
Xianglong Liu
VLM
MQ
85
16
0
06 May 2024
1
2
Next