ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2306.14289
  4. Cited By
Faster Segment Anything: Towards Lightweight SAM for Mobile Applications

Faster Segment Anything: Towards Lightweight SAM for Mobile Applications

25 June 2023
Chaoning Zhang
Dongshen Han
Yu Qiao
Jung Uk Kim
Sung-Ho Bae
Seungkyu Lee
Choong Seon Hong
    VLM
ArXivPDFHTML

Papers citing "Faster Segment Anything: Towards Lightweight SAM for Mobile Applications"

50 / 57 papers shown
Title
AoP-SAM: Automation of Prompts for Efficient Segmentation
AoP-SAM: Automation of Prompts for Efficient Segmentation
Yi Chen
Mu-Young Son
Chuanbo Hua
Joo-Young Kim
VLM
2
0
0
17 May 2025
GA3CE: Unconstrained 3D Gaze Estimation with Gaze-Aware 3D Context Encoding
GA3CE: Unconstrained 3D Gaze Estimation with Gaze-Aware 3D Context Encoding
Yuki Kawana
Shintaro Shiba
Quan Kong
Norimasa Kobori
17
0
0
15 May 2025
Mix-QSAM: Mixed-Precision Quantization of the Segment Anything Model
Mix-QSAM: Mixed-Precision Quantization of the Segment Anything Model
Navin Ranjan
Andreas E. Savakis
MQ
VLM
68
0
0
08 May 2025
CaRaFFusion: Improving 2D Semantic Segmentation with Camera-Radar Point Cloud Fusion and Zero-Shot Image Inpainting
CaRaFFusion: Improving 2D Semantic Segmentation with Camera-Radar Point Cloud Fusion and Zero-Shot Image Inpainting
Huawei Sun
Bora Kunter Sahin
Georg Stettinger
Maximilian Bernhard
Matthias Schubert
Robert Wille
49
0
0
06 May 2025
Cues3D: Unleashing the Power of Sole NeRF for Consistent and Unique Instances in Open-Vocabulary 3D Panoptic Segmentation
Cues3D: Unleashing the Power of Sole NeRF for Consistent and Unique Instances in Open-Vocabulary 3D Panoptic Segmentation
Feng Xue
Wenzhuang Xu
Guofeng Zhong
Anlong Minga
N. Sebe
65
0
0
01 May 2025
MoSAM: Motion-Guided Segment Anything Model with Spatial-Temporal Memory Selection
MoSAM: Motion-Guided Segment Anything Model with Spatial-Temporal Memory Selection
Q. Yang
Yuan Yao
Miaomiao Cui
Liefeng Bo
VLM
61
0
0
30 Apr 2025
SAM-Guided Robust Representation Learning for One-Shot 3D Medical Image Segmentation
SAM-Guided Robust Representation Learning for One-Shot 3D Medical Image Segmentation
Jia Wang
Yunan Mei
Jiarui Liu
Xin Fan
44
0
0
29 Apr 2025
ApexNav: An Adaptive Exploration Strategy for Zero-Shot Object Navigation with Target-centric Semantic Fusion
ApexNav: An Adaptive Exploration Strategy for Zero-Shot Object Navigation with Target-centric Semantic Fusion
Mingjie Zhang
Yuheng Du
Chengkai Wu
Jinni Zhou
Zhenchao Qi
Jun Ma
Boyu Zhou
34
0
0
20 Apr 2025
Customized SAM 2 for Referring Remote Sensing Image Segmentation
Customized SAM 2 for Referring Remote Sensing Image Segmentation
Fu Rong
Meng Lan
Qian Zhang
Lefei Zhang
47
0
0
10 Mar 2025
Bayesian Fields: Task-driven Open-Set Semantic Gaussian Splatting
Dominic Maggio
Luca Carlone
171
0
0
07 Mar 2025
A Token-level Text Image Foundation Model for Document Understanding
A Token-level Text Image Foundation Model for Document Understanding
Tongkun Guan
Zining Wang
Pei Fu
Zhengtao Guo
Wei-Ming Shen
...
Chen Duan
Hao Sun
Qianyi Jiang
Junfeng Luo
Xiaokang Yang
VLM
45
1
0
04 Mar 2025
ZeroPS: High-quality Cross-modal Knowledge Transfer for Zero-Shot 3D Part Segmentation
ZeroPS: High-quality Cross-modal Knowledge Transfer for Zero-Shot 3D Part Segmentation
Yuheng Xue
Nenglun Chen
Jun Liu
Wenyun Sun
3DPC
69
7
0
24 Feb 2025
Vision Foundation Models in Medical Image Analysis: Advances and Challenges
Vision Foundation Models in Medical Image Analysis: Advances and Challenges
Pengchen Liang
Bin Pu
Haishan Huang
Yiwei Li
Haoran Wang
Weibo Ma
Qing Chang
VLM
MedIm
106
0
0
24 Feb 2025
Imit Diff: Semantics Guided Diffusion Transformer with Dual Resolution Fusion for Imitation Learning
Imit Diff: Semantics Guided Diffusion Transformer with Dual Resolution Fusion for Imitation Learning
Yuhang Dong
Haizhou Ge
Yupei Zeng
Jun Zhang
Beiwen Tian
...
Yufei Jia
Ruixiang Wang
Ran Yi
Guyue Zhou
Longhua Ma
56
0
0
11 Feb 2025
Exploring Few-Shot Defect Segmentation in General Industrial Scenarios with Metric Learning and Vision Foundation Models
Exploring Few-Shot Defect Segmentation in General Industrial Scenarios with Metric Learning and Vision Foundation Models
Tongkun Liu
Bing Li
Xiao Jin
Yupeng Shi
Qiuying Li
Xiang Wei
64
0
0
03 Feb 2025
MPG-SAM 2: Adapting SAM 2 with Mask Priors and Global Context for Referring Video Object Segmentation
MPG-SAM 2: Adapting SAM 2 with Mask Priors and Global Context for Referring Video Object Segmentation
Fu Rong
Meng Lan
Qian Zhang
Lefei Zhang
VOS
VGen
73
1
0
23 Jan 2025
DynamicEarth: How Far are We from Open-Vocabulary Change Detection?
DynamicEarth: How Far are We from Open-Vocabulary Change Detection?
Kaiyu Li
Xiangyong Cao
Yupeng Deng
Chao Pang
Zepeng Xin
Deyu Meng
Zhi Wang
ObjD
69
1
0
22 Jan 2025
Dr. Tongue: Sign-Oriented Multi-label Detection for Remote Tongue Diagnosis
Dr. Tongue: Sign-Oriented Multi-label Detection for Remote Tongue Diagnosis
Yiliang Chen
Steven SC Ho
Cheng Xu
Yao Jie Xie
Wing-Fai Yeung
Shengfeng He
Jing Qin
LM&MA
28
0
0
06 Jan 2025
Swiss Army Knife: Synergizing Biases in Knowledge from Vision Foundation Models for Multi-Task Learning
Swiss Army Knife: Synergizing Biases in Knowledge from Vision Foundation Models for Multi-Task Learning
Yuxiang Lu
Shengcao Cao
Yu-xiong Wang
55
1
0
18 Oct 2024
BlabberSeg: Real-Time Embedded Open-Vocabulary Aerial Segmentation
BlabberSeg: Real-Time Embedded Open-Vocabulary Aerial Segmentation
Haechan Mark Bong
Ricardo de Azambuja
Giovanni Beltrame
VLM
38
0
0
16 Oct 2024
Order-aware Interactive Segmentation
Order-aware Interactive Segmentation
Bin Wang
Anwesa Choudhuri
Meng Zheng
Zhongpai Gao
Benjamin Planche
Andong Deng
Qin Liu
Terrence Chen
Ulas Bagci
Ziyan Wu
VLM
170
1
0
16 Oct 2024
Zero-Shot Pupil Segmentation with SAM 2: A Case Study of Over 14 Million Images
Zero-Shot Pupil Segmentation with SAM 2: A Case Study of Over 14 Million Images
Virmarie Maquiling
Sean Anthony Byrne
D. Niehorster
Marco Carminati
Enkelejda Kasneci
VLM
45
0
0
11 Oct 2024
Playful DoggyBot: Learning Agile and Precise Quadrupedal Locomotion
Playful DoggyBot: Learning Agile and Precise Quadrupedal Locomotion
Xin Duan
Ziwen Zhuang
Hang Zhao
Soeren Schwertfeger
52
2
0
30 Sep 2024
SAMEdge: An Edge-cloud Video Analytics Architecture for the Segment
  Anything Model
SAMEdge: An Edge-cloud Video Analytics Architecture for the Segment Anything Model
Rui Lu
Siping Shi
Yanting Liu
Dan Wang
VLM
32
2
0
23 Sep 2024
Autonomous Exploration and Semantic Updating of Large-Scale Indoor Environments with Mobile Robots
Autonomous Exploration and Semantic Updating of Large-Scale Indoor Environments with Mobile Robots
Sai Haneesh Allu
Itay Kadosh
Tyler Summers
Yu Xiang
24
0
0
23 Sep 2024
PointSAM: Pointly-Supervised Segment Anything Model for Remote Sensing Images
PointSAM: Pointly-Supervised Segment Anything Model for Remote Sensing Images
Nanqing Liu
Xun Xu
Yongyi Su
Haojie Zhang
Heng-Chao Li
VLM
43
14
0
20 Sep 2024
Swin-LiteMedSAM: A Lightweight Box-Based Segment Anything Model for
  Large-Scale Medical Image Datasets
Swin-LiteMedSAM: A Lightweight Box-Based Segment Anything Model for Large-Scale Medical Image Datasets
Ruochen Gao
Donghang Lyu
Marius Staring
VLM
MedIm
39
3
0
11 Sep 2024
Towards Generalizable Scene Change Detection
Towards Generalizable Scene Change Detection
Jaewoo Kim
Uehwan Kim
50
0
0
10 Sep 2024
FrozenSeg: Harmonizing Frozen Foundation Models for Open-Vocabulary
  Segmentation
FrozenSeg: Harmonizing Frozen Foundation Models for Open-Vocabulary Segmentation
Xi Chen
Haosen Yang
Sheng Jin
Xiatian Zhu
H. Yao
VLM
29
3
0
05 Sep 2024
The Collection of a Human Robot Collaboration Dataset for Cooperative Assembly in Glovebox Environments
The Collection of a Human Robot Collaboration Dataset for Cooperative Assembly in Glovebox Environments
Shivansh Sharma
Mathew Huang
Sanat Nair
Alan Wen
Christina Petlowany
Juston Moore
Selma Wanna
Mitch Pryor
41
0
0
19 Jul 2024
DreamStory: Open-Domain Story Visualization by LLM-Guided Multi-Subject Consistent Diffusion
DreamStory: Open-Domain Story Visualization by LLM-Guided Multi-Subject Consistent Diffusion
Huiguo He
Huan Yang
Zixi Tuo
Yuan Zhou
Qiuyue Wang
Yuhang Zhang
Zeyu Liu
Wenhao Huang
Hongyang Chao
Jian Yin
DiffM
VGen
62
12
0
17 Jul 2024
Towards Open-World Mobile Manipulation in Homes: Lessons from the
  Neurips 2023 HomeRobot Open Vocabulary Mobile Manipulation Challenge
Towards Open-World Mobile Manipulation in Homes: Lessons from the Neurips 2023 HomeRobot Open Vocabulary Mobile Manipulation Challenge
Sriram Yenamandra
Arun Ramachandran
Mukul Khanna
Karmesh Yadav
Jay Vakil
...
Z. Kira
Dhruv Batra
Roozbeh Mottaghi
Yonatan Bisk
Chris Paxton
LM&Ro
62
6
0
09 Jul 2024
EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model
EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model
Yuxuan Zhang
Tianheng Cheng
Lianghui Zhu
Lei Liu
Heng Liu
Longjin Ran
Xiaoxin Chen
Xiaoxin Chen
Wenyu Liu
Xinggang Wang
VLM
61
25
0
28 Jun 2024
XAMI -- A Benchmark Dataset for Artefact Detection in XMM-Newton Optical
  Images
XAMI -- A Benchmark Dataset for Artefact Detection in XMM-Newton Optical Images
Elisabeta-Iulia Dima
Pablo Gómez
Sandor Kruk
Peter Kretschmar
Simon Rosen
Călin-Adrian Popa
31
0
0
25 Jun 2024
Balancing Performance and Efficiency in Zero-shot Robotic Navigation
Balancing Performance and Efficiency in Zero-shot Robotic Navigation
Dmytro Kuzmenko
N. Shvai
LM&Ro
34
0
0
05 Jun 2024
Adapting Pre-Trained Vision Models for Novel Instance Detection and Segmentation
Adapting Pre-Trained Vision Models for Novel Instance Detection and Segmentation
Ya Lu
Jishnu Jaykumar
Yunhui Guo
Nicholas Ruozzi
Yu Xiang
VLM
ISeg
58
4
0
28 May 2024
Innovative Integration of Visual Foundation Model with a Robotic Arm on
  a Mobile Platform
Innovative Integration of Visual Foundation Model with a Robotic Arm on a Mobile Platform
Shimian Zhang
Qiuhong Lu
34
1
0
29 Apr 2024
How to build the best medical image segmentation algorithm using foundation models: a comprehensive empirical study with Segment Anything Model
How to build the best medical image segmentation algorithm using foundation models: a comprehensive empirical study with Segment Anything Model
Han Gu
Haoyu Dong
Jichen Yang
Maciej Mazurowski
MedIm
VLM
80
14
0
15 Apr 2024
VisionGPT: Vision-Language Understanding Agent Using Generalized
  Multimodal Framework
VisionGPT: Vision-Language Understanding Agent Using Generalized Multimodal Framework
Chris Kelly
Luhui Hu
Bang Yang
Yu Tian
Deshun Yang
Cindy Yang
Zaoshan Huang
Zihao Li
Jiayin Hu
Yuexian Zou
37
9
0
14 Mar 2024
SAM-DiffSR: Structure-Modulated Diffusion Model for Image Super-Resolution
SAM-DiffSR: Structure-Modulated Diffusion Model for Image Super-Resolution
Chengcheng Wang
Zhiwei Hao
Yehui Tang
Jianyuan Guo
Yujie Yang
Kai Han
Yunhe Wang
40
6
0
27 Feb 2024
Subobject-level Image Tokenization
Subobject-level Image Tokenization
Delong Chen
Samuel Cahyawijaya
Jianfeng Liu
Baoyuan Wang
Pascale Fung
VLM
OCL
54
7
0
22 Feb 2024
SAGD: Boundary-Enhanced Segment Anything in 3D Gaussian via Gaussian Decomposition
SAGD: Boundary-Enhanced Segment Anything in 3D Gaussian via Gaussian Decomposition
Xu Hu
Yuxi Wang
Lue Fan
Junsong Fan
Junran Peng
Zhen Lei
Qing Li
Zhaoxiang Zhang
Zhaoxiang Zhang
3DGS
42
8
0
31 Jan 2024
TriSAM: Tri-Plane SAM for zero-shot cortical blood vessel segmentation
  in VEM images
TriSAM: Tri-Plane SAM for zero-shot cortical blood vessel segmentation in VEM images
Jia Wan
Wanhua Li
Jason Ken Adhinarta
Atmadeep Banerjee
Evelina Sjostedt
Jingpeng Wu
J. Lichtman
Hanspeter Pfister
D. Wei
34
6
0
25 Jan 2024
TinySAM: Pushing the Envelope for Efficient Segment Anything Model
TinySAM: Pushing the Envelope for Efficient Segment Anything Model
Han Shu
Wenshuo Li
Yehui Tang
Yiman Zhang
Yihao Chen
Houqiang Li
Yunhe Wang
Xinghao Chen
VLM
44
19
0
21 Dec 2023
EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment
  Anything
EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything
Yunyang Xiong
Bala Varadarajan
Lemeng Wu
Xiaoyu Xiang
Fanyi Xiao
...
Dilin Wang
Fei Sun
Forrest N. Iandola
Raghuraman Krishnamoorthi
Vikas Chandra
VLM
42
140
0
01 Dec 2023
MOSAIC: Multi-Object Segmented Arbitrary Stylization Using CLIP
MOSAIC: Multi-Object Segmented Arbitrary Stylization Using CLIP
Prajwal Ganugula
Y. Kumar
N. Reddy
Prabhath Chellingi
A. Thakur
Neeraj Kasera
C. S. Anand
CLIP
DiffM
11
3
0
24 Sep 2023
Tracking Anything with Decoupled Video Segmentation
Tracking Anything with Decoupled Video Segmentation
Ho Kei Cheng
Seoung Wug Oh
Brian L. Price
Alexander Schwing
Joon-Young Lee
VOS
VLM
43
121
0
07 Sep 2023
Large Language Models and Foundation Models in Smart Agriculture:
  Basics, Opportunities, and Challenges
Large Language Models and Foundation Models in Smart Agriculture: Basics, Opportunities, and Challenges
Jiajia Li
Mingle Xu
Lirong Xiang
Dong Chen
Weichao Zhuang
Xunyuan Yin
Zhao Li
39
3
0
13 Aug 2023
Foundational Models Defining a New Era in Vision: A Survey and Outlook
Foundational Models Defining a New Era in Vision: A Survey and Outlook
Muhammad Awais
Muzammal Naseer
Salman Khan
Rao Muhammad Anwer
Hisham Cholakkal
M. Shah
Ming Yang
Fahad Shahbaz Khan
VLM
38
118
0
25 Jul 2023
Boosting Federated Learning Convergence with Prototype Regularization
Boosting Federated Learning Convergence with Prototype Regularization
Yu Qiao
Huy Q. Le
Choong Seon Hong
FedML
24
6
0
20 Jul 2023
12
Next