ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2503.12781
  4. Cited By
SAM2 for Image and Video Segmentation: A Comprehensive Survey

SAM2 for Image and Video Segmentation: A Comprehensive Survey

17 March 2025
Zhang Jiaxing
Tang Hao
    VLM
ArXiv (abs)PDFHTML

Papers citing "SAM2 for Image and Video Segmentation: A Comprehensive Survey"

50 / 126 papers shown
Title
SAMAug: Point Prompt Augmentation for Segment Anything Model
SAMAug: Point Prompt Augmentation for Segment Anything Model
Haixing Dai
Chong Ma
Zhiling Yan
Zheng Liu
Enze Shi
...
Wen Liu
Quanzheng Li
Lichao Sun
Shu Zhang Tianming Liu
Xiang Li
VLM
116
43
0
03 Jul 2023
One-Prompt to Segment All Medical Images
One-Prompt to Segment All Medical Images
Junde Wu
Jiayuan Zhu
Yueming Jin
Min Xu
VLMMedIm
92
32
0
17 May 2023
A Comprehensive Survey on Segment Anything Model for Vision and Beyond
A Comprehensive Survey on Segment Anything Model for Vision and Beyond
Chunhui Zhang
Li Liu
Yawen Cui
Guanjie Huang
Weilin Lin
Yiqian Yang
Yuehong Hu
VLM
89
100
0
14 May 2023
A Survey on Segment Anything Model (SAM): Vision Foundation Model Meets
  Prompt Engineering
A Survey on Segment Anything Model (SAM): Vision Foundation Model Meets Prompt Engineering
Chaoning Zhang
Fachrina Dewi Puspitasari
Sheng Zheng
Chenghao Li
Yu Qiao
...
Caiyan Qin
François Rameau
Lik-Hang Lee
Sung-Ho Bae
Choong Seon Hong
VLM
156
66
0
12 May 2023
Towards Segment Anything Model (SAM) for Medical Image Segmentation: A
  Survey
Towards Segment Anything Model (SAM) for Medical Image Segmentation: A Survey
Yichi Zhang
Rushi Jiao
MedImVLM
88
27
0
05 May 2023
Personalize Segment Anything Model with One Shot
Personalize Segment Anything Model with One Shot
Renrui Zhang
Zhengkai Jiang
Ziyu Guo
Shilin Yan
Junting Pan
Xianzheng Ma
Hao Dong
Peng Gao
Hongsheng Li
MLLMVLM
105
218
0
04 May 2023
UniverSeg: Universal Medical Image Segmentation
UniverSeg: Universal Medical Image Segmentation
V. Butoi
Jose Javier Gonzalez Ortiz
Tianyu Ma
M. Sabuncu
John Guttag
Adrian Dalca
84
86
0
12 Apr 2023
SegGPT: Segmenting Everything In Context
SegGPT: Segmenting Everything In Context
Xinlong Wang
Xiaosong Zhang
Yue Cao
Wen Wang
Chunhua Shen
Tiejun Huang
VOSMLLMVLM
103
207
0
06 Apr 2023
Segment Anything
Segment Anything
A. Kirillov
Eric Mintun
Nikhila Ravi
Hanzi Mao
Chloe Rolland
...
Spencer Whitehead
Alexander C. Berg
Wan-Yen Lo
Piotr Dollár
Ross B. Girshick
MLLMVLM
397
7,421
0
05 Apr 2023
Focused and Collaborative Feedback Integration for Interactive Image
  Segmentation
Focused and Collaborative Feedback Integration for Interactive Image Segmentation
Qiaoqiao Wei
Hui Zhang
Jun-hai Yong
56
22
0
21 Mar 2023
Large-scale Multi-Modal Pre-trained Models: A Comprehensive Survey
Large-scale Multi-Modal Pre-trained Models: A Comprehensive Survey
Tianlin Li
Guangyao Chen
Guangwu Qian
Pengcheng Gao
Xiaoyong Wei
Yaowei Wang
Yonghong Tian
Wen Gao
AI4CEVLM
139
213
0
20 Feb 2023
A Neuromorphic Dataset for Object Segmentation in Indoor Cluttered
  Environment
A Neuromorphic Dataset for Object Segmentation in Indoor Cluttered Environment
Xiaoqian Huang
Sanket Kachole
Abdulla Ayyad
F. B. Naeini
Dimitrios Makris
Yahya Zweiri
3DV3DPC
55
10
0
13 Feb 2023
MOSE: A New Dataset for Video Object Segmentation in Complex Scenes
MOSE: A New Dataset for Video Object Segmentation in Complex Scenes
Henghui Ding
Chang Liu
Shuting He
Xudong Jiang
Philip Torr
S. Bai
VOS
108
145
0
03 Feb 2023
Breaking the "Object" in Video Object Segmentation
Breaking the "Object" in Video Object Segmentation
P. Tokmakov
Jie Li
Adrien Gaidon
VOS
74
40
0
12 Dec 2022
EBHI-Seg: A Novel Enteroscope Biopsy Histopathological Haematoxylin and
  Eosin Image Dataset for Image Segmentation Tasks
EBHI-Seg: A Novel Enteroscope Biopsy Histopathological Haematoxylin and Eosin Image Dataset for Image Segmentation Tasks
Li Shi
Xirong Li
W. Hua
Hao Chen
Jing Chen
...
Hongzan Sun
M. Grzegorzek
Shouliang Qi
Yueyang Teng
Chen Li
75
21
0
01 Dec 2022
LVOS: A Benchmark for Long-term Video Object Segmentation
LVOS: A Benchmark for Long-term Video Object Segmentation
Li Hong
Wen-Chao Chen
Zhongying Liu
Wei Zhang
Pinxue Guo
Zhaoyu Chen
Wenqiang Zhang
VOS
113
50
0
18 Nov 2022
MedSegDiff: Medical Image Segmentation with Diffusion Probabilistic
  Model
MedSegDiff: Medical Image Segmentation with Diffusion Probabilistic Model
Junde Wu
Rao Fu
Huihui Fang
Yu Zhang
Yehui Yang
Haoyi Xiong
Huiying Liu
Yanwu Xu
MedImVLMDiffM
211
253
0
01 Nov 2022
Decoupling Features in Hierarchical Propagation for Video Object
  Segmentation
Decoupling Features in Hierarchical Propagation for Video Object Segmentation
Zongxin Yang
Yi Yang
VOS
101
157
0
18 Oct 2022
EPIC-KITCHENS VISOR Benchmark: VIdeo Segmentations and Object Relations
EPIC-KITCHENS VISOR Benchmark: VIdeo Segmentations and Object Relations
Ahmad Darkhalil
Dandan Shan
Bin Zhu
Jian Ma
Amlan Kar
Richard E. L. Higgins
Sanja Fidler
David Fouhey
Dima Damen
VOS
118
104
0
26 Sep 2022
BURST: A Benchmark for Unifying Object Recognition, Segmentation and
  Tracking in Video
BURST: A Benchmark for Unifying Object Recognition, Segmentation and Tracking in Video
A. Athar
Jonathon Luiten
P. Voigtlaender
Tarasha Khurana
Achal Dave
Bastian Leibe
Deva Ramanan
VOSVLM
99
60
0
25 Sep 2022
SOCRATES: A Stereo Camera Trap for Monitoring of Biodiversity
SOCRATES: A Stereo Camera Trap for Monitoring of Biodiversity
T. Haucke
H. Kühl
Volker Steinhage
88
11
0
19 Sep 2022
TotalSegmentator: robust segmentation of 104 anatomical structures in CT
  images
TotalSegmentator: robust segmentation of 104 anatomical structures in CT images
Jakob Wasserthal
H. Breit
Manfred T. Meyer
M. Pradella
Daniel Hinck
...
Daniel Boll
Joshy Cyriac
Shan Yang
M. Bach
Martin Segeroth
OOD
71
759
0
11 Aug 2022
Fine-Grained Egocentric Hand-Object Segmentation: Dataset, Model, and
  Applications
Fine-Grained Egocentric Hand-Object Segmentation: Dataset, Model, and Applications
Lingzhi Zhang
Shenghao Zhou
Simon Stent
Jianbo Shi
EgoV
91
63
0
07 Aug 2022
XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin
  Memory Model
XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model
Ho Kei Cheng
Alex Schwing
VLMVOS
107
410
0
14 Jul 2022
Neural Rendering for Stereo 3D Reconstruction of Deformable Tissues in
  Robotic Surgery
Neural Rendering for Stereo 3D Reconstruction of Deformable Tissues in Robotic Surgery
Yuehao Wang
Yonghao Long
Siu Hin Fan
Qingxu Dou
MedIm
89
127
0
30 Jun 2022
CIRDataset: A large-scale Dataset for Clinically-Interpretable lung
  nodule Radiomics and malignancy prediction
CIRDataset: A large-scale Dataset for Clinically-Interpretable lung nodule Radiomics and malignancy prediction
Wookjin Choi
N. Dahiya
Saad Nadeem
13
11
0
29 Jun 2022
AMOS: A Large-Scale Abdominal Multi-Organ Benchmark for Versatile
  Medical Image Segmentation
AMOS: A Large-Scale Abdominal Multi-Organ Benchmark for Versatile Medical Image Segmentation
Yuanfeng Ji
Haotian Bai
Jie Yang
Chongjian Ge
Ye Zhu
...
Zhuguo Li
Lingyan Zhang
Wanling Ma
Xiang Wan
Ping Luo
OOD
85
320
0
16 Jun 2022
ISLES 2022: A multi-center magnetic resonance imaging stroke lesion
  segmentation dataset
ISLES 2022: A multi-center magnetic resonance imaging stroke lesion segmentation dataset
M. H. Petzsche
Ezequiel de la Rosa
U. Hanning
Roland Wiest
Waldo Enrique Valenzuela Pinilla
...
T. Boeckh-Behrens
M. Berndt
B. Ikenberg
Benedikt Wiestler
Jan S. Kirschke
OOD
63
122
0
14 Jun 2022
Recurrent Dynamic Embedding for Video Object Segmentation
Recurrent Dynamic Embedding for Video Object Segmentation
Mingxing Li
Liucheng Hu
Zhiwei Xiong
Bang Zhang
Pan Pan
Dong Liu
VOS
145
64
0
08 May 2022
Instance Segmentation for Autonomous Log Grasping in Forestry Operations
Instance Segmentation for Autonomous Log Grasping in Forestry Operations
Jean-Michel Fortin
Olivier Gamache
Vincent Grondin
F. Pomerleau
Philippe Giguère
122
24
0
03 Mar 2022
FUSeg: The Foot Ulcer Segmentation Challenge
FUSeg: The Foot Ulcer Segmentation Challenge
Chuanbo Wang
Amirreza Mahbod
Isabella Ellinger
Adrian Galdran
Sandeep Gopalakrishnan
J. Niezgoda
Zeyun Yu
106
39
0
02 Jan 2022
3D Instance Segmentation of MVS Buildings
3D Instance Segmentation of MVS Buildings
Jiazhou Chen
Yanghui Xu
Shufang Lu
Ronghua Liang
Liangliang Nan
ISeg3DV
58
24
0
18 Dec 2021
Florence: A New Foundation Model for Computer Vision
Florence: A New Foundation Model for Computer Vision
Lu Yuan
Dongdong Chen
Yi-Ling Chen
Noel Codella
Xiyang Dai
...
Zhen Xiao
Jianwei Yang
Michael Zeng
Luowei Zhou
Pengchuan Zhang
VLM
147
908
0
22 Nov 2021
Masked Autoencoders Are Scalable Vision Learners
Masked Autoencoders Are Scalable Vision Learners
Kaiming He
Xinlei Chen
Saining Xie
Yanghao Li
Piotr Dollár
Ross B. Girshick
ViTTPM
485
7,837
0
11 Nov 2021
iShape: A First Step Towards Irregular Shape Instance Segmentation
iShape: A First Step Towards Irregular Shape Instance Segmentation
Lei Yang
Yan Wei
Yisheng He
Wei Sun
Zhenhang Huang
Haibin Huang
Haoqiang Fan
65
14
0
30 Sep 2021
A Survey on Deep Learning Technique for Video Segmentation
A Survey on Deep Learning Technique for Video Segmentation
Tianfei Zhou
Fatih Porikli
David J. Crandall
Luc Van Gool
Wenguan Wang
VOS
118
240
0
02 Jul 2021
Rethinking Space-Time Networks with Improved Memory Coverage for
  Efficient Video Object Segmentation
Rethinking Space-Time Networks with Improved Memory Coverage for Efficient Video Object Segmentation
Ho Kei Cheng
Yu-Wing Tai
Chi-Keung Tang
VOS
93
285
0
09 Jun 2021
A multi-centre polyp detection and segmentation dataset for
  generalisability assessment
A multi-centre polyp detection and segmentation dataset for generalisability assessment
Sharib Ali
Debesh Jha
N. Ghatwary
S. Realdon
R. Cannizzaro
...
Andreas Petlund
Pål Halvorsen
J. Rittscher
Thomas de Lange
J. East
84
88
0
08 Jun 2021
ZeroWaste Dataset: Towards Deformable Object Segmentation in Cluttered
  Scenes
ZeroWaste Dataset: Towards Deformable Object Segmentation in Cluttered Scenes
D. Bashkirova
M. Abdelfattah
Ziliang Zhu
James Akl
Fadi M. Alladkani
Ping Hu
Vitaly Ablavsky
B. Çalli
Sarah Adel Bargal
Kate Saenko
82
53
0
04 Jun 2021
Unidentified Video Objects: A Benchmark for Dense, Open-World
  Segmentation
Unidentified Video Objects: A Benchmark for Dense, Open-World Segmentation
Weiyao Wang
Matt Feiszli
Heng Wang
Du Tran
VOS
78
127
0
10 Apr 2021
COVID-19 Infection Localization and Severity Grading from Chest X-ray
  Images
COVID-19 Infection Localization and Severity Grading from Chest X-ray Images
Anas Tahir
M. Chowdhury
Amith Khandakar
Tawsifur Rahman
Yazan Qiblawey
...
Somaya Al-Madeed
K. Hameed
Tahir Hamid
S. Mahmud
Maymouna Ezeddin
96
147
0
14 Mar 2021
Learning Transferable Visual Models From Natural Language Supervision
Learning Transferable Visual Models From Natural Language Supervision
Alec Radford
Jong Wook Kim
Chris Hallacy
Aditya A. Ramesh
Gabriel Goh
...
Amanda Askell
Pamela Mishkin
Jack Clark
Gretchen Krueger
Ilya Sutskever
CLIPVLM
1.0K
29,926
0
26 Feb 2021
TransUNet: Transformers Make Strong Encoders for Medical Image
  Segmentation
TransUNet: Transformers Make Strong Encoders for Medical Image Segmentation
Jieneng Chen
Yongyi Lu
Qihang Yu
Xiangde Luo
Ehsan Adeli
Yan Wang
Le Lu
Alan Yuille
Yuyin Zhou
ViTMedIm
100
3,506
0
08 Feb 2021
Occluded Video Instance Segmentation: A Benchmark
Occluded Video Instance Segmentation: A Benchmark
Jiyang Qi
Yan Gao
Yao Hu
Xinggang Wang
Xiaoyu Liu
Xiang Bai
Serge Belongie
Alan Yuille
Philip Torr
S. Bai
VOSVLM
94
140
0
02 Feb 2021
A Survey on Visual Transformer
A Survey on Visual Transformer
Kai Han
Yunhe Wang
Hanting Chen
Xinghao Chen
Jianyuan Guo
...
Chunjing Xu
Yixing Xu
Zhaohui Yang
Yiman Zhang
Dacheng Tao
ViT
215
2,251
0
23 Dec 2020
Hypersim: A Photorealistic Synthetic Dataset for Holistic Indoor Scene
  Understanding
Hypersim: A Photorealistic Synthetic Dataset for Holistic Indoor Scene Understanding
Mike Roberts
Jason Ramapuram
Anurag Ranjan
Atulit Kumar
Miguel Angel Bautista
Nathan Paczan
Russ Webb
Joshua M. Susskind
171
393
0
04 Nov 2020
AbdomenCT-1K: Is Abdominal Organ Segmentation A Solved Problem?
AbdomenCT-1K: Is Abdominal Organ Segmentation A Solved Problem?
Jun Ma
Yao Zhang
Song Gu
Cheng Zhu
Cheng Ge
...
Shangqing Liu
Yunpeng Wang
Yuhui Li
Jian He
Xiaoping Yang
SSegOOD
119
346
0
28 Oct 2020
An Image is Worth 16x16 Words: Transformers for Image Recognition at
  Scale
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
Alexey Dosovitskiy
Lucas Beyer
Alexander Kolesnikov
Dirk Weissenborn
Xiaohua Zhai
...
Matthias Minderer
G. Heigold
Sylvain Gelly
Jakob Uszkoreit
N. Houlsby
ViT
684
41,563
0
22 Oct 2020
TrashCan: A Semantically-Segmented Dataset towards Visual Detection of
  Marine Debris
TrashCan: A Semantically-Segmented Dataset towards Visual Detection of Marine Debris
Jungseok Hong
Michael Fulton
Junaed Sattar
64
89
0
16 Jul 2020
A Survey on Instance Segmentation: State of the art
A Survey on Instance Segmentation: State of the art
A. M. Hafiz
G. M. Bhat
SSegISeg
84
436
0
28 Jun 2020
Previous
123
Next