Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2304.11968
Cited By
Track Anything: Segment Anything Meets Videos
24 April 2023
Jinyu Yang
Mingqi Gao
Zhe Li
Shanghua Gao
Fang Wang
Fengcai Zheng
VOS
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Track Anything: Segment Anything Meets Videos"
50 / 160 papers shown
Title
Pro2SAM: Mask Prompt to SAM with Grid Points for Weakly Supervised Object Localization
Xi Yang
Songsong Duan
Nannan Wang
Xinbo Gao
WSOL
78
0
0
08 May 2025
Mix-QSAM: Mixed-Precision Quantization of the Segment Anything Model
Navin Ranjan
Andreas E. Savakis
MQ
VLM
70
0
0
08 May 2025
A Survey on 3D Reconstruction Techniques in Plant Phenotyping: From Classical Methods to Neural Radiance Fields (NeRF), 3D Gaussian Splatting (3DGS), and Beyond
Jiajian Li
Xinda Qi
Seyed Hamidreza Nabaei
M. Liu
Dong Chen
Xin Zhang
Xunyuan Yin
Zhen Li
56
0
0
30 Apr 2025
Vivid4D: Improving 4D Reconstruction from Monocular Video by Video Inpainting
Jiaxin Huang
Sheng Miao
BangBnag Yang
Yuewen Ma
Yiyi Liao
VGen
MDE
36
0
0
15 Apr 2025
Robust SAM: On the Adversarial Robustness of Vision Foundation Models
Jiahuan Long
Zhengqin Xu
Tingsong Jiang
Wen Yao
Shuai Jia
Chao Ma
Xiaoqian Chen
AAML
VLM
39
1
0
11 Apr 2025
Parameter-Free Fine-tuning via Redundancy Elimination for Vision Foundation Models
Jiahuan Long
Tingsong Jiang
Wen Yao
Yizhe Xiong
Zhengqin Xu
Shuai Jia
Chao Ma
29
0
0
11 Apr 2025
HiMoR: Monocular Deformable Gaussian Reconstruction with Hierarchical Motion Representation
Yiming Liang
Tianhan Xu
Yuta Kikuchi
48
0
0
08 Apr 2025
Caption Anything in Video: Fine-grained Object-centric Captioning via Spatiotemporal Multimodal Prompting
Yunlong Tang
Jing Bi
Chao Huang
Susan Liang
Daiki Shimada
...
Jinxi He
Liu He
Zeliang Zhang
Jiebo Luo
Chenliang Xu
49
0
0
07 Apr 2025
S^4M: Boosting Semi-Supervised Instance Segmentation with SAM
Heeji Yoon
Heeseong Shin
Eunbeen Hong
Hyunwook Choi
Hansang Cho
Daun Jeong
Seungryong Kim
33
0
0
07 Apr 2025
Hybrid Global-Local Representation with Augmented Spatial Guidance for Zero-Shot Referring Image Segmentation
Ting Liu
Siyuan Li
51
0
0
01 Apr 2025
Multimodal Data Integration for Sustainable Indoor Gardening: Tracking Anyplant with Time Series Foundation Model
Seyed Hamidreza Nabaei
Zeyang Zheng
Dong Chen
Arsalan Heydarian
41
0
0
27 Mar 2025
BARD-GS: Blur-Aware Reconstruction of Dynamic Scenes via Gaussian Splatting
Yiren Lu
Yunlai Zhou
Disheng Liu
Tuo Liang
Yu Yin
3DGS
65
1
0
20 Mar 2025
E-Values Expand the Scope of Conformal Prediction
Etienne Gauthier
Francis Bach
Michael I. Jordan
52
2
0
17 Mar 2025
E-SAM: Training-Free Segment Every Entity Model
Weiming Zhang
Dingwen Xiao
Lei Chen
Lin Wang
VLM
62
0
0
15 Mar 2025
ROS-SAM: High-Quality Interactive Segmentation for Remote Sensing Moving Object
Zhe Shan
Yang Liu
Lei Zhou
C. Yan
Haoyu Wang
Xia Xie
54
1
0
15 Mar 2025
CRTrack: Low-Light Semi-Supervised Multi-object Tracking Based on Consistency Regularization
Zijing Zhao
Jianlong Yu
Lin Zhang
Shunli Zhang
45
0
0
24 Feb 2025
UnrealZoo: Enriching Photo-realistic Virtual Worlds for Embodied AI
Fangwei Zhong
Kui Wu
Churan Wang
Hao Chen
Hai Ci
Zhoujun Li
Yizhou Wang
VGen
44
0
0
31 Dec 2024
First-frame Supervised Video Polyp Segmentation via Propagative and Semantic Dual-teacher Network
Qiang Hu
Mei Liu
Qiang Li
Zhiwei Wang
90
0
0
21 Dec 2024
Expanded Comprehensive Robotic Cholecystectomy Dataset (CRCD)
K. Oh
Leonardo Borgioli
Alberto Mangano
Valentina Valle
Marco Di Pangrazio
...
Luciano Ambrosini
Alvaro Ducas
Milos Zefran
Liaohai Chen
P. Giulianotti
80
1
0
16 Dec 2024
RoDyGS: Robust Dynamic Gaussian Splatting for Casual Videos
Yoonwoo Jeong
Junmyeong Lee
Hoseung Choi
Minsu Cho
3DGS
94
0
0
04 Dec 2024
RELOCATE: A Simple Training-Free Baseline for Visual Query Localization Using Region-Based Representations
Savya Khosla
S. Vallecorsa
Alex Schwing
Derek Hoiem
69
0
0
02 Dec 2024
CAT4D: Create Anything in 4D with Multi-View Video Diffusion Models
Rundi Wu
Ruiqi Gao
Ben Poole
Alex Trevithick
Changxi Zheng
Jonathan T. Barron
Aleksander Holyñski
VGen
93
24
0
27 Nov 2024
There is no SAMantics! Exploring SAM as a Backbone for Visual Understanding Tasks
Miguel Espinosa
Chenhongyi Yang
Linus Ericsson
Jingyu Sun
Elliot J. Crowley
VLM
80
0
0
22 Nov 2024
Segment Anything in Light Fields for Real-Time Applications via Constrained Prompting
Nikolai Goncharov
Donald G. Dansereau
VLM
75
1
0
21 Nov 2024
Towards Unbiased and Robust Spatio-Temporal Scene Graph Generation and Anticipation
Rohith Peddi
Saurabh
Ayush Abhay Shrivastava
Parag Singla
Vibhav Gogate
87
0
0
20 Nov 2024
QuadWBG: Generalizable Quadrupedal Whole-Body Grasping
Jilong Wang
Javokhirbek Rajabov
Chaoyi Xu
Yiming Zheng
He Wang
49
1
0
11 Nov 2024
SpectroMotion: Dynamic 3D Reconstruction of Specular Scenes
Cheng-De Fan
Chen-Wei Chang
Yi-Ruei Liu
Jie-Ying Lee
Jiun-Long Huang
Yu-Chee Tseng
Yu-Lun Liu
3DGS
70
4
0
22 Oct 2024
BYOCL: Build Your Own Consistent Latent with Hierarchical Representative Latent Clustering
Jiayue Dai
Yunya Wang
Yihan Fang
Yuetong Chen
Butian Xiong
VLM
29
0
0
19 Oct 2024
SAMReg: SAM-enabled Image Registration with ROI-based Correspondence
Shiqi Huang
Tingfa Xu
Z. Shen
Shaheer U. Saeed
Wen Yan
D. Barratt
Yipeng Hu
31
1
0
17 Oct 2024
SeedLM: Compressing LLM Weights into Seeds of Pseudo-Random Generators
Rasoul Shafipour
David Harrison
Maxwell Horton
Jeffrey Marker
Houman Bedayat
Sachin Mehta
Mohammad Rastegari
Mahyar Najibi
Saman Naderiparizi
MQ
62
0
0
14 Oct 2024
Zero-Shot Pupil Segmentation with SAM 2: A Case Study of Over 14 Million Images
Virmarie Maquiling
Sean Anthony Byrne
D. Niehorster
Marco Carminati
Enkelejda Kasneci
VLM
50
0
0
11 Oct 2024
VideoSAM: Open-World Video Segmentation
Pinxue Guo
Zixu Zhao
Jianxiong Gao
Chongruo Wu
Tong He
Zheng Zhang
Tianjun Xiao
Wenqiang Zhang
VOS
36
0
0
11 Oct 2024
OmniPose6D: Towards Short-Term Object Pose Tracking in Dynamic Scenes from Monocular RGB
Yunzhi Lin
Yipu Zhao
Fu-Jen Chu
Xingyu Chen
Weiyao Wang
Hao Tang
Patricio A. Vela
Matt Feiszli
Kevin J. Liang
29
0
0
09 Oct 2024
On Efficient Variants of Segment Anything Model: A Survey
Xiaorui Sun
Jing Liu
H. Shen
Xiaofeng Zhu
Ping Hu
VLM
56
4
0
07 Oct 2024
DarkSAM: Fooling Segment Anything Model to Segment Nothing
Ziqi Zhou
Yufei Song
Minghui Li
Shengshan Hu
Xianlong Wang
Leo Yu Zhang
Dezhong Yao
Hai Jin
44
11
0
26 Sep 2024
Walker: Self-supervised Multiple Object Tracking by Walking on Temporal Appearance Graphs
Mattia Segu
Luigi Piccinelli
Siyuan Li
Luc Van Gool
Fisher Yu
Bernt Schiele
VOT
51
2
0
25 Sep 2024
Underwater Camouflaged Object Tracking Meets Vision-Language SAM2
Chunhui Zhang
Li Liu
Guanjie Huang
Hao-Kai Wen
Xi Zhou
Xi Zhou
Shiming Ge
Yanfeng Wang
33
1
0
25 Sep 2024
Foundation Models for Amodal Video Instance Segmentation in Automated Driving
Jasmin Breitenstein
Franz Jünger
Andreas Bär
Tim Fingscheidt
45
1
0
21 Sep 2024
Oryx MLLM: On-Demand Spatial-Temporal Understanding at Arbitrary Resolution
Zuyan Liu
Yuhao Dong
Ziwei Liu
Winston Hu
Jiwen Lu
Yongming Rao
ObjD
88
55
0
19 Sep 2024
Tracking Virtual Meetings in the Wild: Re-identification in Multi-Participant Virtual Meetings
Oriel Perl
Ido Leshem
Uria Franko
Yuval Goldman
28
0
0
15 Sep 2024
Towards Generalizable Scene Change Detection
Jaewoo Kim
Uehwan Kim
50
0
0
10 Sep 2024
Semantically Controllable Augmentations for Generalizable Robot Learning
Zoey Chen
Zhao Mandi
Homanga Bharadhwaj
Mohit Sharma
Shuran Song
Abhishek Gupta
Vikash Kumar
LM&Ro
39
5
0
02 Sep 2024
SAM2Point: Segment Any 3D as Videos in Zero-shot and Promptable Manners
Ziyu Guo
Renrui Zhang
Xiangyang Zhu
Chengzhuo Tong
Peng Gao
Chunyuan Li
Pheng-Ann Heng
VGen
3DPC
46
13
0
29 Aug 2024
GoodSAM++: Bridging Domain and Capacity Gaps via Segment Anything Model for Panoramic Semantic Segmentation
Weiming Zhang
Yexin Liu
Xu Zheng
Lin Wang
VLM
52
6
0
17 Aug 2024
Surgical SAM 2: Real-time Segment Anything in Surgical Video by Efficient Frame Pruning
Haofeng Liu
Erli Zhang
Junde Wu
Mingxuan Hong
Yueming Jin
MedIm
53
14
0
15 Aug 2024
A Training-Free Framework for Video License Plate Tracking and Recognition with Only One-Shot
Haoxuan Ding
Qi. Wang
Junyu Gao
Qiang Li
VLM
37
0
0
11 Aug 2024
Embodied Uncertainty-Aware Object Segmentation
Xiaolin Fang
Leslie Pack Kaelbling
Tomás Lozano-Pérez
27
5
0
08 Aug 2024
Fast Sprite Decomposition from Animated Graphics
Tomoyuki Suzuki
Kotaro Kikuchi
Kota Yamaguchi
44
1
0
07 Aug 2024
SAM 2: Segment Anything in Images and Videos
Nikhila Ravi
Valentin Gabeur
Yuan-Ting Hu
Ronghang Hu
Chaitanya K. Ryali
...
Nicolas Carion
Chao-Yuan Wu
Ross B. Girshick
Piotr Dollár
Christoph Feichtenhofer
VLM
MLLM
55
744
0
01 Aug 2024
Strike the Balance: On-the-Fly Uncertainty based User Interactions for Long-Term Video Object Segmentation
Stéphane Vujasinović
Kamil Dreczkowski
Sebastian Bullinger
Norbert Scherer-Negenborn
Michael Arens
VOS
31
1
0
31 Jul 2024
1
2
3
4
Next