Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2304.11968
Cited By
Track Anything: Segment Anything Meets Videos
24 April 2023
Jinyu Yang
Mingqi Gao
Zhe Li
Shanghua Gao
Fang Wang
Fengcai Zheng
VOS
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Track Anything: Segment Anything Meets Videos"
50 / 160 papers shown
Title
Segment Anything for Videos: A Systematic Survey
Chunhui Zhang
Yawen Cui
Weilin Lin
Guanjie Huang
Yan Rong
Li Liu
Shiguang Shan
VLM
52
6
0
31 Jul 2024
Theia: Distilling Diverse Vision Foundation Models for Robot Learning
Jinghuan Shang
Karl Schmeckpeper
Brandon B. May
M. Minniti
Tarik Kelestemur
David Watkins
Laura Herlant
VLM
41
23
0
29 Jul 2024
ASI-Seg: Audio-Driven Surgical Instrument Segmentation with Surgeon Intention Understanding
Zhen Chen
Zongmin Zhang
Wenwu Guo
Xingjian Luo
Long Bai
Jinlin Wu
Hongliang Ren
Hongbin Liu
43
5
0
28 Jul 2024
SAM-CP: Marrying SAM with Composable Prompts for Versatile Segmentation
Pengfei Chen
Lingxi Xie
Xinyue Huo
Xuehui Yu
Xiaopeng Zhang
Yingfei Sun
Zhenjun Han
Qi Tian
VLM
70
1
0
23 Jul 2024
WTS: A Pedestrian-Centric Traffic Video Dataset for Fine-grained Spatial-Temporal Understanding
Quan Kong
Yuki Kawana
Rajat Saini
Ashutosh Kumar
Jingjing Pan
...
Yohei Ozao
Balázs Opra
D. Anastasiu
Yoichi Sato
Norimasa Kobori
VGen
44
8
0
22 Jul 2024
Shape of Motion: 4D Reconstruction from a Single Video
Qianqian Wang
Vickie Ye
Hang Gao
Jake Austin
Zhengqi Li
Angjoo Kanazawa
VGen
65
65
0
18 Jul 2024
Weakly-supervised Autism Severity Assessment in Long Videos
Abid Ali
Mahmoud Ali
J. Odobez
Camilla Barbini
Séverine Dubuisson
Francois Bremond
Susanne Thümmler
25
0
0
12 Jul 2024
OSN: Infinite Representations of Dynamic 3D Scenes from Monocular Videos
Ziyang Song
Jinxi Li
Bo Yang
37
0
0
08 Jul 2024
Dynamic Gaussian Marbles for Novel View Synthesis of Casual Monocular Videos
Colton Stearns
Adam W. Harley
Mikaela Uy
Florian Dubost
Federico Tombari
Gordon Wetzstein
Leonidas J. Guibas
3DGS
50
30
0
26 Jun 2024
Diffusion Model-Based Video Editing: A Survey
Wenhao Sun
Rong-Cheng Tu
Jingyi Liao
Dacheng Tao
VGen
71
22
0
26 Jun 2024
ALPS: An Auto-Labeling and Pre-training Scheme for Remote Sensing Segmentation With Segment Anything Model
Song Zhang
Qingzhong Wang
Junyi Liu
Haoyi Xiong
48
1
0
16 Jun 2024
RobustSAM: Segment Anything Robustly on Degraded Images
Wei-Ting Chen
Yu-Jiet Vong
Sy-Yen Kuo
Sizhuo Ma
Jian Wang
VLM
53
10
0
13 Jun 2024
Training-Free Robust Interactive Video Object Segmentation
Xiaoli Wei
Zhaoqing Wang
Yandong Guo
Chunxia Zhang
Tongliang Liu
Mingming Gong
VLM
VOS
49
1
0
08 Jun 2024
Matching Anything by Segmenting Anything
Siyuan Li
Lei Ke
Martin Danelljan
Luigi Piccinelli
Mattia Segu
Luc Van Gool
Fisher Yu
VOS
45
22
0
06 Jun 2024
World Models for General Surgical Grasping
Hongbin Lin
Bin Li
Chun Wai Wong
Juan Rojas
Xiangyu Chu
K. W. S. Au
38
3
0
28 May 2024
RealityEffects: Augmenting 3D Volumetric Videos with Object-Centric Annotation and Dynamic Visual Effects
Jian Liao
Kevin Van
Zhijie Xia
Ryo Suzuki
VGen
37
2
0
28 May 2024
Segmentation of Maya hieroglyphs through fine-tuned foundation models
Fnu Shivam
Megan Leight
Mary Kate Kelly
Claire Davis
Kelsey Clodfelter
Jacob Thrasher
Yenumula Reddy
P. Gyawali
50
0
0
26 May 2024
CoPeD-Advancing Multi-Robot Collaborative Perception: A Comprehensive Dataset in Real-World Environments
Yang Zhou
Long Quang
Carlos Nieto-Granda
Giuseppe Loianno
21
2
0
23 May 2024
PIR: Remote Sensing Image-Text Retrieval with Prior Instruction Representation Learning
Jiancheng Pan
Muyuan Ma
Qing Ma
Cong Bai
Shengyong Chen
32
8
0
16 May 2024
PTQ4SAM: Post-Training Quantization for Segment Anything
Chengtao Lv
Hong Chen
Jinyang Guo
Yifu Ding
Xianglong Liu
VLM
MQ
36
14
0
06 May 2024
UnSAMFlow: Unsupervised Optical Flow Guided by Segment Anything Model
Shuai Yuan
Lei Luo
Zhuo Hui
Can Pu
Xiaoyu Xiang
Rakesh Ranjan
D. Demandolx
46
4
0
04 May 2024
MoPEFT: A Mixture-of-PEFTs for the Segment Anything Model
Rajat Sahay
Andreas E. Savakis
MoE
49
0
0
01 May 2024
What Foundation Models can Bring for Robot Learning in Manipulation : A Survey
Dingzhe Li
Yixiang Jin
A. Yong
Hongze Yu
Jun Shi
Xiaoshuai Hao
Peng Hao
Huaping Liu
Gang Hua
Bin Fang
AI4CE
LM&Ro
76
13
0
28 Apr 2024
Beyond Pixel-Wise Supervision for Medical Image Segmentation: From Traditional Models to Foundation Models
Yuyan Shi
Jialu Ma
Jin Yang
Shasha Wang
Yichi Zhang
MedIm
VLM
26
2
0
20 Apr 2024
REACTO: Reconstructing Articulated Objects from a Single Video
Chaoyue Song
Jiacheng Wei
Chuan-Sheng Foo
Guosheng Lin
Fayao Liu
37
14
0
17 Apr 2024
Empowering Embodied Visual Tracking with Visual Foundation Models and Offline RL
Fangwei Zhong
Kui Wu
Hai Ci
Churan Wang
Hao Chen
OffRL
39
2
0
15 Apr 2024
Practical Region-level Attack against Segment Anything Models
Yifan Shen
Zhengyuan Li
Gang Wang
VLM
53
9
0
12 Apr 2024
GoodSAM: Bridging Domain and Capacity Gaps via Segment Anything Model for Distortion-aware Panoramic Semantic Segmentation
Weiming Zhang
Yexin Liu
Xueye Zheng
Lin Wang
54
11
0
25 Mar 2024
LocalStyleFool: Regional Video Style Transfer Attack Using Segment Anything Model
Yuxin Cao
Jinghao Li
Xi Xiao
Derui Wang
Minhui Xue
Hao Ge
Wei Liu
Guangwu Hu
AAML
44
1
0
18 Mar 2024
PosSAM: Panoptic Open-vocabulary Segment Anything
VS Vibashan
Shubhankar Borse
Hyojin Park
Debasmit Das
Vishal M. Patel
Munawar Hayat
Fatih Porikli
VLM
MLLM
48
6
0
14 Mar 2024
SAM-Lightening: A Lightweight Segment Anything Model with Dilated Flash Attention to Achieve 30 times Acceleration
Yanfei Song
Bangzheng Pu
Peng Wang
Hongxu Jiang
Dong Dong
Yongxiang Cao
Yiqing Shen
VLM
48
12
0
14 Mar 2024
R
2
\text{R}^2
R
2
-Bench: Benchmarking the Robustness of Referring Perception Models under Perturbations
Xiang Li
Kai Qiu
Jinglu Wang
Xiaohao Xu
Rita Singh
Kashu Yamazaki
Hao Chen
Xiaonan Huang
Bhiksha Raj
VOS
45
2
0
07 Mar 2024
SAM-PD: How Far Can SAM Take Us in Tracking and Segmenting Anything in Videos by Prompt Denoising
Tao Zhou
Wenhan Luo
Qi Ye
Zhiguo Shi
Jiming Chen
VLM
55
3
0
07 Mar 2024
A Simple-but-effective Baseline for Training-free Class-Agnostic Counting
Yuhao Lin
Hai-Ming Xu
Lingqiao Liu
Javen Qinfeng Shi
31
1
0
03 Mar 2024
Spurious Feature Eraser: Stabilizing Test-Time Adaptation for Vision-Language Foundation Model
Huan Ma
Yan Zhu
Changqing Zhang
Peilin Zhao
Baoyuan Wu
Long-Kai Huang
Qinghua Hu
Bing Wu
VLM
69
2
0
01 Mar 2024
VRP-SAM: SAM with Visual Reference Prompt
Yanpeng Sun
Jiahui Chen
Shan Zhang
Xinyu Zhang
Qiang Chen
Gang Zhang
Errui Ding
Jingdong Wang
Zechao Li
54
32
0
27 Feb 2024
BLO-SAM: Bi-level Optimization Based Overfitting-Preventing Finetuning of SAM
Li Zhang
Youwei Liang
Ruiyi Zhang
Amirhosein Javadi
Pengtao Xie
VLM
26
8
0
26 Feb 2024
Real-World Robot Applications of Foundation Models: A Review
Kento Kawaharazuka
T. Matsushima
Andrew Gambardella
Jiaxian Guo
Chris Paxton
Andy Zeng
OffRL
VLM
LM&Ro
51
47
0
08 Feb 2024
EfficientViT-SAM: Accelerated Segment Anything Model Without Accuracy Loss
Zhuoyang Zhang
Han Cai
Song Han
VLM
29
3
0
07 Feb 2024
A Survey on Robotics with Foundation Models: toward Embodied AI
Zhiyuan Xu
Kun Wu
Junjie Wen
Jinming Li
Ning Liu
Zhengping Che
Jian Tang
AI4CE
LRM
LM&Ro
33
24
0
04 Feb 2024
Hi-SAM: Marrying Segment Anything Model for Hierarchical Text Segmentation
Maoyuan Ye
Jing Zhang
Juhua Liu
Chenyu Liu
Baocai Yin
Cong Liu
Bo Du
Dacheng Tao
VLM
37
11
0
31 Jan 2024
TriSAM: Tri-Plane SAM for zero-shot cortical blood vessel segmentation in VEM images
Jia Wan
Wanhua Li
Jason Ken Adhinarta
Atmadeep Banerjee
Evelina Sjostedt
Jingpeng Wu
J. Lichtman
Hanspeter Pfister
D. Wei
34
6
0
25 Jan 2024
Boosting Few-Shot Semantic Segmentation Via Segment Anything Model
Chen-Bin Feng
Qi Lai
Kangdao Liu
Houcheng Su
Chi-Man Vong
21
3
0
18 Jan 2024
TACO: Benchmarking Generalizable Bimanual Tool-ACtion-Object Understanding
Yun-Hai Liu
Haolin Yang
Xu Si
Ling Liu
Zipeng Li
Yuxiang Zhang
Yebin Liu
Li Yi
68
23
0
16 Jan 2024
UV-SAM: Adapting Segment Anything Model for Urban Village Identification
Xin Zhang
Yu Liu
Yuming Lin
Qingmin Liao
Yong Li
VLM
32
36
0
16 Jan 2024
Forging Vision Foundation Models for Autonomous Driving: Challenges, Methodologies, and Opportunities
Xu Yan
Haiming Zhang
Yingjie Cai
Jingming Guo
Weichao Qiu
...
Lihui Jiang
Wei Zhang
Hongbo Zhang
Dengxin Dai
Bingbing Liu
63
18
0
16 Jan 2024
SamLP: A Customized Segment Anything Model for License Plate Detection
Haoxuan Ding
Junyuan Gao
Yuan. Yuan
Qi. Wang
MLLM
VLM
36
7
0
12 Jan 2024
LangSplat: 3D Language Gaussian Splatting
Minghan Qin
Wanhua Li
Jiawei Zhou
Haoqian Wang
Hanspeter Pfister
VLM
3DGS
30
184
0
26 Dec 2023
V-STRONG: Visual Self-Supervised Traversability Learning for Off-road Navigation
Sanghun Jung
JoonHo Lee
Xiangyun Meng
Byron Boots
Alexander Lambert
50
28
0
26 Dec 2023
Semantic-aware SAM for Point-Prompted Instance Segmentation
Zhaoyang Wei
Pengfei Chen
Xuehui Yu
Guorong Li
Jianbin Jiao
Zhenjun Han
VLM
40
6
0
26 Dec 2023
Previous
1
2
3
4
Next