Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2101.08833
Cited By
SSTVOS: Sparse Spatiotemporal Transformers for Video Object Segmentation
21 January 2021
Brendan Duke
Abdalla Ahmed
Christian Wolf
P. Aarabi
Graham W. Taylor
VOS
Re-assign community
ArXiv
PDF
HTML
Papers citing
"SSTVOS: Sparse Spatiotemporal Transformers for Video Object Segmentation"
40 / 40 papers shown
Title
OpenAVS: Training-Free Open-Vocabulary Audio Visual Segmentation with Foundational Models
Shengkai Chen
Yifang Yin
Jinming Cao
Shili Xiang
Zhenguang Liu
Roger Zimmermann
VOS
VLM
48
0
0
30 Apr 2025
MoSAM: Motion-Guided Segment Anything Model with Spatial-Temporal Memory Selection
Q. Yang
Yuan Yao
Miaomiao Cui
Liefeng Bo
VLM
61
0
0
30 Apr 2025
Learning Spatial-Semantic Features for Robust Video Object Segmentation
Xin Li
Deshui Miao
Zhenyu He
Yue Wang
Huchuan Lu
Ming Yang
VOS
59
4
0
10 Jul 2024
RMem: Restricted Memory Banks Improve Video Object Segmentation
Junbao Zhou
Ziqi Pang
Yu-xiong Wang
VOS
63
7
0
12 Jun 2024
Enhancing Multimodal Unified Representations for Cross Modal Generalization
Hai Huang
Yan Xia
Shengpeng Ji
Shulei Wang
Hanting Wang
Minghui Fang
Jieming Zhu
Zhenhua Dong
Sashuai Zhou
Zhou Zhao
31
6
0
08 Mar 2024
Self-supervised Video Object Segmentation with Distillation Learning of Deformable Attention
Quang-Trung Truong
Duc Thanh Nguyen
Binh-Son Hua
Sai-Kit Yeung
VOS
34
1
0
25 Jan 2024
Multimodal Variational Auto-encoder based Audio-Visual Segmentation
Yuxin Mao
Jing Zhang
Mochu Xiang
Yiran Zhong
Yuchao Dai
40
34
0
12 Oct 2023
Cross-modal Cognitive Consensus guided Audio-Visual Segmentation
Zhaofeng Shi
Qingbo Wu
Fanman Meng
Linfeng Xu
Hongliang Li
VOS
33
3
0
10 Oct 2023
Segmenting the motion components of a video: A long-term unsupervised model
E. Meunier
P. Bouthemy
24
0
0
02 Oct 2023
Contrastive Conditional Latent Diffusion for Audio-visual Segmentation
Yuxin Mao
Jing Zhang
Mochu Xiang
Yun-Qiu Lv
Yiran Zhong
Yuchao Dai
DiffM
43
28
0
31 Jul 2023
Hierarchical Spatiotemporal Transformers for Video Object Segmentation
Jun-Sang Yoo
H. Lee
Seung‐Won Jung
VOS
37
1
0
17 Jul 2023
Referred by Multi-Modality: A Unified Temporal Transformer for Video Object Segmentation
Shilin Yan
Renrui Zhang
Ziyu Guo
Wenchao Chen
Wei Zhang
Hongyang Li
Yu Qiao
Hao Dong
Zhongjiang He
Peng Gao
VOS
22
30
0
25 May 2023
Transavs: End-To-End Audio-Visual Segmentation With Transformer
Yuhang Ling
Yuxi Li
Zhenye Gan
Jiangning Zhang
M. Chi
Yabiao Wang
VOS
ViT
37
1
0
12 May 2023
Co-attention Propagation Network for Zero-Shot Video Object Segmentation
Gensheng Pei
Yazhou Yao
Fumin Shen
Daniel Huang
Xing-Rui Huang
Hengtao Shen
VOS
38
11
0
08 Apr 2023
Online Lane Graph Extraction from Onboard Video
Y. Can
Alexander Liniger
D. Paudel
Luc Van Gool
34
2
0
03 Apr 2023
MOSE: A New Dataset for Video Object Segmentation in Complex Scenes
Henghui Ding
Chang Liu
Shuting He
Xudong Jiang
Philip Torr
S. Bai
VOS
27
132
0
03 Feb 2023
Look Before You Match: Instance Understanding Matters in Video Object Segmentation
Junke Wang
Dongdong Chen
Zuxuan Wu
Chong Luo
Chuanxin Tang
Xiyang Dai
Yucheng Zhao
Yujia Xie
Lu Yuan
Yu-Gang Jiang
VOS
36
39
0
13 Dec 2022
Breaking the "Object" in Video Object Segmentation
P. Tokmakov
Jie Li
Adrien Gaidon
VOS
29
39
0
12 Dec 2022
Video Object of Interest Segmentation
Siyuan Zhou
Chunru Zhan
Biao Wang
T. Ge
Yuning Jiang
Li Niu
VOS
28
0
0
06 Dec 2022
Grafting Vision Transformers
Jong Sung Park
Kumara Kahatapitiya
Donghyun Kim
Shivchander Sudalairaj
Quanfu Fan
Michael S. Ryoo
ViT
29
2
0
28 Oct 2022
Per-Clip Video Object Segmentation
Kwanyong Park
Sanghyun Woo
Seoung Wug Oh
In So Kweon
Joon-Young Lee
VLM
VOS
32
50
0
03 Aug 2022
BATMAN: Bilateral Attention Transformer in Motion-Appearance Neighboring Space for Video Object Segmentation
Ye Yu
Jialing Yuan
Gaurav Mittal
Fuxin Li
Mei Chen
VOS
45
28
0
01 Aug 2022
Region Aware Video Object Segmentation with Deep Motion Modeling
Bo Miao
Bennamoun
Yongsheng Gao
Ajmal Mian
VOS
29
16
0
21 Jul 2022
Hierarchical Feature Alignment Network for Unsupervised Video Object Segmentation
Gensheng Pei
Fumin Shen
Yazhou Yao
G. Xie
Zhenmin Tang
Jinhui Tang
VOS
28
51
0
18 Jul 2022
Learning Quality-aware Dynamic Memory for Video Object Segmentation
Yong Liu
R. Yu
Fei Yin
Xinyuan Zhao
Wei-Ye Zhao
Weihao Xia
Yujiu Yang
VOS
32
47
0
16 Jul 2022
Audio-Visual Segmentation
Jinxing Zhou
Jianyuan Wang
Jianwei Zhang
Weixuan Sun
Jing Zhang
Stan Birchfield
Dan Guo
Lingpeng Kong
Meng Wang
Yiran Zhong
VOS
33
110
0
11 Jul 2022
Recurrent Dynamic Embedding for Video Object Segmentation
Mingxing Li
Liucheng Hu
Zhiwei Xiong
Bang Zhang
Pan Pan
Dong Liu
VOS
67
61
0
08 May 2022
Temporal Context for Robust Maritime Obstacle Detection
Lojze Žust
Matej Kristan
26
14
0
10 Mar 2022
Hybrid Tracker with Pixel and Instance for Video Panoptic Segmentation
Weicai Ye
Xinyue Lan
Gewei Su
Hujun Bao
Zhaopeng Cui
Guofeng Zhang
VOS
50
3
0
02 Mar 2022
UniFormer: Unifying Convolution and Self-attention for Visual Recognition
Kunchang Li
Yali Wang
Junhao Zhang
Peng Gao
Guanglu Song
Yu Liu
Hongsheng Li
Yu Qiao
ViT
162
360
0
24 Jan 2022
Video Transformers: A Survey
Javier Selva
A. S. Johansen
Sergio Escalera
Kamal Nasrollahi
T. Moeslund
Albert Clapés
ViT
22
103
0
16 Jan 2022
Siamese Network with Interactive Transformer for Video Object Segmentation
Meng Lan
Jing Zhang
Fengxiang He
Lefei Zhang
ViT
21
36
0
28 Dec 2021
Reliable Propagation-Correction Modulation for Video Object Segmentation
Xiaohao Xu
Jinglu Wang
Xiao Li
Yan Lu
VOS
43
61
0
06 Dec 2021
SWAT: Spatial Structure Within and Among Tokens
Kumara Kahatapitiya
Michael S. Ryoo
25
6
0
26 Nov 2021
Pixel-Level Bijective Matching for Video Object Segmentation
Suhwan Cho
Heansung Lee
Minjung Kim
Sungjun Jang
Sangyoun Lee
VOS
51
23
0
04 Oct 2021
Evo-ViT: Slow-Fast Token Evolution for Dynamic Vision Transformer
Yifan Xu
Zhijie Zhang
Mengdan Zhang
Kekai Sheng
Ke Li
Weiming Dong
Liqing Zhang
Changsheng Xu
Xing Sun
ViT
32
201
0
03 Aug 2021
A Survey on Deep Learning Technique for Video Segmentation
Tianfei Zhou
Fatih Porikli
David J. Crandall
Luc Van Gool
Wenguan Wang
VOS
34
232
0
02 Jul 2021
Can An Image Classifier Suffice For Action Recognition?
Quanfu Fan
Chun-Fu Chen
Chen
Rameswar Panda
ViT
29
33
0
26 Jun 2021
TransVOS: Video Object Segmentation with Transformers
Jianbiao Mei
Mengmeng Wang
Yen-Yu Lin
Yi Yuan
Yong Liu
ViT
11
28
0
01 Jun 2021
VisQA: X-raying Vision and Language Reasoning in Transformers
Theo Jaunet
Corentin Kervadec
Romain Vuillemot
G. Antipov
M. Baccouche
Christian Wolf
16
26
0
02 Apr 2021
1