ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1809.00461
  4. Cited By
YouTube-VOS: Sequence-to-Sequence Video Object Segmentation

YouTube-VOS: Sequence-to-Sequence Video Object Segmentation

3 September 2018
N. Xu
L. Yang
Yuchen Fan
Jianchao Yang
Dingcheng Yue
Yuchen Liang
Brian L. Price
Scott D. Cohen
Thomas Huang
    VOS
ArXiv (abs)PDFHTML

Papers citing "YouTube-VOS: Sequence-to-Sequence Video Object Segmentation"

50 / 238 papers shown
Title
LVOS: A Benchmark for Large-scale Long-term Video Object Segmentation
LVOS: A Benchmark for Large-scale Long-term Video Object Segmentation
Lingyi Hong
Zhongying Liu
Wenchao Chen
Chenzhi Tan
Yuang Feng
...
Jinglun Li
Zhaoyu Chen
Shuyong Gao
Wei Zhang
Wenqiang Zhang
VLMVOS
78
14
0
30 Apr 2024
Dynamic in Static: Hybrid Visual Correspondence for Self-Supervised
  Video Object Segmentation
Dynamic in Static: Hybrid Visual Correspondence for Self-Supervised Video Object Segmentation
Gensheng Pei
Yazhou Yao
Jianbo Jiao
Wenguan Wang
Liqiang Nie
Jinhui Tang
VOS
98
1
0
21 Apr 2024
Multilateral Temporal-view Pyramid Transformer for Video Inpainting
  Detection
Multilateral Temporal-view Pyramid Transformer for Video Inpainting Detection
Ying Zhang
Yuezun Li
Bo Peng
Jiaran Zhou
Huiyu Zhou
Junyu Dong
76
0
0
17 Apr 2024
Efficient Video Object Segmentation via Modulated Cross-Attention Memory
Efficient Video Object Segmentation via Modulated Cross-Attention Memory
Abdelrahman M. Shaker
Syed Talal Wasim
Martin Danelljan
Salman Khan
Ming-Hsuan Yang
Fahad Shahbaz Khan
VOS
65
4
0
26 Mar 2024
Exploring Dynamic Transformer for Efficient Object Tracking
Exploring Dynamic Transformer for Efficient Object Tracking
Jiawen Zhu
Xin Chen
Haiwen Diao
Shuai Li
Jun-Yan He
Chenyang Li
Bin Luo
Dong Wang
Huchuan Lu
146
3
0
26 Mar 2024
CLIP-VIS: Adapting CLIP for Open-Vocabulary Video Instance Segmentation
CLIP-VIS: Adapting CLIP for Open-Vocabulary Video Instance Segmentation
Wenqi Zhu
Jiale Cao
Jin Xie
Shuangming Yang
Yanwei Pang
VLMCLIP
115
3
0
19 Mar 2024
Object Segmentation-Assisted Inter Prediction for Versatile Video Coding
Object Segmentation-Assisted Inter Prediction for Versatile Video Coding
Zhuoyuan Li
Zikun Yuan
Li Li
Dong Liu
Xiaohu Tang
Feng Wu
VOS
77
11
0
18 Mar 2024
IMPRINT: Generative Object Compositing by Learning Identity-Preserving
  Representation
IMPRINT: Generative Object Compositing by Learning Identity-Preserving Representation
Yizhi Song
Zhifei Zhang
Zhe Lin
Scott D. Cohen
Brian L. Price
Jianming Zhang
Soo Ye Kim
He Zhang
Wei Xiong
Daniel G. Aliaga
DiffM
102
41
0
15 Mar 2024
OneTracker: Unifying Visual Object Tracking with Foundation Models and
  Efficient Tuning
OneTracker: Unifying Visual Object Tracking with Foundation Models and Efficient Tuning
Lingyi Hong
Shilin Yan
Renrui Zhang
Wanyun Li
Xinyu Zhou
...
Kaixun Jiang
Yiting Chen
Jinglun Li
Zhaoyu Chen
Wenqiang Zhang
VLM
82
51
0
14 Mar 2024
VideoMAC: Video Masked Autoencoders Meet ConvNets
VideoMAC: Video Masked Autoencoders Meet ConvNets
Gensheng Pei
Tao Chen
XiRuo Jiang
Huafeng Liu
Zeren Sun
Yazhou Yao
VGen
108
10
0
29 Feb 2024
UniVS: Unified and Universal Video Segmentation with Prompts as Queries
UniVS: Unified and Universal Video Segmentation with Prompts as Queries
Ming-hui Li
Shuai Li
Xindong Zhang
Lei Zhang
VOS
105
18
0
28 Feb 2024
Reimagining Reality: A Comprehensive Survey of Video Inpainting
  Techniques
Reimagining Reality: A Comprehensive Survey of Video Inpainting Techniques
Shreyank N. Gowda
Yash Thakre
Shashank Narayana Gowda
Xiaobo Jin
77
0
0
31 Jan 2024
Towards Language-Driven Video Inpainting via Multimodal Large Language
  Models
Towards Language-Driven Video Inpainting via Multimodal Large Language Models
Jianzong Wu
Xiangtai Li
Chenyang Si
Shangchen Zhou
Jingkang Yang
...
Yining Li
Kai Chen
Yunhai Tong
Ziwei Liu
Chen Change Loy
VGenDiffMMLLM
121
17
0
18 Jan 2024
Deep Learning-based Image and Video Inpainting: A Survey
Deep Learning-based Image and Video Inpainting: A Survey
Weize Quan
Jiaxi Chen
Yanli Liu
Dong-Ming Yan
Peter Wonka
3DV
78
39
0
07 Jan 2024
VASE: Object-Centric Appearance and Shape Manipulation of Real Videos
VASE: Object-Centric Appearance and Shape Manipulation of Real Videos
E. Peruzzo
Vidit Goel
Dejia Xu
Xingqian Xu
Yi Ding
Zhangyang Wang
Humphrey Shi
N. Sebe
LM&RoVGenDiffM
126
12
0
04 Jan 2024
Hierarchical Graph Pattern Understanding for Zero-Shot VOS
Hierarchical Graph Pattern Understanding for Zero-Shot VOS
Gensheng Pei
Fumin Shen
Yazhou Yao
Tao Chen
Xian-Sheng Hua
Jikang Cheng
VOS
71
3
0
15 Dec 2023
TAM-VT: Transformation-Aware Multi-scale Video Transformer for
  Segmentation and Tracking
TAM-VT: Transformation-Aware Multi-scale Video Transformer for Segmentation and Tracking
Raghav Goyal
Wan-Cyuan Fan
Mennatullah Siam
Leonid Sigal
VOS
82
3
0
13 Dec 2023
Semi-supervised Active Learning for Video Action Detection
Semi-supervised Active Learning for Video Action Detection
Aayush Singh
A. J. Rana
Akash Kumar
Shruti Vyas
Yogesh S Rawat
102
9
0
12 Dec 2023
SimulFlow: Simultaneously Extracting Feature and Identifying Target for
  Unsupervised Video Object Segmentation
SimulFlow: Simultaneously Extracting Feature and Identifying Target for Unsupervised Video Object Segmentation
Lingyi Hong
Wei Zhang
Shuyong Gao
Hong Lu
Wenqiang Zhang
VOS
86
9
0
30 Nov 2023
Flow-Guided Diffusion for Video Inpainting
Flow-Guided Diffusion for Video Inpainting
Bohai Gu
Yongsheng Yu
Hengrui Fan
Libo Zhang
VGenDiffM
102
12
0
26 Nov 2023
Sketch-based Video Object Segmentation: Benchmark and Analysis
Sketch-based Video Object Segmentation: Benchmark and Analysis
Ruolin Yang
Da Li
Conghui Hu
Timothy M. Hospedales
Honggang Zhang
Yi-Zhe Song
VOS
75
1
0
13 Nov 2023
Exploiting Inductive Biases in Video Modeling through Neural CDEs
Exploiting Inductive Biases in Video Modeling through Neural CDEs
Johnathan Chiu
Samuel Duffield
Max Hunter Gordon
Kaelan Donatella
Maxwell Aifer
Andi Gu
82
1
0
08 Nov 2023
Learning the What and How of Annotation in Video Object Segmentation
Learning the What and How of Annotation in Video Object Segmentation
Thanos Delatolas
Vicky S. Kalogeiton
Dim P. Papadopoulos
VOS
54
13
0
08 Nov 2023
SpVOS: Efficient Video Object Segmentation with Triple Sparse
  Convolution
SpVOS: Efficient Video Object Segmentation with Triple Sparse Convolution
Weihao Lin
Tao Chen
Chong Yu
VOS
80
3
0
23 Oct 2023
FocDepthFormer: Transformer with LSTM for Depth Estimation from Focus
FocDepthFormer: Transformer with LSTM for Depth Estimation from Focus
Xueyang Kang
Fengze Han
A. Fayjie
Dong Gong
MDEViT
87
1
0
17 Oct 2023
Zero-Shot Open-Vocabulary Tracking with Large Pre-Trained Models
Zero-Shot Open-Vocabulary Tracking with Large Pre-Trained Models
Wen-Hsuan Chu
Adam W. Harley
P. Tokmakov
Achal Dave
Leonidas Guibas
Katerina Fragkiadaki
VLM
123
7
0
10 Oct 2023
From Text to Mask: Localizing Entities Using the Attention of
  Text-to-Image Diffusion Models
From Text to Mask: Localizing Entities Using the Attention of Text-to-Image Diffusion Models
Changming Xiao
Qi Yang
Feng Zhou
Changshui Zhang
84
17
0
08 Sep 2023
ProPainter: Improving Propagation and Transformer for Video Inpainting
ProPainter: Improving Propagation and Transformer for Video Inpainting
Shangchen Zhou
Chongyi Li
Kelvin C. K. Chan
Chen Change Loy
ViT
122
105
0
07 Sep 2023
Online Overexposed Pixels Hallucination in Videos with Adaptive
  Reference Frame Selection
Online Overexposed Pixels Hallucination in Videos with Adaptive Reference Frame Selection
Yazhou Xing
Amrita Mazumdar
Anjul Patney
Chao Liu
Hongxu Yin
Qifeng Chen
Jan Kautz
I. Frosio
80
1
0
29 Aug 2023
UMMAFormer: A Universal Multimodal-adaptive Transformer Framework for
  Temporal Forgery Localization
UMMAFormer: A Universal Multimodal-adaptive Transformer Framework for Temporal Forgery Localization
Rui Zhang
Hongxia Wang
Ming-han Du
Hanqing Liu
Yangqiaoyu Zhou
Q. Zeng
93
24
0
28 Aug 2023
RefEgo: Referring Expression Comprehension Dataset from First-Person
  Perception of Ego4D
RefEgo: Referring Expression Comprehension Dataset from First-Person Perception of Ego4D
Shuhei Kurita
Naoki Katsura
Eri Onami
EgoV
89
14
0
23 Aug 2023
Semantics Meets Temporal Correspondence: Self-supervised Object-centric
  Learning in Videos
Semantics Meets Temporal Correspondence: Self-supervised Object-centric Learning in Videos
Rui Qian
Shuangrui Ding
Xian Liu
Dahua Lin
101
16
0
19 Aug 2023
ResQ: Residual Quantization for Video Perception
ResQ: Residual Quantization for Video Perception
Davide Abati
H. Yahia
Markus Nagel
A. Habibian
MQ
40
2
0
18 Aug 2023
Isomer: Isomerous Transformer for Zero-shot Video Object Segmentation
Isomer: Isomerous Transformer for Zero-shot Video Object Segmentation
Yichen Yuan
Yifan Wang
Lijun Wang
Xiaoqi Zhao
Huchuan Lu
Yu Wang
Wei Su
Lei Zhang
VOS
86
11
0
13 Aug 2023
Learning Referring Video Object Segmentation from Weak Annotation
Learning Referring Video Object Segmentation from Weak Annotation
Wangbo Zhao
Ke Nan
Songyang Zhang
Kai-xiang Chen
Dahua Lin
Yang You
VOS
68
2
0
04 Aug 2023
OnlineRefer: A Simple Online Baseline for Referring Video Object
  Segmentation
OnlineRefer: A Simple Online Baseline for Referring Video Object Segmentation
Dongming Wu
Tiancai Wang
Yuang Zhang
Xiangyu Zhang
Jianbing Shen
VOS
84
40
0
18 Jul 2023
Deficiency-Aware Masked Transformer for Video Inpainting
Deficiency-Aware Masked Transformer for Video Inpainting
Yongsheng Yu
Hengrui Fan
Libo Zhang
VGen
63
9
0
17 Jul 2023
Hierarchical Spatiotemporal Transformers for Video Object Segmentation
Hierarchical Spatiotemporal Transformers for Video Object Segmentation
Jun-Sang Yoo
H. Lee
Seung‐Won Jung
VOS
69
1
0
17 Jul 2023
Test-Time Training on Video Streams
Test-Time Training on Video Streams
Renhao Wang
Yu Sun
Yossi Gandelsman
Xinlei Chen
Alexei A. Efros
Alexei A. Efros
Xiaolong Wang
TTAViT3DGS
157
21
0
11 Jul 2023
Bidirectional Correlation-Driven Inter-Frame Interaction Transformer for
  Referring Video Object Segmentation
Bidirectional Correlation-Driven Inter-Frame Interaction Transformer for Referring Video Object Segmentation
Meng Lan
Fu Rong
Zuchao Li
Wei Yu
Lefei Zhang
VOS
117
7
0
02 Jul 2023
Dense Video Object Captioning from Disjoint Supervision
Dense Video Object Captioning from Disjoint Supervision
Xingyi Zhou
Anurag Arnab
Chen Sun
Cordelia Schmid
105
3
0
20 Jun 2023
Meta-Personalizing Vision-Language Models to Find Named Instances in
  Video
Meta-Personalizing Vision-Language Models to Find Named Instances in Video
Chun-Hsiao Yeh
Bryan C. Russell
Josef Sivic
Fabian Caba Heilbron
Simon Jenni
VLMMLLM
101
11
0
16 Jun 2023
Tracking through Containers and Occluders in the Wild
Tracking through Containers and Occluders in the Wild
Basile Van Hoorick
P. Tokmakov
Simon Stent
Jie Li
Carl Vondrick
100
14
0
04 May 2023
Bootstrapping Objectness from Videos by Relaxed Common Fate and Visual
  Grouping
Bootstrapping Objectness from Videos by Relaxed Common Fate and Visual Grouping
Long Lian
Zhirong Wu
Stella X. Yu
VOS
54
12
0
17 Apr 2023
Boosting Video Object Segmentation via Space-time Correspondence
  Learning
Boosting Video Object Segmentation via Space-time Correspondence Learning
Yurong Zhang
Liulei Li
Wenguan Wang
Rong Xie
Li Song
Wenjun Zhang
VOS
65
33
0
13 Apr 2023
Co-attention Propagation Network for Zero-Shot Video Object Segmentation
Co-attention Propagation Network for Zero-Shot Video Object Segmentation
Gensheng Pei
Yazhou Yao
Fumin Shen
Daniel Huang
Xing-Rui Huang
Hengtao Shen
VOS
91
12
0
08 Apr 2023
Exemplar-based Video Colorization with Long-term Spatiotemporal
  Dependency
Exemplar-based Video Colorization with Long-term Spatiotemporal Dependency
Si-Yu Chen
Xueming Li
Xianlin Zhang
Mingdao Wang
Yu Zhang
Jiatong Han
Yue Zhang
75
10
0
27 Mar 2023
Reliability-Hierarchical Memory Network for Scribble-Supervised Video
  Object Segmentation
Reliability-Hierarchical Memory Network for Scribble-Supervised Video Object Segmentation
Zikun Zhou
Kaige Mao
Wenjie Pei
Hongpeng Wang
Yaowei Wang
Zhenyu He
VOS
64
1
0
25 Mar 2023
ARKitTrack: A New Diverse Dataset for Tracking Using Mobile RGB-D Data
ARKitTrack: A New Diverse Dataset for Tracking Using Mobile RGB-D Data
Hao-Xin Zhao
Junsong Chen
Lijun Wang
Huchuan Lu
77
10
0
24 Mar 2023
Visual Prompt Multi-Modal Tracking
Visual Prompt Multi-Modal Tracking
Jiawen Zhu
Simiao Lai
Xin Chen
D. Wang
Huchuan Lu
VLM
91
168
0
20 Mar 2023
Previous
12345
Next