ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2408.10627
  4. Cited By
Rethinking Video Segmentation with Masked Video Consistency: Did the
  Model Learn as Intended?

Rethinking Video Segmentation with Masked Video Consistency: Did the Model Learn as Intended?

20 August 2024
Chen Liang
Qiang Guo
Xiaochao Qu
Luoqi Liu
Ting Liu
    VOS
ArXivPDFHTML

Papers citing "Rethinking Video Segmentation with Masked Video Consistency: Did the Model Learn as Intended?"

15 / 15 papers shown
Title
DVIS++: Improved Decoupled Framework for Universal Video Segmentation
DVIS++: Improved Decoupled Framework for Universal Video Segmentation
Tao Zhang
Xingye Tian
Yikang Zhou
Shunping Ji
Xuebo Wang
Xin Tao
Yuanhui Zhang
Pengfei Wan
Zhong-ming Wang
Yu Wu
27
20
0
20 Dec 2023
Video-kMaX: A Simple Unified Approach for Online and Near-Online Video
  Panoptic Segmentation
Video-kMaX: A Simple Unified Approach for Online and Near-Online Video Panoptic Segmentation
Inkyu Shin
Dahun Kim
Qihang Yu
Jun Xie
Hong-Seok Kim
Bradley Green
In So Kweon
Kuk-Jin Yoon
Liang-Chieh Chen
VLM
78
18
0
10 Apr 2023
TarViS: A Unified Approach for Target-based Video Segmentation
TarViS: A Unified Approach for Target-based Video Segmentation
A. Athar
Alexander Hermans
Jonathon Luiten
Deva Ramanan
Bastian Leibe
VOS
43
29
0
06 Jan 2023
mc-BEiT: Multi-choice Discretization for Image BERT Pre-training
mc-BEiT: Multi-choice Discretization for Image BERT Pre-training
Xiaotong Li
Yixiao Ge
Kun Yi
Zixuan Hu
Ying Shan
Ling-yu Duan
53
39
0
29 Mar 2022
Masked Feature Prediction for Self-Supervised Visual Pre-Training
Masked Feature Prediction for Self-Supervised Visual Pre-Training
Chen Wei
Haoqi Fan
Saining Xie
Chaoxia Wu
Alan Yuille
Christoph Feichtenhofer
ViT
125
661
0
16 Dec 2021
Masked-attention Mask Transformer for Universal Image Segmentation
Masked-attention Mask Transformer for Universal Image Segmentation
Bowen Cheng
Ishan Misra
Alex Schwing
Alexander Kirillov
Rohit Girdhar
ISeg
178
2,315
0
02 Dec 2021
PeCo: Perceptual Codebook for BERT Pre-training of Vision Transformers
PeCo: Perceptual Codebook for BERT Pre-training of Vision Transformers
Xiaoyi Dong
Jianmin Bao
Ting Zhang
Dongdong Chen
Weiming Zhang
Lu Yuan
Dong Chen
Fang Wen
Nenghai Yu
Baining Guo
ViT
81
241
0
24 Nov 2021
K-Net: Towards Unified Image Segmentation
K-Net: Towards Unified Image Segmentation
Wenwei Zhang
Jiangmiao Pang
Kai-xiang Chen
Chen Change Loy
ISeg
51
361
0
28 Jun 2021
BEiT: BERT Pre-Training of Image Transformers
BEiT: BERT Pre-Training of Image Transformers
Hangbo Bao
Li Dong
Songhao Piao
Furu Wei
ViT
166
2,785
0
15 Jun 2021
SegFormer: Simple and Efficient Design for Semantic Segmentation with
  Transformers
SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers
Enze Xie
Wenhai Wang
Zhiding Yu
Anima Anandkumar
J. Álvarez
Ping Luo
ViT
130
4,934
0
31 May 2021
Occluded Video Instance Segmentation: A Benchmark
Occluded Video Instance Segmentation: A Benchmark
Jiyang Qi
Yan Gao
Yao Hu
Xinggang Wang
Xiaoyu Liu
Xiang Bai
Serge Belongie
Alan Yuille
Philip Torr
S. Bai
VOS
VLM
38
137
0
02 Feb 2021
An Image is Worth 16x16 Words: Transformers for Image Recognition at
  Scale
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
Alexey Dosovitskiy
Lucas Beyer
Alexander Kolesnikov
Dirk Weissenborn
Xiaohua Zhai
...
Matthias Minderer
G. Heigold
Sylvain Gelly
Jakob Uszkoreit
N. Houlsby
ViT
314
40,217
0
22 Oct 2020
Language Models are Few-Shot Learners
Language Models are Few-Shot Learners
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
...
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
BDL
461
41,106
0
28 May 2020
Video Instance Segmentation
Video Instance Segmentation
Linjie Yang
Yuchen Fan
N. Xu
VOS
ISeg
66
503
0
12 May 2019
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
882
93,936
0
11 Oct 2018
1