ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2011.14503
  4. Cited By
End-to-End Video Instance Segmentation with Transformers

End-to-End Video Instance Segmentation with Transformers

30 November 2020
Yuqing Wang
Zhaoliang Xu
Xinlong Wang
Chunhua Shen
Baoshan Cheng
Hao Shen
Huaxia Xia
    ViT
ArXivPDFHTML

Papers citing "End-to-End Video Instance Segmentation with Transformers"

50 / 161 papers shown
Title
SAM2MOT: A Novel Paradigm of Multi-Object Tracking by Segmentation
SAM2MOT: A Novel Paradigm of Multi-Object Tracking by Segmentation
Junjie Jiang
Zelin Wang
Manqi Zhao
Yin Li
Dongsheng Jiang
41
0
0
06 Apr 2025
Video-based Traffic Light Recognition by Rockchip RV1126 for Autonomous Driving
Video-based Traffic Light Recognition by Rockchip RV1126 for Autonomous Driving
Miao Fan
Xuxu Kong
Shengtong Xu
Haoyi Xiong
Xiangzeng Liu
ViT
46
0
0
31 Mar 2025
Segment Any-Quality Images with Generative Latent Space Enhancement
Segment Any-Quality Images with Generative Latent Space Enhancement
Guangqian Guo
Yoong Guo
Xuehui Yu
Wenbo Li
Yaoxing Wang
Shan Gao
VLM
77
0
0
16 Mar 2025
Transformer Based Self-Context Aware Prediction for Few-Shot Anomaly Detection in Videos
Gargi V. Pillai
Ashish Verma
Debashis Sen
ViT
34
7
0
02 Mar 2025
Learning Motion and Temporal Cues for Unsupervised Video Object Segmentation
Learning Motion and Temporal Cues for Unsupervised Video Object Segmentation
Yunzhi Zhuge
Hongyu Gu
Lu Zhang
Jinqing Qi
Huchuan Lu
VOS
67
2
0
14 Jan 2025
Order-aware Interactive Segmentation
Order-aware Interactive Segmentation
Bin Wang
Anwesa Choudhuri
Meng Zheng
Zhongpai Gao
Benjamin Planche
Andong Deng
Qin Liu
Terrence Chen
Ulas Bagci
Ziyan Wu
VLM
127
1
0
16 Oct 2024
SwinShadow: Shifted Window for Ambiguous Adjacent Shadow Detection
SwinShadow: Shifted Window for Ambiguous Adjacent Shadow Detection
Yonghui Wang
Shaokai Liu
Li Li
Wengang Zhou
Houqiang Li
ViT
46
1
0
07 Aug 2024
ViLLa: Video Reasoning Segmentation with Large Language Model
ViLLa: Video Reasoning Segmentation with Large Language Model
Rongkun Zheng
Lu Qi
Xi Chen
Yi Wang
Kun Wang
Yu Qiao
Hengshuang Zhao
VOS
LRM
65
2
0
18 Jul 2024
Neural-based Video Compression on Solar Dynamics Observatory Images
Neural-based Video Compression on Solar Dynamics Observatory Images
Atefeh Khoshkhahtinat
Ali Zafari
P. Mehta
Nasser M. Nasrabadi
Barbara J. Thompson
M. Kirk
D. D. Silva
46
0
0
12 Jul 2024
Matching Anything by Segmenting Anything
Matching Anything by Segmenting Anything
Siyuan Li
Lei Ke
Martin Danelljan
Luigi Piccinelli
Mattia Segu
Luc Van Gool
Fisher Yu
VOS
37
22
0
06 Jun 2024
DeVOS: Flow-Guided Deformable Transformer for Video Object Segmentation
DeVOS: Flow-Guided Deformable Transformer for Video Object Segmentation
Volodymyr Fedynyak
Yaroslav Romanus
Bohdan Hlovatskyi
Bohdan Sydor
Oles Dobosevych
Igor Babin
Roman Riazantsev
VOS
40
3
0
11 May 2024
Two in One Go: Single-stage Emotion Recognition with Decoupled
  Subject-context Transformer
Two in One Go: Single-stage Emotion Recognition with Decoupled Subject-context Transformer
Xinpeng Li
Teng Wang
Jian Zhao
Shuyi Mao
Jinbao Wang
Feng Zheng
Xiaojiang Peng
Xuelong Li
28
1
0
26 Apr 2024
Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video
  Object Segmentation
Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation
Zixin Zhu
Xuelu Feng
Dongdong Chen
Junsong Yuan
Chunming Qiao
Gang Hua
DiffM
39
7
0
18 Mar 2024
Motion-Corrected Moving Average: Including Post-Hoc Temporal Information
  for Improved Video Segmentation
Motion-Corrected Moving Average: Including Post-Hoc Temporal Information for Improved Video Segmentation
R. Mendel
Tobias Rueckert
Dirk Wilhelm
Daniel Rueckert
Christoph Palm
22
0
0
05 Mar 2024
End-to-End Human Instance Matting
End-to-End Human Instance Matting
Qinglin Liu
Shengping Zhang
Quanling Meng
Bineng Zhong
Peiqiang Liu
H. Yao
3DH
37
5
0
03 Mar 2024
UniVS: Unified and Universal Video Segmentation with Prompts as Queries
UniVS: Unified and Universal Video Segmentation with Prompts as Queries
Ming-hui Li
Shuai Li
Xindong Zhang
Lei Zhang
VOS
41
16
0
28 Feb 2024
Multi-Object Tracking by Hierarchical Visual Representations
Multi-Object Tracking by Hierarchical Visual Representations
Jinkun Cao
Jiangmiao Pang
Kris M. Kitani
OCL
47
0
0
24 Feb 2024
TMT-VIS: Taxonomy-aware Multi-dataset Joint Training for Video Instance
  Segmentation
TMT-VIS: Taxonomy-aware Multi-dataset Joint Training for Video Instance Segmentation
Rongkun Zheng
Lu Qi
Xi Chen
Yi Wang
Kun Wang
Yu Qiao
Hengshuang Zhao
24
2
0
11 Dec 2023
Improved TokenPose with Sparsity
Improved TokenPose with Sparsity
Anning Li
ViT
34
0
0
16 Nov 2023
Improving Robustness for Vision Transformer with a Simple Dynamic
  Scanning Augmentation
Improving Robustness for Vision Transformer with a Simple Dynamic Scanning Augmentation
Shashank Kotyan
Danilo Vasconcellos Vargas
ViT
22
2
0
01 Nov 2023
Audio-Visual Instance Segmentation
Audio-Visual Instance Segmentation
Ruohao Guo
Yaru Chen
Yanyu Qi
Wenzhen Yue
Dantong Niu
...
Wenzhen Yue
Ji Shi
Qixun Wang
Peiliang Zhang
Buwen Liang
VLM
VOS
28
2
0
28 Oct 2023
TCOVIS: Temporally Consistent Online Video Instance Segmentation
TCOVIS: Temporally Consistent Online Video Instance Segmentation
Junlong Li
Ting Yu
Yongming Rao
Jie Zhou
Jiwen Lu
38
12
0
21 Sep 2023
DeNoising-MOT: Towards Multiple Object Tracking with Severe Occlusions
DeNoising-MOT: Towards Multiple Object Tracking with Severe Occlusions
Teng Fu
Xiaocong Wang
Haiyang Yu
Ke Niu
Bin Li
Xiangyang Xue
VOT
ViT
31
6
0
09 Sep 2023
Tracking Anything with Decoupled Video Segmentation
Tracking Anything with Decoupled Video Segmentation
Ho Kei Cheng
Seoung Wug Oh
Brian L. Price
Alexander Schwing
Joon-Young Lee
VOS
VLM
32
121
0
07 Sep 2023
Temporal Collection and Distribution for Referring Video Object
  Segmentation
Temporal Collection and Distribution for Referring Video Object Segmentation
Jiajin Tang
Ge Zheng
Sibei Yang
VOS
36
14
0
07 Sep 2023
NOVIS: A Case for End-to-End Near-Online Video Instance Segmentation
NOVIS: A Case for End-to-End Near-Online Video Instance Segmentation
Tim Meinhardt
Matt Feiszli
Yuchen Fan
Laura Leal-Taixe
Rakesh Ranjan
ViT
19
5
0
29 Aug 2023
Hierarchical Spatiotemporal Transformers for Video Object Segmentation
Hierarchical Spatiotemporal Transformers for Video Object Segmentation
Jun-Sang Yoo
H. Lee
Seung‐Won Jung
VOS
26
1
0
17 Jul 2023
Multiscale Memory Comparator Transformer for Few-Shot Video Segmentation
Multiscale Memory Comparator Transformer for Few-Shot Video Segmentation
Mennatullah Siam
R. Karim
Henghui Zhao
Richard P. Wildes
VOS
23
2
0
15 Jul 2023
Bidirectional Correlation-Driven Inter-Frame Interaction Transformer for
  Referring Video Object Segmentation
Bidirectional Correlation-Driven Inter-Frame Interaction Transformer for Referring Video Object Segmentation
Meng Lan
Fu Rong
Zuchao Li
Wei Yu
L. Zhang
VOS
29
5
0
02 Jul 2023
DVIS: Decoupled Video Instance Segmentation Framework
DVIS: Decoupled Video Instance Segmentation Framework
Tao Zhang
Xingye Tian
Yuehua Wu
Shunping Ji
Xuebo Wang
Yuan Zhang
Pengfei Wan
23
44
0
06 Jun 2023
Referred by Multi-Modality: A Unified Temporal Transformer for Video
  Object Segmentation
Referred by Multi-Modality: A Unified Temporal Transformer for Video Object Segmentation
Shilin Yan
Renrui Zhang
Ziyu Guo
Wenchao Chen
Wei Zhang
Hongyang Li
Yu Qiao
Hao Dong
Zhongjiang He
Peng Gao
VOS
20
30
0
25 May 2023
Type-to-Track: Retrieve Any Object via Prompt-based Tracking
Type-to-Track: Retrieve Any Object via Prompt-based Tracking
Pha Nguyen
Kha Gia Quach
Kris M. Kitani
Khoa Luu
39
18
0
22 May 2023
End-to-End Spatio-Temporal Action Localisation with Video Transformers
End-to-End Spatio-Temporal Action Localisation with Video Transformers
A. Gritsenko
Xuehan Xiong
Josip Djolonga
Mostafa Dehghani
Chen Sun
Mario Lucic
Cordelia Schmid
Anurag Arnab
ViT
32
13
0
24 Apr 2023
Co-attention Propagation Network for Zero-Shot Video Object Segmentation
Co-attention Propagation Network for Zero-Shot Video Object Segmentation
Gensheng Pei
Yazhou Yao
Fumin Shen
Daniel Huang
Xing-Rui Huang
Hengtao Shen
VOS
30
11
0
08 Apr 2023
RGBT Tracking via Progressive Fusion Transformer with Dynamically Guided Learning
Yabin Zhu
Chenglong Li
Xiao Wang
Jin Tang
Zhixiang Huang
26
7
0
26 Mar 2023
MDQE: Mining Discriminative Query Embeddings to Segment Occluded
  Instances on Challenging Videos
MDQE: Mining Discriminative Query Embeddings to Segment Occluded Instances on Challenging Videos
Minghan Li
Shuai Li
Wangmeng Xiang
Lei Zhang
31
9
0
25 Mar 2023
Capturing the motion of every joint: 3D human pose and shape estimation
  with independent tokens
Capturing the motion of every joint: 3D human pose and shape estimation with independent tokens
Sen Yang
Wen Heng
Gang Liu
Guozhong Luo
Wankou Yang
Gang Yu
3DH
ViT
18
11
0
01 Mar 2023
Knowledge Augmented Relation Inference for Group Activity Recognition
Knowledge Augmented Relation Inference for Group Activity Recognition
Xianglong Lang
Zhuming Wang
Zun Li
Meng-Syue Tian
Ge Shi
Lifang Wu
Liang Wang
16
3
0
28 Feb 2023
Offline-to-Online Knowledge Distillation for Video Instance Segmentation
Offline-to-Online Knowledge Distillation for Video Instance Segmentation
H. Kim
Seunghun Lee
Sunghoon Im
OffRL
36
3
0
15 Feb 2023
TarViS: A Unified Approach for Target-based Video Segmentation
TarViS: A Unified Approach for Target-based Video Segmentation
A. Athar
Alexander Hermans
Jonathon Luiten
Deva Ramanan
Bastian Leibe
VOS
23
29
0
06 Jan 2023
A New Perspective to Boost Vision Transformer for Medical Image
  Classification
A New Perspective to Boost Vision Transformer for Medical Image Classification
Yuexiang Li
Yawen Huang
Nanjun He
Kai Ma
Yefeng Zheng
ViT
MedIm
21
3
0
03 Jan 2023
Edge Enhanced Image Style Transfer via Transformers
Edge Enhanced Image Style Transfer via Transformers
Chi Zhang
Jun Yang
Zaiyan Dai
Peng-Xia Cao
11
10
0
02 Jan 2023
Tracking by Associating Clips
Tracking by Associating Clips
Sanghyun Woo
Kwanyong Park
Seoung Wug Oh
In So Kweon
Joon-Young Lee
VOT
27
9
0
20 Dec 2022
MIMO Is All You Need : A Strong Multi-In-Multi-Out Baseline for Video
  Prediction
MIMO Is All You Need : A Strong Multi-In-Multi-Out Baseline for Video Prediction
Shuliang Ning
Mengcheng Lan
Yanran Li
Chaofeng Chen
Qian Chen
Xunlai Chen
Xiaoguang Han
Shuguang Cui
28
20
0
09 Dec 2022
Video Object of Interest Segmentation
Video Object of Interest Segmentation
Siyuan Zhou
Chunru Zhan
Biao Wang
T. Ge
Yuning Jiang
Li Niu
VOS
20
0
0
06 Dec 2022
Prototype as Query for Few Shot Semantic Segmentation
Prototype as Query for Few Shot Semantic Segmentation
Leilei Cao
Yibo Guo
Ye Yuan
Qiangguo Jin
ViT
25
10
0
27 Nov 2022
Unifying Tracking and Image-Video Object Detection
Unifying Tracking and Image-Video Object Detection
Peirong Liu
Rui Wang
Pengchuan Zhang
Omid Poursaeed
Yipin Zhou
Xuefei Cao
Sreya . Dutta Roy
Ashish Shah
Ser-Nam Lim
13
0
0
20 Nov 2022
Multi-Camera Multi-Object Tracking on the Move via Single-Stage Global
  Association Approach
Multi-Camera Multi-Object Tracking on the Move via Single-Stage Global Association Approach
Pha Nguyen
Kha Gia Quach
C. Duong
S. L. Phung
Ngan Le
Khoa Luu
44
12
0
17 Nov 2022
Robust Online Video Instance Segmentation with Track Queries
Robust Online Video Instance Segmentation with Track Queries
Zitong Zhan
Daniel McKee
Svetlana Lazebnik
23
9
0
16 Nov 2022
Grafting Vision Transformers
Grafting Vision Transformers
Jong Sung Park
Kumara Kahatapitiya
Donghyun Kim
Shivchander Sudalairaj
Quanfu Fan
Michael S. Ryoo
ViT
26
2
0
28 Oct 2022
1234
Next