Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2201.05047
Cited By
TransVOD: End-to-End Video Object Detection with Spatial-Temporal Transformers
13 January 2022
Qianyu Zhou
Hefei Ling
Lu He
Li Niu
Guangliang Cheng
Yunhai Tong
Lizhuang Ma
Liqing Zhang
ViT
Re-assign community
ArXiv
PDF
HTML
Papers citing
"TransVOD: End-to-End Video Object Detection with Spatial-Temporal Transformers"
28 / 28 papers shown
Title
Video-based Traffic Light Recognition by Rockchip RV1126 for Autonomous Driving
Miao Fan
Xuxu Kong
Shengtong Xu
Haoyi Xiong
Xiangzeng Liu
ViT
49
0
0
31 Mar 2025
Infrared Small Target Detection in Satellite Videos: A New Dataset and A Novel Recurrent Feature Refinement Framework
Xinyi Ying
Li Liu
Zaipin Lin
Yangsi Shi
Y. Wang
Ruojing Li
Xu Cao
Boyang Li
Shilin Zhou
Wei An
135
1
0
21 Feb 2025
GloTSFormer: Global Video Text Spotting Transformer
Hang Wang
Yanjie Wang
Yang Li
Can Huang
37
0
0
08 Jan 2024
Diverse Target and Contribution Scheduling for Domain Generalization
Shaocong Long
Qianyu Zhou
Soham Dan
Lizhuang Ma
Yuan Luo
62
8
0
28 Sep 2023
SSVOD: Semi-Supervised Video Object Detection with Sparse Annotations
Tanvir Mahmud
Chun-Hao Liu
Burhaneddin Yaman
Diana Marculescu
15
4
0
04 Sep 2023
NOVIS: A Case for End-to-End Near-Online Video Instance Segmentation
Tim Meinhardt
Matt Feiszli
Yuchen Fan
Laura Leal-Taixe
Rakesh Ranjan
ViT
19
5
0
29 Aug 2023
Object Detection Difficulty: Suppressing Over-aggregation for Faster and Better Video Object Detection
Bin Zhang
Sen Wang
Yifan Liu
Brano Kusy
Xue Li
Jiajun Liu
ObjD
42
0
0
22 Aug 2023
Sketch-based Video Object Localization
Sangmin Woo
So-Yeong Jeon
Jinyoung Park
Minji Son
Sumin Lee
Changick Kim
19
0
0
02 Apr 2023
Reference Twice: A Simple and Unified Baseline for Few-Shot Instance Segmentation
Yue Han
Jiangning Zhang
Zhucun Xue
Chao Xu
Xintian Shen
Yabiao Wang
Chengjie Wang
Yong Liu
Xiangtai Li
37
17
0
03 Jan 2023
PanopticPartFormer++: A Unified and Decoupled View for Panoptic Part Segmentation
Xiangtai Li
Shilin Xu
Yibo Yang
Haobo Yuan
Guangliang Cheng
Yu Tong
Zhouchen Lin
Ming-Hsuan Yang
Dacheng Tao
ViT
42
21
0
03 Jan 2023
Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance Segmentation
Jianzong Wu
Xiangtai Li
Henghui Ding
Xia Li
Guangliang Cheng
Yu Tong
Chen Change Loy
VLM
85
31
0
02 Jan 2023
Convolution-enhanced Evolving Attention Networks
Yujing Wang
Yaming Yang
Zhuowan Li
Jiangang Bai
Mingliang Zhang
Xiangtai Li
Jiahao Yu
Ce Zhang
Gao Huang
Yu Tong
ViT
27
6
0
16 Dec 2022
BoxMask: Revisiting Bounding Box Supervision for Video Object Detection
K. Hashmi
A. Pagani
D. Stricker
Muhammad Zeshan Afzal
VOS
45
10
0
12 Oct 2022
Spatio-Temporal Learnable Proposals for End-to-End Video Object Detection
K. Hashmi
D. Stricker
Muhammamd Zeshan Afzal
21
7
0
05 Oct 2022
INT: Towards Infinite-frames 3D Detection with An Efficient Framework
Jianyun Xu
Zhenwei Miao
Da Zhang
Hongyu Pan
Kai Liu
Peihan Hao
Jun Zhu
Zhengyang Sun
Hongming Li
Xin Zhan
41
17
0
30 Sep 2022
PTSEFormer: Progressive Temporal-Spatial Enhanced TransFormer Towards Video Object Detection
Han Wang
Jun Tang
Xiaodong Liu
Shanyan Guan
Rong Xie
Li-Na Song
ViT
36
25
0
06 Sep 2022
TogetherNet: Bridging Image Restoration and Object Detection Together via Dynamic Enhancement Learning
Yongzhen Wang
Xu Yan
Kaiwen Zhang
Lina Gong
H. Xie
F. Wang
Mingqiang Wei
18
29
0
03 Sep 2022
Tracking Objects as Pixel-wise Distributions
Zelin Zhao
Ze Wu
Yueqing Zhuang
Boxun Li
Jiaya Jia
VOT
31
54
0
12 Jul 2022
Fashionformer: A simple, Effective and Unified Baseline for Human Fashion Segmentation and Recognition
Shilin Xu
Xiangtai Li
Jingbo Wang
Guangliang Cheng
Yunhai Tong
Dacheng Tao
ViT
23
27
0
10 Apr 2022
PolyphonicFormer: Unified Query Learning for Depth-aware Video Panoptic Segmentation
Haobo Yuan
Xiangtai Li
Yibo Yang
Guangliang Cheng
Jing Zhang
Yunhai Tong
Lefei Zhang
Dacheng Tao
MDE
44
42
0
05 Dec 2021
TF-Blender: Temporal Feature Blender for Video Object Detection
Yiming Cui
Liqi Yan
Zhiwen Cao
Dongfang Liu
ViT
53
100
0
12 Aug 2021
Self-Adversarial Disentangling for Specific Domain Adaptation
Qianyu Zhou
Qiqi Gu
Jiangmiao Pang
Xuequan Lu
Lizhuang Ma
69
49
0
08 Aug 2021
End-to-End Video Object Detection with Spatial-Temporal Transformers
Lu He
Qianyu Zhou
Hefei Ling
Li Niu
Guangliang Cheng
Xiao Li
Wenxuan Liu
Yu Tong
Lizhuang Ma
Liqing Zhang
ViT
59
96
0
23 May 2021
TrackFormer: Multi-Object Tracking with Transformers
Tim Meinhardt
A. Kirillov
Laura Leal-Taixe
Christoph Feichtenhofer
VOT
232
743
0
07 Jan 2021
TransTrack: Multiple Object Tracking with Transformer
Pei Sun
Jinkun Cao
Yi-Xin Jiang
Rufeng Zhang
Enze Xie
Zehuan Yuan
Changhu Wang
Ping Luo
ViT
VOT
264
566
0
31 Dec 2020
Memory Enhanced Global-Local Aggregation for Video Object Detection
Yihong Chen
Yue Cao
Han Hu
Liwei Wang
112
261
0
26 Mar 2020
Relation Distillation Networks for Video Object Detection
Jiajun Deng
Yingwei Pan
Ting Yao
Wen-gang Zhou
Houqiang Li
Tao Mei
ObjD
108
191
0
26 Aug 2019
Object Detection in 20 Years: A Survey
Zhengxia Zou
Keyan Chen
Zhenwei Shi
Yuhong Guo
Jieping Ye
VLM
ObjD
AI4TS
32
2,285
0
13 May 2019
1