Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2402.02574
Cited By
Spatio-temporal Prompting Network for Robust Video Feature Extraction
4 February 2024
Guanxiong Sun
Chi Wang
Zhaoyu Zhang
Jiankang Deng
Stefanos Zafeiriou
Yang Hua
ViT
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Spatio-temporal Prompting Network for Robust Video Feature Extraction"
38 / 38 papers shown
Title
Expanding Language-Image Pretrained Models for General Video Recognition
Bolin Ni
Houwen Peng
Minghao Chen
Songyang Zhang
Gaofeng Meng
Jianlong Fu
Shiming Xiang
Haibin Ling
VLM
CLIP
ViT
106
325
0
04 Aug 2022
MinVIS: A Minimal Video Instance Segmentation Framework without Video-based Training
De-An Huang
Zhiding Yu
Anima Anandkumar
VLM
93
81
0
03 Aug 2022
In Defense of Online Models for Video Instance Segmentation
Junfeng Wu
Qihao Liu
Yi Jiang
S. Bai
Alan Yuille
Xiang Bai
73
110
0
21 Jul 2022
ST-Adapter: Parameter-Efficient Image-to-Video Transfer Learning
Junting Pan
Ziyi Lin
Xiatian Zhu
Jing Shao
Hongsheng Li
90
204
0
27 Jun 2022
VITA: Video Instance Segmentation via Object Token Association
Miran Heo
Sukjun Hwang
Seoung Wug Oh
Joon-Young Lee
Seon Joo Kim
VOS
68
92
0
09 Jun 2022
Visual Prompt Tuning
Menglin Jia
Luming Tang
Bor-Chun Chen
Claire Cardie
Serge Belongie
Bharath Hariharan
Ser-Nam Lim
VLM
VPVLM
153
1,627
0
23 Mar 2022
MixFormer: End-to-End Tracking with Iterative Mixed Attention
Yutao Cui
Jiang Cheng
Limin Wang
Gangshan Wu
VOT
114
473
0
21 Mar 2022
Prompting Visual-Language Models for Efficient Video Understanding
Chen Ju
Tengda Han
Kunhao Zheng
Ya Zhang
Weidi Xie
VPVLM
VLM
83
379
0
08 Dec 2021
Masked-attention Mask Transformer for Universal Image Segmentation
Bowen Cheng
Ishan Misra
Alex Schwing
Alexander Kirillov
Rohit Girdhar
ISeg
248
2,374
0
02 Dec 2021
MLP-Mixer: An all-MLP Architecture for Vision
Ilya O. Tolstikhin
N. Houlsby
Alexander Kolesnikov
Lucas Beyer
Xiaohua Zhai
...
Andreas Steiner
Daniel Keysers
Jakob Uszkoreit
Mario Lucic
Alexey Dosovitskiy
418
2,674
0
04 May 2021
The Power of Scale for Parameter-Efficient Prompt Tuning
Brian Lester
Rami Al-Rfou
Noah Constant
VPVLM
573
4,047
0
18 Apr 2021
Learning Spatio-Temporal Transformer for Visual Tracking
Bin Yan
Houwen Peng
Jianlong Fu
Dong Wang
Huchuan Lu
ViT
70
724
0
31 Mar 2021
CvT: Introducing Convolutions to Vision Transformers
Haiping Wu
Bin Xiao
Noel Codella
Mengchen Liu
Xiyang Dai
Lu Yuan
Lei Zhang
ViT
152
1,910
0
29 Mar 2021
Transformer Tracking
Xin Chen
Bin Yan
Jiawen Zhu
Dong Wang
Xiaoyun Yang
Huchuan Lu
ViT
69
957
0
29 Mar 2021
Zero-Shot Text-to-Image Generation
Aditya A. Ramesh
Mikhail Pavlov
Gabriel Goh
Scott Gray
Chelsea Voss
Alec Radford
Mark Chen
Ilya Sutskever
VLM
415
4,953
0
24 Feb 2021
End-to-End Video Instance Segmentation with Transformers
Yuqing Wang
Zhaoliang Xu
Xinlong Wang
Chunhua Shen
Baoshan Cheng
Hao Shen
Huaxia Xia
ViT
79
691
0
30 Nov 2020
Computing Systems for Autonomous Driving: State-of-the-Art and Challenges
Liangkai Liu
Sidi Lu
Ren Zhong
Baofu Wu
Yongtao Yao
Qingyan Zhang
Weisong Shi
78
276
0
30 Sep 2020
Large-Scale Adversarial Training for Vision-and-Language Representation Learning
Zhe Gan
Yen-Chun Chen
Linjie Li
Chen Zhu
Yu Cheng
Jingjing Liu
ObjD
VLM
70
498
0
11 Jun 2020
End-to-End Object Detection with Transformers
Nicolas Carion
Francisco Massa
Gabriel Synnaeve
Nicolas Usunier
Alexander Kirillov
Sergey Zagoruyko
ViT
3DV
PINN
415
13,048
0
26 May 2020
Probabilistic Regression for Visual Tracking
Martin Danelljan
Luc Van Gool
Radu Timofte
BDL
82
520
0
27 Mar 2020
Memory Enhanced Global-Local Aggregation for Video Object Detection
Yihong Chen
Yue Cao
Han Hu
Liwei Wang
167
262
0
26 Mar 2020
Siam R-CNN: Visual Tracking by Re-Detection
P. Voigtlaender
Jonathon Luiten
Philip Torr
Bastian Leibe
106
514
0
28 Nov 2019
Relation Distillation Networks for Video Object Detection
Jiajun Deng
Yingwei Pan
Ting Yao
Wen-gang Zhou
Houqiang Li
Tao Mei
ObjD
154
191
0
26 Aug 2019
Sequence Level Semantics Aggregation for Video Object Detection
Haiping Wu
Yuntao Chen
Naiyan Wang
Zhaoxiang Zhang
83
204
0
15 Jul 2019
A Survey of Autonomous Driving: Common Practices and Emerging Technologies
Ekim Yurtsever
Jacob Lambert
Alexander Carballo
K. Takeda
93
1,381
0
12 Jun 2019
Video Instance Segmentation
Linjie Yang
Yuchen Fan
N. Xu
VOS
ISeg
85
508
0
12 May 2019
Fast Online Object Tracking and Segmentation: A Unifying Approach
Qiang Wang
Li Zhang
Luca Bertinetto
Weiming Hu
Philip Torr
VOS
70
1,205
0
12 Dec 2018
GOT-10k: A Large High-Diversity Benchmark for Generic Object Tracking in the Wild
Lianghua Huang
Xin Zhao
Kaiqi Huang
89
1,344
0
29 Oct 2018
Relation Networks for Object Detection
Han Hu
Jiayuan Gu
Zheng Zhang
Jifeng Dai
Yichen Wei
ObjD
119
1,223
0
30 Nov 2017
Non-local Neural Networks
Xinyu Wang
Ross B. Girshick
Abhinav Gupta
Kaiming He
OffRL
289
8,906
0
21 Nov 2017
End-to-end representation learning for Correlation Filter based tracking
Jack Valmadre
Luca Bertinetto
João F. Henriques
Andrea Vedaldi
Philip Torr
88
1,399
0
20 Apr 2017
Flow-Guided Feature Aggregation for Video Object Detection
Xizhou Zhu
Yujie Wang
Jifeng Dai
Lu Yuan
Yichen Wei
101
621
0
29 Mar 2017
Mask R-CNN
Kaiming He
Georgia Gkioxari
Piotr Dollár
Ross B. Girshick
ObjD
352
27,195
0
20 Mar 2017
Feature Pyramid Networks for Object Detection
Nayeon Lee
Piotr Dollár
Ross B. Girshick
Kaiming He
Bharath Hariharan
Serge J. Belongie
ObjD
474
22,108
0
09 Dec 2016
Grad-CAM: Visual Explanations from Deep Networks via Gradient-based Localization
Ramprasaath R. Selvaraju
Michael Cogswell
Abhishek Das
Ramakrishna Vedantam
Devi Parikh
Dhruv Batra
FAtt
312
20,023
0
07 Oct 2016
Layer Normalization
Jimmy Lei Ba
J. Kiros
Geoffrey E. Hinton
413
10,494
0
21 Jul 2016
Fully-Convolutional Siamese Networks for Object Tracking
Luca Bertinetto
Jack Valmadre
João F. Henriques
Andrea Vedaldi
Philip Torr
VOT
77
3,879
0
30 Jun 2016
ImageNet Large Scale Visual Recognition Challenge
Olga Russakovsky
Jia Deng
Hao Su
J. Krause
S. Satheesh
...
A. Karpathy
A. Khosla
Michael S. Bernstein
Alexander C. Berg
Li Fei-Fei
VLM
ObjD
1.7K
39,547
0
01 Sep 2014
1