Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2504.05141
Cited By
v1
v2 (latest)
EffOWT: Transfer Visual Language Models to Open-World Tracking Efficiently and Effectively
7 April 2025
Bingyang Wang
Kaer Huang
Bin Li
Yiqiang Yan
Lulu Zhang
Huchuan Lu
You He
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"EffOWT: Transfer Visual Language Models to Open-World Tracking Efficiently and Effectively"
38 / 38 papers shown
Title
NetTrack: Tracking Highly Dynamic Objects with a Net
Guang-Zheng Zheng
Shijie Lin
Haobo Zuo
Changhong Fu
Jia Pan
103
11
0
17 Mar 2024
UniPT: Universal Parallel Tuning for Transfer Learning with Efficient Parameter and Memory
Haiwen Diao
Bo Wan
Yanzhe Zhang
Xuecong Jia
Huchuan Lu
Long Chen
VLM
81
19
0
28 Aug 2023
Video OWL-ViT: Temporally-consistent open-world localization in video
G. Heigold
Matthias Minderer
A. Gritsenko
Alex Bewley
Daniel Keysers
Mario Luvcić
Feng Yu
Thomas Kipf
VLM
92
14
0
22 Aug 2023
Strip-MLP: Efficient Token Interaction for Vision MLP
Guiping Cao
Shengda Luo
Wen-Fong Huang
X. Lan
D. Jiang
Yaowei Wang
Jianguo Zhang
108
12
0
21 Jul 2023
OVTrack: Open-Vocabulary Multiple Object Tracking
Siyuan Li
Tobias Fischer
Lei Ke
Henghui Ding
Martin Danelljan
Feng Yu
DiffM
120
46
0
17 Apr 2023
DINOv2: Learning Robust Visual Features without Supervision
Maxime Oquab
Timothée Darcet
Théo Moutakanni
Huy Q. Vo
Marc Szafraniec
...
Hervé Jégou
Julien Mairal
Patrick Labatut
Armand Joulin
Piotr Bojanowski
VLM
CLIP
SSL
555
3,536
0
14 Apr 2023
Your representations are in the network: composable and parallel adaptation for large scale models
Yonatan Dukler
Alessandro Achille
Hao Yang
Varsha Vivek
Luca Zancato
Benjamin Bowman
Avinash Ravichandran
Charless C. Fowlkes
A. Swaminathan
Stefano Soatto
97
3
0
07 Mar 2023
Side Adapter Network for Open-Vocabulary Semantic Segmentation
Mengde Xu
Zheng Zhang
Fangyun Wei
Han Hu
Xiang Bai
VLM
89
273
0
23 Feb 2023
Visual Query Tuning: Towards Effective Usage of Intermediate Representations for Parameter and Memory Efficient Transfer Learning
Cheng-Hao Tu
Zheda Mai
Wei-Lun Chao
64
48
0
06 Dec 2022
LST: Ladder Side-Tuning for Parameter and Memory Efficient Transfer Learning
Yi-Lin Sung
Jaemin Cho
Joey Tianyi Zhou
VLM
101
246
0
13 Jun 2022
Observation-Centric SORT: Rethinking SORT for Robust Multi-Object Tracking
Jinkun Cao
Jiangmiao Pang
Xinshuo Weng
Rawal Khirodkar
Kris Kitani
VOT
139
505
0
27 Mar 2022
Visual Prompt Tuning
Menglin Jia
Luming Tang
Bor-Chun Chen
Claire Cardie
Serge Belongie
Bharath Hariharan
Ser-Nam Lim
VLM
VPVLM
245
1,660
0
23 Mar 2022
Joint Feature Learning and Relation Modeling for Tracking: A One-Stream Framework
Botao Ye
Hong Chang
Bingpeng Ma
Shiguang Shan
Xilin Chen
ViT
141
491
0
22 Mar 2022
Head2Toe: Utilizing Intermediate Representations for Better Transfer Learning
Utku Evci
Vincent Dumoulin
Hugo Larochelle
Michael C. Mozer
147
86
0
10 Jan 2022
RegionCLIP: Region-based Language-Image Pretraining
Yiwu Zhong
Jianwei Yang
Pengchuan Zhang
Chunyuan Li
Noel Codella
...
Luowei Zhou
Xiyang Dai
Lu Yuan
Yin Li
Jianfeng Gao
VLM
CLIP
157
585
0
16 Dec 2021
Grounded Language-Image Pre-training
Liunian Harold Li
Pengchuan Zhang
Haotian Zhang
Jianwei Yang
Chunyuan Li
...
Lu Yuan
Lei Zhang
Lei Li
Kai-Wei Chang
Jianfeng Gao
ObjD
VLM
186
1,073
0
07 Dec 2021
OW-DETR: Open-world Detection Transformer
Akshita Gupta
Sanath Narayan
K. J. Joseph
Salman Khan
Fahad Shahbaz Khan
M. Shah
ViT
98
175
0
02 Dec 2021
ByteTrack: Multi-Object Tracking by Associating Every Detection Box
Yifu Zhang
Pei Sun
Yi Jiang
Dongdong Yu
Fucheng Weng
Zehuan Yuan
Ping Luo
Wenyu Liu
Xinggang Wang
VOT
210
1,412
0
13 Oct 2021
MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision Transformer
Sachin Mehta
Mohammad Rastegari
ViT
305
1,306
0
05 Oct 2021
Mobile-Former: Bridging MobileNet and Transformer
Yinpeng Chen
Xiyang Dai
Dongdong Chen
Mengchen Liu
Xiaoyi Dong
Lu Yuan
Zicheng Liu
ViT
282
494
0
12 Aug 2021
PVT v2: Improved Baselines with Pyramid Vision Transformer
Wenhai Wang
Enze Xie
Xiang Li
Deng-Ping Fan
Kaitao Song
Ding Liang
Tong Lu
Ping Luo
Ling Shao
ViT
AI4TS
190
1,705
0
25 Jun 2021
Opening up Open-World Tracking
Yang Liu
Idil Esen Zulfikar
Jonathon Luiten
Achal Dave
Deva Ramanan
Bastian Leibe
Aljosa Osep
Laura Leal-Taixé
114
54
0
22 Apr 2021
Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
Ze Liu
Yutong Lin
Yue Cao
Han Hu
Yixuan Wei
Zheng Zhang
Stephen Lin
B. Guo
ViT
540
21,854
0
25 Mar 2021
Towards Open World Object Detection
K. J. Joseph
Salman Khan
Fahad Shahbaz Khan
V. Balasubramanian
ObjD
116
464
0
03 Mar 2021
Learning Transferable Visual Models From Natural Language Supervision
Alec Radford
Jong Wook Kim
Chris Hallacy
Aditya A. Ramesh
Gabriel Goh
...
Amanda Askell
Pamela Mishkin
Jack Clark
Gretchen Krueger
Ilya Sutskever
CLIP
VLM
1.1K
30,116
0
26 Feb 2021
Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision
Chao Jia
Yinfei Yang
Ye Xia
Yi-Ting Chen
Zarana Parekh
Hieu H. Pham
Quoc V. Le
Yun-hsuan Sung
Zhen Li
Tom Duerig
VLM
CLIP
562
3,917
0
11 Feb 2021
TransTrack: Multiple Object Tracking with Transformer
Pei Sun
Jinkun Cao
Yi Jiang
Rufeng Zhang
Enze Xie
Zehuan Yuan
Changhu Wang
Ping Luo
ViT
VOT
322
589
0
31 Dec 2020
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
Alexey Dosovitskiy
Lucas Beyer
Alexander Kolesnikov
Dirk Weissenborn
Xiaohua Zhai
...
Matthias Minderer
G. Heigold
Sylvain Gelly
Jakob Uszkoreit
N. Houlsby
ViT
795
41,946
0
22 Oct 2020
Quasi-Dense Similarity Learning for Multiple Object Tracking
Jiangmiao Pang
Linlu Qiu
Xia Li
Haofeng Chen
Qi Li
Trevor Darrell
Feng Yu
VOT
203
375
0
11 Jun 2020
TAO: A Large-Scale Benchmark for Tracking Any Object
Achal Dave
Tarasha Khurana
P. Tokmakov
Cordelia Schmid
Deva Ramanan
96
180
0
20 May 2020
MOT20: A benchmark for multi object tracking in crowded scenes
Patrick Dendorfer
Hamid Rezatofighi
Anton Milan
Javen Qinfeng Shi
Zorah Lähner
Ian Reid
Stefan Roth
Konrad Schindler
Laura Leal-Taixé
VOT
249
658
0
19 Mar 2020
Towards Real-Time Multi-Object Tracking
Zhongdao Wang
Liang Zheng
Yixuan Liu
Yali Li
Shengjin Wang
VOT
326
887
0
27 Sep 2019
BDD100K: A Diverse Driving Dataset for Heterogeneous Multitask Learning
Feng Yu
Haofeng Chen
Xin Wang
Wenqi Xian
Yingying Chen
Fangchen Liu
Vashisht Madhavan
Trevor Darrell
VLM
357
2,171
0
12 May 2018
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
1.0K
133,589
0
12 Jun 2017
MOT16: A Benchmark for Multi-Object Tracking
Anton Milan
Laura Leal-Taixe
Ian Reid
Stefan Roth
Konrad Schindler
VOT
253
1,812
0
02 Mar 2016
Simple Online and Realtime Tracking
Alex Bewley
Zongyuan Ge
Lionel Ott
F. Ramos
B. Upcroft
VOT
157
3,127
0
02 Feb 2016
Deep Residual Learning for Image Recognition
Kaiming He
Xinming Zhang
Shaoqing Ren
Jian Sun
MedIm
2.7K
195,301
0
10 Dec 2015
Microsoft COCO: Common Objects in Context
Nayeon Lee
Michael Maire
Serge J. Belongie
Lubomir Bourdev
Ross B. Girshick
James Hays
Pietro Perona
Deva Ramanan
C. L. Zitnick
Piotr Dollár
ObjD
507
44,016
0
01 May 2014
1