ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2304.08408
  4. Cited By
OVTrack: Open-Vocabulary Multiple Object Tracking

OVTrack: Open-Vocabulary Multiple Object Tracking

17 April 2023
Siyuan Li
Tobias Fischer
Lei Ke
Henghui Ding
Martin Danelljan
Feng Yu
    DiffM
ArXiv (abs)PDFHTML

Papers citing "OVTrack: Open-Vocabulary Multiple Object Tracking"

50 / 53 papers shown
Title
EffOWT: Transfer Visual Language Models to Open-World Tracking Efficiently and Effectively
EffOWT: Transfer Visual Language Models to Open-World Tracking Efficiently and Effectively
Bingyang Wang
Kaer Huang
Bin Li
Yiqiang Yan
Lulu Zhang
Huchuan Lu
You He
VLM
148
0
0
07 Apr 2025
OVTR: End-to-End Open-Vocabulary Multiple Object Tracking with Transformer
OVTR: End-to-End Open-Vocabulary Multiple Object Tracking with Transformer
Jinyang Li
En Yu
Sijia Chen
Wenbing Tao
144
2
0
13 Mar 2025
Omnidirectional Multi-Object Tracking
Omnidirectional Multi-Object Tracking
Kai Luo
Hao-miao Shi
Sheng Wu
Fei Teng
Mengfei Duan
Chang Huang
Yansen Wang
Kaiwei Wang
Kailun Yang
169
1
0
06 Mar 2025
Multiple Object Tracking as ID Prediction
Multiple Object Tracking as ID Prediction
Ruopeng Gao
Yijun Zhang
Limin Wang
180
16
0
25 Mar 2024
Tracking by Associating Clips
Tracking by Associating Clips
Sanghyun Woo
Kwanyong Park
Seoung Wug Oh
In So Kweon
Joon-Young Lee
VOT
67
9
0
20 Dec 2022
BURST: A Benchmark for Unifying Object Recognition, Segmentation and
  Tracking in Video
BURST: A Benchmark for Unifying Object Recognition, Segmentation and Tracking in Video
A. Athar
Jonathon Luiten
P. Voigtlaender
Tarasha Khurana
Achal Dave
Bastian Leibe
Deva Ramanan
VOSVLM
99
60
0
25 Sep 2022
Tracking Every Thing in the Wild
Tracking Every Thing in the Wild
Siyuan Li
Martin Danelljan
Henghui Ding
Thomas E. Huang
Feng Yu
90
43
0
26 Jul 2022
Learning to Prompt for Open-Vocabulary Object Detection with
  Vision-Language Model
Learning to Prompt for Open-Vocabulary Object Detection with Vision-Language Model
Yu Du
Fangyun Wei
Zihe Zhang
Miaojing Shi
Yue Gao
Guoqi Li
VPVLMVLM
81
335
0
28 Mar 2022
Global Tracking Transformers
Global Tracking Transformers
Xingyi Zhou
Tianwei Yin
V. Koltun
Philipp Krahenbuhl
VOT
101
138
0
24 Mar 2022
RePaint: Inpainting using Denoising Diffusion Probabilistic Models
RePaint: Inpainting using Denoising Diffusion Probabilistic Models
Andreas Lugmayr
Martin Danelljan
Andrés Romero
Feng Yu
Radu Timofte
Luc Van Gool
DiffM
355
1,427
0
24 Jan 2022
Detecting Twenty-thousand Classes using Image-level Supervision
Detecting Twenty-thousand Classes using Image-level Supervision
Xingyi Zhou
Rohit Girdhar
Armand Joulin
Phillip Krahenbuhl
Ishan Misra
CLIPVLM
113
618
0
07 Jan 2022
RegionCLIP: Region-based Language-Image Pretraining
RegionCLIP: Region-based Language-Image Pretraining
Yiwu Zhong
Jianwei Yang
Pengchuan Zhang
Chunyuan Li
Noel Codella
...
Luowei Zhou
Xiyang Dai
Lu Yuan
Yin Li
Jianfeng Gao
VLMCLIP
151
580
0
16 Dec 2021
MOTSynth: How Can Synthetic Data Help Pedestrian Detection and Tracking?
MOTSynth: How Can Synthetic Data Help Pedestrian Detection and Tracking?
Matteo Fabbri
Guillem Brasó
Gianluca Maugeri
Orcun Cetintas
Riccardo Gasparini
Aljosa Osep
Simone Calderara
Laura Leal-Taixe
Rita Cucchiara
ViT
113
109
0
21 Aug 2021
Aligning Pretraining for Detection via Object-Level Contrastive Learning
Aligning Pretraining for Detection via Object-Level Contrastive Learning
Fangyun Wei
Yue Gao
Zhirong Wu
Han Hu
Stephen Lin
ObjD
70
148
0
04 Jun 2021
MOTR: End-to-End Multiple-Object Tracking with Transformer
MOTR: End-to-End Multiple-Object Tracking with Transformer
Fangao Zeng
Bin Dong
Cheng Chen
Tiancai Wang
Xinming Zhang
Yichen Wei
VOT
77
519
0
07 May 2021
DriveGAN: Towards a Controllable High-Quality Neural Simulation
DriveGAN: Towards a Controllable High-Quality Neural Simulation
S. Kim
Jonah Philion
Antonio Torralba
Sanja Fidler
89
119
0
30 Apr 2021
Open-vocabulary Object Detection via Vision and Language Knowledge
  Distillation
Open-vocabulary Object Detection via Vision and Language Knowledge Distillation
Xiuye Gu
Nayeon Lee
Weicheng Kuo
Huayu Chen
VLMObjD
300
921
0
28 Apr 2021
Monocular Quasi-Dense 3D Object Tracking
Monocular Quasi-Dense 3D Object Tracking
Hou-Ning Hu
Yung-Hsu Yang
Tobias Fischer
Trevor Darrell
Feng Yu
Min Sun
3DPC
80
117
0
12 Mar 2021
Zero-Shot Text-to-Image Generation
Zero-Shot Text-to-Image Generation
Aditya A. Ramesh
Mikhail Pavlov
Gabriel Goh
Scott Gray
Chelsea Voss
Alec Radford
Mark Chen
Ilya Sutskever
VLM
420
5,005
0
24 Feb 2021
1st Place Solution to ECCV-TAO-2020: Detect and Represent Any Object for
  Tracking
1st Place Solution to ECCV-TAO-2020: Detect and Represent Any Object for Tracking
Fei Du
Boao Xu
Jiasheng Tang
Yuqi Zhang
F. Wang
Hao Li
83
19
0
20 Jan 2021
GeoSim: Realistic Video Simulation via Geometry-Aware Composition for
  Self-Driving
GeoSim: Realistic Video Simulation via Geometry-Aware Composition for Self-Driving
Yun Chen
Frieda Rong
Shivam Duggal
Shenlong Wang
Xinchen Yan
S. Manivasagam
Shangjie Xue
Ersin Yumer
R. Urtasun
57
100
0
16 Jan 2021
TrackFormer: Multi-Object Tracking with Transformers
TrackFormer: Multi-Object Tracking with Transformers
Tim Meinhardt
A. Kirillov
Laura Leal-Taixe
Christoph Feichtenhofer
VOT
281
775
0
07 Jan 2021
TransTrack: Multiple Object Tracking with Transformer
TransTrack: Multiple Object Tracking with Transformer
Pei Sun
Jinkun Cao
Yi Jiang
Rufeng Zhang
Enze Xie
Zehuan Yuan
Changhu Wang
Ping Luo
ViTVOT
316
587
0
31 Dec 2020
Open-Vocabulary Object Detection Using Captions
Open-Vocabulary Object Detection Using Captions
Alireza Zareian
Kevin Dela Rosa
Derek Hao Hu
Shih-Fu Chang
VLMObjD
139
433
0
20 Nov 2020
MOTChallenge: A Benchmark for Single-Camera Multiple Target Tracking
MOTChallenge: A Benchmark for Single-Camera Multiple Target Tracking
Patrick Dendorfer
Aljosa Osep
Anton Milan
Konrad Schindler
Daniel Cremers
Ian Reid
Stefan Roth
Laura Leal-Taixé
VOT
71
267
0
15 Oct 2020
Quasi-Dense Similarity Learning for Multiple Object Tracking
Quasi-Dense Similarity Learning for Multiple Object Tracking
Jiangmiao Pang
Linlu Qiu
Xia Li
Haofeng Chen
Qi Li
Trevor Darrell
Feng Yu
VOT
176
373
0
11 Jun 2020
TAO: A Large-Scale Benchmark for Tracking Any Object
TAO: A Large-Scale Benchmark for Tracking Any Object
Achal Dave
Tarasha Khurana
P. Tokmakov
Cordelia Schmid
Deva Ramanan
75
180
0
20 May 2020
FairMOT: On the Fairness of Detection and Re-Identification in Multiple
  Object Tracking
FairMOT: On the Fairness of Detection and Re-Identification in Multiple Object Tracking
Yifu Zhang
Chunyu Wang
Xinggang Wang
Wenjun Zeng
Wenyu Liu
VOT
132
1,350
0
04 Apr 2020
Tracking Objects as Points
Tracking Objects as Points
Xingyi Zhou
V. Koltun
Philipp Krahenbuhl
VOT3DPC
85
1,068
0
02 Apr 2020
A Simple Framework for Contrastive Learning of Visual Representations
A Simple Framework for Contrastive Learning of Visual Representations
Ting-Li Chen
Simon Kornblith
Mohammad Norouzi
Geoffrey E. Hinton
SSL
395
18,897
0
13 Feb 2020
Learning a Neural Solver for Multiple Object Tracking
Learning a Neural Solver for Multiple Object Tracking
Guillem Brasó
Laura Leal-Taixé
VOT
85
401
0
16 Dec 2019
Scalability in Perception for Autonomous Driving: Waymo Open Dataset
Scalability in Perception for Autonomous Driving: Waymo Open Dataset
Pei Sun
Henrik Kretzschmar
Xerxes Dotiwalla
Aurelien Chouard
Vijaysai Patnaik
...
Shuyang Cheng
Yu Zhang
Jonathon Shlens
Zhifeng Chen
Dragomir Anguelov
149
2,907
0
10 Dec 2019
Momentum Contrast for Unsupervised Visual Representation Learning
Momentum Contrast for Unsupervised Visual Representation Learning
Kaiming He
Haoqi Fan
Yuxin Wu
Saining Xie
Ross B. Girshick
SSL
216
12,136
0
13 Nov 2019
LVIS: A Dataset for Large Vocabulary Instance Segmentation
LVIS: A Dataset for Large Vocabulary Instance Segmentation
Agrim Gupta
Piotr Dollár
Ross B. Girshick
ISegVLM
111
1,379
0
08 Aug 2019
Video Instance Segmentation
Video Instance Segmentation
Linjie Yang
Yuchen Fan
N. Xu
VOSISeg
88
510
0
12 May 2019
Res2Net: A New Multi-scale Backbone Architecture
Res2Net: A New Multi-scale Backbone Architecture
Shanghua Gao
Ming-Ming Cheng
Kai Zhao
Xinyu Zhang
Ming-Hsuan Yang
Philip Torr
126
2,404
0
02 Apr 2019
Tracking without bells and whistles
Tracking without bells and whistles
Philipp Bergmann
Tim Meinhardt
Laura Leal-Taixe
VOT
123
913
0
13 Mar 2019
Towards Segmenting Anything That Moves
Towards Segmenting Anything That Moves
Achal Dave
P. Tokmakov
Deva Ramanan
86
87
0
11 Feb 2019
Combined Image- and World-Space Tracking in Traffic Scenes
Combined Image- and World-Space Tracking in Traffic Scenes
Aljosa Osep
Wolfgang Mehner
Markus Mathias
Bastian Leibe
VOT
45
135
0
19 Sep 2018
BDD100K: A Diverse Driving Dataset for Heterogeneous Multitask Learning
BDD100K: A Diverse Driving Dataset for Heterogeneous Multitask Learning
Feng Yu
Haofeng Chen
Xin Wang
Wenqi Xian
Yingying Chen
Fangchen Liu
Vashisht Madhavan
Trevor Darrell
VLM
346
2,158
0
12 May 2018
Simple Baselines for Human Pose Estimation and Tracking
Simple Baselines for Human Pose Estimation and Tracking
Bin Xiao
Haiping Wu
Yichen Wei
3DHVOT
128
1,793
0
17 Apr 2018
Beyond Pixels: Leveraging Geometry and Shape Cues for Online
  Multi-Object Tracking
Beyond Pixels: Leveraging Geometry and Shape Cues for Online Multi-Object Tracking
Sarthak Sharma
J. Ansari
Krishna Murthy Jatavallabhula
K. M. Krishna
VOT
67
169
0
26 Feb 2018
Detect to Track and Track to Detect
Detect to Track and Track to Detect
Christoph Feichtenhofer
A. Pinz
Andrew Zisserman
VOT
123
562
0
11 Oct 2017
Simple Online and Realtime Tracking with a Deep Association Metric
Simple Online and Realtime Tracking with a Deep Association Metric
N. Wojke
Alex Bewley
Dietrich Paulus
VOT
400
3,552
0
21 Mar 2017
Mask R-CNN
Mask R-CNN
Kaiming He
Georgia Gkioxari
Piotr Dollár
Ross B. Girshick
ObjD
387
27,275
0
20 Mar 2017
YOLO9000: Better, Faster, Stronger
YOLO9000: Better, Faster, Stronger
Joseph Redmon
Ali Farhadi
VLMObjD
183
15,641
0
25 Dec 2016
Feature Pyramid Networks for Object Detection
Feature Pyramid Networks for Object Detection
Nayeon Lee
Piotr Dollár
Ross B. Girshick
Kaiming He
Bharath Hariharan
Serge J. Belongie
ObjD
491
22,158
0
09 Dec 2016
Playing for Data: Ground Truth from Computer Games
Playing for Data: Ground Truth from Computer Games
Stephan R. Richter
Vibhav Vineet
Stefan Roth
V. Koltun
VLM
124
2,013
0
07 Aug 2016
Learning by tracking: Siamese CNN for robust target association
Learning by tracking: Siamese CNN for robust target association
Laura Leal-Taixé
Cristian Canton Ferrer
Konrad Schindler
55
427
0
26 Apr 2016
Online Multi-Target Tracking Using Recurrent Neural Networks
Online Multi-Target Tracking Using Recurrent Neural Networks
Anton Milan
S. Hamid Rezatofighi
A. Dick
Ian Reid
Konrad Schindler
VOT
94
516
0
13 Apr 2016
12
Next