15
0

GoTrack: Generic 6DoF Object Pose Refinement and Tracking

Main:8 Pages
12 Figures
Bibliography:3 Pages
4 Tables
Appendix:9 Pages
Abstract

We introduce GoTrack, an efficient and accurate CAD-based method for 6DoF object pose refinement and tracking, which can handle diverse objects without any object-specific training. Unlike existing tracking methods that rely solely on an analysis-by-synthesis approach for model-to-frame registration, GoTrack additionally integrates frame-to-frame registration, which saves compute and stabilizes tracking. Both types of registration are realized by optical flow estimation. The model-to-frame registration is noticeably simpler than in existing methods, relying only on standard neural network blocks (a transformer is trained on top of DINOv2) and producing reliable pose confidence scores without a scoring network. For the frame-to-frame registration, which is an easier problem as consecutive video frames are typically nearly identical, we employ a light off-the-shelf optical flow model. We demonstrate that GoTrack can be seamlessly combined with existing coarse pose estimation methods to create a minimal pipeline that reaches state-of-the-art RGB-only results on standard benchmarks for 6DoF object pose estimation and tracking. Our source code and trained models are publicly available atthis https URL

View on arXiv
@article{nguyen2025_2506.07155,
  title={ GoTrack: Generic 6DoF Object Pose Refinement and Tracking },
  author={ Van Nguyen Nguyen and Christian Forster and Sindi Shkodrani and Vincent Lepetit and Bugra Tekin and Cem Keskin and Tomas Hodan },
  journal={arXiv preprint arXiv:2506.07155},
  year={ 2025 }
}
Comments on this paper