50
7

O2V-Mapping: Online Open-Vocabulary Mapping with Neural Implicit Representation

Abstract

Online construction of open-ended language scenes is crucial for robotic applications, where open-vocabulary interactive scene understanding is required. Recently, neural implicit representation has provided a promising direction for online interactive mapping. However, implementing open-vocabulary scene understanding capability into online neural implicit mapping still faces three challenges: lack of local scene updating ability, blurry spatial hierarchical semantic segmentation and difficulty in maintaining multi-view consistency. To this end, we proposed O2V-mapping, which utilizes voxel-based language and geometric features to create an open-vocabulary field, thus allowing for local updates during online training process. Additionally, we leverage a foundational model for image segmentation to extract language features on object-level entities, achieving clear segmentation boundaries and hierarchical semantic features. For the purpose of preserving consistency in 3D object properties across different viewpoints, we propose a spatial adaptive voxel adjustment mechanism and a multi-view weight selection method. Extensive experiments on open-vocabulary object localization and semantic segmentation demonstrate that O2V-mapping achieves online construction of language scenes while enhancing accuracy, outperforming the previous SOTA method.

View on arXiv
@article{tie2025_2404.06836,
  title={ O2V-Mapping: Online Open-Vocabulary Mapping with Neural Implicit Representation },
  author={ Muer Tie and Julong Wei and Zhengjun Wang and Ke Wu and Shansuai Yuan and Kaizhao Zhang and Jie Jia and Jieru Zhao and Zhongxue Gan and Wenchao Ding },
  journal={arXiv preprint arXiv:2404.06836},
  year={ 2025 }
}
Comments on this paper

We use cookies and other tracking technologies to improve your browsing experience on our website, to show you personalized content and targeted ads, to analyze our website traffic, and to understand where our visitors are coming from. See our policy.