ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2402.06092
  4. Cited By
CLIP-Loc: Multi-modal Landmark Association for Global Localization in
  Object-based Maps

CLIP-Loc: Multi-modal Landmark Association for Global Localization in Object-based Maps

8 February 2024
Shigemichi Matsuzaki
Takuma Sugino
Kazuhito Tanaka
Zijun Sha
Shintaro Nakaoka
Shintaro Yoshizawa
Kazuhiro Shintani
    VLM
ArXivPDFHTML

Papers citing "CLIP-Loc: Multi-modal Landmark Association for Global Localization in Object-based Maps"

21 / 21 papers shown
Title
Towards Global Localization using Multi-Modal Object-Instance Re-Identification
Towards Global Localization using Multi-Modal Object-Instance Re-Identification
Aneesh Chavan
Vaibhav Agrawal
Vineeth Bhat
Sarthak Chittawar
Siddharth Srivastava
Chetan Arora
K. M. Krishna
135
0
0
18 Sep 2024
RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic
  Control
RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic Control
Anthony Brohan
Noah Brown
Justice Carbajal
Yevgen Chebotar
Xi Chen
...
Ted Xiao
Peng Xu
Sichun Xu
Tianhe Yu
Brianna Zitkovich
LM&Ro
LRM
121
1,231
0
28 Jul 2023
An Object SLAM Framework for Association, Mapping, and High-Level Tasks
An Object SLAM Framework for Association, Mapping, and High-Level Tasks
Yanmin Wu
Yunzhou Zhang
Delong Zhu
Zhiqiang Deng
Wenkai Sun
Xin Chen
Jian Zhang
67
38
0
12 May 2023
FM-Loc: Using Foundation Models for Improved Vision-based Localization
FM-Loc: Using Foundation Models for Improved Vision-based Localization
Reihaneh Mirjalili
Michael Krawez
Wolfram Burgard
VLM
63
15
0
14 Apr 2023
SQ-SLAM: Monocular Semantic SLAM Based on Superquadric Object
  Representation
SQ-SLAM: Monocular Semantic SLAM Based on Superquadric Object Representation
Xiaoye Han
L. Yang
67
10
0
22 Sep 2022
OA-SLAM: Leveraging Objects for Camera Relocalization in Visual SLAM
OA-SLAM: Leveraging Objects for Camera Relocalization in Visual SLAM
Matthieu Zins
Gilles Simon
M. Berger
62
36
0
17 Sep 2022
LM-Nav: Robotic Navigation with Large Pre-Trained Models of Language,
  Vision, and Action
LM-Nav: Robotic Navigation with Large Pre-Trained Models of Language, Vision, and Action
Dhruv Shah
B. Osinski
Brian Ichter
Sergey Levine
LM&Ro
213
458
0
10 Jul 2022
CoWs on Pasture: Baselines and Benchmarks for Language-Driven Zero-Shot
  Object Navigation
CoWs on Pasture: Baselines and Benchmarks for Language-Driven Zero-Shot Object Navigation
S. Gadre
Mitchell Wortsman
Gabriel Ilharco
Ludwig Schmidt
Shuran Song
CLIP
LM&Ro
109
150
0
20 Mar 2022
OFA: Unifying Architectures, Tasks, and Modalities Through a Simple
  Sequence-to-Sequence Learning Framework
OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework
Peng Wang
An Yang
Rui Men
Junyang Lin
Shuai Bai
Zhikang Li
Jianxin Ma
Chang Zhou
Jingren Zhou
Hongxia Yang
MLLM
ObjD
141
873
0
07 Feb 2022
SO-SLAM: Semantic Object SLAM with Scale Proportional and Symmetrical
  Texture Constraints
SO-SLAM: Semantic Object SLAM with Scale Proportional and Symmetrical Texture Constraints
Ziwei Liao
Yutong Hu
Jiadong Zhang
Xianyu Qi
Xiaoyu Zhang
Wei Wang
37
59
0
10 Sep 2021
Learning Transferable Visual Models From Natural Language Supervision
Learning Transferable Visual Models From Natural Language Supervision
Alec Radford
Jong Wook Kim
Chris Hallacy
Aditya A. Ramesh
Gabriel Goh
...
Amanda Askell
Pamela Mishkin
Jack Clark
Gretchen Krueger
Ilya Sutskever
CLIP
VLM
871
29,372
0
26 Feb 2021
An Image is Worth 16x16 Words: Transformers for Image Recognition at
  Scale
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
Alexey Dosovitskiy
Lucas Beyer
Alexander Kolesnikov
Dirk Weissenborn
Xiaohua Zhai
...
Matthias Minderer
G. Heigold
Sylvain Gelly
Jakob Uszkoreit
N. Houlsby
ViT
606
40,961
0
22 Oct 2020
ORB-SLAM3: An Accurate Open-Source Library for Visual, Visual-Inertial
  and Multi-Map SLAM
ORB-SLAM3: An Accurate Open-Source Library for Visual, Visual-Inertial and Multi-Map SLAM
C. Campos
Richard Elvira
J. Rodríguez
José M.M. Montiel
Juan D. Tardós
70
2,868
0
23 Jul 2020
Language Models are Few-Shot Learners
Language Models are Few-Shot Learners
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
...
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
BDL
741
41,894
0
28 May 2020
MAGSAC++, a fast, reliable and accurate robust estimator
MAGSAC++, a fast, reliable and accurate robust estimator
Dániel Baráth
Jana Noskova
Maksym Ivashechkin
Jirí Matas
52
247
0
11 Dec 2019
Monocular Object and Plane SLAM in Structured Environments
Monocular Object and Plane SLAM in Structured Environments
Shichao Yang
Sebastian Scherer
56
87
0
10 Sep 2018
CubeSLAM: Monocular 3D Object SLAM
CubeSLAM: Monocular 3D Object SLAM
Shichao Yang
Sebastian Scherer
56
386
0
01 Jun 2018
QuadricSLAM: Dual Quadrics from Object Detections as Landmarks in
  Object-oriented SLAM
QuadricSLAM: Dual Quadrics from Object Detections as Landmarks in Object-oriented SLAM
Lachlan Nicholson
Michael Milford
Niko Sünderhauf
51
284
0
10 Apr 2018
ORB-SLAM2: an Open-Source SLAM System for Monocular, Stereo and RGB-D
  Cameras
ORB-SLAM2: an Open-Source SLAM System for Monocular, Stereo and RGB-D Cameras
Raul Mur-Artal
Juan D. Tardós
331
5,439
0
20 Oct 2016
ORB-SLAM: a Versatile and Accurate Monocular SLAM System
ORB-SLAM: a Versatile and Accurate Monocular SLAM System
Raul Mur-Artal
José M.M. Montiel
Juan D. Tardós
106
6,390
0
03 Feb 2015
Microsoft COCO: Common Objects in Context
Microsoft COCO: Common Objects in Context
Nayeon Lee
Michael Maire
Serge J. Belongie
Lubomir Bourdev
Ross B. Girshick
James Hays
Pietro Perona
Deva Ramanan
C. L. Zitnick
Piotr Dollár
ObjD
408
43,638
0
01 May 2014
1