ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2207.04429
  4. Cited By
LM-Nav: Robotic Navigation with Large Pre-Trained Models of Language,
  Vision, and Action

LM-Nav: Robotic Navigation with Large Pre-Trained Models of Language, Vision, and Action

10 July 2022
Dhruv Shah
B. Osinski
Brian Ichter
Sergey Levine
    LM&Ro
ArXivPDFHTML

Papers citing "LM-Nav: Robotic Navigation with Large Pre-Trained Models of Language, Vision, and Action"

30 / 80 papers shown
Title
CLIP-Loc: Multi-modal Landmark Association for Global Localization in
  Object-based Maps
CLIP-Loc: Multi-modal Landmark Association for Global Localization in Object-based Maps
Shigemichi Matsuzaki
Takuma Sugino
Kazuhito Tanaka
Zijun Sha
Shintaro Nakaoka
Shintaro Yoshizawa
Kazuhiro Shintani
VLM
15
5
0
08 Feb 2024
Sketch-Guided Constrained Decoding for Boosting Blackbox Large Language
  Models without Logit Access
Sketch-Guided Constrained Decoding for Boosting Blackbox Large Language Models without Logit Access
Saibo Geng
Berkay Döner
Chris Wendler
Martin Josifoski
Robert West
38
3
0
18 Jan 2024
Learning-To-Rank Approach for Identifying Everyday Objects Using a
  Physical-World Search Engine
Learning-To-Rank Approach for Identifying Everyday Objects Using a Physical-World Search Engine
Kanta Kaneda
Shunya Nagashima
Ryosuke Korekata
Motonari Kambara
Komei Sugiura
35
6
0
26 Dec 2023
Evaluation of Large Language Models for Decision Making in Autonomous
  Driving
Evaluation of Large Language Models for Decision Making in Autonomous Driving
Kotaro Tanahashi
Yuichi Inoue
Yu Yamaguchi
Hidetatsu Yaginuma
Daiki Shiotsuka
...
Koki Igari
Tsukasa Horinouchi
Kento Tokuhiro
Yugo Tokuchi
Shunsuke Aoki
21
11
0
11 Dec 2023
A Survey on Multimodal Large Language Models for Autonomous Driving
A Survey on Multimodal Large Language Models for Autonomous Driving
Can Cui
Yunsheng Ma
Xu Cao
Wenqian Ye
Yang Zhou
...
Xinrui Yan
Shuqi Mei
Jianguo Cao
Ziran Wang
Chao Zheng
41
249
0
21 Nov 2023
A Graph-to-Text Approach to Knowledge-Grounded Response Generation in Human-Robot Interaction
A Graph-to-Text Approach to Knowledge-Grounded Response Generation in Human-Robot Interaction
Nicholas Walker
Stefan Ultes
Pierre Lison
LM&Ro
46
1
0
03 Nov 2023
Advances in Embodied Navigation Using Large Language Models: A Survey
Advances in Embodied Navigation Using Large Language Models: A Survey
Jinzhou Lin
Han Gao
Xuxiang Feng
Rongtao Xu
Changwei Wang
Man Zhang
Li Guo
Shibiao Xu
LM&Ro
LLMAG
66
9
0
01 Nov 2023
Vision and Language Navigation in the Real World via Online Visual
  Language Mapping
Vision and Language Navigation in the Real World via Online Visual Language Mapping
Chengguang Xu
Hieu T. Nguyen
Christopher Amato
Lawson L. S. Wong
25
9
0
16 Oct 2023
ConceptGraphs: Open-Vocabulary 3D Scene Graphs for Perception and
  Planning
ConceptGraphs: Open-Vocabulary 3D Scene Graphs for Perception and Planning
Yuanyi Zhong
Alihusein Kuwajerwala
Sacha Morin
Krishna Murthy Jatavallabhula
Bipasha Sen
...
Celso Miguel de Melo
Joshua B. Tenenbaum
Antonio Torralba
Florian Shkurti
Liam Paull
LM&Ro
31
166
0
28 Sep 2023
Semantic Scene Difference Detection in Daily Life Patroling by Mobile
  Robots using Pre-Trained Large-Scale Vision-Language Model
Semantic Scene Difference Detection in Daily Life Patroling by Mobile Robots using Pre-Trained Large-Scale Vision-Language Model
Yoshiki Obinata
Kento Kawaharazuka
Naoaki Kanazawa
N. Yamaguchi
Naoto Tsukamoto
Iori Yanokura
Shingo Kitagawa
Koki Shinjo
K. Okada
Masayuki Inaba
LM&Ro
15
6
0
28 Sep 2023
LLM-Grounder: Open-Vocabulary 3D Visual Grounding with Large Language
  Model as an Agent
LLM-Grounder: Open-Vocabulary 3D Visual Grounding with Large Language Model as an Agent
Jianing Yang
Xuweiyi Chen
Shengyi Qian
Nikhil Madaan
Madhavan Iyengar
David Fouhey
Joyce Chai
LM&Ro
LLMAG
31
84
0
21 Sep 2023
HiCRISP: An LLM-based Hierarchical Closed-Loop Robotic Intelligent
  Self-Correction Planner
HiCRISP: An LLM-based Hierarchical Closed-Loop Robotic Intelligent Self-Correction Planner
Chenlin Ming
Jiacheng Lin
Pangkit Fong
Han Wang
Xiaoming Duan
Jianping He
20
1
0
21 Sep 2023
Optimal Scene Graph Planning with Large Language Model Guidance
Optimal Scene Graph Planning with Large Language Model Guidance
Zhirui Dai
Arash Asgharivaskasi
T. Duong
Shusen Lin
Maria-Elizabeth Tzes
George Pappas
Nikolay A. Atanasov
LM&Ro
34
17
0
17 Sep 2023
RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic
  Control
RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic Control
Anthony Brohan
Noah Brown
Justice Carbajal
Yevgen Chebotar
Xi Chen
...
Ted Xiao
Peng-Tao Xu
Sichun Xu
Tianhe Yu
Brianna Zitkovich
LM&Ro
LRM
30
1,091
0
28 Jul 2023
Foundational Models Defining a New Era in Vision: A Survey and Outlook
Foundational Models Defining a New Era in Vision: A Survey and Outlook
Muhammad Awais
Muzammal Naseer
Salman Khan
Rao Muhammad Anwer
Hisham Cholakkal
M. Shah
Ming Yang
F. Khan
VLM
29
118
0
25 Jul 2023
NavGPT: Explicit Reasoning in Vision-and-Language Navigation with Large
  Language Models
NavGPT: Explicit Reasoning in Vision-and-Language Navigation with Large Language Models
Gengze Zhou
Yicong Hong
Qi Wu
ELM
LM&Ro
LLMAG
LRM
23
140
0
26 May 2023
Multimodal Web Navigation with Instruction-Finetuned Foundation Models
Multimodal Web Navigation with Instruction-Finetuned Foundation Models
Hiroki Furuta
Kuang-Huei Lee
Ofir Nachum
Yutaka Matsuo
Aleksandra Faust
S. Gu
Izzeddin Gur
LM&Ro
36
90
0
19 May 2023
Semantic Anomaly Detection with Large Language Models
Semantic Anomaly Detection with Large Language Models
Amine Elhafsi
Rohan Sinha
Christopher Agia
Edward Schmerling
I. Nesnas
Marco Pavone
31
64
0
18 May 2023
FM-Loc: Using Foundation Models for Improved Vision-based Localization
FM-Loc: Using Foundation Models for Improved Vision-based Localization
Reihaneh Mirjalili
Michael Krawez
Wolfram Burgard
VLM
33
15
0
14 Apr 2023
ERRA: An Embodied Representation and Reasoning Architecture for
  Long-horizon Language-conditioned Manipulation Tasks
ERRA: An Embodied Representation and Reasoning Architecture for Long-horizon Language-conditioned Manipulation Tasks
Chao Zhao
Shuai Yuan
Chunli Jiang
Junhao Cai
Hongyu Yu
M. Y. Wang
Qifeng Chen
LM&Ro
24
14
0
05 Apr 2023
The Ladder in Chaos: A Simple and Effective Improvement to General DRL
  Algorithms by Policy Path Trimming and Boosting
The Ladder in Chaos: A Simple and Effective Improvement to General DRL Algorithms by Policy Path Trimming and Boosting
Hongyao Tang
M. Zhang
Jianye Hao
23
1
0
02 Mar 2023
ESC: Exploration with Soft Commonsense Constraints for Zero-shot Object
  Navigation
ESC: Exploration with Soft Commonsense Constraints for Zero-shot Object Navigation
KAI-QING Zhou
Kai Zheng
Connor Pryor
Yilin Shen
Hongxia Jin
Lise Getoor
X. Wang
23
107
0
30 Jan 2023
A Data-Efficient Visual-Audio Representation with Intuitive Fine-tuning
  for Voice-Controlled Robots
A Data-Efficient Visual-Audio Representation with Intuitive Fine-tuning for Voice-Controlled Robots
Peixin Chang
Shuijing Liu
Tianchen Ji
Neeloy Chakraborty
Kaiwen Hong
Katherine Driggs-Campbell
43
3
0
23 Jan 2023
Don't Generate, Discriminate: A Proposal for Grounding Language Models
  to Real-World Environments
Don't Generate, Discriminate: A Proposal for Grounding Language Models to Real-World Environments
Yu Gu
Xiang Deng
Yu-Chuan Su
LLMAG
28
52
0
19 Dec 2022
GNM: A General Navigation Model to Drive Any Robot
GNM: A General Navigation Model to Drive Any Robot
Dhruv Shah
A. Sridhar
Arjun Bhorkar
Noriaki Hirose
Sergey Levine
19
103
0
07 Oct 2022
VIMA: General Robot Manipulation with Multimodal Prompts
VIMA: General Robot Manipulation with Multimodal Prompts
Yunfan Jiang
Agrim Gupta
Zichen Zhang
Guanzhi Wang
Yongqiang Dou
Yanjun Chen
Li Fei-Fei
Anima Anandkumar
Yuke Zhu
Linxi Fan
LM&Ro
24
335
0
06 Oct 2022
ViKiNG: Vision-Based Kilometer-Scale Navigation with Geographic Hints
ViKiNG: Vision-Based Kilometer-Scale Navigation with Geographic Hints
Dhruv Shah
Sergey Levine
132
66
0
23 Feb 2022
Open-vocabulary Object Detection via Vision and Language Knowledge
  Distillation
Open-vocabulary Object Detection via Vision and Language Knowledge Distillation
Xiuye Gu
Tsung-Yi Lin
Weicheng Kuo
Yin Cui
VLM
ObjD
225
898
0
28 Apr 2021
Scaling Up Visual and Vision-Language Representation Learning With Noisy
  Text Supervision
Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision
Chao Jia
Yinfei Yang
Ye Xia
Yi-Ting Chen
Zarana Parekh
Hieu H. Pham
Quoc V. Le
Yun-hsuan Sung
Zhen Li
Tom Duerig
VLM
CLIP
298
3,693
0
11 Feb 2021
Scaling Local Control to Large-Scale Topological Navigation
Scaling Local Control to Large-Scale Topological Navigation
Xiangyun Meng
Nathan D. Ratliff
Yu Xiang
D. Fox
95
61
0
26 Sep 2019
Previous
12