ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2303.07798
  4. Cited By
OVRL-V2: A simple state-of-art baseline for ImageNav and ObjectNav

OVRL-V2: A simple state-of-art baseline for ImageNav and ObjectNav

14 March 2023
Karmesh Yadav
Arjun Majumdar
Ram Ramrakhya
Naoki Yokoyama
Alexei Baevski
Z. Kira
Oleksandr Maksymets
Dhruv Batra
    ViT
ArXivPDFHTML

Papers citing "OVRL-V2: A simple state-of-art baseline for ImageNav and ObjectNav"

44 / 44 papers shown
Title
Multimodal Perception for Goal-oriented Navigation: A Survey
Multimodal Perception for Goal-oriented Navigation: A Survey
I-Tak Ieong
Hao Tang
LM&Ro
LRM
33
0
0
22 Apr 2025
Dexterous Manipulation through Imitation Learning: A Survey
Dexterous Manipulation through Imitation Learning: A Survey
Shan An
Ziyu Meng
Chao Tang
Yue Zhou
Tengyu Liu
...
Yao Mu
Ran Song
Wei Zhang
Zeng-Guang Hou
H. Zhang
51
0
0
04 Apr 2025
Image-Goal Navigation Using Refined Feature Guidance and Scene Graph Enhancement
Zhicheng Feng
Xieyuanli Chen
Chenghao Shi
Lun Luo
Z. Chen
Yun Liu
Huimin Lu
48
0
0
14 Mar 2025
UniGoal: Towards Universal Zero-shot Goal-oriented Navigation
UniGoal: Towards Universal Zero-shot Goal-oriented Navigation
Hang Yin
Xiuwei Xu
Lingqing Zhao
Zehua Wang
Jie Zhou
Jiwen Lu
114
2
0
13 Mar 2025
WMNav: Integrating Vision-Language Models into World Models for Object Goal Navigation
WMNav: Integrating Vision-Language Models into World Models for Object Goal Navigation
Dujun Nie
Xianda Guo
Yiqun Duan
Ruijun Zhang
Long Chen
LM&Ro
156
2
0
04 Mar 2025
Personalized Instance-based Navigation Toward User-Specific Objects in Realistic Environments
Personalized Instance-based Navigation Toward User-Specific Objects in Realistic Environments
Luca Barsellotti
Roberto Bigazzi
Marcella Cornia
Lorenzo Baraldi
Rita Cucchiara
95
1
0
20 Feb 2025
HM3D-OVON: A Dataset and Benchmark for Open-Vocabulary Object Goal
  Navigation
HM3D-OVON: A Dataset and Benchmark for Open-Vocabulary Object Goal Navigation
Naoki Yokoyama
Ram Ramrakhya
Abhishek Das
Dhruv Batra
Sehoon Ha
31
10
0
22 Sep 2024
BEINGS: Bayesian Embodied Image-goal Navigation with Gaussian Splatting
BEINGS: Bayesian Embodied Image-goal Navigation with Gaussian Splatting
Wugang Meng
Tianfu Wu
Huan Yin
Fumin Zhang
3DGS
34
0
0
16 Sep 2024
NOLO: Navigate Only Look Once
NOLO: Navigate Only Look Once
Mengyu Bu
Shuhao Gu
Yang Feng
EgoV
43
1
0
02 Aug 2024
Theia: Distilling Diverse Vision Foundation Models for Robot Learning
Theia: Distilling Diverse Vision Foundation Models for Robot Learning
Jinghuan Shang
Karl Schmeckpeper
Brandon B. May
M. Minniti
Tarik Kelestemur
David Watkins
Laura Herlant
VLM
34
23
0
29 Jul 2024
Two-Stage Depth Enhanced Learning with Obstacle Map For Object
  Navigation
Two-Stage Depth Enhanced Learning with Obstacle Map For Object Navigation
Yanwei Zheng
Shaopu Feng
Bowen Huang
Changrui Li
Xiao Zhang
Dongxiao Yu
36
0
0
20 Jun 2024
CoNav: A Benchmark for Human-Centered Collaborative Navigation
CoNav: A Benchmark for Human-Centered Collaborative Navigation
Changhao Li
Xinyu Sun
Peihao Chen
Jugang Fan
Zixu Wang
Yanxia Liu
Jinhui Zhu
Chuang Gan
Mingkui Tan
56
1
0
04 Jun 2024
Transformers for Image-Goal Navigation
Transformers for Image-Goal Navigation
Nikhilanj Pelluri
ViT
32
0
0
23 May 2024
Pre-trained Text-to-Image Diffusion Models Are Versatile Representation
  Learners for Control
Pre-trained Text-to-Image Diffusion Models Are Versatile Representation Learners for Control
Gunshi Gupta
Karmesh Yadav
Y. Gal
Dhruv Batra
Z. Kira
Cong Lu
Tim G. J. Rudner
39
7
0
09 May 2024
LOC-ZSON: Language-driven Object-Centric Zero-Shot Object Retrieval and
  Navigation
LOC-ZSON: Language-driven Object-Centric Zero-Shot Object Retrieval and Navigation
Tianrui Guan
Yurou Yang
Harry Cheng
Muyuan Lin
Richard Kim
R. Madhivanan
Arnie Sen
Dinesh Manocha
LM&Ro
44
8
0
08 May 2024
GOAT-Bench: A Benchmark for Multi-Modal Lifelong Navigation
GOAT-Bench: A Benchmark for Multi-Modal Lifelong Navigation
Mukul Khanna
Ram Ramrakhya
Gunjan Chhablani
Sriram Yenamandra
Théophile Gervet
Matthew Chang
Z. Kira
Devendra Singh Chaplot
Dhruv Batra
Roozbeh Mottaghi
LM&Ro
59
23
0
09 Apr 2024
Prioritized Semantic Learning for Zero-shot Instance Navigation
Prioritized Semantic Learning for Zero-shot Instance Navigation
Xander Sun
Louis Lau
Hoyard Zhi
Ronghe Qiu
Junwei Liang
37
8
0
18 Mar 2024
GaussNav: Gaussian Splatting for Visual Navigation
GaussNav: Gaussian Splatting for Visual Navigation
Xiaohan Lei
Min Wang
Wen-gang Zhou
Houqiang Li
3DGS
32
12
0
18 Mar 2024
Learning Generalizable Feature Fields for Mobile Manipulation
Learning Generalizable Feature Fields for Mobile Manipulation
Ri-Zhao Qiu
Yafei Hu
Ge Yang
Yuchen Song
Yang Fu
...
Jiteng Mu
Ruihan Yang
Nikolay Atanasov
Sebastian Scherer
Xiaolong Wang
40
27
0
12 Mar 2024
DOZE: A Dataset for Open-Vocabulary Zero-Shot Object Navigation in
  Dynamic Environments
DOZE: A Dataset for Open-Vocabulary Zero-Shot Object Navigation in Dynamic Environments
Ji Ma
Hongming Dai
Yao Mu
Pengying Wu
Hao Wang
Xiaowei Chi
Yang Fei
Shanghang Zhang
Chang-rui Liu
49
6
0
29 Feb 2024
Instance-aware Exploration-Verification-Exploitation for Instance
  ImageGoal Navigation
Instance-aware Exploration-Verification-Exploitation for Instance ImageGoal Navigation
X. Lei
Min Wang
Wen-gang Zhou
Li Li
Houqiang Li
46
5
0
25 Feb 2024
Task-conditioned adaptation of visual features in multi-task policy
  learning
Task-conditioned adaptation of visual features in multi-task policy learning
Pierre Marza
L. Matignon
Olivier Simonin
Christian Wolf
45
2
0
12 Feb 2024
Vision-Language Models Provide Promptable Representations for
  Reinforcement Learning
Vision-Language Models Provide Promptable Representations for Reinforcement Learning
William Chen
Oier Mees
Aviral Kumar
Sergey Levine
VLM
LM&Ro
42
23
0
05 Feb 2024
Learning to navigate efficiently and precisely in real environments
Learning to navigate efficiently and precisely in real environments
G. Bono
Hervé Poirier
L. Antsfeld
G. Monaci
Boris Chidlovskii
Christian Wolf
21
2
0
25 Jan 2024
VLFM: Vision-Language Frontier Maps for Zero-Shot Semantic Navigation
VLFM: Vision-Language Frontier Maps for Zero-Shot Semantic Navigation
Naoki Yokoyama
Sehoon Ha
Dhruv Batra
Jiuguang Wang
Bernadette Bucher
LM&Ro
26
78
0
06 Dec 2023
Selective Visual Representations Improve Convergence and Generalization
  for Embodied AI
Selective Visual Representations Improve Convergence and Generalization for Embodied AI
Ainaz Eftekhar
Kuo-Hao Zeng
Jiafei Duan
Ali Farhadi
Aniruddha Kembhavi
Ranjay Krishna
27
13
0
07 Nov 2023
Exploitation-Guided Exploration for Semantic Embodied Navigation
Exploitation-Guided Exploration for Semantic Embodied Navigation
Justin Wasserman
Girish Chowdhary
Abhinav Gupta
Unnat Jain
21
1
0
06 Nov 2023
Advances in Embodied Navigation Using Large Language Models: A Survey
Advances in Embodied Navigation Using Large Language Models: A Survey
Jinzhou Lin
Han Gao
Xuxiang Feng
Rongtao Xu
Changwei Wang
Man Zhang
Li Guo
Shibiao Xu
LM&Ro
LLMAG
66
9
0
01 Nov 2023
Navigation with Large Language Models: Semantic Guesswork as a Heuristic
  for Planning
Navigation with Large Language Models: Semantic Guesswork as a Heuristic for Planning
Dhruv Shah
Michael Equi
B. Osinski
Fei Xia
Brian Ichter
Sergey Levine
3DV
LM&Ro
14
91
0
16 Oct 2023
FGPrompt: Fine-grained Goal Prompting for Image-goal Navigation
FGPrompt: Fine-grained Goal Prompting for Image-goal Navigation
Xinyu Sun
Peihao Chen
Jugang Fan
Thomas H. Li
Jian Chen
Mingkui Tan
32
12
0
11 Oct 2023
What do we learn from a large-scale study of pre-trained visual
  representations in sim and real environments?
What do we learn from a large-scale study of pre-trained visual representations in sim and real environments?
Sneha Silwal
Karmesh Yadav
Tingfan Wu
Jay Vakil
Arjun Majumdar
...
Dhruv Batra
Aravind Rajeswaran
Mrinal Kalakrishnan
Franziska Meier
Oleksandr Maksymets
SSL
LM&Ro
34
5
0
03 Oct 2023
End-to-End (Instance)-Image Goal Navigation through Correspondence as an
  Emergent Phenomenon
End-to-End (Instance)-Image Goal Navigation through Correspondence as an Emergent Phenomenon
G. Bono
L. Antsfeld
Boris Chidlovskii
Zhi Zheng
Christian Wolf
3DV
26
9
0
28 Sep 2023
HabiCrowd: A High Performance Simulator for Crowd-Aware Visual
  Navigation
HabiCrowd: A High Performance Simulator for Crowd-Aware Visual Navigation
Vuong Dinh An
Toan Tien Nguyen
Minh Nhat Vu
Baoru Huang
Dzung Nguyen
H. Binh
T. Vo
Anh Nguyen
40
5
0
20 Jun 2023
MOPA: Modular Object Navigation with PointGoal Agents
MOPA: Modular Object Navigation with PointGoal Agents
Sonia Raychaudhuri
Tommaso Campari
Unnat Jain
Manolis Savva
Angel X. Chang
3DPC
24
8
0
07 Apr 2023
Navigating to Objects Specified by Images
Navigating to Objects Specified by Images
Jacob Krantz
Théophile Gervet
Karmesh Yadav
Austin S. Wang
Chris Paxton
Roozbeh Mottaghi
Dhruv Batra
Jitendra Malik
Stefan Lee
Devendra Singh Chaplot
44
36
0
03 Apr 2023
Where are we in the search for an Artificial Visual Cortex for Embodied
  Intelligence?
Where are we in the search for an Artificial Visual Cortex for Embodied Intelligence?
Arjun Majumdar
Karmesh Yadav
Sergio Arnaud
Yecheng Jason Ma
Claire Chen
...
Dhruv Batra
Yixin Lin
Oleksandr Maksymets
Aravind Rajeswaran
Franziska Meier
LM&Ro
19
172
0
31 Mar 2023
Habitat-Matterport 3D Semantics Dataset
Habitat-Matterport 3D Semantics Dataset
Karmesh Yadav
Ram Ramrakhya
Santhosh Kumar Ramakrishnan
Theo Gervet
John Turner
...
Angel X. Chang
Dhruv Batra
Manolis Savva
Alexander Clegg
Devendra Singh Chaplot
3DV
MDE
89
83
0
11 Oct 2022
Real-World Robot Learning with Masked Visual Pre-training
Real-World Robot Learning with Masked Visual Pre-training
Ilija Radosavovic
Tete Xiao
Stephen James
Pieter Abbeel
Jitendra Malik
Trevor Darrell
SSL
156
239
0
06 Oct 2022
Masked World Models for Visual Control
Masked World Models for Visual Control
Younggyo Seo
Danijar Hafner
Hao Liu
Fangchen Liu
Stephen James
Kimin Lee
Pieter Abbeel
OffRL
87
146
0
28 Jun 2022
ZSON: Zero-Shot Object-Goal Navigation using Multimodal Goal Embeddings
ZSON: Zero-Shot Object-Goal Navigation using Multimodal Goal Embeddings
Arjun Majumdar
Gunjan Aggarwal
Bhavika Devnani
Judy Hoffman
Dhruv Batra
LM&Ro
149
149
0
24 Jun 2022
Masked Autoencoders Are Scalable Vision Learners
Masked Autoencoders Are Scalable Vision Learners
Kaiming He
Xinlei Chen
Saining Xie
Yanghao Li
Piotr Dollár
Ross B. Girshick
ViT
TPM
305
7,443
0
11 Nov 2021
No RL, No Simulation: Learning to Navigate without Navigating
No RL, No Simulation: Learning to Navigate without Navigating
Meera Hahn
Devendra Singh Chaplot
Shubham Tulsiani
Mustafa Mukadam
James M. Rehg
Abhinav Gupta
75
71
0
18 Oct 2021
Curious Representation Learning for Embodied Intelligence
Curious Representation Learning for Embodied Intelligence
Yilun Du
Chuang Gan
Phillip Isola
SSL
LM&Ro
112
40
0
03 May 2021
Emerging Properties in Self-Supervised Vision Transformers
Emerging Properties in Self-Supervised Vision Transformers
Mathilde Caron
Hugo Touvron
Ishan Misra
Hervé Jégou
Julien Mairal
Piotr Bojanowski
Armand Joulin
317
5,785
0
29 Apr 2021
1