Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2503.03208
Cited By
Embodied Escaping: End-to-End Reinforcement Learning for Robot Navigation in Narrow Environment
5 March 2025
Han Zheng
Jing Zhang
Mingyang Jiang
Peiyuan Liu
Danni Liu
Tong Qin
Ming Yang
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Embodied Escaping: End-to-End Reinforcement Learning for Robot Navigation in Narrow Environment"
12 / 12 papers shown
Title
Excluding the Irrelevant: Focusing Reinforcement Learning through Continuous Action Masking
Roland Stolz
Hanna Krasowski
Jakob Thumm
Michael Eichelbeck
Philipp Gassert
Matthias Althoff
CLL
44
3
0
06 Jun 2024
HOPE: A Reinforcement Learning-based Hybrid Policy Path Planner for Diverse Parking Scenarios
Mingyang Jiang
Yueyuan Li
Songan Zhang
Siyuan Chen
Chunxiang Wang
Ming Yang
97
4
0
31 May 2024
Efficient Reinforcement Learning of Task Planners for Robotic Palletization through Iterative Action Masking Learning
Zheng Wu
Yichuan Li
Wei Zhan
Changliu Liu
Yun-Hui Liu
Masayoshi Tomizuka
61
5
0
07 Apr 2024
NoMaD: Goal Masked Diffusion Policies for Navigation and Exploration
A. Sridhar
Dhruv Shah
Catherine Glossop
Sergey Levine
103
126
0
11 Oct 2023
Multi-vehicle Conflict Resolution in Highly Constrained Spaces by Merging Optimal Control and Reinforcement Learning
Xu Shen
Francesco Borrelli
28
3
0
02 Nov 2022
Arena-Rosnav: Towards Deployment of Deep-Reinforcement-Learning-Based Obstacle Avoidance into Conventional Autonomous Navigation Systems
Linh Kästner
Teham Buiyan
Xinlin Zhao
Lei Jiao
Zhengcheng Shen
Jens Lambrecht
36
47
0
08 Apr 2021
A Closer Look at Invalid Action Masking in Policy Gradient Algorithms
Shengyi Huang
Santiago Ontañón
78
320
0
25 Jun 2020
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
314
8,396
0
04 Jan 2018
Optimization-Based Collision Avoidance
Xiaojing Zhang
Alexander Liniger
Francesco Borrelli
47
335
0
09 Nov 2017
A Brief Survey of Deep Reinforcement Learning
Kai Arulkumaran
M. Deisenroth
Miles Brundage
Anil Anthony Bharath
OffRL
122
2,821
0
19 Aug 2017
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
526
19,237
0
20 Jul 2017
ROS Navigation Tuning Guide
Kaiyu Zheng
42
84
0
27 Jun 2017
1