Safe Exploration by Solving Early Terminated MDP

9 July 2021

Papers citing "Safe Exploration by Solving Early Terminated MDP"

3 / 3 papers shown

Title
Reinforcement Learning in the Era of LLMs: What is Essential? What is needed? An RL Perspective on RLHF, Prompting, and Beyond Hao Sun OffRL 34 21 0 09 Oct 2023
Novel Policy Seeking with Constrained Optimization Hao Sun Zhenghao Peng Bo Dai Jian Guo Dahua Lin Bolei Zhou 24 13 0 21 May 2020
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks Chelsea Finn Pieter Abbeel Sergey Levine OOD 463 11,715 0 09 Mar 2017