Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2107.04200
Cited By
Safe Exploration by Solving Early Terminated MDP
9 July 2021
Hao Sun
Ziping Xu
Meng Fang
Zhenghao Peng
Jiadong Guo
Bo Dai
Bolei Zhou
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Safe Exploration by Solving Early Terminated MDP"
3 / 3 papers shown
Title
Reinforcement Learning in the Era of LLMs: What is Essential? What is needed? An RL Perspective on RLHF, Prompting, and Beyond
Hao Sun
OffRL
34
21
0
09 Oct 2023
Novel Policy Seeking with Constrained Optimization
Hao Sun
Zhenghao Peng
Bo Dai
Jian Guo
Dahua Lin
Bolei Zhou
24
13
0
21 May 2020
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Chelsea Finn
Pieter Abbeel
Sergey Levine
OOD
463
11,715
0
09 Mar 2017
1