DOPE: Doubly Optimistic and Pessimistic Exploration for Safe
  Reinforcement Learning
v1v2v3 (latest)

DOPE: Doubly Optimistic and Pessimistic Exploration for Safe Reinforcement Learning

Papers citing "DOPE: Doubly Optimistic and Pessimistic Exploration for Safe Reinforcement Learning"