Learning to Make Predictions In Partially Observable Environments Without a Generative Model

16 January 2014

Papers citing "Learning to Make Predictions In Partially Observable Environments Without a Generative Model"

2 / 2 papers shown

Title
The Optimal Reward Baseline for Gradient-Based Reinforcement Learning Lex Weaver Nigel Tao 113 246 0 10 Jan 2013
Predictive State Representations: A New Theory for Modeling Dynamical Systems Satinder Singh Michael R. James Matthew R. Rudary AI4TS AI4CE 75 288 0 11 Jul 2012