Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1401.3870
Cited By
Learning to Make Predictions In Partially Observable Environments Without a Generative Model
16 January 2014
Erik Talvitie
Satinder Singh
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Learning to Make Predictions In Partially Observable Environments Without a Generative Model"
2 / 2 papers shown
Title
The Optimal Reward Baseline for Gradient-Based Reinforcement Learning
Lex Weaver
Nigel Tao
113
246
0
10 Jan 2013
Predictive State Representations: A New Theory for Modeling Dynamical Systems
Satinder Singh
Michael R. James
Matthew R. Rudary
AI4TS
AI4CE
75
288
0
11 Jul 2012
1