Distributed Policy Evaluation Under Multiple Behavior Strategies

30 December 2013

Papers citing "Distributed Policy Evaluation Under Multiple Behavior Strategies"

12 / 12 papers shown

Title
A primal-dual perspective for distributed TD-learning Han-Dong Lim Donghwan Lee 81 1 0 01 Oct 2023
Backstepping Temporal Difference Learning Han-Dong Lim Dong-hwan Lee OffRL 67 2 0 20 Feb 2023
On the Learning Behavior of Adaptive Networks - Part I: Transient Analysis Jianshu Chen Ali H. Sayed 86 132 0 29 Dec 2013
Asynchronous Adaptation and Learning over Networks - Part II: Performance Analysis Xiaochuan Zhao Ali H. Sayed 78 43 0 19 Dec 2013
Distributed Pareto Optimization via Diffusion Strategies Jianshu Chen Ali H. Sayed 98 173 0 13 Aug 2012
Performance Limits for Distributed Estimation Over LMS Adaptive Networks Xiaochuan Zhao Ali H. Sayed 63 140 0 17 Jun 2012
Diffusion Adaptation over Networks Ali H. Sayed 86 447 0 18 May 2012
$QD$ -Learning: A Collaborative Distributed Strategy for Multi-Agent Reinforcement Learning Through Consensus + Innovations S. Kar José M. F. Moura H. Vincent Poor 94 189 0 30 Apr 2012
Multi-timescale Nexting in a Reinforcement Learning Robot Joseph Modayil Adam White R. Sutton 170 130 0 06 Dec 2011
Diffusion Adaptation Strategies for Distributed Optimization and Learning over Networks Jianshu Chen Ali H. Sayed 96 654 0 31 Oct 2011
Should one compute the Temporal Difference fix point or minimize the Bellman Residual? The unified oblique projection view B. Scherrer 78 102 0 19 Nov 2010
Predictive State Temporal Difference Learning Byron Boots Geoffrey J. Gordon 109 48 0 30 Oct 2010