ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1312.7606
  4. Cited By
Distributed Policy Evaluation Under Multiple Behavior Strategies

Distributed Policy Evaluation Under Multiple Behavior Strategies

30 December 2013
Sergio Valcarcel Macua
Jianshu Chen
S. Zazo
Ali H. Sayed
ArXivPDFHTML

Papers citing "Distributed Policy Evaluation Under Multiple Behavior Strategies"

12 / 12 papers shown
Title
A primal-dual perspective for distributed TD-learning
A primal-dual perspective for distributed TD-learning
Han-Dong Lim
Donghwan Lee
81
1
0
01 Oct 2023
Backstepping Temporal Difference Learning
Backstepping Temporal Difference Learning
Han-Dong Lim
Dong-hwan Lee
OffRL
67
2
0
20 Feb 2023
On the Learning Behavior of Adaptive Networks - Part I: Transient
  Analysis
On the Learning Behavior of Adaptive Networks - Part I: Transient Analysis
Jianshu Chen
Ali H. Sayed
86
132
0
29 Dec 2013
Asynchronous Adaptation and Learning over Networks - Part II:
  Performance Analysis
Asynchronous Adaptation and Learning over Networks - Part II: Performance Analysis
Xiaochuan Zhao
Ali H. Sayed
78
43
0
19 Dec 2013
Distributed Pareto Optimization via Diffusion Strategies
Distributed Pareto Optimization via Diffusion Strategies
Jianshu Chen
Ali H. Sayed
98
173
0
13 Aug 2012
Performance Limits for Distributed Estimation Over LMS Adaptive Networks
Performance Limits for Distributed Estimation Over LMS Adaptive Networks
Xiaochuan Zhao
Ali H. Sayed
63
140
0
17 Jun 2012
Diffusion Adaptation over Networks
Diffusion Adaptation over Networks
Ali H. Sayed
86
447
0
18 May 2012
$QD$-Learning: A Collaborative Distributed Strategy for Multi-Agent
  Reinforcement Learning Through Consensus + Innovations
QDQDQD-Learning: A Collaborative Distributed Strategy for Multi-Agent Reinforcement Learning Through Consensus + Innovations
S. Kar
José M. F. Moura
H. Vincent Poor
94
189
0
30 Apr 2012
Multi-timescale Nexting in a Reinforcement Learning Robot
Multi-timescale Nexting in a Reinforcement Learning Robot
Joseph Modayil
Adam White
R. Sutton
170
130
0
06 Dec 2011
Diffusion Adaptation Strategies for Distributed Optimization and
  Learning over Networks
Diffusion Adaptation Strategies for Distributed Optimization and Learning over Networks
Jianshu Chen
Ali H. Sayed
96
654
0
31 Oct 2011
Should one compute the Temporal Difference fix point or minimize the
  Bellman Residual? The unified oblique projection view
Should one compute the Temporal Difference fix point or minimize the Bellman Residual? The unified oblique projection view
B. Scherrer
78
102
0
19 Nov 2010
Predictive State Temporal Difference Learning
Predictive State Temporal Difference Learning
Byron Boots
Geoffrey J. Gordon
109
48
0
30 Oct 2010
1