ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.03654
  4. Cited By
Beyond the One Step Greedy Approach in Reinforcement Learning

Beyond the One Step Greedy Approach in Reinforcement Learning

10 February 2018
Yonathan Efroni
Gal Dalal
B. Scherrer
Shie Mannor
    OffRL
ArXivPDFHTML

Papers citing "Beyond the One Step Greedy Approach in Reinforcement Learning"

10 / 10 papers shown
Title
Counting Hours, Counting Losses: The Toll of Unpredictable Work Schedules on Financial Security
Counting Hours, Counting Losses: The Toll of Unpredictable Work Schedules on Financial Security
Pegah Nokhiz
Aravinda Kanchana Ruwanpathirana
Aditya Bhaskara
Suresh Venkatasubramanian
25
0
0
10 Apr 2025
On-line Policy Improvement using Monte-Carlo Search
On-line Policy Improvement using Monte-Carlo Search
Gerald Tesauro
Gregory R. Galperin
92
270
0
09 Jan 2025
A New Policy Iteration Algorithm For Reinforcement Learning in Zero-Sum
  Markov Games
A New Policy Iteration Algorithm For Reinforcement Learning in Zero-Sum Markov Games
Anna Winnicki
R. Srikant
34
1
0
17 Mar 2023
Planning and Learning with Adaptive Lookahead
Planning and Learning with Adaptive Lookahead
Aviv A. Rosenberg
Assaf Hallak
Shie Mannor
Gal Chechik
Gal Dalal
19
7
0
28 Jan 2022
The Role of Lookahead and Approximate Policy Evaluation in Reinforcement
  Learning with Linear Value Function Approximation
The Role of Lookahead and Approximate Policy Evaluation in Reinforcement Learning with Linear Value Function Approximation
Anna Winnicki
Joseph Lubars
Michael Livesay
R. Srikant
20
3
0
28 Sep 2021
Improve Agents without Retraining: Parallel Tree Search with Off-Policy
  Correction
Improve Agents without Retraining: Parallel Tree Search with Off-Policy Correction
Assaf Hallak
Gal Dalal
Steven Dalton
I. Frosio
Shie Mannor
Gal Chechik
OffRL
OnRL
35
9
0
04 Jul 2021
Heuristic-Guided Reinforcement Learning
Heuristic-Guided Reinforcement Learning
Ching-An Cheng
Andrey Kolobov
Adith Swaminathan
OffRL
30
61
0
05 Jun 2021
Discount Factor as a Regularizer in Reinforcement Learning
Discount Factor as a Regularizer in Reinforcement Learning
Ron Amit
Ron Meir
K. Ciosek
OffRL
17
71
0
04 Jul 2020
Trajectory-wise Control Variates for Variance Reduction in Policy
  Gradient Methods
Trajectory-wise Control Variates for Variance Reduction in Policy Gradient Methods
Ching-An Cheng
Xinyan Yan
Byron Boots
14
22
0
08 Aug 2019
Deep Policies for Width-Based Planning in Pixel Domains
Deep Policies for Width-Based Planning in Pixel Domains
Miquel Junyent
Anders Jonsson
Vicencc Gómez
36
10
0
12 Apr 2019
1