Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2209.13966
Cited By
SoftTreeMax: Policy Gradient with Tree Search
28 September 2022
Gal Dalal
Assaf Hallak
Shie Mannor
Gal Chechik
Re-assign community
ArXiv
PDF
HTML
Papers citing
"SoftTreeMax: Policy Gradient with Tree Search"
2 / 2 papers shown
Title
Improve Agents without Retraining: Parallel Tree Search with Off-Policy Correction
Assaf Hallak
Gal Dalal
Steven Dalton
I. Frosio
Shie Mannor
Gal Chechik
OffRL
OnRL
35
9
0
04 Jul 2021
On the Convergence and Sample Efficiency of Variance-Reduced Policy Gradient Method
Junyu Zhang
Chengzhuo Ni
Zheng Yu
Csaba Szepesvári
Mengdi Wang
44
67
0
17 Feb 2021
1