Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2007.02141
Cited By
Off-Policy Exploitability-Evaluation in Two-Player Zero-Sum Markov Games
4 July 2020
Kenshi Abe
Yusuke Kaneko
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Off-Policy Exploitability-Evaluation in Two-Player Zero-Sum Markov Games"
2 / 2 papers shown
Title
Double Reinforcement Learning for Efficient Off-Policy Evaluation in Markov Decision Processes
Nathan Kallus
Masatoshi Uehara
OffRL
66
185
0
22 Aug 2019
Data-Efficient Off-Policy Policy Evaluation for Reinforcement Learning
Philip S. Thomas
Emma Brunskill
OffRL
162
573
0
04 Apr 2016
1