Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2302.12359
Cited By
Targeted Search Control in AlphaZero for Effective Policy Improvement
23 February 2023
Alexandre Trudeau
Michael Bowling
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Targeted Search Control in AlphaZero for Effective Policy Improvement"
3 / 3 papers shown
Title
First return, then explore
Adrien Ecoffet
Joost Huizinga
Joel Lehman
Kenneth O. Stanley
Jeff Clune
71
360
0
27 Apr 2020
OpenSpiel: A Framework for Reinforcement Learning in Games
Marc Lanctot
Edward Lockhart
Jean-Baptiste Lespiau
V. Zambaldi
Satyaki Upadhyay
...
Julian Schrittwieser
Thomas W. Anthony
Edward Hughes
Ivo Danihelka
Jonah Ryan-Davis
OffRL
92
250
0
26 Aug 2019
Playing Atari with Deep Reinforcement Learning
Volodymyr Mnih
Koray Kavukcuoglu
David Silver
Alex Graves
Ioannis Antonoglou
Daan Wierstra
Martin Riedmiller
125
12,227
0
19 Dec 2013
1