Title
Learning to Drive in a Day Alex Kendall Jeffrey Hawke David Janz Przemyslaw Mazur Daniele Reda John M. Allen Vinh-Dieu Lam Alex Bewley Amar Shah 42 643 0 01 Jul 2018
RUDDER: Return Decomposition for Delayed Rewards Jose A. Arjona-Medina Michael Gillhofer Michael Widrich Thomas Unterthiner Johannes Brandstetter Sepp Hochreiter 33 213 0 20 Jun 2018
$ML + FV = $\heartsuit$? A Survey on the Application of Machine Learning to Formal Verification$ ML + FV = $\heartsuit$ ? A Survey on the Application of Machine Learning to Formal Verification Moussa Amrani L. Lucio Adrien Bibal 30 5 0 10 Jun 2018
Re-evaluating Evaluation David Balduzzi K. Tuyls Julien Perolat T. Graepel MoMe 30 97 0 07 Jun 2018
Fast Exploration with Simplified Models and Approximately Optimistic Planning in Model Based Reinforcement Learning Ramtin Keramati Jay Whang Patrick Cho Emma Brunskill OffRL 29 7 0 01 Jun 2018
Fast Policy Learning through Imitation and Reinforcement Ching-An Cheng Xinyan Yan Nolan Wagener Byron Boots 26 83 0 26 May 2018
A0C: Alpha Zero in Continuous Action Space Thomas M. Moerland Joost Broekens Aske Plaat Catholijn M. Jonker 16 48 0 24 May 2018
Bandit-Based Monte Carlo Optimization for Nearest Neighbors Vivek Bagaria Tavor Z. Baharav G. Kamath David Tse 13 12 0 21 May 2018
AGI Safety Literature Review Tom Everitt G. Lea Marcus Hutter AI4CE 36 115 0 03 May 2018
AI safety via debate G. Irving Paul Christiano Dario Amodei 204 203 0 02 May 2018
Monte Carlo Q-learning for General Game Playing Hui Wang M. Emmerich Aske Plaat GP 13 20 0 16 Feb 2018
ProofWatch: Watchlist Guidance for Large Theories in E Z. Goertzel Jan Jakubuv S. Schulz Josef Urban LRM 28 13 0 12 Feb 2018
Tunneling Neural Perception and Logic Reasoning through Abductive Learning Wang-Zhou Dai Qiu-Ling Xu Yang Yu Zhi-Hua Zhou LRM AI4CE 32 22 0 04 Feb 2018
Deep Reinforcement Learning using Capsules in Advanced Game Environments Per-Arne Andersen 18 16 0 29 Jan 2018
Innateness, AlphaZero, and Artificial Intelligence G. Marcus 30 73 0 17 Jan 2018
Building a Conversational Agent Overnight with Dialogue Self-Play Pararth Shah Dilek Z. Hakkani-Tür Gokhan Tur Abhinav Rastogi Ankur Bapna Neha Nayak Kennard Larry Heck 45 193 0 15 Jan 2018
Distributed Deep Reinforcement Learning: Learn how to play Atari games in 21 minutes Igor Adamski R. Adamski T. Grel Adam Jedrych Kamil Kaczmarek Henryk Michalewski OffRL 41 37 0 09 Jan 2018