Title
Using Unity to Help Solve Intelligence Tom Ward Andrew Bolt Nik Hemmings Simon Carter Manuel Sanchez ... Jay Lemmon J. Coe Piotr Trochim T. Handley Adrian Bolton 66 18 0 18 Nov 2020
Phoebe: Reuse-Aware Online Caching with Reinforcement Learning for Emerging Storage Models Nan Wu Pengcheng Li 53 7 0 13 Nov 2020
Emergent Reciprocity and Team Formation from Randomized Uncertain Social Preferences Bowen Baker LRM 69 37 0 10 Nov 2020
Playing optical tweezers with deep reinforcement learning: in virtual, physical and augmented environments M. Praeger Yunhui Xie J. Grant-Jacob R. Eason B. Mills 64 12 0 05 Nov 2020
Learning to Utilize Shaping Rewards: A New Approach of Reward Shaping Yujing Hu Weixun Wang Hangtian Jia Yixiang Wang Yingfeng Chen Jianye Hao Feng Wu Changjie Fan OffRL 101 180 0 05 Nov 2020
Meta-trained agents implement Bayes-optimal agents Vladimir Mikulik Grégoire Delétang Tom McGrath Tim Genewein Miljan Martic Shane Legg Pedro A. Ortega OOD FedML 88 43 0 21 Oct 2020
Negotiating Team Formation Using Deep Reinforcement Learning Yoram Bachrach Richard Everett Edward Hughes Angeliki Lazaridou Joel Z Leibo Marc Lanctot Michael Bradley Johanson Wojciech M. Czarnecki T. Graepel 109 36 0 20 Oct 2020
Watch-And-Help: A Challenge for Social Perception and Human-AI Collaboration Xavier Puig Tianmin Shu Shuang Li Zilin Wang Yuan-Hong Liao J. Tenenbaum Sanja Fidler Antonio Torralba LM&Ro 163 130 0 19 Oct 2020
Multi-Agent Collaboration via Reward Attribution Decomposition Tianjun Zhang Huazhe Xu Xiaolong Wang Yi Wu Kurt Keutzer Joseph E. Gonzalez Yuandong Tian 73 40 0 16 Oct 2020
RODE: Learning Roles to Decompose Multi-Agent Tasks Tonghan Wang Tarun Gupta Anuj Mahajan Bei Peng Shimon Whiteson Chongjie Zhang OffRL 118 211 0 04 Oct 2020
Beyond Tabula-Rasa: a Modular Reinforcement Learning Approach for Physically Embedded 3D Sokoban Peter Karkus M. Berk Mirza A. Guez Andrew Jaegle Timothy Lillicrap Lars Buesing N. Heess T. Weber OffRL 67 8 0 03 Oct 2020
Learning to Play against Any Mixture of Opponents Max O. Smith Thomas W. Anthony Yongzhao Wang Michael P. Wellman OffRL 75 9 0 29 Sep 2020
Symbolic Relational Deep Reinforcement Learning based on Graph Neural Networks and Autoregressive Policy Decomposition Jaromír Janisch Tomávs Pevný Viliam Lisý AI4CE 91 3 0 25 Sep 2020
Decoupling Representation Learning from Reinforcement Learning Adam Stooke Kimin Lee Pieter Abbeel Michael Laskin SSL DRL 380 346 0 14 Sep 2020
Efficient Competitive Self-Play Policy Optimization Yuanyi Zhong Yuanshuo Zhou Jian Peng 13 2 0 13 Sep 2020
RLCFR: Minimize Counterfactual Regret by Deep Reinforcement Learning Huale Li Xinyu Wang Fengwei Jia Yifan Li Yulin Wu Jia-jia Zhang Shuhan Qi 29 6 0 10 Sep 2020
A community-powered search of machine learning strategy space to find NMR property prediction models Lars A. Bratholm W. Gerrard Brandon M. Anderson Shaojie Bai Sunghwan Choi ... A. Torrubia Devin Willmott C. Butts David R. Glowacki Kaggle participants 46 17 0 13 Aug 2020
Deep Model-Based Reinforcement Learning for High-Dimensional Problems, a Survey Aske Plaat W. Kosters Mike Preuss BDL OffRL 127 17 0 11 Aug 2020
Off-Policy Multi-Agent Decomposed Policy Gradients Yihan Wang Beining Han Tonghan Wang Heng Dong Chongjie Zhang 102 181 0 24 Jul 2020
Dynamic Relational Inference in Multi-Agent Trajectories Ruichao Xiao Manish Kumar Singh Rose Yu 125 2 0 16 Jul 2020
A Cordial Sync: Going Beyond Marginal Policies for Multi-Agent Embodied Tasks Unnat Jain Luca Weihs Eric Kolve Ali Farhadi Svetlana Lazebnik Aniruddha Kembhavi Alex Schwing 86 58 0 09 Jul 2020
Responsive Safety in Reinforcement Learning by PID Lagrangian Methods Adam Stooke Joshua Achiam Pieter Abbeel 112 302 0 08 Jul 2020
Deep Reinforcement Learning and its Neuroscientific Implications M. Botvinick Jane X. Wang Will Dabney Kevin J. Miller Z. Kurth-Nelson OffRL AI4CE 97 176 0 07 Jul 2020
Scaling Imitation Learning in Minecraft Artemij Amiranashvili Nicolai Dorka Wolfram Burgard V. Koltun Thomas Brox MLAU 53 15 0 06 Jul 2020
Perception-Prediction-Reaction Agents for Deep Reinforcement Learning Adam Stooke Valentin Dalibard Siddhant M. Jayakumar Wojciech M. Czarnecki Max Jaderberg 53 1 0 26 Jun 2020
A Benchmarking Framework for Interactive 3D Applications in the Cloud Tianyi Liu Sen He Sunzhou Huang Danny H. K. Tsang Lingjia Tang Jason Mars Wei Wang 18 11 0 23 Jun 2020
Sample Factory: Egocentric 3D Control from Pixels at 100000 FPS with Asynchronous Reinforcement Learning Aleksei Petrenko Zhehui Huang T. Kumar Gaurav Sukhatme V. Koltun 103 105 0 21 Jun 2020
Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games Stephen Marcus McAleer John Lanier Roy Fox Pierre Baldi 63 78 0 15 Jun 2020
Shared Experience Actor-Critic for Multi-Agent Reinforcement Learning Filippos Christianos Lukas Schafer Stefano V. Albrecht 150 170 0 12 Jun 2020
Learning to Incentivize Other Learning Agents Jiachen Yang Ang Li Mehrdad Farajtabar P. Sunehag Edward Hughes H. Zha 102 70 0 10 Jun 2020
Learning to Play No-Press Diplomacy with Best Response Policy Iteration Thomas W. Anthony Tom Eccles Andrea Tacchetti János Kramár I. Gemp ... Richard Everett Roman Werpachowski Satinder Singh T. Graepel Yoram Bachrach 106 43 0 08 Jun 2020
A Comparison of Self-Play Algorithms Under a Generalized Framework Daniel Hernández Kevin Denamganai Sam Devlin Spyridon Samothrakis James Alfred Walker 61 12 0 08 Jun 2020
Causality and Batch Reinforcement Learning: Complementary Approaches To Planning In Unknown Domains James Bannon Bradford T. Windsor Wenbo Song Tao Li CML OOD OffRL 73 20 0 03 Jun 2020
Manipulating the Distributions of Experience used for Self-Play Learning in Expert Iteration Dennis J. N. J. Soemers Éric Piette Matthew Stephenson C. Browne OffRL 72 6 0 30 May 2020
Accelerating Deep Neuroevolution on Distributed FPGAs for Reinforcement Learning Problems Alexis Asseman Nicolas Antoine A. Ozcan 29 4 0 10 May 2020
Navigating the Landscape of Multiplayer Games Shayegan Omidshafiei K. Tuyls Wojciech M. Czarnecki Francisco C. Santos Mark Rowland ... Paul Muller Julien Perolat Bart De Vylder A. Gruslys Rémi Munos 32 2 0 04 May 2020
The AI Economist: Improving Equality and Productivity with AI-Driven Tax Policies Stephan Zheng Alexander R. Trott Sunil Srinivasa Nikhil Naik Melvin Gruesbeck David C. Parkes R. Socher 60 136 0 28 Apr 2020
Real World Games Look Like Spinning Tops Wojciech M. Czarnecki Gauthier Gidel Brendan D. Tracey K. Tuyls Shayegan Omidshafiei David Balduzzi Max Jaderberg 75 101 0 20 Apr 2020
Energy-Based Imitation Learning Minghuan Liu Tairan He Minkai Xu Weinan Zhang 118 48 0 20 Apr 2020
Meta-Learning in Neural Networks: A Survey Timothy M. Hospedales Antreas Antoniou P. Micaelli Amos Storkey OOD 420 2,005 0 11 Apr 2020
CURL: Contrastive Unsupervised Representations for Reinforcement Learning A. Srinivas Michael Laskin Pieter Abbeel SSL DRL OffRL 148 1,097 0 08 Apr 2020
How Do You Act? An Empirical Study to Understand Behavior of Deep Reinforcement Learning Agents Richard Meyes Moritz Schneider Tobias Meisen 55 2 0 07 Apr 2020
Obstacle Tower Without Human Demonstrations: How Far a Deep Feed-Forward Network Goes with Reinforcement Learning Marco Pleines J. Jitsev Mike Preuss Frank Zimmer 49 2 0 01 Apr 2020
Fiber: A Platform for Efficient Development and Distributed Training for Reinforcement Learning and Population-Based Methods Jiale Zhi Rui Wang Jeff Clune Kenneth O. Stanley OffRL 72 12 0 25 Mar 2020
Multi-Agent Reinforcement Learning for Problems with Combined Individual and Team Reward Hassam Sheikh Ladislau Bölöni 105 37 0 24 Mar 2020
Decentralized MCTS via Learned Teammate Models A. Czechowski F. Oliehoek 451 19 0 19 Mar 2020
Accelerating and Improving AlphaZero Using Population Based Training Ti-Rong Wu Ting Han Wei I-Chen Wu 41 17 0 13 Mar 2020
Human AI interaction loop training: New approach for interactive reinforcement learning N. Navidi 41 6 0 09 Mar 2020
FormulaZero: Distributionally Robust Online Adaptation via Offline Population Synthesis Aman Sinha Matthew O'Kelly Hongrui Zheng Rahul Mangharam John C. Duchi Russ Tedrake OffRL 132 27 0 09 Mar 2020
RIDE: Rewarding Impact-Driven Exploration for Procedurally-Generated Environments Roberta Raileanu Tim Rocktaschel 88 174 0 27 Feb 2020