Learning more skills through optimistic exploration

Learning more skills through optimistic exploration

29 July 2021

David Warde-Farley

Papers citing "Learning more skills through optimistic exploration"

17 / 17 papers shown

Title
Do's and Don'ts: Learning Desirable Skills with Instruction Videos Hyunseung Kim ByungKun Lee Hojoon Lee Dongyoon Hwang Donghu Kim Jaegul Choo 120 1 0 01 Jun 2024
Deep Reinforcement Learning at the Edge of the Statistical Precipice Rishabh Agarwal Max Schwarzer Pablo Samuel Castro Aaron Courville Marc G. Bellemare OffRL 110 671 0 30 Aug 2021
Relative Variational Intrinsic Control Kate Baumli David Warde-Farley Steven Hansen Volodymyr Mnih 61 43 0 14 Dec 2020
One Solution is Not All You Need: Few-Shot Extrapolation via Structured MaxEnt RL Saurabh Kumar Aviral Kumar Sergey Levine Chelsea Finn OffRL 62 94 0 27 Oct 2020
Automatic Curriculum Learning through Value Disagreement Yunzhi Zhang Pieter Abbeel Lerrel Pinto 70 107 0 17 Jun 2020
RIDE: Rewarding Impact-Driven Exploration for Procedurally-Generated Environments Roberta Raileanu Tim Rocktaschel 67 173 0 27 Feb 2020
Dota 2 with Large Scale Deep Reinforcement Learning OpenAI OpenAI : Christopher Berner Greg Brockman Brooke Chan ... Szymon Sidor Ilya Sutskever Jie Tang Filip Wolski Susan Zhang GNN VLM CLL AI4CE LRM 166 1,824 0 13 Dec 2019
MAVEN: Multi-Agent Variational Exploration Anuj Mahajan Tabish Rashid Mikayel Samvelyan Shimon Whiteson DRL 184 362 0 16 Oct 2019
Solving Rubik's Cube with a Robot Hand OpenAI Ilge Akkaya Marcin Andrychowicz Maciek Chociej Ma-teusz Litwin ... Peter Welinder Lilian Weng Qiming Yuan Wojciech Zaremba Lei Zhang ODL 116 1,230 0 16 Oct 2019
Unsupervised State Representation Learning in Atari Ankesh Anand Evan Racah Sherjil Ozair Yoshua Bengio Marc-Alexandre Côté R. Devon Hjelm SSL 56 255 0 19 Jun 2019
Self-Supervised Exploration via Disagreement Deepak Pathak Dhiraj Gandhi Abhinav Gupta SSL 81 382 0 10 Jun 2019
Unsupervised Control Through Non-Parametric Discriminative Rewards David Warde-Farley T. Wiele Tejas D. Kulkarni Catalin Ionescu Steven Hansen Volodymyr Mnih DRL OffRL SSL 81 177 0 28 Nov 2018
Model-Based Active Exploration Pranav Shyam Wojciech Ja'skowski Faustino J. Gomez 86 179 0 29 Oct 2018
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures L. Espeholt Hubert Soyer Rémi Munos Karen Simonyan Volodymyr Mnih ... Vlad Firoiu Tim Harley Iain Dunning Shane Legg Koray Kavukcuoglu 218 1,600 0 05 Feb 2018
Deep Exploration via Randomized Value Functions Ian Osband Benjamin Van Roy Daniel Russo Zheng Wen 89 306 0 22 Mar 2017
Unifying Count-Based Exploration and Intrinsic Motivation Marc G. Bellemare S. Srinivasan Georg Ostrovski Tom Schaul D. Saxton Rémi Munos 174 1,478 0 06 Jun 2016
The Arcade Learning Environment: An Evaluation Platform for General Agents Marc G. Bellemare Yavar Naddaf J. Veness Michael Bowling 117 3,006 0 19 Jul 2012