ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1805.11592
  4. Cited By
Playing hard exploration games by watching YouTube

Playing hard exploration games by watching YouTube

29 May 2018
Y. Aytar
Tobias Pfaff
David Budden
T. Paine
Ziyun Wang
Nando de Freitas
ArXivPDFHTML

Papers citing "Playing hard exploration games by watching YouTube"

10 / 60 papers shown
Title
Hyperbolic Embeddings for Learning Options in Hierarchical Reinforcement Learning
Saket Tiwari
M. Prannoy
16
2
0
04 Dec 2018
Scalable agent alignment via reward modeling: a research direction
Scalable agent alignment via reward modeling: a research direction
Jan Leike
David M. Krueger
Tom Everitt
Miljan Martic
Vishal Maini
Shane Legg
34
396
0
19 Nov 2018
Episodic Curiosity through Reachability
Episodic Curiosity through Reachability
Nikolay Savinov
Anton Raichuk
Raphaël Marinier
Damien Vincent
Marc Pollefeys
Timothy Lillicrap
Sylvain Gelly
14
266
0
04 Oct 2018
SmartChoices: Hybridizing Programming and Machine Learning
SmartChoices: Hybridizing Programming and Machine Learning
Victor Carbune
Thierry Coppey
A. Daryin
Thomas Deselaers
Nikhil Sarda
J. Yagnik
16
2
0
01 Oct 2018
Sample Efficient Adaptive Text-to-Speech
Sample Efficient Adaptive Text-to-Speech
Yutian Chen
Yannis Assael
Brendan Shillingford
David Budden
Scott E. Reed
...
Ben Laurie
Çağlar Gülçehre
Aaron van den Oord
Oriol Vinyals
Nando de Freitas
35
149
0
27 Sep 2018
Expert-augmented actor-critic for ViZDoom and Montezumas Revenge
Expert-augmented actor-critic for ViZDoom and Montezumas Revenge
Michal Garmulewicz
Henryk Michalewski
Piotr Milos
16
8
0
10 Sep 2018
RUDDER: Return Decomposition for Delayed Rewards
RUDDER: Return Decomposition for Delayed Rewards
Jose A. Arjona-Medina
Michael Gillhofer
Michael Widrich
Thomas Unterthiner
Johannes Brandstetter
Sepp Hochreiter
30
212
0
20 Jun 2018
Fast Exploration with Simplified Models and Approximately Optimistic
  Planning in Model Based Reinforcement Learning
Fast Exploration with Simplified Models and Approximately Optimistic Planning in Model Based Reinforcement Learning
Ramtin Keramati
Jay Whang
Patrick Cho
Emma Brunskill
OffRL
23
7
0
01 Jun 2018
Imitating Latent Policies from Observation
Imitating Latent Policies from Observation
Ashley D. Edwards
Himanshu Sahni
Yannick Schroecker
Charles Isbell
34
137
0
21 May 2018
Not to Cry Wolf: Distantly Supervised Multitask Learning in Critical
  Care
Not to Cry Wolf: Distantly Supervised Multitask Learning in Critical Care
Patrick Schwab
E. Keller
C. Muroi
David J. Mack
C. Strässle
W. Karlen
26
23
0
14 Feb 2018
Previous
12