ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1709.03969
  4. Cited By
Explore, Exploit or Listen: Combining Human Feedback and Policy Model to
  Speed up Deep Reinforcement Learning in 3D Worlds

Explore, Exploit or Listen: Combining Human Feedback and Policy Model to Speed up Deep Reinforcement Learning in 3D Worlds

12 September 2017
Zhiyu Lin
Brent Harrison
A. Keech
Mark O. Riedl
ArXivPDFHTML

Papers citing "Explore, Exploit or Listen: Combining Human Feedback and Policy Model to Speed up Deep Reinforcement Learning in 3D Worlds"

14 / 14 papers shown
Title
Beyond Following: Mixing Active Initiative into Computational Creativity
Beyond Following: Mixing Active Initiative into Computational Creativity
Zhiyu Lin
Upol Ehsan
Rohan Agarwal
Samihan Dani
Vidushi Vashishth
Mark O. Riedl
46
0
0
06 Sep 2024
A Framework for Learning from Demonstration with Minimal Human Effort
A Framework for Learning from Demonstration with Minimal Human Effort
Marc Rigter
Bruno Lacerda
Nick Hawes
31
28
0
15 Jun 2023
Reinforcement Learning With Reward Machines in Stochastic Games
Reinforcement Learning With Reward Machines in Stochastic Games
Jueming Hu
Jean-Raphael Gaglione
Yanze Wang
Zhe Xu
Ufuk Topcu
Yongming Liu
18
1
0
27 May 2023
Reinforcement Learning for UAV control with Policy and Reward Shaping
Reinforcement Learning for UAV control with Policy and Reward Shaping
Cristian Millán-Arias
R. Contreras
Francisco Cruz
Bruno José Torres Fernandes
OffRL
23
1
0
06 Dec 2022
A Brief Guide to Designing and Evaluating Human-Centered Interactive
  Machine Learning
A Brief Guide to Designing and Evaluating Human-Centered Interactive Machine Learning
K. Mathewson
P. Pilarski
HAI
21
4
0
20 Apr 2022
Training Value-Aligned Reinforcement Learning Agents Using a Normative
  Prior
Training Value-Aligned Reinforcement Learning Agents Using a Normative Prior
Md Sultan al Nahian
Spencer Frazier
Brent Harrison
Mark O. Riedl
27
18
0
19 Apr 2021
Improving Deep Reinforcement Learning in Minecraft with Action Advice
Improving Deep Reinforcement Learning in Minecraft with Action Advice
Spencer Frazier
Mark O. Riedl
19
29
0
02 Aug 2019
Why Build an Assistant in Minecraft?
Why Build an Assistant in Minecraft?
Arthur Szlam
Jonathan Gray
Kavya Srinet
Yacine Jernite
Armand Joulin
...
Siddharth Goyal
Demi Guo
Dan Rothermel
C. L. Zitnick
Jason Weston
LLMAG
30
29
0
22 Jul 2019
A Human-Centered Approach to Interactive Machine Learning
A Human-Centered Approach to Interactive Machine Learning
K. Mathewson
16
7
0
15 May 2019
Meta-learners' learning dynamics are unlike learners'
Meta-learners' learning dynamics are unlike learners'
Neil C. Rabinowitz
OffRL
31
16
0
03 May 2019
Robot Learning via Human Adversarial Games
Robot Learning via Human Adversarial Games
Jiali Duan
Qian Wang
Lerrel Pinto
C.-C. Jay Kuo
Stefanos Nikolaidis
AAML
SSL
22
7
0
02 Mar 2019
Parenting: Safe Reinforcement Learning from Human Input
Parenting: Safe Reinforcement Learning from Human Input
Christopher Frye
Ilya Feige
14
7
0
18 Feb 2019
Reward learning from human preferences and demonstrations in Atari
Reward learning from human preferences and demonstrations in Atari
Borja Ibarz
Jan Leike
Tobias Pohlen
G. Irving
Shane Legg
Dario Amodei
33
387
0
15 Nov 2018
Shared Autonomy via Deep Reinforcement Learning
Shared Autonomy via Deep Reinforcement Learning
S. Reddy
Anca Dragan
Sergey Levine
21
177
0
06 Feb 2018
1