Here's What I've Learned: Asking Questions that Reveal Reward Learning

2 July 2021

Papers citing "Here's What I've Learned: Asking Questions that Reveal Reward Learning"

24 / 24 papers shown

Title
VIEW: Visual Imitation Learning with Waypoints Ananth Jonnavittula Sagar Parekh Dylan P. Losey SSL 132 10 0 27 Apr 2024
LIMIT: Learning Interfaces to Maximize Information Transfer Benjamin A. Christie Dylan P. Losey 51 5 0 17 Apr 2023
I Know What You Meant: Learning Human Objectives by (Under)estimating Their Choice Set Ananth Jonnavittula Dylan P. Losey 32 16 0 11 Nov 2020
Learning Reward Functions from Diverse Sources of Human Feedback: Optimally Integrating Demonstrations and Preferences Erdem Biyik Dylan P. Losey Malayandi Palan Nicholas C. Landolfi Gleb Shevchuk Dorsa Sadigh 48 118 0 24 Jun 2020
Active Preference Learning using Maximum Regret Nils Wilde Dana Kulić Stephen L. Smith GP 30 44 0 08 May 2020
Interactive Robot Training for Non-Markov Tasks Ankit J. Shah Samir Wadhwania J. Shah 28 15 0 04 Mar 2020
Reward-rational (implicit) choice: A unifying formalism for reward learning Hong Jun Jeon S. Milli Anca Dragan 62 177 0 12 Feb 2020
When Humans Aren't Optimal: Robots that Collaborate with Risk-Aware Humans Minae Kwon Erdem Biyik Aditi Talati Karan Bhasin Dylan P. Losey Dorsa Sadigh 159 98 0 13 Jan 2020
Four Years in Review: Statistical Practices of Likert Scales in Human-Robot Interaction Studies Mariah L. Schrum Michael Johnson Muyleng Ghuy Matthew C. Gombolay 20 68 0 09 Jan 2020
Explainable Artificial Intelligence (XAI): Concepts, Taxonomies, Opportunities and Challenges toward Responsible AI Alejandro Barredo Arrieta Natalia Díaz Rodríguez Javier Del Ser Adrien Bennetot Siham Tabik ... S. Gil-Lopez Daniel Molina Richard Benjamins Raja Chatila Francisco Herrera XAI 116 6,254 0 22 Oct 2019
Asking Easy Questions: A User-Friendly Approach to Active Reward Learning Erdem Biyik Malayandi Palan Nicholas C. Landolfi Dylan P. Losey Dorsa Sadigh 39 116 0 10 Oct 2019
Preference-Based Learning for Exoskeleton Gait Optimization Maegan Tucker Ellen R. Novoseller Claudia K. Kann Yanan Sui Yisong Yue J. W. Burdick Aaron D. Ames 98 90 0 26 Sep 2019
Robots that Take Advantage of Human Trust Dylan P. Losey Dorsa Sadigh 126 17 0 12 Sep 2019
Active Learning within Constrained Environments through Imitation of an Expert Questioner Kalesha Bullard Yannick Schroecker Sonia Chernova 102 19 0 01 Jul 2019
Extrapolating Beyond Suboptimal Demonstrations via Inverse Reinforcement Learning from Observations Daniel S. Brown Wonjoon Goo P. Nagarajan S. Niekum 71 355 0 12 Apr 2019
An Algorithmic Perspective on Imitation Learning Takayuki Osa Joni Pajarinen Gerhard Neumann J. Andrew Bagnell Pieter Abbeel Jan Peters 83 843 0 16 Nov 2018
Reward learning from human preferences and demonstrations in Atari Borja Ibarz Jan Leike Tobias Pohlen G. Irving Shane Legg Dario Amodei 82 393 0 15 Nov 2018
Expressing Robot Incapability Minae Kwon Sandy H. Huang Anca Dragan 39 125 0 18 Oct 2018
Learning from Richer Human Guidance: Augmenting Comparison-Based Learning with Feature Queries Chandrayee Basu M. Singhal Anca Dragan 55 57 0 05 Feb 2018
Planning with Verbal Communication for Human-Robot Collaboration Stefanos Nikolaidis Minae Kwon Jodi Forlizzi S. Srinivasa 28 66 0 14 Jun 2017
Deep reinforcement learning from human preferences Paul Christiano Jan Leike Tom B. Brown Miljan Martic Shane Legg Dario Amodei 153 3,296 0 12 Jun 2017
Enabling Robots to Communicate their Objectives Sandy H. Huang David Held Pieter Abbeel Anca Dragan 48 161 0 11 Feb 2017
Early Detection of Combustion Instabilities using Deep Convolutional Selective Autoencoders on Hi-speed Flame Video Chandrayee Basu Qian Yang M. Singhal Anca Dragan 74 174 0 25 Mar 2016
Learning Preferences for Manipulation Tasks from Online Coactive Feedback Ashesh Jain Shikhar Sharma Thorsten Joachims Ashutosh Saxena 67 116 0 05 Jan 2016