ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2107.01995
  4. Cited By
Here's What I've Learned: Asking Questions that Reveal Reward Learning

Here's What I've Learned: Asking Questions that Reveal Reward Learning

2 July 2021
Soheil Habibian
Ananth Jonnavittula
Dylan P. Losey
ArXivPDFHTML

Papers citing "Here's What I've Learned: Asking Questions that Reveal Reward Learning"

24 / 24 papers shown
Title
VIEW: Visual Imitation Learning with Waypoints
VIEW: Visual Imitation Learning with Waypoints
Ananth Jonnavittula
Sagar Parekh
Dylan P. Losey
SSL
132
10
0
27 Apr 2024
LIMIT: Learning Interfaces to Maximize Information Transfer
LIMIT: Learning Interfaces to Maximize Information Transfer
Benjamin A. Christie
Dylan P. Losey
51
5
0
17 Apr 2023
I Know What You Meant: Learning Human Objectives by (Under)estimating
  Their Choice Set
I Know What You Meant: Learning Human Objectives by (Under)estimating Their Choice Set
Ananth Jonnavittula
Dylan P. Losey
32
16
0
11 Nov 2020
Learning Reward Functions from Diverse Sources of Human Feedback:
  Optimally Integrating Demonstrations and Preferences
Learning Reward Functions from Diverse Sources of Human Feedback: Optimally Integrating Demonstrations and Preferences
Erdem Biyik
Dylan P. Losey
Malayandi Palan
Nicholas C. Landolfi
Gleb Shevchuk
Dorsa Sadigh
48
118
0
24 Jun 2020
Active Preference Learning using Maximum Regret
Active Preference Learning using Maximum Regret
Nils Wilde
Dana Kulić
Stephen L. Smith
GP
30
44
0
08 May 2020
Interactive Robot Training for Non-Markov Tasks
Interactive Robot Training for Non-Markov Tasks
Ankit J. Shah
Samir Wadhwania
J. Shah
28
15
0
04 Mar 2020
Reward-rational (implicit) choice: A unifying formalism for reward
  learning
Reward-rational (implicit) choice: A unifying formalism for reward learning
Hong Jun Jeon
S. Milli
Anca Dragan
62
177
0
12 Feb 2020
When Humans Aren't Optimal: Robots that Collaborate with Risk-Aware
  Humans
When Humans Aren't Optimal: Robots that Collaborate with Risk-Aware Humans
Minae Kwon
Erdem Biyik
Aditi Talati
Karan Bhasin
Dylan P. Losey
Dorsa Sadigh
159
98
0
13 Jan 2020
Four Years in Review: Statistical Practices of Likert Scales in
  Human-Robot Interaction Studies
Four Years in Review: Statistical Practices of Likert Scales in Human-Robot Interaction Studies
Mariah L. Schrum
Michael Johnson
Muyleng Ghuy
Matthew C. Gombolay
20
68
0
09 Jan 2020
Explainable Artificial Intelligence (XAI): Concepts, Taxonomies,
  Opportunities and Challenges toward Responsible AI
Explainable Artificial Intelligence (XAI): Concepts, Taxonomies, Opportunities and Challenges toward Responsible AI
Alejandro Barredo Arrieta
Natalia Díaz Rodríguez
Javier Del Ser
Adrien Bennetot
Siham Tabik
...
S. Gil-Lopez
Daniel Molina
Richard Benjamins
Raja Chatila
Francisco Herrera
XAI
116
6,254
0
22 Oct 2019
Asking Easy Questions: A User-Friendly Approach to Active Reward
  Learning
Asking Easy Questions: A User-Friendly Approach to Active Reward Learning
Erdem Biyik
Malayandi Palan
Nicholas C. Landolfi
Dylan P. Losey
Dorsa Sadigh
39
116
0
10 Oct 2019
Preference-Based Learning for Exoskeleton Gait Optimization
Preference-Based Learning for Exoskeleton Gait Optimization
Maegan Tucker
Ellen R. Novoseller
Claudia K. Kann
Yanan Sui
Yisong Yue
J. W. Burdick
Aaron D. Ames
98
90
0
26 Sep 2019
Robots that Take Advantage of Human Trust
Robots that Take Advantage of Human Trust
Dylan P. Losey
Dorsa Sadigh
126
17
0
12 Sep 2019
Active Learning within Constrained Environments through Imitation of an
  Expert Questioner
Active Learning within Constrained Environments through Imitation of an Expert Questioner
Kalesha Bullard
Yannick Schroecker
Sonia Chernova
102
19
0
01 Jul 2019
Extrapolating Beyond Suboptimal Demonstrations via Inverse Reinforcement
  Learning from Observations
Extrapolating Beyond Suboptimal Demonstrations via Inverse Reinforcement Learning from Observations
Daniel S. Brown
Wonjoon Goo
P. Nagarajan
S. Niekum
71
355
0
12 Apr 2019
An Algorithmic Perspective on Imitation Learning
An Algorithmic Perspective on Imitation Learning
Takayuki Osa
Joni Pajarinen
Gerhard Neumann
J. Andrew Bagnell
Pieter Abbeel
Jan Peters
83
843
0
16 Nov 2018
Reward learning from human preferences and demonstrations in Atari
Reward learning from human preferences and demonstrations in Atari
Borja Ibarz
Jan Leike
Tobias Pohlen
G. Irving
Shane Legg
Dario Amodei
82
393
0
15 Nov 2018
Expressing Robot Incapability
Expressing Robot Incapability
Minae Kwon
Sandy H. Huang
Anca Dragan
39
125
0
18 Oct 2018
Learning from Richer Human Guidance: Augmenting Comparison-Based
  Learning with Feature Queries
Learning from Richer Human Guidance: Augmenting Comparison-Based Learning with Feature Queries
Chandrayee Basu
M. Singhal
Anca Dragan
55
57
0
05 Feb 2018
Planning with Verbal Communication for Human-Robot Collaboration
Planning with Verbal Communication for Human-Robot Collaboration
Stefanos Nikolaidis
Minae Kwon
Jodi Forlizzi
S. Srinivasa
28
66
0
14 Jun 2017
Deep reinforcement learning from human preferences
Deep reinforcement learning from human preferences
Paul Christiano
Jan Leike
Tom B. Brown
Miljan Martic
Shane Legg
Dario Amodei
153
3,296
0
12 Jun 2017
Enabling Robots to Communicate their Objectives
Enabling Robots to Communicate their Objectives
Sandy H. Huang
David Held
Pieter Abbeel
Anca Dragan
48
161
0
11 Feb 2017
Early Detection of Combustion Instabilities using Deep Convolutional
  Selective Autoencoders on Hi-speed Flame Video
Early Detection of Combustion Instabilities using Deep Convolutional Selective Autoencoders on Hi-speed Flame Video
Chandrayee Basu
Qian Yang
M. Singhal
Anca Dragan
74
174
0
25 Mar 2016
Learning Preferences for Manipulation Tasks from Online Coactive
  Feedback
Learning Preferences for Manipulation Tasks from Online Coactive Feedback
Ashesh Jain
Shikhar Sharma
Thorsten Joachims
Ashutosh Saxena
67
116
0
05 Jan 2016
1