Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1606.03137
Cited By
Cooperative Inverse Reinforcement Learning
9 June 2016
Dylan Hadfield-Menell
Anca Dragan
Pieter Abbeel
Stuart J. Russell
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Cooperative Inverse Reinforcement Learning"
13 / 13 papers shown
Title
Redefining Superalignment: From Weak-to-Strong Alignment to Human-AI Co-Alignment to Sustainable Symbiotic Society
Yi Zeng
Yijiao Wang
Enmeng Lu
Dongcheng Zhao
Bing Han
...
Chao Liu
Yaodong Yang
Yi Zeng
Boyuan Chen
Jinyu Fan
110
0
0
24 Apr 2025
Learning to Assist Humans without Inferring Rewards
Vivek Myers
Evan Ellis
Sergey Levine
Benjamin Eysenbach
Anca Dragan
79
3
0
17 Jan 2025
Solving Hierarchical Information-Sharing Dec-POMDPs: An Extensive-Form Game Approach
Johan Peralez
Aurélien Delage
Olivier Buffet
J. Dibangoye
59
1
0
03 Jan 2025
Problem Solving Through Human-AI Preference-Based Cooperation
Subhabrata Dutta
Timo Kaufmann
Goran Glavaš
Ivan Habernal
Kristian Kersting
Frauke Kreuter
Mira Mezini
Iryna Gurevych
Eyke Hüllermeier
Hinrich Schuetze
109
1
0
14 Aug 2024
Highly Efficient Self-Adaptive Reward Shaping for Reinforcement Learning
Haozhe Ma
Zhengding Luo
Thanh Vinh Vo
Kuankuan Sima
Tze-Yun Leong
70
6
0
06 Aug 2024
GOMA: Proactive Embodied Cooperative Communication via Goal-Oriented Mental Alignment
Lance Ying
Kunal Jha
Shivam Aarya
Joshua B. Tenenbaum
Antonio Torralba
Tianmin Shu
58
14
0
17 Mar 2024
Active teacher selection for reinforcement learning from human feedback
Rachel Freedman
Justin Svegliato
K. H. Wray
Stuart J. Russell
82
6
0
23 Oct 2023
Learning Formal Specifications from Membership and Preference Queries
Ameesh Shah
Marcell Vazquez-Chanlatte
Sebastian Junges
Sanjit A. Seshia
45
5
0
19 Jul 2023
Proportional Aggregation of Preferences for Sequential Decision Making
Nikhil Chandak
Shashwat Goel
Dominik Peters
71
12
0
26 Jun 2023
Reinforcement Learning with a Corrupted Reward Channel
Tom Everitt
Victoria Krakovna
Laurent Orseau
Marcus Hutter
Shane Legg
66
100
0
23 May 2017
Computational Rationalization: The Inverse Equilibrium Problem
Kevin Waugh
Brian Ziebart
J. Andrew Bagnell
37
82
0
15 Aug 2013
The Complexity of Decentralized Control of Markov Decision Processes
D. Bernstein
S. Zilberstein
N. Immerman
41
1,588
0
16 Jan 2013
A Reduction of Imitation Learning and Structured Prediction to No-Regret Online Learning
Stéphane Ross
Geoffrey J. Gordon
J. Andrew Bagnell
OffRL
134
3,196
0
02 Nov 2010
1