Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2410.16517
Cited By
RGMDT: Return-Gap-Minimizing Decision Tree Extraction in Non-Euclidean Metric Space
21 October 2024
Jingdi Chen
Hanhan Zhou
Yongsheng Mei
Carlee Joe-Wong
Gina Adam
Nathaniel D. Bastian
Tian-Shing Lan
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"RGMDT: Return-Gap-Minimizing Decision Tree Extraction in Non-Euclidean Metric Space"
16 / 16 papers shown
Title
RGMComm: Return Gap Minimization via Discrete Communications in Multi-Agent Reinforcement Learning
Jingdi Chen
Tian-Shing Lan
Carlee Joe-Wong
56
15
0
07 Aug 2023
AccMER: Accelerating Multi-Agent Experience Replay with Cache Locality-aware Prioritization
Kailash Gogineni
Yongsheng Mei
Peng Wei
Tian-Shing Lan
Guru Venkataramani
72
13
0
31 May 2023
MAC-PO: Multi-Agent Experience Replay via Collective Priority Optimization
Yongsheng Mei
Hanhan Zhou
Tian-Shing Lan
Guru Venkataramani
Peng Wei
104
39
0
21 Feb 2023
Scalable Bayesian Inverse Reinforcement Learning
Alex J. Chan
M. Schaar
OffRL
BDL
81
67
0
12 Feb 2021
CDT: Cascading Decision Trees for Explainable Reinforcement Learning
Zihan Ding
Pablo Hernandez-Leal
G. Ding
Changjian Li
Ruitong Huang
44
21
0
15 Nov 2020
What Did You Think Would Happen? Explaining Agent Behaviour Through Intended Outcomes
Herman Yau
Chris Russell
Simon Hadfield
FAtt
LRM
42
38
0
10 Nov 2020
A Scalable MIP-based Method for Learning Optimal Multivariate Decision Trees
Haoran Zhu
Pavankumar Murali
Dzung Phan
Lam M. Nguyen
Jayant Kalagnanam
151
37
0
06 Nov 2020
Inverse Active Sensing: Modeling and Understanding Timely Decision-Making
Daniel Jarrett
M. Schaar
66
19
0
25 Jun 2020
D4RL: Datasets for Deep Data-Driven Reinforcement Learning
Justin Fu
Aviral Kumar
Ofir Nachum
George Tucker
Sergey Levine
GP
OffRL
223
1,377
0
15 Apr 2020
Soft Actor-Critic Algorithms and Applications
Tuomas Haarnoja
Aurick Zhou
Kristian Hartikainen
George Tucker
Sehoon Ha
...
Vikash Kumar
Henry Zhu
Abhishek Gupta
Pieter Abbeel
Sergey Levine
136
2,445
0
13 Dec 2018
Verifiable Reinforcement Learning via Policy Extraction
Osbert Bastani
Yewen Pu
Armando Solar-Lezama
OffRL
129
338
0
22 May 2018
Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments
Ryan J. Lowe
Yi Wu
Aviv Tamar
J. Harb
Pieter Abbeel
Igor Mordatch
140
4,501
0
07 Jun 2017
Interpretable classifiers using rules and Bayesian analysis: Building a better stroke prediction model
Benjamin Letham
Cynthia Rudin
Tyler H. McCormick
D. Madigan
FAtt
72
743
0
05 Nov 2015
ranger: A Fast Implementation of Random Forests for High Dimensional Data in C++ and R
Marvin N. Wright
A. Ziegler
267
2,787
0
18 Aug 2015
The Complexity of Decentralized Control of Markov Decision Processes
D. Bernstein
S. Zilberstein
N. Immerman
108
1,592
0
16 Jan 2013
A Reduction of Imitation Learning and Structured Prediction to No-Regret Online Learning
Stéphane Ross
Geoffrey J. Gordon
J. Andrew Bagnell
OffRL
231
3,232
0
02 Nov 2010
1