Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1710.11424
Cited By
Regret Minimization for Partially Observable Deep Reinforcement Learning
31 October 2017
Peter H. Jin
Kurt Keutzer
Sergey Levine
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Regret Minimization for Partially Observable Deep Reinforcement Learning"
13 / 13 papers shown
Title
A Survey on Self-play Methods in Reinforcement Learning
Ruize Zhang
Zelai Xu
Chengdong Ma
Chao Yu
Weijuan Tu
...
Deheng Ye
Wenbo Ding
Yaodong Yang
Yu Wang
Yu Wang
SyDa
SSL
OnRL
51
8
0
02 Aug 2024
Bridging the Gap between Discrete Agent Strategies in Game Theory and Continuous Motion Planning in Dynamic Environments
Hongrui Zheng
Zhijun Zhuang
Stephanie Wu
Shuo Yang
Rahul Mangharam
30
1
0
17 Mar 2024
ReMIX: Regret Minimization for Monotonic Value Function Factorization in Multiagent Reinforcement Learning
Yongsheng Mei
Hanhan Zhou
Tian-Shing Lan
35
11
0
11 Feb 2023
Let's Collaborate: Regret-based Reactive Synthesis for Robotic Manipulation
Karan Muvvala
Peter Amorese
Morteza Lahijanian
24
12
0
14 Mar 2022
Dual Behavior Regularized Reinforcement Learning
Chapman Siu
Jason M. Traish
R. Xu
OffRL
17
1
0
19 Sep 2021
Accelerating the Learning of TAMER with Counterfactual Explanations
Jakob Karalus
F. Lindner
OffRL
29
4
0
03 Aug 2021
Optimize Neural Fictitious Self-Play in Regret Minimization Thinking
Yuxuan Chen
Li Zhang
Shijian Li
Gang Pan
18
2
0
22 Apr 2021
Adversarial jamming attacks and defense strategies via adaptive deep reinforcement learning
Feng Wang
Chen Zhong
M. C. Gursoy
Senem Velipasalar
AAML
18
8
0
12 Jul 2020
DREAM: Deep Regret minimization with Advantage baselines and Model-free learning
Eric Steinberger
Adam Lerer
Noam Brown
30
53
0
18 Jun 2020
A Survey of Deep Reinforcement Learning in Video Games
Kun Shao
Zhentao Tang
Yuanheng Zhu
Nannan Li
Dongbin Zhao
OffRL
AI4TS
37
188
0
23 Dec 2019
Actor-Critic Policy Optimization in Partially Observable Multiagent Environments
S. Srinivasan
Marc Lanctot
V. Zambaldi
Julien Perolat
K. Tuyls
Rémi Munos
Michael Bowling
6
148
0
21 Oct 2018
Expert-augmented actor-critic for ViZDoom and Montezumas Revenge
Michal Garmulewicz
Henryk Michalewski
Piotr Milos
16
8
0
10 Sep 2018
Online Convex Optimization for Sequential Decision Processes and Extensive-Form Games
Gabriele Farina
Christian Kroer
T. Sandholm
16
59
0
10 Sep 2018
1