Regret Minimization for Partially Observable Deep Reinforcement Learning

31 October 2017

Papers citing "Regret Minimization for Partially Observable Deep Reinforcement Learning"

13 / 13 papers shown

Title
A Survey on Self-play Methods in Reinforcement Learning Ruize Zhang Zelai Xu Chengdong Ma Chao Yu Weijuan Tu ... Deheng Ye Wenbo Ding Yaodong Yang Yu Wang Yu Wang SyDa SSL OnRL 51 8 0 02 Aug 2024
Bridging the Gap between Discrete Agent Strategies in Game Theory and Continuous Motion Planning in Dynamic Environments Hongrui Zheng Zhijun Zhuang Stephanie Wu Shuo Yang Rahul Mangharam 30 1 0 17 Mar 2024
ReMIX: Regret Minimization for Monotonic Value Function Factorization in Multiagent Reinforcement Learning Yongsheng Mei Hanhan Zhou Tian-Shing Lan 35 11 0 11 Feb 2023
Let's Collaborate: Regret-based Reactive Synthesis for Robotic Manipulation Karan Muvvala Peter Amorese Morteza Lahijanian 24 12 0 14 Mar 2022
Dual Behavior Regularized Reinforcement Learning Chapman Siu Jason M. Traish R. Xu OffRL 17 1 0 19 Sep 2021
Accelerating the Learning of TAMER with Counterfactual Explanations Jakob Karalus F. Lindner OffRL 29 4 0 03 Aug 2021
Optimize Neural Fictitious Self-Play in Regret Minimization Thinking Yuxuan Chen Li Zhang Shijian Li Gang Pan 18 2 0 22 Apr 2021
Adversarial jamming attacks and defense strategies via adaptive deep reinforcement learning Feng Wang Chen Zhong M. C. Gursoy Senem Velipasalar AAML 18 8 0 12 Jul 2020
DREAM: Deep Regret minimization with Advantage baselines and Model-free learning Eric Steinberger Adam Lerer Noam Brown 30 53 0 18 Jun 2020
A Survey of Deep Reinforcement Learning in Video Games Kun Shao Zhentao Tang Yuanheng Zhu Nannan Li Dongbin Zhao OffRL AI4TS 37 188 0 23 Dec 2019
Actor-Critic Policy Optimization in Partially Observable Multiagent Environments S. Srinivasan Marc Lanctot V. Zambaldi Julien Perolat K. Tuyls Rémi Munos Michael Bowling 6 148 0 21 Oct 2018
Expert-augmented actor-critic for ViZDoom and Montezumas Revenge Michal Garmulewicz Henryk Michalewski Piotr Milos 16 8 0 10 Sep 2018
Online Convex Optimization for Sequential Decision Processes and Extensive-Form Games Gabriele Farina Christian Kroer T. Sandholm 16 59 0 10 Sep 2018