Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2007.13544
Cited By
Combining Deep Reinforcement Learning and Search for Imperfect-Information Games
27 July 2020
Noam Brown
A. Bakhtin
Adam Lerer
Qucheng Gong
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Combining Deep Reinforcement Learning and Search for Imperfect-Information Games"
30 / 30 papers shown
Title
Automated Meta Prompt Engineering for Alignment with the Theory of Mind
Aaron Baughman
Rahul Agarwal
Eduardo Morales
Gozde Akay
36
0
0
13 May 2025
Approximating Nash Equilibria in General-Sum Games via Meta-Learning
David Sychrovský
Christopher Solinas
Revan MacQueen
Kevin Wang
James Wright
Nathan R Sturtevant
Michael Bowling
21
0
0
26 Apr 2025
Learning Nash Equilibrial Hamiltonian for Two-Player Collision-Avoiding Interactions
Lei Zhang
Siddharth Das
Tanner Merry
Wenlong Zhang
Yi Ren
54
0
0
10 Mar 2025
Two-Player Zero-Sum Differential Games with One-Sided Information
Mukesh Ghimire
Z. Xu
Yi Ren
SyDa
95
0
0
17 Feb 2025
Large Language Monkeys: Scaling Inference Compute with Repeated Sampling
Bradley Brown
Jordan Juravsky
Ryan Ehrlich
Ronald Clark
Quoc V. Le
Christopher Ré
Azalia Mirhoseini
ALM
LRM
95
224
0
03 Jan 2025
Beyond Autoregression: Fast LLMs via Self-Distillation Through Time
Justin Deschenaux
Çağlar Gülçehre
47
2
0
28 Oct 2024
A Survey on Self-play Methods in Reinforcement Learning
Chao Yu
Zelai Xu
Chengdong Ma
Chao Yu
Weijuan Tu
...
Deheng Ye
Wenbo Ding
Yaodong Yang
Yu Wang
Yu Wang
SyDa
SSL
OnRL
51
8
0
02 Aug 2024
A Meta-Game Evaluation Framework for Deep Multiagent Reinforcement Learning
Zun Li
Michael P. Wellman
34
1
0
30 Apr 2024
RL-CFR: Improving Action Abstraction for Imperfect Information Extensive-Form Games with Reinforcement Learning
Boning Li
Zhixuan Fang
Longbo Huang
20
0
0
07 Mar 2024
State-Constrained Zero-Sum Differential Games with One-Sided Information
Mukesh Ghimire
Lei Zhang
Zhenni Xu
Yi Ren
44
2
0
05 Mar 2024
Deciphering Digital Detectives: Understanding LLM Behaviors and Capabilities in Multi-Agent Mystery Games
Dekun Wu
Haochen Shi
Zhiyuan Sun
Bang Liu
LLMAG
29
16
0
01 Dec 2023
Zero-sum Polymatrix Markov Games: Equilibrium Collapse and Efficient Computation of Nash Equilibria
Fivos Kalogiannis
Ioannis Panageas
34
8
0
23 May 2023
Mastering Strategy Card Game (Legends of Code and Magic) via End-to-End Policy and Optimistic Smooth Fictitious Play
Wei Xi
Yongxin Zhang
Changnan Xiao
Xuefeng Huang
Shihong Deng
Haowei Liang
Jie Chen
Peng Sun
OffRL
50
8
0
07 Mar 2023
Learning not to Regret
David Sychrovský
Michal Sustr
Elnaz Davoodi
Michael Bowling
Marc Lanctot
Martin Schmid
34
3
0
02 Mar 2023
Single-Call Stochastic Extragradient Methods for Structured Non-monotone Variational Inequalities: Improved Analysis under Weaker Conditions
S. Choudhury
Eduard A. Gorbunov
Nicolas Loizou
27
13
0
27 Feb 2023
DITTO: Offline Imitation Learning with World Models
Branton DeMoss
Paul Duckworth
Nick Hawes
Ingmar Posner
Ingmar Posner
OffRL
21
18
0
06 Feb 2023
Abstracting Imperfect Information Away from Two-Player Zero-Sum Games
Samuel Sokota
Ryan DÓrazio
Chun Kai Ling
David J. Wu
J. Zico Kolter
Noam Brown
27
4
0
22 Jan 2023
DanZero: Mastering GuanDan Game with Reinforcement Learning
Yudong Lu
Jian Zhao
Youpeng Zhao
Wen-gang Zhou
Houqiang Li
11
6
0
31 Oct 2022
Efficiently Computing Nash Equilibria in Adversarial Team Markov Games
Fivos Kalogiannis
Ioannis Anagnostides
Ioannis Panageas
Emmanouil-Vasileios Vlatakis-Gkaragkounis
Vaggos Chatziafratis
S. Stavroulakis
39
13
0
03 Aug 2022
Self-Explaining Deviations for Coordination
Hengyuan Hu
Samuel Sokota
David J. Wu
A. Bakhtin
Andrei Lupu
Brandon Cui
Jakob N. Foerster
27
2
0
13 Jul 2022
Approximating Discontinuous Nash Equilibrial Values of Two-Player General-Sum Differential Games
Lei Zhang
Mukesh Ghimire
Wenlong Zhang
Zhenni Xu
Yi Ren
27
7
0
05 Jul 2022
Improving Bidding and Playing Strategies in the Trick-Taking game Wizard using Deep Q-Networks
Jonas Schumacher
Marco Pleines
32
0
0
27 May 2022
Symphony: Learning Realistic and Diverse Agents for Autonomous Driving Simulation
Maximilian Igl
Daewoo Kim
Alex Kuefler
Paul Mougin
Punit Shah
K. Shiarlis
Drago Anguelov
Mark Palatucci
Brandyn White
Shimon Whiteson
35
64
0
06 May 2022
Explainable Biomedical Recommendations via Reinforcement Learning Reasoning on Knowledge Graphs
G. Edwards
Sebastian Nilsson
Benedek Rozemberczki
Eliseo Papa
23
12
0
20 Nov 2021
Deep Synoptic Monte Carlo Planning in Reconnaissance Blind Chess
Gregory Clark
30
9
0
05 Oct 2021
DouZero: Mastering DouDizhu with Self-Play Deep Reinforcement Learning
Daochen Zha
Jingru Xie
Wenye Ma
Sheng Zhang
Xiangru Lian
Xia Hu
Ji Liu
14
117
0
11 Jun 2021
Common Information Belief based Dynamic Programs for Stochastic Zero-sum Games with Competing Teams
D. Kartik
A. Nayyar
U. Mitra
8
12
0
11 Feb 2021
Solving Common-Payoff Games with Approximate Policy Iteration
Samuel Sokota
Edward Lockhart
Finbarr Timbers
Elnaz Davoodi
Ryan DÓrazio
Neil Burch
Martin Schmid
Michael Bowling
Marc Lanctot
42
22
0
11 Jan 2021
Joint Policy Search for Multi-agent Collaboration with Imperfect Information
Yuandong Tian
Qucheng Gong
Tina Jiang
29
19
0
14 Aug 2020
Rethinking Formal Models of Partially Observable Multiagent Decision Making
Vojtěch Kovařík
Martin Schmid
Neil Burch
Michael Bowling
Viliam Lisý
OffRL
14
54
0
26 Jun 2019
1