ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2007.13544
  4. Cited By
Combining Deep Reinforcement Learning and Search for
  Imperfect-Information Games

Combining Deep Reinforcement Learning and Search for Imperfect-Information Games

27 July 2020
Noam Brown
A. Bakhtin
Adam Lerer
Qucheng Gong
ArXivPDFHTML

Papers citing "Combining Deep Reinforcement Learning and Search for Imperfect-Information Games"

30 / 30 papers shown
Title
Automated Meta Prompt Engineering for Alignment with the Theory of Mind
Automated Meta Prompt Engineering for Alignment with the Theory of Mind
Aaron Baughman
Rahul Agarwal
Eduardo Morales
Gozde Akay
36
0
0
13 May 2025
Approximating Nash Equilibria in General-Sum Games via Meta-Learning
Approximating Nash Equilibria in General-Sum Games via Meta-Learning
David Sychrovský
Christopher Solinas
Revan MacQueen
Kevin Wang
James Wright
Nathan R Sturtevant
Michael Bowling
21
0
0
26 Apr 2025
Learning Nash Equilibrial Hamiltonian for Two-Player Collision-Avoiding Interactions
Learning Nash Equilibrial Hamiltonian for Two-Player Collision-Avoiding Interactions
Lei Zhang
Siddharth Das
Tanner Merry
Wenlong Zhang
Yi Ren
54
0
0
10 Mar 2025
Two-Player Zero-Sum Differential Games with One-Sided Information
Two-Player Zero-Sum Differential Games with One-Sided Information
Mukesh Ghimire
Z. Xu
Yi Ren
SyDa
95
0
0
17 Feb 2025
Large Language Monkeys: Scaling Inference Compute with Repeated Sampling
Large Language Monkeys: Scaling Inference Compute with Repeated Sampling
Bradley Brown
Jordan Juravsky
Ryan Ehrlich
Ronald Clark
Quoc V. Le
Christopher Ré
Azalia Mirhoseini
ALM
LRM
95
224
0
03 Jan 2025
Beyond Autoregression: Fast LLMs via Self-Distillation Through Time
Beyond Autoregression: Fast LLMs via Self-Distillation Through Time
Justin Deschenaux
Çağlar Gülçehre
47
2
0
28 Oct 2024
A Survey on Self-play Methods in Reinforcement Learning
A Survey on Self-play Methods in Reinforcement Learning
Chao Yu
Zelai Xu
Chengdong Ma
Chao Yu
Weijuan Tu
...
Deheng Ye
Wenbo Ding
Yaodong Yang
Yu Wang
Yu Wang
SyDa
SSL
OnRL
51
8
0
02 Aug 2024
A Meta-Game Evaluation Framework for Deep Multiagent Reinforcement
  Learning
A Meta-Game Evaluation Framework for Deep Multiagent Reinforcement Learning
Zun Li
Michael P. Wellman
34
1
0
30 Apr 2024
RL-CFR: Improving Action Abstraction for Imperfect Information
  Extensive-Form Games with Reinforcement Learning
RL-CFR: Improving Action Abstraction for Imperfect Information Extensive-Form Games with Reinforcement Learning
Boning Li
Zhixuan Fang
Longbo Huang
20
0
0
07 Mar 2024
State-Constrained Zero-Sum Differential Games with One-Sided Information
State-Constrained Zero-Sum Differential Games with One-Sided Information
Mukesh Ghimire
Lei Zhang
Zhenni Xu
Yi Ren
44
2
0
05 Mar 2024
Deciphering Digital Detectives: Understanding LLM Behaviors and
  Capabilities in Multi-Agent Mystery Games
Deciphering Digital Detectives: Understanding LLM Behaviors and Capabilities in Multi-Agent Mystery Games
Dekun Wu
Haochen Shi
Zhiyuan Sun
Bang Liu
LLMAG
29
16
0
01 Dec 2023
Zero-sum Polymatrix Markov Games: Equilibrium Collapse and Efficient
  Computation of Nash Equilibria
Zero-sum Polymatrix Markov Games: Equilibrium Collapse and Efficient Computation of Nash Equilibria
Fivos Kalogiannis
Ioannis Panageas
34
8
0
23 May 2023
Mastering Strategy Card Game (Legends of Code and Magic) via End-to-End
  Policy and Optimistic Smooth Fictitious Play
Mastering Strategy Card Game (Legends of Code and Magic) via End-to-End Policy and Optimistic Smooth Fictitious Play
Wei Xi
Yongxin Zhang
Changnan Xiao
Xuefeng Huang
Shihong Deng
Haowei Liang
Jie Chen
Peng Sun
OffRL
50
8
0
07 Mar 2023
Learning not to Regret
Learning not to Regret
David Sychrovský
Michal Sustr
Elnaz Davoodi
Michael Bowling
Marc Lanctot
Martin Schmid
34
3
0
02 Mar 2023
Single-Call Stochastic Extragradient Methods for Structured Non-monotone
  Variational Inequalities: Improved Analysis under Weaker Conditions
Single-Call Stochastic Extragradient Methods for Structured Non-monotone Variational Inequalities: Improved Analysis under Weaker Conditions
S. Choudhury
Eduard A. Gorbunov
Nicolas Loizou
27
13
0
27 Feb 2023
DITTO: Offline Imitation Learning with World Models
DITTO: Offline Imitation Learning with World Models
Branton DeMoss
Paul Duckworth
Nick Hawes
Ingmar Posner
Ingmar Posner
OffRL
21
18
0
06 Feb 2023
Abstracting Imperfect Information Away from Two-Player Zero-Sum Games
Abstracting Imperfect Information Away from Two-Player Zero-Sum Games
Samuel Sokota
Ryan DÓrazio
Chun Kai Ling
David J. Wu
J. Zico Kolter
Noam Brown
27
4
0
22 Jan 2023
DanZero: Mastering GuanDan Game with Reinforcement Learning
DanZero: Mastering GuanDan Game with Reinforcement Learning
Yudong Lu
Jian Zhao
Youpeng Zhao
Wen-gang Zhou
Houqiang Li
11
6
0
31 Oct 2022
Efficiently Computing Nash Equilibria in Adversarial Team Markov Games
Efficiently Computing Nash Equilibria in Adversarial Team Markov Games
Fivos Kalogiannis
Ioannis Anagnostides
Ioannis Panageas
Emmanouil-Vasileios Vlatakis-Gkaragkounis
Vaggos Chatziafratis
S. Stavroulakis
39
13
0
03 Aug 2022
Self-Explaining Deviations for Coordination
Self-Explaining Deviations for Coordination
Hengyuan Hu
Samuel Sokota
David J. Wu
A. Bakhtin
Andrei Lupu
Brandon Cui
Jakob N. Foerster
27
2
0
13 Jul 2022
Approximating Discontinuous Nash Equilibrial Values of Two-Player
  General-Sum Differential Games
Approximating Discontinuous Nash Equilibrial Values of Two-Player General-Sum Differential Games
Lei Zhang
Mukesh Ghimire
Wenlong Zhang
Zhenni Xu
Yi Ren
27
7
0
05 Jul 2022
Improving Bidding and Playing Strategies in the Trick-Taking game Wizard
  using Deep Q-Networks
Improving Bidding and Playing Strategies in the Trick-Taking game Wizard using Deep Q-Networks
Jonas Schumacher
Marco Pleines
32
0
0
27 May 2022
Symphony: Learning Realistic and Diverse Agents for Autonomous Driving
  Simulation
Symphony: Learning Realistic and Diverse Agents for Autonomous Driving Simulation
Maximilian Igl
Daewoo Kim
Alex Kuefler
Paul Mougin
Punit Shah
K. Shiarlis
Drago Anguelov
Mark Palatucci
Brandyn White
Shimon Whiteson
35
64
0
06 May 2022
Explainable Biomedical Recommendations via Reinforcement Learning
  Reasoning on Knowledge Graphs
Explainable Biomedical Recommendations via Reinforcement Learning Reasoning on Knowledge Graphs
G. Edwards
Sebastian Nilsson
Benedek Rozemberczki
Eliseo Papa
23
12
0
20 Nov 2021
Deep Synoptic Monte Carlo Planning in Reconnaissance Blind Chess
Deep Synoptic Monte Carlo Planning in Reconnaissance Blind Chess
Gregory Clark
30
9
0
05 Oct 2021
DouZero: Mastering DouDizhu with Self-Play Deep Reinforcement Learning
DouZero: Mastering DouDizhu with Self-Play Deep Reinforcement Learning
Daochen Zha
Jingru Xie
Wenye Ma
Sheng Zhang
Xiangru Lian
Xia Hu
Ji Liu
14
117
0
11 Jun 2021
Common Information Belief based Dynamic Programs for Stochastic Zero-sum
  Games with Competing Teams
Common Information Belief based Dynamic Programs for Stochastic Zero-sum Games with Competing Teams
D. Kartik
A. Nayyar
U. Mitra
8
12
0
11 Feb 2021
Solving Common-Payoff Games with Approximate Policy Iteration
Solving Common-Payoff Games with Approximate Policy Iteration
Samuel Sokota
Edward Lockhart
Finbarr Timbers
Elnaz Davoodi
Ryan DÓrazio
Neil Burch
Martin Schmid
Michael Bowling
Marc Lanctot
42
22
0
11 Jan 2021
Joint Policy Search for Multi-agent Collaboration with Imperfect
  Information
Joint Policy Search for Multi-agent Collaboration with Imperfect Information
Yuandong Tian
Qucheng Gong
Tina Jiang
29
19
0
14 Aug 2020
Rethinking Formal Models of Partially Observable Multiagent Decision
  Making
Rethinking Formal Models of Partially Observable Multiagent Decision Making
Vojtěch Kovařík
Martin Schmid
Neil Burch
Michael Bowling
Viliam Lisý
OffRL
14
54
0
26 Jun 2019
1