ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1609.05559
  4. Cited By
Opponent Modeling in Deep Reinforcement Learning

Opponent Modeling in Deep Reinforcement Learning

18 September 2016
He He
Jordan L. Boyd-Graber
Kevin Kwok
Hal Daumé III
    BDL
ArXivPDFHTML

Papers citing "Opponent Modeling in Deep Reinforcement Learning"

46 / 46 papers shown
Title
The Curse of Diversity in Ensemble-Based Exploration
The Curse of Diversity in Ensemble-Based Exploration
Zhixuan Lin
P. DÓro
Evgenii Nikishin
Rameswar Panda
42
1
0
07 May 2024
Contrastive learning-based agent modeling for deep reinforcement
  learning
Contrastive learning-based agent modeling for deep reinforcement learning
Wenhao Ma
Yu-Cheng Chang
Jie Yang
Yu-Kai Wang
Chin-Teng Lin
OffRL
29
0
0
30 Dec 2023
(Ir)rationality in AI: State of the Art, Research Challenges and Open Questions
(Ir)rationality in AI: State of the Art, Research Challenges and Open Questions
Olivia Macmillan-Scott
Mirco Musolesi
40
1
0
28 Nov 2023
Reward Shaping for Improved Learning in Real-time Strategy Game Play
Reward Shaping for Improved Learning in Real-time Strategy Game Play
John Kliem
Prithviraj Dasgupta
OffRL
19
1
0
27 Nov 2023
All by Myself: Learning Individualized Competitive Behaviour with a
  Contrastive Reinforcement Learning optimization
All by Myself: Learning Individualized Competitive Behaviour with a Contrastive Reinforcement Learning optimization
Pablo V. A. Barros
A. Sciutti
SSL
31
3
0
02 Oct 2023
Double Deep Q-Learning in Opponent Modeling
Double Deep Q-Learning in Opponent Modeling
Yangtianze Tao
J. Doe
26
3
0
24 Nov 2022
Value-based CTDE Methods in Symmetric Two-team Markov Game: from
  Cooperation to Team Competition
Value-based CTDE Methods in Symmetric Two-team Markov Game: from Cooperation to Team Competition
Pascal Leroy
J. Pisane
D. Ernst
25
3
0
21 Nov 2022
TiAda: A Time-scale Adaptive Algorithm for Nonconvex Minimax
  Optimization
TiAda: A Time-scale Adaptive Algorithm for Nonconvex Minimax Optimization
Xiang Li
Junchi Yang
Niao He
26
8
0
31 Oct 2022
DanZero: Mastering GuanDan Game with Reinforcement Learning
DanZero: Mastering GuanDan Game with Reinforcement Learning
Yudong Lu
Jian Zhao
Youpeng Zhao
Wen-gang Zhou
Houqiang Li
11
6
0
31 Oct 2022
Coordination with Humans via Strategy Matching
Coordination with Humans via Strategy Matching
Michelle Zhao
Reid G. Simmons
H. Admoni
16
8
0
27 Oct 2022
An Opponent-Aware Reinforcement Learning Method for Team-to-Team
  Multi-Vehicle Pursuit via Maximizing Mutual Information Indicator
An Opponent-Aware Reinforcement Learning Method for Team-to-Team Multi-Vehicle Pursuit via Maximizing Mutual Information Indicator
Qinwen Wang
Xinhang Li
Zheng Yuan
Yiying Yang
Chen Xu
Lin Zhang
32
2
0
24 Oct 2022
Classifying Ambiguous Identities in Hidden-Role Stochastic Games with
  Multi-Agent Reinforcement Learning
Classifying Ambiguous Identities in Hidden-Role Stochastic Games with Multi-Agent Reinforcement Learning
Shijie Han
Siyuan Li
Bo An
Wei Zhao
P. Liu
35
0
0
24 Oct 2022
Centralized Training with Hybrid Execution in Multi-Agent Reinforcement
  Learning
Centralized Training with Hybrid Execution in Multi-Agent Reinforcement Learning
Pedro P. Santos
Diogo S. Carvalho
Miguel Vasco
Alberto Sardinha
Pedro A. Santos
Ana Paiva
Francisco S. Melo
21
1
0
12 Oct 2022
The Neural Race Reduction: Dynamics of Abstraction in Gated Networks
The Neural Race Reduction: Dynamics of Abstraction in Gated Networks
Andrew M. Saxe
Shagun Sodhani
Sam Lewallen
AI4CE
30
34
0
21 Jul 2022
Offline Equilibrium Finding
Offline Equilibrium Finding
Shuxin Li
Xinrun Wang
Youzhi Zhang
Jakub Cerny
Pengdeng Li
Hau Chan
Bo An
OffRL
46
2
0
12 Jul 2022
A Survey on Model-based Reinforcement Learning
A Survey on Model-based Reinforcement Learning
Fan Luo
Tian Xu
Hang Lai
Xiong-Hui Chen
Weinan Zhang
Yang Yu
OffRL
LRM
50
101
0
19 Jun 2022
When Physics Meets Machine Learning: A Survey of Physics-Informed
  Machine Learning
When Physics Meets Machine Learning: A Survey of Physics-Informed Machine Learning
Chuizheng Meng
Sungyong Seo
Defu Cao
Sam Griesemer
Yan Liu
PINN
AI4CE
44
57
0
31 Mar 2022
Learning to Infer Belief Embedded Communication
Learning to Infer Belief Embedded Communication
Guo Ye
Han Liu
B. Sengupta
21
0
0
15 Mar 2022
Learning to Cooperate with Unseen Agent via Meta-Reinforcement Learning
Learning to Cooperate with Unseen Agent via Meta-Reinforcement Learning
Rujikorn Charakorn
P. Manoonpong
Nat Dilokthanakul
33
5
0
05 Nov 2021
ToM2C: Target-oriented Multi-agent Communication and Cooperation with
  Theory of Mind
ToM2C: Target-oriented Multi-agent Communication and Cooperation with Theory of Mind
Yuan-Fang Wang
Fangwei Zhong
Jing Xu
Yizhou Wang
LLMAG
19
68
0
15 Oct 2021
Influencing Towards Stable Multi-Agent Interactions
Influencing Towards Stable Multi-Agent Interactions
Woodrow Z. Wang
Andy Shih
Annie Xie
Dorsa Sadigh
38
35
0
05 Oct 2021
Model-based Multi-agent Policy Optimization with Adaptive Opponent-wise
  Rollouts
Model-based Multi-agent Policy Optimization with Adaptive Opponent-wise Rollouts
Weinan Zhang
Xihuai Wang
Jian Shen
Ming Zhou
27
35
0
07 May 2021
Opponent Learning Awareness and Modelling in Multi-Objective Normal Form
  Games
Opponent Learning Awareness and Modelling in Multi-Objective Normal Form Games
Roxana Rădulescu
T. Verstraeten
Yijie Zhang
Patrick Mannion
D. Roijers
A. Nowé
25
14
0
14 Nov 2020
Learning Latent Representations to Influence Multi-Agent Interaction
Learning Latent Representations to Influence Multi-Agent Interaction
Annie Xie
Dylan P. Losey
R. Tolsma
Chelsea Finn
Dorsa Sadigh
DRL
21
132
0
12 Nov 2020
Improving Dialog Systems for Negotiation with Personality Modeling
Improving Dialog Systems for Negotiation with Personality Modeling
Runzhe Yang
Jingxiao Chen
Karthik Narasimhan
27
46
0
20 Oct 2020
What About Inputing Policy in Value Function: Policy Representation and
  Policy-extended Value Function Approximator
What About Inputing Policy in Value Function: Policy Representation and Policy-extended Value Function Approximator
Hongyao Tang
Zhaopeng Meng
Jianye Hao
Chong Chen
D. Graves
...
Hangyu Mao
Wulong Liu
Yaodong Yang
Wenyuan Tao
Li Wang
OffRL
14
7
0
19 Oct 2020
A Generative Machine Learning Approach to Policy Optimization in
  Pursuit-Evasion Games
A Generative Machine Learning Approach to Policy Optimization in Pursuit-Evasion Games
Shiva Navabi
Osonde A. Osoba
18
2
0
04 Oct 2020
Learning to Play against Any Mixture of Opponents
Learning to Play against Any Mixture of Opponents
Max O. Smith
Thomas W. Anthony
Yongzhao Wang
Michael P. Wellman
OffRL
33
9
0
29 Sep 2020
Agent Modelling under Partial Observability for Deep Reinforcement
  Learning
Agent Modelling under Partial Observability for Deep Reinforcement Learning
Georgios Papoudakis
Filippos Christianos
Stefano V. Albrecht
24
61
0
16 Jun 2020
Benchmarking Multi-Agent Deep Reinforcement Learning Algorithms in
  Cooperative Tasks
Benchmarking Multi-Agent Deep Reinforcement Learning Algorithms in Cooperative Tasks
Georgios Papoudakis
Filippos Christianos
Lukas Schafer
Stefano V. Albrecht
OffRL
26
220
0
14 Jun 2020
Efficient Use of heuristics for accelerating XCS-based Policy Learning
  in Markov Games
Efficient Use of heuristics for accelerating XCS-based Policy Learning in Markov Games
Hao Chen
Chang Wang
Jian Huang
Jianxing Gong
13
5
0
26 May 2020
Enhanced Rolling Horizon Evolution Algorithm with Opponent Model
  Learning: Results for the Fighting Game AI Competition
Enhanced Rolling Horizon Evolution Algorithm with Opponent Model Learning: Results for the Fighting Game AI Competition
Zhentao Tang
Yuanheng Zhu
Dongbin Zhao
Simon Lucas
21
28
0
31 Mar 2020
FormulaZero: Distributionally Robust Online Adaptation via Offline
  Population Synthesis
FormulaZero: Distributionally Robust Online Adaptation via Offline Population Synthesis
Aman Sinha
Matthew O'Kelly
Hongrui Zheng
Rahul Mangharam
John C. Duchi
Russ Tedrake
OffRL
66
26
0
09 Mar 2020
Multi-Agent Reinforcement Learning: A Selective Overview of Theories and
  Algorithms
Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms
Kaipeng Zhang
Zhuoran Yang
Tamer Basar
58
1,181
0
24 Nov 2019
What Question Answering can Learn from Trivia Nerds
What Question Answering can Learn from Trivia Nerds
Jordan L. Boyd-Graber
Benjamin Borschinger
24
36
0
31 Oct 2019
Stabilizing Generative Adversarial Networks: A Survey
Stabilizing Generative Adversarial Networks: A Survey
Maciej Wiatrak
Stefano V. Albrecht
A. Nystrom
GAN
29
84
0
30 Sep 2019
OpenSpiel: A Framework for Reinforcement Learning in Games
OpenSpiel: A Framework for Reinforcement Learning in Games
Marc Lanctot
Edward Lockhart
Jean-Baptiste Lespiau
V. Zambaldi
Satyaki Upadhyay
...
Julian Schrittwieser
Thomas W. Anthony
Edward Hughes
Ivo Danihelka
Jonah Ryan-Davis
OffRL
22
248
0
26 Aug 2019
Arena: A General Evaluation Platform and Building Toolkit for
  Multi-Agent Intelligence
Arena: A General Evaluation Platform and Building Toolkit for Multi-Agent Intelligence
Yuhang Song
Andrzej Wojcicki
Thomas Lukasiewicz
Jianyi Wang
Abi Aryan
Zhenghua Xu
Mai Xu
Zihan Ding
Lianlong Wu
AI4CE
ELM
27
33
0
17 May 2019
A Regularized Opponent Model with Maximum Entropy Objective
A Regularized Opponent Model with Maximum Entropy Objective
Zheng Tian
Ying Wen
Zhichen Gong
Faiz Punakkath
Shihao Zou
Jun Wang
30
30
0
17 May 2019
Infer Your Enemies and Know Yourself, Learning in Real-Time Bidding with
  Partially Observable Opponents
Infer Your Enemies and Know Yourself, Learning in Real-Time Bidding with Partially Observable Opponents
Manxing Du
Alexander I. Cowen-Rivers
Ying Wen
Phu Sakulwongtana
Jun Wang
M. Brorsson
R. State
16
1
0
28 Feb 2019
Actor-Critic Policy Optimization in Partially Observable Multiagent
  Environments
Actor-Critic Policy Optimization in Partially Observable Multiagent Environments
S. Srinivasan
Marc Lanctot
V. Zambaldi
Julien Perolat
K. Tuyls
Rémi Munos
Michael Bowling
8
148
0
21 Oct 2018
A Survey and Critique of Multiagent Deep Reinforcement Learning
A Survey and Critique of Multiagent Deep Reinforcement Learning
Pablo Hernandez-Leal
Bilal Kartal
Matthew E. Taylor
OffRL
32
550
0
12 Oct 2018
Actor-Attention-Critic for Multi-Agent Reinforcement Learning
Actor-Attention-Critic for Multi-Agent Reinforcement Learning
Shariq Iqbal
Fei Sha
6
738
0
05 Oct 2018
Towards Efficient Detection and Optimal Response against Sophisticated
  Opponents
Towards Efficient Detection and Optimal Response against Sophisticated Opponents
Tianpei Yang
Zhaopeng Meng
Jianye Hao
Chongjie Zhang
Yan Zheng
Ze Zheng
AAML
13
12
0
12 Sep 2018
On Gradient-Based Learning in Continuous Games
On Gradient-Based Learning in Continuous Games
Eric Mazumdar
Lillian J. Ratliff
S. Shankar Sastry
11
134
0
16 Apr 2018
Multiagent Bidirectionally-Coordinated Nets: Emergence of Human-level
  Coordination in Learning to Play StarCraft Combat Games
Multiagent Bidirectionally-Coordinated Nets: Emergence of Human-level Coordination in Learning to Play StarCraft Combat Games
Peng Peng
Ying Wen
Yaodong Yang
Quan Yuan
Zhenkun Tang
Haitao Long
Jun Wang
15
333
0
29 Mar 2017
1