Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1811.02483
Cited By
Deep Reinforcement Learning for Green Security Games with Real-Time Information
6 November 2018
Yufei Wang
Zheyuan Ryan Shi
Lantao Yu
Yi Wu
Rohit Singh
Lucas Joppa
Fei Fang
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Deep Reinforcement Learning for Green Security Games with Real-Time Information"
13 / 13 papers shown
Title
A Unified Game-Theoretic Approach to Multiagent Reinforcement Learning
Marc Lanctot
V. Zambaldi
A. Gruslys
Angeliki Lazaridou
K. Tuyls
Julien Perolat
David Silver
T. Graepel
70
628
0
02 Nov 2017
Safe and Nested Subgame Solving for Imperfect-Information Games
Noam Brown
Tuomas Sandholm
32
182
0
08 May 2017
Multi-agent Reinforcement Learning in Sequential Social Dilemmas
Joel Z Leibo
V. Zambaldi
Marc Lanctot
J. Marecki
T. Graepel
32
606
0
10 Feb 2017
DeepStack: Expert-Level Artificial Intelligence in No-Limit Poker
Matej Moravcík
Martin Schmid
Neil Burch
Viliam Lisý
Dustin Morrill
Nolan Bard
Trevor Davis
Kevin Waugh
Michael Bradley Johanson
Michael Bowling
BDL
41
904
0
06 Jan 2017
SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient
Lantao Yu
Weinan Zhang
Jun Wang
Yong Yu
GAN
37
2,397
0
18 Sep 2016
Learning to Communicate with Deep Multi-Agent Reinforcement Learning
Jakob N. Foerster
Yannis Assael
Nando de Freitas
Shimon Whiteson
92
1,600
0
21 May 2016
Asynchronous Methods for Deep Reinforcement Learning
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
150
8,805
0
04 Feb 2016
Dueling Network Architectures for Deep Reinforcement Learning
Ziyun Wang
Tom Schaul
Matteo Hessel
H. V. Hasselt
Marc Lanctot
Nando de Freitas
OffRL
54
3,742
0
20 Nov 2015
Deep Reinforcement Learning with Double Q-learning
H. V. Hasselt
A. Guez
David Silver
OffRL
98
7,590
0
22 Sep 2015
Action-Conditional Video Prediction using Deep Networks in Atari Games
Junhyuk Oh
Xiaoxiao Guo
Honglak Lee
Richard L. Lewis
Satinder Singh
64
852
0
31 Jul 2015
Deep Recurrent Q-Learning for Partially Observable MDPs
Matthew J. Hausknecht
Peter Stone
69
1,668
0
23 Jul 2015
Adam: A Method for Stochastic Optimization
Diederik P. Kingma
Jimmy Ba
ODL
453
149,474
0
22 Dec 2014
On the difficulty of training Recurrent Neural Networks
Razvan Pascanu
Tomas Mikolov
Yoshua Bengio
ODL
95
5,318
0
21 Nov 2012
1