Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2503.12098
Cited By
v1
v2 (latest)
Eval-PPO: Building an Efficient Threat Evaluator Using Proximal Policy Optimization
15 March 2025
Wuzhou Sun
Siyi Li
Qingxiang Zou
Zixing Liao
AAML
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Eval-PPO: Building an Efficient Threat Evaluator Using Proximal Policy Optimization"
11 / 11 papers shown
Title
DanZero+: Dominating the GuanDan Game through Reinforcement Learning
Youpeng Zhao
Yudong Lu
Jian Zhao
Wen-gang Zhou
Houqiang Li
70
6
0
05 Dec 2023
Voyager: An Open-Ended Embodied Agent with Large Language Models
Guanzhi Wang
Yuqi Xie
Yunfan Jiang
Ajay Mandlekar
Chaowei Xiao
Yuke Zhu
Linxi Fan
Anima Anandkumar
LM&Ro
SyDa
162
841
0
25 May 2023
Navigates Like Me: Understanding How People Evaluate Human-Like AI in Video Games
Stephanie Milani
Arthur Juliani
Ida Momennejad
Raluca Georgescu
Jaroslaw Rzepecki
Alison Shaw
Gavin Costello
Fei Fang
Sam Devlin
Katja Hofmann
58
11
0
02 Mar 2023
Do As I Can, Not As I Say: Grounding Language in Robotic Affordances
Michael Ahn
Anthony Brohan
Noah Brown
Yevgen Chebotar
Omar Cortes
...
Ted Xiao
Peng Xu
Sichun Xu
Mengyuan Yan
Andy Zeng
LM&Ro
195
1,988
0
04 Apr 2022
Counter-Strike Deathmatch with Large-Scale Behavioural Cloning
Tim Pearce
Jun Zhu
79
47
0
09 Apr 2021
Dota 2 with Large Scale Deep Reinforcement Learning
OpenAI OpenAI
:
Christopher Berner
Greg Brockman
Brooke Chan
...
Szymon Sidor
Ilya Sutskever
Jie Tang
Filip Wolski
Susan Zhang
GNN
VLM
CLL
AI4CE
LRM
169
1,838
0
13 Dec 2019
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
547
19,296
0
20 Jul 2017
Value-Decomposition Networks For Cooperative Multi-Agent Learning
P. Sunehag
Guy Lever
A. Gruslys
Wojciech M. Czarnecki
V. Zambaldi
...
Marc Lanctot
Nicolas Sonnerat
Joel Z Leibo
K. Tuyls
T. Graepel
75
1,013
0
16 Jun 2017
Asynchronous Methods for Deep Reinforcement Learning
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
207
8,881
0
04 Feb 2016
Continuous control with deep reinforcement learning
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
330
13,289
0
09 Sep 2015
Playing Atari with Deep Reinforcement Learning
Volodymyr Mnih
Koray Kavukcuoglu
David Silver
Alex Graves
Ioannis Antonoglou
Daan Wierstra
Martin Riedmiller
132
12,269
0
19 Dec 2013
1