ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2503.12098
  4. Cited By
Eval-PPO: Building an Efficient Threat Evaluator Using Proximal Policy Optimization
v1v2 (latest)

Eval-PPO: Building an Efficient Threat Evaluator Using Proximal Policy Optimization

15 March 2025
Wuzhou Sun
Siyi Li
Qingxiang Zou
Zixing Liao
    AAML
ArXiv (abs)PDFHTML

Papers citing "Eval-PPO: Building an Efficient Threat Evaluator Using Proximal Policy Optimization"

11 / 11 papers shown
Title
DanZero+: Dominating the GuanDan Game through Reinforcement Learning
DanZero+: Dominating the GuanDan Game through Reinforcement Learning
Youpeng Zhao
Yudong Lu
Jian Zhao
Wen-gang Zhou
Houqiang Li
73
6
0
05 Dec 2023
Voyager: An Open-Ended Embodied Agent with Large Language Models
Voyager: An Open-Ended Embodied Agent with Large Language Models
Guanzhi Wang
Yuqi Xie
Yunfan Jiang
Ajay Mandlekar
Chaowei Xiao
Yuke Zhu
Linxi Fan
Anima Anandkumar
LM&RoSyDa
162
841
0
25 May 2023
Navigates Like Me: Understanding How People Evaluate Human-Like AI in
  Video Games
Navigates Like Me: Understanding How People Evaluate Human-Like AI in Video Games
Stephanie Milani
Arthur Juliani
Ida Momennejad
Raluca Georgescu
Jaroslaw Rzepecki
Alison Shaw
Gavin Costello
Fei Fang
Sam Devlin
Katja Hofmann
58
11
0
02 Mar 2023
Do As I Can, Not As I Say: Grounding Language in Robotic Affordances
Do As I Can, Not As I Say: Grounding Language in Robotic Affordances
Michael Ahn
Anthony Brohan
Noah Brown
Yevgen Chebotar
Omar Cortes
...
Ted Xiao
Peng Xu
Sichun Xu
Mengyuan Yan
Andy Zeng
LM&Ro
195
1,988
0
04 Apr 2022
Counter-Strike Deathmatch with Large-Scale Behavioural Cloning
Counter-Strike Deathmatch with Large-Scale Behavioural Cloning
Tim Pearce
Jun Zhu
79
47
0
09 Apr 2021
Dota 2 with Large Scale Deep Reinforcement Learning
Dota 2 with Large Scale Deep Reinforcement Learning
OpenAI OpenAI
:
Christopher Berner
Greg Brockman
Brooke Chan
...
Szymon Sidor
Ilya Sutskever
Jie Tang
Filip Wolski
Susan Zhang
GNNVLMCLLAI4CELRM
169
1,838
0
13 Dec 2019
Proximal Policy Optimization Algorithms
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
547
19,296
0
20 Jul 2017
Value-Decomposition Networks For Cooperative Multi-Agent Learning
Value-Decomposition Networks For Cooperative Multi-Agent Learning
P. Sunehag
Guy Lever
A. Gruslys
Wojciech M. Czarnecki
V. Zambaldi
...
Marc Lanctot
Nicolas Sonnerat
Joel Z Leibo
K. Tuyls
T. Graepel
81
1,013
0
16 Jun 2017
Asynchronous Methods for Deep Reinforcement Learning
Asynchronous Methods for Deep Reinforcement Learning
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
207
8,881
0
04 Feb 2016
Continuous control with deep reinforcement learning
Continuous control with deep reinforcement learning
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
330
13,289
0
09 Sep 2015
Playing Atari with Deep Reinforcement Learning
Playing Atari with Deep Reinforcement Learning
Volodymyr Mnih
Koray Kavukcuoglu
David Silver
Alex Graves
Ioannis Antonoglou
Daan Wierstra
Martin Riedmiller
132
12,269
0
19 Dec 2013
1