ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2310.16473
  4. Cited By
Policy Optimization via Adv2: Adversarial Learning on Advantage Functions

Policy Optimization via Adv2: Adversarial Learning on Advantage Functions

25 October 2023
Matthieu Jonckheere
Chiara Mignacco
Gilles Stoltz
ArXivPDFHTML

Papers citing "Policy Optimization via Adv2: Adversarial Learning on Advantage Functions"

3 / 3 papers shown
Title
Hierarchical Orchestra of Policies
Hierarchical Orchestra of Policies
Thomas P Cannon
Özgür Simsek
CLL
34
0
0
05 Nov 2024
Narrowing the Gap between Adversarial and Stochastic MDPs via Policy Optimization
Narrowing the Gap between Adversarial and Stochastic MDPs via Policy Optimization
D. Tiapkin
Evgenii Chzhen
Gilles Stoltz
74
0
0
08 Jul 2024
Ensemble Reinforcement Learning: A Survey
Ensemble Reinforcement Learning: A Survey
Yanjie Song
Ponnuthurai Nagaratnam Suganthan
Witold Pedrycz
Junwei Ou
Yongming He
Y. Chen
Yutong Wu
OffRL
46
39
0
05 Mar 2023
1