ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1710.10866
  4. Cited By
Unifying Value Iteration, Advantage Learning, and Dynamic Policy
  Programming

Unifying Value Iteration, Advantage Learning, and Dynamic Policy Programming

30 October 2017
Tadashi Kozuno
E. Uchibe
Kenji Doya
ArXiv (abs)PDFHTML

Papers citing "Unifying Value Iteration, Advantage Learning, and Dynamic Policy Programming"

3 / 3 papers shown
Title
Smoothing Advantage Learning
Smoothing Advantage Learning
Yaozhong Gan
Zhe Zhang
Xiaoyang Tan
AAML
31
2
0
20 Mar 2022
Robust Action Gap Increasing with Clipped Advantage Learning
Robust Action Gap Increasing with Clipped Advantage Learning
Zhe Zhang
Yaozhong Gan
Xiaoyang Tan
50
2
0
20 Mar 2022
Gap-Increasing Policy Evaluation for Efficient and Noise-Tolerant
  Reinforcement Learning
Gap-Increasing Policy Evaluation for Efficient and Noise-Tolerant Reinforcement Learning
Tadashi Kozuno
Dongqi Han
Kenji Doya
OffRL
43
2
0
18 Jun 2019
1