ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2010.09536
  4. Cited By
What About Inputing Policy in Value Function: Policy Representation and
  Policy-extended Value Function Approximator

What About Inputing Policy in Value Function: Policy Representation and Policy-extended Value Function Approximator

19 October 2020
Hongyao Tang
Zhaopeng Meng
Jianye Hao
Cheng Chen
D. Graves
Dong Li
Changmin Yu
Hangyu Mao
Wulong Liu
Yaodong Yang
Wenyuan Tao
Li Wang
    OffRL
ArXivPDFHTML

Papers citing "What About Inputing Policy in Value Function: Policy Representation and Policy-extended Value Function Approximator"

5 / 5 papers shown
Title
Towards A Unified Policy Abstraction Theory and Representation Learning
  Approach in Markov Decision Processes
Towards A Unified Policy Abstraction Theory and Representation Learning Approach in Markov Decision Processes
M. Zhang
Hongyao Tang
Jianye Hao
Yan Zheng
OffRL
28
0
0
16 Sep 2022
Controlling Overestimation Bias with Truncated Mixture of Continuous
  Distributional Quantile Critics
Controlling Overestimation Bias with Truncated Mixture of Continuous Distributional Quantile Critics
Arsenii Kuznetsov
Pavel Shvechikov
Alexander Grishin
Dmitry Vetrov
136
185
0
08 May 2020
Graph Convolutional Policy Network for Goal-Directed Molecular Graph
  Generation
Graph Convolutional Policy Network for Goal-Directed Molecular Graph Generation
Jiaxuan You
Bowen Liu
Rex Ying
Vijay S. Pande
J. Leskovec
GNN
206
885
0
07 Jun 2018
On Large-Batch Training for Deep Learning: Generalization Gap and Sharp
  Minima
On Large-Batch Training for Deep Learning: Generalization Gap and Sharp Minima
N. Keskar
Dheevatsa Mudigere
J. Nocedal
M. Smelyanskiy
P. T. P. Tang
ODL
296
2,890
0
15 Sep 2016
Off-Policy Actor-Critic
Off-Policy Actor-Critic
T. Degris
Martha White
R. Sutton
OffRL
CML
163
220
0
22 May 2012
1