ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2206.01011
  4. Cited By
Policy Gradient Algorithms with Monte Carlo Tree Learning for Non-Markov
  Decision Processes

Policy Gradient Algorithms with Monte Carlo Tree Learning for Non-Markov Decision Processes

2 June 2022
Tetsuro Morimura
Kazuhiro Ota
Kenshi Abe
Peinan Zhang
    OffRL
ArXivPDFHTML

Papers citing "Policy Gradient Algorithms with Monte Carlo Tree Learning for Non-Markov Decision Processes"

2 / 2 papers shown
Title
Training language models to follow instructions with human feedback
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
369
12,003
0
04 Mar 2022
Graph Convolutional Policy Network for Goal-Directed Molecular Graph
  Generation
Graph Convolutional Policy Network for Goal-Directed Molecular Graph Generation
Jiaxuan You
Bowen Liu
Rex Ying
Vijay S. Pande
J. Leskovec
GNN
206
885
0
07 Jun 2018
1