ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2410.11234
  4. Cited By
Bayes Adaptive Monte Carlo Tree Search for Offline Model-based Reinforcement Learning

Bayes Adaptive Monte Carlo Tree Search for Offline Model-based Reinforcement Learning

15 October 2024
Jiayu Chen
Wentse Chen
Jeff Schneider
    OffRL
ArXivPDFHTML

Papers citing "Bayes Adaptive Monte Carlo Tree Search for Offline Model-based Reinforcement Learning"

35 / 35 papers shown
Title
SOReL and TOReL: Two Methods for Fully Offline Reinforcement Learning
SOReL and TOReL: Two Methods for Fully Offline Reinforcement Learning
Mattie Fellows
Clarisse Wibault
Uljad Berdica
Johannes Forkel
Jakob Foerster
Michael A. Osborne
OffRL
OnRL
20
0
0
28 May 2025
Policy-Driven World Model Adaptation for Robust Offline Model-based Reinforcement Learning
Policy-Driven World Model Adaptation for Robust Offline Model-based Reinforcement Learning
Jiayu Chen
Aravind Venugopal
Jeff Schneider
OffRL
44
0
0
19 May 2025
Variational Offline Multi-agent Skill Discovery
Variational Offline Multi-agent Skill Discovery
Jiayu Chen
Bhargav Ganguly
Tian-Shing Lan
OffRL
84
3
0
26 May 2024
Full Shot Predictions for the DIII-D Tokamak via Deep Recurrent Networks
Full Shot Predictions for the DIII-D Tokamak via Deep Recurrent Networks
I. Char
Youngseog Chung
J. Abbate
E. Kolemen
Jeff Schneider
57
5
0
18 Apr 2024
Deep Generative Models for Offline Policy Learning: Tutorial, Survey,
  and Perspectives on Future Directions
Deep Generative Models for Offline Policy Learning: Tutorial, Survey, and Perspectives on Future Directions
Jiayu Chen
Bhargav Ganguly
Yang Xu
Yongsheng Mei
Tian-Shing Lan
Vaneet Aggarwal
OffRL
21
9
0
21 Feb 2024
LightZero: A Unified Benchmark for Monte Carlo Tree Search in General
  Sequential Decision Scenarios
LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios
Yazhe Niu
Yuan Pu
Zhenjie Yang
Xueyan Li
Tong Zhou
Jiyuan Ren
Shuai Hu
Hongsheng Li
Yu Liu
114
13
0
12 Oct 2023
ContraBAR: Contrastive Bayes-Adaptive Deep RL
ContraBAR: Contrastive Bayes-Adaptive Deep RL
Era Choshen
Aviv Tamar
BDL
OffRL
28
8
0
04 Jun 2023
Hierarchical Deep Counterfactual Regret Minimization
Hierarchical Deep Counterfactual Regret Minimization
Jiayu Chen
Tian-Shing Lan
Vaneet Aggarwal
47
3
0
27 May 2023
A Unified Algorithm Framework for Unsupervised Discovery of Skills based
  on Determinantal Point Process
A Unified Algorithm Framework for Unsupervised Discovery of Skills based on Determinantal Point Process
Jiayu Chen
Vaneet Aggarwal
Tian-Shing Lan
39
3
0
01 Dec 2022
Model-Based Offline Reinforcement Learning with Pessimism-Modulated
  Dynamics Belief
Model-Based Offline Reinforcement Learning with Pessimism-Modulated Dynamics Belief
Kaiyang Guo
Yunfeng Shao
Yanhui Geng
OffRL
39
25
0
13 Oct 2022
Conservative Bayesian Model-Based Value Expansion for Offline Policy
  Optimization
Conservative Bayesian Model-Based Value Expansion for Offline Policy Optimization
Jihwan Jeong
Xiaoyu Wang
Michael Gimelfarb
Hyunwoo J. Kim
Baher Abdulhai
Scott Sanner
OffRL
99
11
0
07 Oct 2022
Offline RL Policies Should be Trained to be Adaptive
Offline RL Policies Should be Trained to be Adaptive
Dibya Ghosh
Anurag Ajay
Pulkit Agrawal
Sergey Levine
OffRL
54
46
0
05 Jul 2022
Mastering Atari Games with Limited Data
Mastering Atari Games with Limited Data
Weirui Ye
Shao-Wei Liu
Thanard Kurutach
Pieter Abbeel
Yang Gao
VLM
87
228
0
30 Oct 2021
Revisiting Design Choices in Offline Model-Based Reinforcement Learning
Revisiting Design Choices in Offline Model-Based Reinforcement Learning
Cong Lu
Philip J. Ball
Jack Parker-Holder
Michael A. Osborne
Stephen J. Roberts
OffRL
40
54
0
08 Oct 2021
Model-Based Offline Planning with Trajectory Pruning
Model-Based Offline Planning with Trajectory Pruning
Xianyuan Zhan
Xiangyu Zhu
Haoran Xu
OffRL
65
36
0
16 May 2021
Learning and Planning in Complex Action Spaces
Learning and Planning in Complex Action Spaces
Thomas Hubert
Julian Schrittwieser
Ioannis Antonoglou
M. Barekatain
Simon Schmitt
David Silver
44
78
0
13 Apr 2021
Online and Offline Reinforcement Learning by Planning with a Learned
  Model
Online and Offline Reinforcement Learning by Planning with a Learned Model
Julian Schrittwieser
Thomas Hubert
Amol Mandhane
M. Barekatain
Ioannis Antonoglou
David Silver
OffRL
53
114
0
13 Apr 2021
COMBO: Conservative Offline Model-Based Policy Optimization
COMBO: Conservative Offline Model-Based Policy Optimization
Tianhe Yu
Aviral Kumar
Rafael Rafailov
Aravind Rajeswaran
Sergey Levine
Chelsea Finn
OffRL
261
425
0
16 Feb 2021
On the role of planning in model-based deep reinforcement learning
On the role of planning in model-based deep reinforcement learning
Jessica B. Hamrick
A. Friesen
Feryal M. P. Behbahani
A. Guez
Fabio Viola
Sims Witherspoon
Thomas W. Anthony
Lars Buesing
Petar Velickovic
T. Weber
OffRL
41
65
0
08 Nov 2020
Bayes-Adaptive Deep Model-Based Policy Optimisation
Bayes-Adaptive Deep Model-Based Policy Optimisation
Tai Hoang
Ngo Anh Vien
BDL
31
1
0
29 Oct 2020
Model-Based Offline Planning
Model-Based Offline Planning
Arthur Argenson
Gabriel Dulac-Arnold
OffRL
40
152
0
12 Aug 2020
Conservative Q-Learning for Offline Reinforcement Learning
Conservative Q-Learning for Offline Reinforcement Learning
Aviral Kumar
Aurick Zhou
George Tucker
Sergey Levine
OffRL
OnRL
84
1,780
0
08 Jun 2020
MOPO: Model-based Offline Policy Optimization
MOPO: Model-based Offline Policy Optimization
Tianhe Yu
G. Thomas
Lantao Yu
Stefano Ermon
James Zou
Sergey Levine
Chelsea Finn
Tengyu Ma
OffRL
56
759
0
27 May 2020
MOReL : Model-Based Offline Reinforcement Learning
MOReL : Model-Based Offline Reinforcement Learning
Rahul Kidambi
Aravind Rajeswaran
Praneeth Netrapalli
Thorsten Joachims
OffRL
59
662
0
12 May 2020
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on
  Open Problems
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
Sergey Levine
Aviral Kumar
George Tucker
Justin Fu
OffRL
GP
471
1,994
0
04 May 2020
D4RL: Datasets for Deep Data-Driven Reinforcement Learning
D4RL: Datasets for Deep Data-Driven Reinforcement Learning
Justin Fu
Aviral Kumar
Ofir Nachum
George Tucker
Sergey Levine
GP
OffRL
169
1,338
0
15 Apr 2020
Behavior Regularized Offline Reinforcement Learning
Behavior Regularized Offline Reinforcement Learning
Yifan Wu
George Tucker
Ofir Nachum
OffRL
53
675
0
26 Nov 2019
Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction
Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction
Aviral Kumar
Justin Fu
George Tucker
Sergey Levine
OffRL
OnRL
71
1,044
0
03 Jun 2019
Model-Based Value Estimation for Efficient Model-Free Reinforcement
  Learning
Model-Based Value Estimation for Efficient Model-Free Reinforcement Learning
Vladimir Feinberg
Alvin Wan
Ion Stoica
Michael I. Jordan
Joseph E. Gonzalez
Sergey Levine
OffRL
45
317
0
28 Feb 2018
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement
  Learning with a Stochastic Actor
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
180
8,236
0
04 Jan 2018
Mastering Chess and Shogi by Self-Play with a General Reinforcement
  Learning Algorithm
Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm
David Silver
Thomas Hubert
Julian Schrittwieser
Ioannis Antonoglou
Matthew Lai
...
D. Kumaran
T. Graepel
Timothy Lillicrap
Karen Simonyan
Demis Hassabis
87
1,755
0
05 Dec 2017
Simple and Scalable Predictive Uncertainty Estimation using Deep
  Ensembles
Simple and Scalable Predictive Uncertainty Estimation using Deep Ensembles
Balaji Lakshminarayanan
Alexander Pritzel
Charles Blundell
UQCV
BDL
458
5,748
0
05 Dec 2016
Monte Carlo Bayesian Reinforcement Learning
Monte Carlo Bayesian Reinforcement Learning
Yi Wang
K. Won
David Hsu
Wee Sun Lee
OffRL
59
45
0
27 Jun 2012
A Bayesian Sampling Approach to Exploration in Reinforcement Learning
A Bayesian Sampling Approach to Exploration in Reinforcement Learning
J. Asmuth
Lihong Li
Michael L. Littman
A. Nouri
David Wingate
BDL
62
189
0
09 May 2012
Variance-Based Rewards for Approximate Bayesian Reinforcement Learning
Variance-Based Rewards for Approximate Bayesian Reinforcement Learning
Jonathan Sorg
Satinder Singh
Richard L. Lewis
OffRL
60
69
0
15 Mar 2012
1