ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1906.05243
  4. Cited By
When to use parametric models in reinforcement learning?

When to use parametric models in reinforcement learning?

12 June 2019
H. V. Hasselt
Matteo Hessel
John Aslanides
ArXivPDFHTML

Papers citing "When to use parametric models in reinforcement learning?"

50 / 124 papers shown
Title
EVaDE : Event-Based Variational Thompson Sampling for Model-Based Reinforcement Learning
EVaDE : Event-Based Variational Thompson Sampling for Model-Based Reinforcement Learning
Siddharth Aravindan
Dixant Mittal
Wee Sun Lee
BDL
79
0
0
17 Jan 2025
CALE: Continuous Arcade Learning Environment
CALE: Continuous Arcade Learning Environment
Jesse Farebrother
Pablo Samuel Castro
ELM
38
0
0
31 Oct 2024
Uncovering RL Integration in SSL Loss: Objective-Specific Implications for Data-Efficient RL
Uncovering RL Integration in SSL Loss: Objective-Specific Implications for Data-Efficient RL
Ömer Veysel Çağatan
Barış Akgün
OffRL
41
0
0
22 Oct 2024
Reinforcement Learning From Imperfect Corrective Actions And Proxy
  Rewards
Reinforcement Learning From Imperfect Corrective Actions And Proxy Rewards
Zhaohui Jiang
Xuening Feng
Paul Weng
Yifei Zhu
Yan Song
Tianze Zhou
Yujing Hu
Tangjie Lv
Changjie Fan
46
0
0
08 Oct 2024
Don't flatten, tokenize! Unlocking the key to SoftMoE's efficacy in deep RL
Don't flatten, tokenize! Unlocking the key to SoftMoE's efficacy in deep RL
Ghada Sokar
J. Obando-Ceron
Rameswar Panda
Hugo Larochelle
Pablo Samuel Castro
MoE
171
2
0
02 Oct 2024
Offline Model-Based Reinforcement Learning with Anti-Exploration
Offline Model-Based Reinforcement Learning with Anti-Exploration
Padmanaba Srinivasan
William J. Knottenbelt
OffRL
57
0
0
20 Aug 2024
ProSpec RL: Plan Ahead, then Execute
ProSpec RL: Plan Ahead, then Execute
Liangliang Liu
Huiyu Duan
Liu Yang
Rujia Shen
Yi Lin
Chaoran Kong
Lian Yan
P. Callet
OffRL
40
0
0
31 Jul 2024
The Cross-environment Hyperparameter Setting Benchmark for Reinforcement
  Learning
The Cross-environment Hyperparameter Setting Benchmark for Reinforcement Learning
Andrew Patterson
Samuel Neumann
Raksha Kumaraswamy
Martha White
Adam White
26
2
0
26 Jul 2024
Investigating the Interplay of Prioritized Replay and Generalization
Investigating the Interplay of Prioritized Replay and Generalization
Parham Mohammad Panahi
Andrew Patterson
Martha White
Adam White
58
0
0
12 Jul 2024
Generalizing soft actor-critic algorithms to discrete action spaces
Generalizing soft actor-critic algorithms to discrete action spaces
Le Zhang
Yong Gu
Xin Zhao
Yanshuo Zhang
Shu Zhao
Yifei Jin
Xinxin Wu
34
0
0
08 Jul 2024
On the consistency of hyper-parameter selection in value-based deep
  reinforcement learning
On the consistency of hyper-parameter selection in value-based deep reinforcement learning
J. Obando-Ceron
J. G. Araújo
Rameswar Panda
Pablo Samuel Castro
48
7
0
25 Jun 2024
A New View on Planning in Online Reinforcement Learning
A New View on Planning in Online Reinforcement Learning
Kevin Roice
Parham Mohammad Panahi
Scott M. Jordan
Adam White
Martha White
OffRL
28
0
0
03 Jun 2024
Partial Models for Building Adaptive Model-Based Reinforcement Learning
  Agents
Partial Models for Building Adaptive Model-Based Reinforcement Learning Agents
Safa Alver
Ali Rahimi-Kalahroudi
Doina Precup
32
1
0
27 May 2024
Sequence Compression Speeds Up Credit Assignment in Reinforcement
  Learning
Sequence Compression Speeds Up Credit Assignment in Reinforcement Learning
Aditya A. Ramesh
Kenny Young
Louis Kirsch
Jürgen Schmidhuber
34
1
0
06 May 2024
In value-based deep reinforcement learning, a pruned network is a good
  network
In value-based deep reinforcement learning, a pruned network is a good network
J. Obando-Ceron
Rameswar Panda
Pablo Samuel Castro
OffRL
50
18
0
19 Feb 2024
Mixtures of Experts Unlock Parameter Scaling for Deep RL
Mixtures of Experts Unlock Parameter Scaling for Deep RL
J. Obando-Ceron
Ghada Sokar
Timon Willi
Clare Lyle
Jesse Farebrother
Jakob N. Foerster
Gintare Karolina Dziugaite
Doina Precup
Pablo Samuel Castro
63
31
0
13 Feb 2024
Q-Star Meets Scalable Posterior Sampling: Bridging Theory and Practice
  via HyperAgent
Q-Star Meets Scalable Posterior Sampling: Bridging Theory and Practice via HyperAgent
Yingru Li
Jiawei Xu
Lei Han
Zhi-Quan Luo
BDL
OffRL
26
5
0
05 Feb 2024
Unsupervised Salient Patch Selection for Data-Efficient Reinforcement
  Learning
Unsupervised Salient Patch Selection for Data-Efficient Reinforcement Learning
Zhaohui Jiang
Paul Weng
OffRL
27
0
0
10 Jan 2024
Towards Control-Centric Representations in Reinforcement Learning from
  Images
Towards Control-Centric Representations in Reinforcement Learning from Images
Chen Liu
Hongyu Zang
Xin Li
Yong Heng
Yifei Wang
Zhen Fang
Yisen Wang
Mingzhong Wang
30
0
0
25 Oct 2023
Mind the Model, Not the Agent: The Primacy Bias in Model-based RL
Mind the Model, Not the Agent: The Primacy Bias in Model-based RL
Zhongjian Qiao
Jiafei Lyu
Xiu Li
24
3
0
23 Oct 2023
STORM: Efficient Stochastic Transformer based World Models for
  Reinforcement Learning
STORM: Efficient Stochastic Transformer based World Models for Reinforcement Learning
Weipu Zhang
Gang Wang
Jian Sun
Yetian Yuan
Gao Huang
63
34
0
14 Oct 2023
Small batch deep reinforcement learning
Small batch deep reinforcement learning
J. Obando-Ceron
Marc G. Bellemare
Pablo Samuel Castro
VLM
34
14
0
05 Oct 2023
Enhancing data efficiency in reinforcement learning: a novel imagination
  mechanism based on mesh information propagation
Enhancing data efficiency in reinforcement learning: a novel imagination mechanism based on mesh information propagation
Zihang Wang
Maowei Jiang
AI4CE
31
0
0
25 Sep 2023
BarlowRL: Barlow Twins for Data-Efficient Reinforcement Learning
BarlowRL: Barlow Twins for Data-Efficient Reinforcement Learning
Omer Veysel Cagatan
Barış Akgün
BDL
OffRL
35
3
0
08 Aug 2023
PLASTIC: Improving Input and Label Plasticity for Sample Efficient
  Reinforcement Learning
PLASTIC: Improving Input and Label Plasticity for Sample Efficient Reinforcement Learning
Hojoon Lee
Hanseul Cho
Hyunseung Kim
Daehoon Gwak
Joonkee Kim
Jaegul Choo
Se-Young Yun
Chulhee Yun
OffRL
82
26
0
19 Jun 2023
Deep Generative Models for Decision-Making and Control
Deep Generative Models for Decision-Making and Control
Michael Janner
36
1
0
15 Jun 2023
What model does MuZero learn?
What model does MuZero learn?
Jinke He
Thomas M. Moerland
F. Oliehoek
33
4
0
01 Jun 2023
Bigger, Better, Faster: Human-level Atari with human-level efficiency
Bigger, Better, Faster: Human-level Atari with human-level efficiency
Max Schwarzer
J. Obando-Ceron
Rameswar Panda
Marc G. Bellemare
Rishabh Agarwal
Pablo Samuel Castro
OffRL
54
85
0
30 May 2023
Off-Policy RL Algorithms Can be Sample-Efficient for Continuous Control
  via Sample Multiple Reuse
Off-Policy RL Algorithms Can be Sample-Efficient for Continuous Control via Sample Multiple Reuse
Jiafei Lyu
Le Wan
Zongqing Lu
Xiu Li
OffRL
36
9
0
29 May 2023
On the Value of Myopic Behavior in Policy Reuse
On the Value of Myopic Behavior in Policy Reuse
Kang Xu
Chenjia Bai
Shuang Qiu
Haoran He
Bin Zhao
Zhen Wang
Wei Li
Xuelong Li
36
1
0
28 May 2023
Regularization and Variance-Weighted Regression Achieves Minimax
  Optimality in Linear MDPs: Theory and Practice
Regularization and Variance-Weighted Regression Achieves Minimax Optimality in Linear MDPs: Theory and Practice
Toshinori Kitamura
Tadashi Kozuno
Yunhao Tang
Nino Vieillard
Michal Valko
...
Olivier Pietquin
M. Geist
Csaba Szepesvári
Wataru Kumagai
Yutaka Matsuo
OffRL
32
3
0
22 May 2023
On First-Order Meta-Reinforcement Learning with Moreau Envelopes
On First-Order Meta-Reinforcement Learning with Moreau Envelopes
Taha Toghani
Sebastian Perez-Salazar
César A. Uribe
37
2
0
20 May 2023
Sample-efficient Model-based Reinforcement Learning for Quantum Control
Sample-efficient Model-based Reinforcement Learning for Quantum Control
Irtaza Khalid
C. Weidner
E. Jonckheere
Sophie G. Shermer
F. Langbein
19
10
0
19 Apr 2023
Transformer-based World Models Are Happy With 100k Interactions
Transformer-based World Models Are Happy With 100k Interactions
Jan Robine
Marc Höftmann
Tobias Uelwer
Stefan Harmeling
OffRL
27
71
0
13 Mar 2023
RePreM: Representation Pre-training with Masked Model for Reinforcement
  Learning
RePreM: Representation Pre-training with Masked Model for Reinforcement Learning
Yuanying Cai
Chuheng Zhang
Wei Shen
Xuyun Zhang
Wenjie Ruan
Longbo Huang
OffRL
32
4
0
03 Mar 2023
The Dormant Neuron Phenomenon in Deep Reinforcement Learning
The Dormant Neuron Phenomenon in Deep Reinforcement Learning
Ghada Sokar
Rishabh Agarwal
Pablo Samuel Castro
Utku Evci
CLL
51
90
0
24 Feb 2023
Understanding the effect of varying amounts of replay per step
Understanding the effect of varying amounts of replay per step
A. Paul
Videh Raj Nema
8
0
0
20 Feb 2023
Learning a model is paramount for sample efficiency in reinforcement
  learning control of PDEs
Learning a model is paramount for sample efficiency in reinforcement learning control of PDEs
Stefan Werner
Sebastian Peitz
41
9
0
14 Feb 2023
Predictable MDP Abstraction for Unsupervised Model-Based RL
Predictable MDP Abstraction for Unsupervised Model-Based RL
Seohong Park
Sergey Levine
24
9
0
08 Feb 2023
Model-based Offline Reinforcement Learning with Local Misspecification
Model-based Offline Reinforcement Learning with Local Misspecification
Kefan Dong
Yannis Flet-Berliac
Allen Nie
Emma Brunskill
OffRL
18
4
0
26 Jan 2023
The Benefits of Model-Based Generalization in Reinforcement Learning
The Benefits of Model-Based Generalization in Reinforcement Learning
K. Young
Aditya A. Ramesh
Louis Kirsch
Jürgen Schmidhuber
OffRL
28
12
0
04 Nov 2022
Model-based Reinforcement Learning with a Hamiltonian Canonical ODE
  Network
Model-based Reinforcement Learning with a Hamiltonian Canonical ODE Network
Yao Feng
Yuhong Jiang
Hang Su
Dong Yan
Jun Zhu
20
1
0
02 Nov 2022
Learning General World Models in a Handful of Reward-Free Deployments
Learning General World Models in a Handful of Reward-Free Deployments
Yingchen Xu
Jack Parker-Holder
Aldo Pacchiano
Philip J. Ball
Oleh Rybkin
Stephen J. Roberts
Tim Rocktaschel
Edward Grefenstette
OffRL
62
9
0
23 Oct 2022
Reinforcement Learning with Automated Auxiliary Loss Search
Reinforcement Learning with Automated Auxiliary Loss Search
Tairan He
Yuge Zhang
Kan Ren
Minghuan Liu
Che Wang
Weinan Zhang
Yuqing Yang
Dongsheng Li
38
16
0
12 Oct 2022
Hyperbolic Deep Reinforcement Learning
Hyperbolic Deep Reinforcement Learning
Edoardo Cetin
B. Chamberlain
Michael M. Bronstein
Jonathan J. Hunt
50
21
0
04 Oct 2022
Pretraining the Vision Transformer using self-supervised methods for
  vision based Deep Reinforcement Learning
Pretraining the Vision Transformer using self-supervised methods for vision based Deep Reinforcement Learning
Manuel Goulão
Arlindo L. Oliveira
ViT
45
6
0
22 Sep 2022
Reducing Variance in Temporal-Difference Value Estimation via Ensemble
  of Deep Networks
Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks
Litian Liang
Yaosheng Xu
Stephen Marcus McAleer
Dailin Hu
Alexander Ihler
Pieter Abbeel
Roy Fox
OOD
27
16
0
16 Sep 2022
Human-level Atari 200x faster
Human-level Atari 200x faster
Steven Kapturowski
Victor Campos
Ray Jiang
Nemanja Rakićević
Hado van Hasselt
Charles Blundell
Adria Puigdomenech Badia
OffRL
52
28
0
15 Sep 2022
Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning
  in Online Reinforcement Learning
Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning
Shuang Qiu
Lingxiao Wang
Chenjia Bai
Zhuoran Yang
Zhaoran Wang
SSL
OffRL
26
32
0
29 Jul 2022
Compositional Generalization in Grounded Language Learning via Induced
  Model Sparsity
Compositional Generalization in Grounded Language Learning via Induced Model Sparsity
Sam Spilsbury
Alexander Ilin
16
7
0
06 Jul 2022
123
Next