ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2002.03072
  4. Cited By
Generalized Hidden Parameter MDPs Transferable Model-based RL in a
  Handful of Trials

Generalized Hidden Parameter MDPs Transferable Model-based RL in a Handful of Trials

8 February 2020
Christian F. Perez
F. Such
Theofanis Karaletsos
ArXivPDFHTML

Papers citing "Generalized Hidden Parameter MDPs Transferable Model-based RL in a Handful of Trials"

15 / 15 papers shown
Title
Policy Resilience to Environment Poisoning Attacks on Reinforcement
  Learning
Policy Resilience to Environment Poisoning Attacks on Reinforcement Learning
Hang Xu
Xinghua Qu
Zinovi Rabinovich
34
1
0
24 Apr 2023
On the Benefits of Leveraging Structural Information in Planning Over
  the Learned Model
On the Benefits of Leveraging Structural Information in Planning Over the Learned Model
Jiajun Shen
K. Kuwaranancharoen
R. Ayoub
Pietro Mercati
S. Sundaram
OffRL
27
0
0
15 Mar 2023
A Survey of Meta-Reinforcement Learning
A Survey of Meta-Reinforcement Learning
Jacob Beck
Risto Vuorio
E. Liu
Zheng Xiong
L. Zintgraf
Chelsea Finn
Shimon Whiteson
OOD
OffRL
42
124
0
19 Jan 2023
A model-based approach to meta-Reinforcement Learning: Transformers and
  tree search
A model-based approach to meta-Reinforcement Learning: Transformers and tree search
Brieuc Pinon
Jean-Charles Delvenne
Raphaël Jungers
OffRL
37
3
0
24 Aug 2022
Factored Adaptation for Non-Stationary Reinforcement Learning
Factored Adaptation for Non-Stationary Reinforcement Learning
Fan Feng
Erdun Gao
Kun Zhang
Sara Magliacane
CML
OffRL
49
32
0
30 Mar 2022
GalilAI: Out-of-Task Distribution Detection using Causal Active
  Experimentation for Safe Transfer RL
GalilAI: Out-of-Task Distribution Detection using Causal Active Experimentation for Safe Transfer RL
Sumedh Anand Sontakke
Stephen Iota
Zizhao Hu
Arash Mehrjou
Laurent Itti
Bernhard Schölkopf
OODD
20
2
0
29 Oct 2021
Block Contextual MDPs for Continual Learning
Block Contextual MDPs for Continual Learning
Shagun Sodhani
Franziska Meier
Joelle Pineau
Amy Zhang
CLL
41
26
0
13 Oct 2021
Compositional Q-learning for electrolyte repletion with imbalanced
  patient sub-populations
Compositional Q-learning for electrolyte repletion with imbalanced patient sub-populations
Aishwarya Mandyam
Andrew Jones
Jiayu Yao
K. Laudanski
Barbara E. Engelhardt
OffRL
29
0
0
06 Oct 2021
Toward AI Assistants That Let Designers Design
Toward AI Assistants That Let Designers Design
Sebastiaan De Peuter
Antti Oulasvirta
Samuel Kaski
AI4CE
29
19
0
22 Jul 2021
Contrastive Behavioral Similarity Embeddings for Generalization in
  Reinforcement Learning
Contrastive Behavioral Similarity Embeddings for Generalization in Reinforcement Learning
Rishabh Agarwal
Marlos C. Machado
Pablo Samuel Castro
Marc G. Bellemare
OffRL
29
164
0
13 Jan 2021
MELD: Meta-Reinforcement Learning from Images via Latent State Models
MELD: Meta-Reinforcement Learning from Images via Latent State Models
Tony Zhao
Anusha Nagabandi
Kate Rakelly
Chelsea Finn
Sergey Levine
OffRL
32
36
0
26 Oct 2020
Causal Curiosity: RL Agents Discovering Self-supervised Experiments for
  Causal Representation Learning
Causal Curiosity: RL Agents Discovering Self-supervised Experiments for Causal Representation Learning
Sumedh Anand Sontakke
Arash Mehrjou
Laurent Itti
Bernhard Schölkopf
CML
25
60
0
07 Oct 2020
Learning Robust State Abstractions for Hidden-Parameter Block MDPs
Learning Robust State Abstractions for Hidden-Parameter Block MDPs
Amy Zhang
Shagun Sodhani
Khimya Khetarpal
Joelle Pineau
31
5
0
14 Jul 2020
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Chelsea Finn
Pieter Abbeel
Sergey Levine
OOD
493
11,727
0
09 Mar 2017
Simple and Scalable Predictive Uncertainty Estimation using Deep
  Ensembles
Simple and Scalable Predictive Uncertainty Estimation using Deep Ensembles
Balaji Lakshminarayanan
Alexander Pritzel
Charles Blundell
UQCV
BDL
276
5,695
0
05 Dec 2016
1