Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1711.05726
Cited By
Markov Decision Processes with Continuous Side Information
15 November 2017
Aditya Modi
Nan Jiang
Satinder Singh
Ambuj Tewari
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Markov Decision Processes with Continuous Side Information"
13 / 13 papers shown
Title
Pessimism Principle Can Be Effective: Towards a Framework for Zero-Shot Transfer Reinforcement Learning
Chi Zhang
Ziying Jia
George Atia
Sihong He
Yue Wang
95
0
0
24 May 2025
On the Fly Adaptation of Behavior Tree-Based Policies through Reinforcement Learning
M. Iannotta
J. A. Stork
Erik Schaffernicht
Todor Stoyanov
59
0
0
08 Mar 2025
Model-Based Transfer Learning for Contextual Reinforcement Learning
Jung-Hoon Cho
Vindula Jayawardana
Sirui Li
Cathy Wu
OffRL
146
0
0
08 Aug 2024
Sample Complexity of Reinforcement Learning using Linearly Combined Model Ensembles
Aditya Modi
Nan Jiang
Ambuj Tewari
Satinder Singh
68
132
0
23 Oct 2019
Transfer Learning Across Patient Variations with Hidden Parameter Markov Decision Processes
Taylor W. Killian
George Konidaris
Finale Doshi-Velez
OOD
40
9
0
01 Dec 2016
Sample Complexity of Episodic Fixed-Horizon Reinforcement Learning
Christoph Dann
Emma Brunskill
74
249
0
29 Oct 2015
Contextual Markov Decision Processes
Assaf Hallak
Dotan Di Castro
Shie Mannor
89
248
0
08 Feb 2015
Online learning in MDPs with side information
Yasin Abbasi-Yadkori
Gergely Neu
OffRL
55
18
0
26 Jun 2014
Clustering Markov Decision Processes For Continual Transfer
M. H. Mahmud
Majd Hawasly
Benjamin Rosman
S. Ramamoorthy
OffRL
77
22
0
15 Nov 2013
Sample Complexity of Multi-task Reinforcement Learning
Emma Brunskill
Lihong Li
86
138
0
26 Sep 2013
Exploring compact reinforcement-learning representations with linear regression
Thomas J. Walsh
I. Szita
Carlos Diuk
Michael L. Littman
OffRL
222
114
0
09 May 2012
A Contextual-Bandit Approach to Personalized News Article Recommendation
Lihong Li
Wei Chu
John Langford
Robert Schapire
471
2,955
0
28 Feb 2010
Contextual Bandits with Similarity Information
Aleksandrs Slivkins
461
450
0
23 Jul 2009
1