Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2212.07486
Cited By
Scaling Marginalized Importance Sampling to High-Dimensional State-Spaces via State Abstraction
14 December 2022
Brahma S. Pavse
Josiah P. Hanna
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Scaling Marginalized Importance Sampling to High-Dimensional State-Spaces via State Abstraction"
34 / 34 papers shown
Title
Offline Reinforcement Learning Under Value and Density-Ratio Realizability: The Power of Gaps
Jinglin Chen
Nan Jiang
OffRL
61
35
0
25 Mar 2022
Near-optimal Offline Reinforcement Learning with Linear Representation: Leveraging Variance Information with Pessimism
Ming Yin
Yaqi Duan
Mengdi Wang
Yu Wang
OffRL
52
66
0
11 Mar 2022
A Theory of Abstraction in Reinforcement Learning
David Abel
OffRL
16
29
0
01 Mar 2022
Offline Reinforcement Learning with Realizability and Single-policy Concentrability
Wenhao Zhan
Baihe Huang
Audrey Huang
Nan Jiang
Jason D. Lee
OffRL
128
107
0
09 Feb 2022
Autoregressive Dynamics Models for Offline Policy Evaluation and Optimization
Michael Ruogu Zhang
T. Paine
Ofir Nachum
Cosmin Paduraru
George Tucker
Ziyun Wang
Mohammad Norouzi
OffRL
51
46
0
28 Apr 2021
Benchmarks for Deep Off-Policy Evaluation
Justin Fu
Mohammad Norouzi
Ofir Nachum
George Tucker
Ziyun Wang
...
Yutian Chen
Aviral Kumar
Cosmin Paduraru
Sergey Levine
T. Paine
ELM
OffRL
50
102
0
30 Mar 2021
Instabilities of Offline RL with Pre-Trained Neural Representation
Ruosong Wang
Yifan Wu
Ruslan Salakhutdinov
Sham Kakade
OffRL
58
42
0
08 Mar 2021
Hyperparameter Selection for Offline Reinforcement Learning
T. Paine
Cosmin Paduraru
Andrea Michi
Çağlar Gülçehre
Konrad Zolna
Alexander Novikov
Ziyun Wang
Nando de Freitas
GP
OffRL
90
147
0
17 Jul 2020
Off-Policy Evaluation via the Regularized Lagrangian
Mengjiao Yang
Ofir Nachum
Bo Dai
Lihong Li
Dale Schuurmans
OffRL
12
115
0
07 Jul 2020
Learning Invariant Representations for Reinforcement Learning without Reconstruction
Amy Zhang
R. McAllister
Roberto Calandra
Y. Gal
Sergey Levine
OOD
SSL
75
469
0
18 Jun 2020
D4RL: Datasets for Deep Data-Driven Reinforcement Learning
Justin Fu
Aviral Kumar
Ofir Nachum
George Tucker
Sergey Levine
GP
OffRL
161
1,338
0
15 Apr 2020
Batch Stationary Distribution Estimation
Junfeng Wen
Bo Dai
Lihong Li
Dale Schuurmans
OffRL
51
22
0
02 Mar 2020
GenDICE: Generalized Offline Estimation of Stationary Values
Ruiyi Zhang
Bo Dai
Lihong Li
Dale Schuurmans
OffRL
99
173
0
21 Feb 2020
GradientDICE: Rethinking Generalized Offline Estimation of Stationary Values
Shangtong Zhang
Bo Liu
Shimon Whiteson
OffRL
14
103
0
29 Jan 2020
Asymptotically Efficient Off-Policy Evaluation for Tabular Reinforcement Learning
Ming Yin
Yu Wang
OffRL
79
82
0
29 Jan 2020
Scalable methods for computing state similarity in deterministic Markov Decision Processes
Pablo Samuel Castro
29
136
0
21 Nov 2019
Empirical Study of Off-Policy Policy Evaluation for Reinforcement Learning
Cameron Voloshin
Hoang Minh Le
Nan Jiang
Yisong Yue
OffRL
35
154
0
15 Nov 2019
Minimax Weight and Q-Function Learning for Off-Policy Evaluation
Masatoshi Uehara
Jiawei Huang
Nan Jiang
OffRL
61
186
0
28 Oct 2019
Understanding the Curse of Horizon in Off-Policy Evaluation via Conditional Importance Sampling
Yao Liu
Pierre-Luc Bacon
Emma Brunskill
OffRL
46
46
0
15 Oct 2019
DualDICE: Behavior-Agnostic Estimation of Discounted Stationary Distribution Corrections
Ofir Nachum
Yinlam Chow
Bo Dai
Lihong Li
OffRL
60
332
0
10 Jun 2019
Towards Optimal Off-Policy Evaluation for Reinforcement Learning with Marginalized Importance Sampling
Tengyang Xie
Yifei Ma
Yu Wang
OffRL
64
181
0
08 Jun 2019
DeepMDP: Learning Continuous Latent Space Models for Representation Learning
Carles Gelada
Saurabh Kumar
Jacob Buckman
Ofir Nachum
Marc G. Bellemare
BDL
45
283
0
06 Jun 2019
Batch Policy Learning under Constraints
Hoang Minh Le
Cameron Voloshin
Yisong Yue
OffRL
40
328
0
20 Mar 2019
Off-Policy Deep Reinforcement Learning by Bootstrapping the Covariate Shift
Carles Gelada
Marc G. Bellemare
OffRL
34
97
0
27 Jan 2019
Breaking the Curse of Horizon: Infinite-Horizon Off-Policy Estimation
Qiang Liu
Lihong Li
Ziyang Tang
Dengyong Zhou
OffRL
62
354
0
29 Oct 2018
Importance Sampling Policy Evaluation with an Estimated Behavior Policy
Josiah P. Hanna
S. Niekum
Peter Stone
OffRL
23
67
0
04 Jun 2018
Representation Balancing MDPs for Off-Policy Policy Evaluation
Yao Liu
Omer Gottesman
Aniruddh Raghu
Matthieu Komorowski
A. Faisal
Finale Doshi-Velez
Emma Brunskill
OffRL
39
75
0
23 May 2018
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
183
18,685
0
20 Jul 2017
Consistent On-Line Off-Policy Evaluation
Assaf Hallak
Shie Mannor
OffRL
41
93
0
23 Feb 2017
Learning from Conditional Distributions via Dual Embeddings
Bo Dai
Niao He
Yunpeng Pan
Byron Boots
Le Song
45
21
0
15 Jul 2016
OpenAI Gym
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
OffRL
ODL
160
5,048
0
05 Jun 2016
Data-Efficient Off-Policy Policy Evaluation for Reinforcement Learning
Philip S. Thomas
Emma Brunskill
OffRL
132
573
0
04 Apr 2016
Doubly Robust Off-policy Value Evaluation for Reinforcement Learning
Nan Jiang
Lihong Li
OffRL
90
621
0
11 Nov 2015
Dyna-Style Planning with Linear Function Approximation and Prioritized Sweeping
R. Sutton
Csaba Szepesvári
A. Geramifard
Michael Bowling
OffRL
48
203
0
13 Jun 2012
1