Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2210.01800
Cited By
Bayesian Q-learning With Imperfect Expert Demonstrations
1 October 2022
Fengdi Che
Xiru Zhu
Doina Precup
David Meger
Gregory Dudek
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Bayesian Q-learning With Imperfect Expert Demonstrations"
15 / 15 papers shown
Title
Shaping Rewards for Reinforcement Learning with Imperfect Demonstrations using Generative Models
Yuchen Wu
Melissa Mozifian
Florian Shkurti
63
21
0
02 Nov 2020
Temporally-Extended ε-Greedy Exploration
Will Dabney
Georg Ostrovski
André Barreto
68
34
0
02 Jun 2020
Reinforcement Learning from Imperfect Demonstrations under Soft Expert Guidance
Mingxuan Jing
Xiaojian Ma
Wenbing Huang
F. Sun
Chao Yang
Bin Fang
Huaping Liu
63
60
0
16 Nov 2019
Extrapolating Beyond Suboptimal Demonstrations via Inverse Reinforcement Learning from Observations
Daniel S. Brown
Wonjoon Goo
P. Nagarajan
S. Niekum
76
358
0
12 Apr 2019
Soft Actor-Critic Algorithms and Applications
Tuomas Haarnoja
Aurick Zhou
Kristian Hartikainen
George Tucker
Sehoon Ha
...
Vikash Kumar
Henry Zhu
Abhishek Gupta
Pieter Abbeel
Sergey Levine
143
2,449
0
13 Dec 2018
Is Q-learning Provably Efficient?
Chi Jin
Zeyuan Allen-Zhu
Sébastien Bubeck
Michael I. Jordan
OffRL
75
812
0
10 Jul 2018
Randomized Prior Functions for Deep Reinforcement Learning
Ian Osband
John Aslanides
Albin Cassirer
UQCV
BDL
76
380
0
08 Jun 2018
Rainbow: Combining Improvements in Deep Reinforcement Learning
Matteo Hessel
Joseph Modayil
H. V. Hasselt
Tom Schaul
Georg Ostrovski
Will Dabney
Dan Horgan
Bilal Piot
M. G. Azar
David Silver
OffRL
107
2,270
0
06 Oct 2017
Overcoming Exploration in Reinforcement Learning with Demonstrations
Ashvin Nair
Bob McGrew
Marcin Andrychowicz
Wojciech Zaremba
Pieter Abbeel
OffRL
99
788
0
28 Sep 2017
Deep Exploration via Randomized Value Functions
Ian Osband
Benjamin Van Roy
Daniel Russo
Zheng Wen
89
307
0
22 Mar 2017
Unifying Count-Based Exploration and Intrinsic Motivation
Marc G. Bellemare
S. Srinivasan
Georg Ostrovski
Tom Schaul
D. Saxton
Rémi Munos
179
1,483
0
06 Jun 2016
Dueling Network Architectures for Deep Reinforcement Learning
Ziyun Wang
Tom Schaul
Matteo Hessel
H. V. Hasselt
Marc Lanctot
Nando de Freitas
OffRL
91
3,768
0
20 Nov 2015
Prioritized Experience Replay
Tom Schaul
John Quan
Ioannis Antonoglou
David Silver
OffRL
223
3,797
0
18 Nov 2015
Playing Atari with Deep Reinforcement Learning
Volodymyr Mnih
Koray Kavukcuoglu
David Silver
Alex Graves
Ioannis Antonoglou
Daan Wierstra
Martin Riedmiller
129
12,265
0
19 Dec 2013
Stochastic Variational Inference
Matt Hoffman
David M. Blei
Chong-Jun Wang
John Paisley
BDL
262
2,627
0
29 Jun 2012
1