Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1707.06887
Cited By
A Distributional Perspective on Reinforcement Learning
21 July 2017
Marc G. Bellemare
Will Dabney
Rémi Munos
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"A Distributional Perspective on Reinforcement Learning"
50 / 257 papers shown
Title
Revisiting Discrete Soft Actor-Critic
Haibin Zhou
Zichuan Lin
Junyou Li
Qiang Fu
Wei Yang
Deheng Ye
51
12
0
21 Sep 2022
MAN: Multi-Action Networks Learning
Keqin Wang
Alison Bartsch
A. Farimani
21
3
0
19 Sep 2022
Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks
Litian Liang
Yaosheng Xu
Stephen Marcus McAleer
Dailin Hu
Alexander Ihler
Pieter Abbeel
Roy Fox
OOD
24
16
0
16 Sep 2022
A Risk-Sensitive Approach to Policy Optimization
Jared Markowitz
Ryan W. Gardner
Ashley J. Llorens
R. Arora
I-J. Wang
OffRL
29
6
0
19 Aug 2022
Reinforcement Learning For Survival, A Clinically Motivated Method For Critically Ill Patients
Thesath Nanayakkara
OOD
OffRL
21
0
0
17 Jul 2022
Offline RL Policies Should be Trained to be Adaptive
Dibya Ghosh
Anurag Ajay
Pulkit Agrawal
Sergey Levine
OffRL
35
45
0
05 Jul 2022
Does Self-supervised Learning Really Improve Reinforcement Learning from Pixels?
Xiang Li
Jinghuan Shang
Srijan Das
Michael S. Ryoo
SSL
27
31
0
10 Jun 2022
FishGym: A High-Performance Physics-based Simulation Framework for Underwater Robot Learning
Wenji Liu
Kai-Yi Bai
Xuming He
Shuran Song
Changxi Zheng
Xiaopei Liu
AI4CE
32
12
0
03 Jun 2022
Learning Generalizable Risk-Sensitive Policies to Coordinate in Decentralized Multi-Agent General-Sum Games
Ziyi Liu
Xian Guo
Yongchun Fang
18
0
0
31 May 2022
Critic Sequential Monte Carlo
Vasileios Lioutas
J. Lavington
Justice Sefas
Matthew Niedoba
Yunpeng Liu
Berend Zwartsenberg
Setareh Dabiri
Frank Wood
Adam Scibior
50
7
0
30 May 2022
Sample-Efficient Optimisation with Probabilistic Transformer Surrogates
A. Maraval
Matthieu Zimmer
Antoine Grosnit
Rasul Tutunov
Jun Wang
H. Ammar
30
2
0
27 May 2022
The Sufficiency of Off-Policyness and Soft Clipping: PPO is still Insufficient according to an Off-Policy Measure
Xing Chen
Dongcui Diao
Hechang Chen
Hengshuai Yao
Haiyin Piao
Zhixiao Sun
Zhiwei Yang
Randy Goebel
Bei Jiang
Yi-Ju Chang
OffRL
41
8
0
20 May 2022
Robust Losses for Learning Value Functions
Andrew Patterson
Victor Liao
Martha White
28
12
0
17 May 2022
Sibyl: Adaptive and Extensible Data Placement in Hybrid Storage Systems Using Online Reinforcement Learning
Gagandeep Singh
Rakesh Nadig
Jisung Park
Rahul Bera
Nastaran Hajinazar
D. Novo
Juan Gómez Luna
S. Stuijk
Henk Corporaal
O. Mutlu
65
33
0
15 May 2022
How to Spend Your Robot Time: Bridging Kickstarting and Offline Reinforcement Learning for Vision-based Robotic Manipulation
Alex X. Lee
Coline Devin
Jost Tobias Springenberg
Yuxiang Zhou
Thomas Lampe
A. Abdolmaleki
Konstantinos Bousmalis
OffRL
OnRL
24
15
0
06 May 2022
Revisiting Gaussian mixture critics in off-policy reinforcement learning: a sample-based approach
Bobak Shahriari
A. Abdolmaleki
Arunkumar Byravan
A. Friesen
Siqi Liu
Jost Tobias Springenberg
N. Heess
Matthew W. Hoffman
Martin Riedmiller
OffRL
46
27
0
21 Apr 2022
Adaptive Risk-Tendency: Nano Drone Navigation in Cluttered Environments with Distributional Reinforcement Learning
Cheng Liu
E. Kampen
Guido de Croon
34
16
0
28 Mar 2022
Symmetry-Based Representations for Artificial and Biological General Intelligence
I. Higgins
S. Racanière
Danilo Jimenez Rezende
AI4CE
31
44
0
17 Mar 2022
Conditional Measurement Density Estimation in Sequential Monte Carlo via Normalizing Flow
Xiongjie Chen
Yunpeng Li
21
6
0
16 Mar 2022
COPA: Certifying Robust Policies for Offline Reinforcement Learning against Poisoning Attacks
Fan Wu
Linyi Li
Chejian Xu
Huan Zhang
B. Kailkhura
K. Kenthapadi
Ding Zhao
Bo-wen Li
AAML
OffRL
32
34
0
16 Mar 2022
Zipfian environments for Reinforcement Learning
Stephanie C. Y. Chan
Andrew Kyle Lampinen
Pierre Harvey Richemond
Felix Hill
OffRL
15
15
0
15 Mar 2022
Orchestrated Value Mapping for Reinforcement Learning
Mehdi Fatemi
Arash Tavakoli
27
8
0
14 Mar 2022
Distributional Reinforcement Learning for Scheduling of Chemical Production Processes
M. Mowbray
Dongda Zhang
Ehecatl Antonio del Rio Chanona
OffRL
25
6
0
01 Mar 2022
Reinforcement Learning in Practice: Opportunities and Challenges
Yuxi Li
OffRL
36
9
0
23 Feb 2022
Sequential Bayesian experimental designs via reinforcement learning
Hikaru Asano
OffRL
18
0
0
14 Feb 2022
Reinforcement Learning with Heterogeneous Data: Estimation and Inference
Elynn Y. Chen
Rui Song
Michael I. Jordan
OffRL
21
10
0
31 Jan 2022
Mask-based Latent Reconstruction for Reinforcement Learning
Tao Yu
Zhizheng Zhang
Cuiling Lan
Yan Lu
Zhibo Chen
24
44
0
28 Jan 2022
Reinforcement Learning for Personalized Drug Discovery and Design for Complex Diseases: A Systems Pharmacology Perspective
Ryan K. Tan
Yang Liu
Lei Xie
42
2
0
21 Jan 2022
Conservative Distributional Reinforcement Learning with Safety Constraints
Hengrui Zhang
Youfang Lin
Sheng Han
Shuo Wang
Kai Lv
OffRL
21
5
0
18 Jan 2022
Automated Reinforcement Learning (AutoRL): A Survey and Open Problems
Jack Parker-Holder
Raghunandan Rajan
Xingyou Song
André Biedenkapp
Yingjie Miao
...
Vu-Linh Nguyen
Roberto Calandra
Aleksandra Faust
Frank Hutter
Marius Lindauer
AI4CE
33
100
0
11 Jan 2022
Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation
Vincent Mai
Kaustubh Mani
Liam Paull
36
34
0
05 Jan 2022
Value Activation for Bias Alleviation: Generalized-activated Deep Double Deterministic Policy Gradients
Jiafei Lyu
Yu Yang
Jiangpeng Yan
Xiu Li
OffRL
AI4CE
39
5
0
21 Dec 2021
Transformers Can Do Bayesian Inference
Samuel G. Müller
Noah Hollmann
Sebastian Pineda Arango
Josif Grabocka
Frank Hutter
BDL
UQCV
25
141
0
20 Dec 2021
Towards Autonomous Satellite Communications: An AI-based Framework to Address System-level Challenges
J. Luis
Skylar Eiskowitz
Nils Pachler de la Osa
E. Crawley
B. Cameron
20
5
0
11 Dec 2021
Model-Value Inconsistency as a Signal for Epistemic Uncertainty
Angelos Filos
Eszter Vértes
Zita Marinho
Gregory Farquhar
Diana Borsa
A. Friesen
Feryal M. P. Behbahani
Tom Schaul
André Barreto
Simon Osindero
44
7
0
08 Dec 2021
Reinforcement Learning-based Switching Controller for a Milliscale Robot in a Constrained Environment
Abbas Tariverdi
Ulysse Côté-Allard
Kim Mathiassen
O. Elle
H. Kalvøy
Ø. Martinsen
J. Tørresen
16
4
0
27 Nov 2021
Adaptively Calibrated Critic Estimates for Deep Reinforcement Learning
Nicolai Dorka
Tim Welschehold
Joschka Boedecker
Wolfram Burgard
OffRL
30
9
0
24 Nov 2021
Aggressive Q-Learning with Ensembles: Achieving Both High Sample Efficiency and High Asymptotic Performance
Yanqiu Wu
Xinyue Chen
Che Wang
Yiming Zhang
Keith Ross
OffRL
11
9
0
17 Nov 2021
CleanRL: High-quality Single-file Implementations of Deep Reinforcement Learning Algorithms
Shengyi Huang
Rousslan Fernand Julien Dossa
Chang Ye
Jeff Braga
OffRL
16
0
0
16 Nov 2021
Exponential Bellman Equation and Improved Regret Bounds for Risk-Sensitive Reinforcement Learning
Yingjie Fei
Zhuoran Yang
Yudong Chen
Zhaoran Wang
41
46
0
06 Nov 2021
Value Function Spaces: Skill-Centric State Abstractions for Long-Horizon Reasoning
Dhruv Shah
Peng Xu
Yao Lu
Ted Xiao
Alexander Toshev
Sergey Levine
Brian Ichter
OffRL
35
41
0
04 Nov 2021
Robust Dynamic Bus Control: A Distributional Multi-agent Reinforcement Learning Approach
Changyin Sun
Lijun Sun
19
6
0
02 Nov 2021
On the Expressivity of Markov Reward
David Abel
Will Dabney
Anna Harutyunyan
Mark K. Ho
Michael L. Littman
Doina Precup
Satinder Singh
29
82
0
01 Nov 2021
Learning to Be Cautious
Montaser Mohammedalamen
Dustin Morrill
Alexander Sieusahai
Yash Satsangi
Michael Bowling
18
3
0
29 Oct 2021
Learning Pessimism for Robust and Efficient Off-Policy Reinforcement Learning
Edoardo Cetin
Oya Celiktutan
OffRL
42
17
0
07 Oct 2021
Deep reinforcement learning for guidewire navigation in coronary artery phantom
Jihoon Kweon
Kyunghwan Kim
Chaehyuk Lee
Hwi Kwon
Jinwoo Park
...
Inwook Back
J. Roh
Y. Moon
Jaesoon Choi
Young-Hak Kim
OnRL
18
33
0
05 Oct 2021
On the Convergence of Projected Alternating Maximization for Equitable and Optimal Transport
Minhui Huang
Shiqian Ma
Lifeng Lai
35
3
0
29 Sep 2021
The
f
f
f
-Divergence Reinforcement Learning Framework
Chen Gong
Qiang He
Yunpeng Bai
Zhouyi Yang
Xiaoyu Chen
Xinwen Hou
Xianjie Zhang
Yu Liu
Guoliang Fan
34
3
0
24 Sep 2021
Improved Soft Actor-Critic: Mixing Prioritized Off-Policy Samples with On-Policy Experience
C. Banerjee
Zhiyong Chen
N. Noman
19
30
0
24 Sep 2021
On Bonus-Based Exploration Methods in the Arcade Learning Environment
Adrien Ali Taïga
W. Fedus
Marlos C. Machado
Aaron Courville
Marc G. Bellemare
18
58
0
22 Sep 2021
Previous
1
2
3
4
5
6
Next