Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1911.01546
Cited By
Being Optimistic to Be Conservative: Quickly Learning a CVaR Policy
5 November 2019
Ramtin Keramati
Christoph Dann
Alex Tamkin
Emma Brunskill
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Being Optimistic to Be Conservative: Quickly Learning a CVaR Policy"
26 / 26 papers shown
Title
Feasibility-Aware Pessimistic Estimation: Toward Long-Horizon Safety in Offline RL
Zhikun Tao
Gang Xiong
He Fang
Zhen Shen
Yunjun Han
Qing-Shan Jia
OffRL
39
0
0
13 May 2025
Return Capping: Sample-Efficient CVaR Policy Gradient Optimisation
Harry Mead
Clarissa Costen
Bruno Lacerda
Nick Hawes
29
0
0
29 Apr 2025
Beyond CVaR: Leveraging Static Spectral Risk Measures for Enhanced Decision-Making in Distributional Reinforcement Learning
Mehrdad Moghimi
Hyejin Ku
OffRL
48
0
0
03 Jan 2025
Policy Gradient Methods for Risk-Sensitive Distributional Reinforcement Learning with Provable Convergence
Minheng Xiao
Xian Yu
Lei Ying
47
2
0
23 May 2024
Provable Risk-Sensitive Distributional Reinforcement Learning with General Function Approximation
Yu Chen
Xiangcheng Zhang
Siwei Wang
Longbo Huang
47
3
0
28 Feb 2024
Distributional Off-Policy Evaluation for Slate Recommendations
Shreyas Chaudhari
David Arbour
Georgios Theocharous
N. Vlassis
OffRL
46
0
0
27 Aug 2023
Cramer Type Distances for Learning Gaussian Mixture Models by Gradient Descent
Ruichong Zhang
28
0
0
13 Jul 2023
Is Risk-Sensitive Reinforcement Learning Properly Resolved?
Ruiwen Zhou
Minghuan Liu
Kan Ren
Xufang Luo
Weinan Zhang
Dongsheng Li
27
2
0
02 Jul 2023
Robust Route Planning with Distributional Reinforcement Learning in a Stochastic Road Network Environment
Xi Lin
Paul Szenher
John D. Martin
Brendan Englot
31
2
0
19 Apr 2023
Toward Risk-based Optimistic Exploration for Cooperative Multi-Agent Reinforcement Learning
Ji-Yun Oh
Joonkee Kim
Minchan Jeong
Se-Young Yun
38
1
0
03 Mar 2023
Distributional Offline Policy Evaluation with Predictive Error Guarantees
Runzhe Wu
Masatoshi Uehara
Wen Sun
OffRL
40
13
0
19 Feb 2023
Near-Minimax-Optimal Risk-Sensitive Reinforcement Learning with CVaR
Kaiwen Wang
Nathan Kallus
Wen Sun
112
18
0
07 Feb 2023
Risk-Averse Model Uncertainty for Distributionally Robust Safe Reinforcement Learning
James Queeney
M. Benosman
OOD
OffRL
46
5
0
30 Jan 2023
Bridging Distributional and Risk-sensitive Reinforcement Learning with Provable Regret Bounds
Hao Liang
Zhihui Luo
33
14
0
25 Oct 2022
Regret Bounds for Risk-Sensitive Reinforcement Learning
Osbert Bastani
Y. Ma
E. Shen
Wei Xu
46
18
0
11 Oct 2022
Enforcing Delayed-Impact Fairness Guarantees
Aline Weber
Blossom Metevier
Yuriy Brun
Philip S. Thomas
Bruno Castro da Silva
FaML
35
9
0
24 Aug 2022
Provably Efficient Risk-Sensitive Reinforcement Learning: Iterated CVaR and Worst Path
Yihan Du
Siwei Wang
Longbo Huang
OOD
34
13
0
06 Jun 2022
The Sufficiency of Off-Policyness and Soft Clipping: PPO is still Insufficient according to an Off-Policy Measure
Xing Chen
Dongcui Diao
Hechang Chen
Hengshuai Yao
Haiyin Piao
Zhixiao Sun
Zhiwei Yang
Randy Goebel
Bei Jiang
Yi-Ju Chang
OffRL
43
8
0
20 May 2022
Risk-aware Stochastic Shortest Path
Tobias Meggendorfer
11
9
0
03 Mar 2022
Conservative Offline Distributional Reinforcement Learning
Yecheng Jason Ma
Dinesh Jayaraman
Osbert Bastani
OffRL
73
79
0
12 Jul 2021
Universal Off-Policy Evaluation
Yash Chandak
S. Niekum
Bruno C. da Silva
Erik Learned-Miller
Emma Brunskill
Philip S. Thomas
OffRL
ELM
39
52
0
26 Apr 2021
Off-Policy Risk Assessment in Contextual Bandits
Audrey Huang
Liu Leqi
Zachary Chase Lipton
Kamyar Azizzadenesheli
OffRL
32
36
0
18 Apr 2021
Lyapunov Barrier Policy Optimization
Harshit S. Sikchi
Wenxuan Zhou
David Held
34
14
0
16 Mar 2021
Risk-Averse Bayes-Adaptive Reinforcement Learning
Marc Rigter
Bruno Lacerda
Nick Hawes
30
43
0
10 Feb 2021
The Potential of the Return Distribution for Exploration in RL
Thomas M. Moerland
Joost Broekens
Catholijn M. Jonker
29
9
0
11 Jun 2018
Risk-Sensitive and Robust Decision-Making: a CVaR Optimization Approach
Yinlam Chow
Aviv Tamar
Shie Mannor
Marco Pavone
73
314
0
06 Jun 2015
1