ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1911.01546
  4. Cited By
Being Optimistic to Be Conservative: Quickly Learning a CVaR Policy

Being Optimistic to Be Conservative: Quickly Learning a CVaR Policy

5 November 2019
Ramtin Keramati
Christoph Dann
Alex Tamkin
Emma Brunskill
ArXivPDFHTML

Papers citing "Being Optimistic to Be Conservative: Quickly Learning a CVaR Policy"

26 / 26 papers shown
Title
Feasibility-Aware Pessimistic Estimation: Toward Long-Horizon Safety in Offline RL
Feasibility-Aware Pessimistic Estimation: Toward Long-Horizon Safety in Offline RL
Zhikun Tao
Gang Xiong
He Fang
Zhen Shen
Yunjun Han
Qing-Shan Jia
OffRL
39
0
0
13 May 2025
Return Capping: Sample-Efficient CVaR Policy Gradient Optimisation
Return Capping: Sample-Efficient CVaR Policy Gradient Optimisation
Harry Mead
Clarissa Costen
Bruno Lacerda
Nick Hawes
29
0
0
29 Apr 2025
Beyond CVaR: Leveraging Static Spectral Risk Measures for Enhanced Decision-Making in Distributional Reinforcement Learning
Beyond CVaR: Leveraging Static Spectral Risk Measures for Enhanced Decision-Making in Distributional Reinforcement Learning
Mehrdad Moghimi
Hyejin Ku
OffRL
48
0
0
03 Jan 2025
Policy Gradient Methods for Risk-Sensitive Distributional Reinforcement Learning with Provable Convergence
Policy Gradient Methods for Risk-Sensitive Distributional Reinforcement Learning with Provable Convergence
Minheng Xiao
Xian Yu
Lei Ying
47
2
0
23 May 2024
Provable Risk-Sensitive Distributional Reinforcement Learning with
  General Function Approximation
Provable Risk-Sensitive Distributional Reinforcement Learning with General Function Approximation
Yu Chen
Xiangcheng Zhang
Siwei Wang
Longbo Huang
47
3
0
28 Feb 2024
Distributional Off-Policy Evaluation for Slate Recommendations
Distributional Off-Policy Evaluation for Slate Recommendations
Shreyas Chaudhari
David Arbour
Georgios Theocharous
N. Vlassis
OffRL
46
0
0
27 Aug 2023
Cramer Type Distances for Learning Gaussian Mixture Models by Gradient
  Descent
Cramer Type Distances for Learning Gaussian Mixture Models by Gradient Descent
Ruichong Zhang
28
0
0
13 Jul 2023
Is Risk-Sensitive Reinforcement Learning Properly Resolved?
Is Risk-Sensitive Reinforcement Learning Properly Resolved?
Ruiwen Zhou
Minghuan Liu
Kan Ren
Xufang Luo
Weinan Zhang
Dongsheng Li
27
2
0
02 Jul 2023
Robust Route Planning with Distributional Reinforcement Learning in a
  Stochastic Road Network Environment
Robust Route Planning with Distributional Reinforcement Learning in a Stochastic Road Network Environment
Xi Lin
Paul Szenher
John D. Martin
Brendan Englot
31
2
0
19 Apr 2023
Toward Risk-based Optimistic Exploration for Cooperative Multi-Agent
  Reinforcement Learning
Toward Risk-based Optimistic Exploration for Cooperative Multi-Agent Reinforcement Learning
Ji-Yun Oh
Joonkee Kim
Minchan Jeong
Se-Young Yun
38
1
0
03 Mar 2023
Distributional Offline Policy Evaluation with Predictive Error
  Guarantees
Distributional Offline Policy Evaluation with Predictive Error Guarantees
Runzhe Wu
Masatoshi Uehara
Wen Sun
OffRL
40
13
0
19 Feb 2023
Near-Minimax-Optimal Risk-Sensitive Reinforcement Learning with CVaR
Near-Minimax-Optimal Risk-Sensitive Reinforcement Learning with CVaR
Kaiwen Wang
Nathan Kallus
Wen Sun
112
18
0
07 Feb 2023
Risk-Averse Model Uncertainty for Distributionally Robust Safe
  Reinforcement Learning
Risk-Averse Model Uncertainty for Distributionally Robust Safe Reinforcement Learning
James Queeney
M. Benosman
OOD
OffRL
46
5
0
30 Jan 2023
Bridging Distributional and Risk-sensitive Reinforcement Learning with
  Provable Regret Bounds
Bridging Distributional and Risk-sensitive Reinforcement Learning with Provable Regret Bounds
Hao Liang
Zhihui Luo
33
14
0
25 Oct 2022
Regret Bounds for Risk-Sensitive Reinforcement Learning
Regret Bounds for Risk-Sensitive Reinforcement Learning
Osbert Bastani
Y. Ma
E. Shen
Wei Xu
46
18
0
11 Oct 2022
Enforcing Delayed-Impact Fairness Guarantees
Enforcing Delayed-Impact Fairness Guarantees
Aline Weber
Blossom Metevier
Yuriy Brun
Philip S. Thomas
Bruno Castro da Silva
FaML
35
9
0
24 Aug 2022
Provably Efficient Risk-Sensitive Reinforcement Learning: Iterated CVaR
  and Worst Path
Provably Efficient Risk-Sensitive Reinforcement Learning: Iterated CVaR and Worst Path
Yihan Du
Siwei Wang
Longbo Huang
OOD
34
13
0
06 Jun 2022
The Sufficiency of Off-Policyness and Soft Clipping: PPO is still
  Insufficient according to an Off-Policy Measure
The Sufficiency of Off-Policyness and Soft Clipping: PPO is still Insufficient according to an Off-Policy Measure
Xing Chen
Dongcui Diao
Hechang Chen
Hengshuai Yao
Haiyin Piao
Zhixiao Sun
Zhiwei Yang
Randy Goebel
Bei Jiang
Yi-Ju Chang
OffRL
43
8
0
20 May 2022
Risk-aware Stochastic Shortest Path
Risk-aware Stochastic Shortest Path
Tobias Meggendorfer
11
9
0
03 Mar 2022
Conservative Offline Distributional Reinforcement Learning
Conservative Offline Distributional Reinforcement Learning
Yecheng Jason Ma
Dinesh Jayaraman
Osbert Bastani
OffRL
73
79
0
12 Jul 2021
Universal Off-Policy Evaluation
Universal Off-Policy Evaluation
Yash Chandak
S. Niekum
Bruno C. da Silva
Erik Learned-Miller
Emma Brunskill
Philip S. Thomas
OffRL
ELM
39
52
0
26 Apr 2021
Off-Policy Risk Assessment in Contextual Bandits
Off-Policy Risk Assessment in Contextual Bandits
Audrey Huang
Liu Leqi
Zachary Chase Lipton
Kamyar Azizzadenesheli
OffRL
32
36
0
18 Apr 2021
Lyapunov Barrier Policy Optimization
Lyapunov Barrier Policy Optimization
Harshit S. Sikchi
Wenxuan Zhou
David Held
34
14
0
16 Mar 2021
Risk-Averse Bayes-Adaptive Reinforcement Learning
Risk-Averse Bayes-Adaptive Reinforcement Learning
Marc Rigter
Bruno Lacerda
Nick Hawes
30
43
0
10 Feb 2021
The Potential of the Return Distribution for Exploration in RL
The Potential of the Return Distribution for Exploration in RL
Thomas M. Moerland
Joost Broekens
Catholijn M. Jonker
29
9
0
11 Jun 2018
Risk-Sensitive and Robust Decision-Making: a CVaR Optimization Approach
Risk-Sensitive and Robust Decision-Making: a CVaR Optimization Approach
Yinlam Chow
Aviv Tamar
Shie Mannor
Marco Pavone
73
314
0
06 Jun 2015
1