ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1902.02893
  4. Cited By
Rethinking the Discount Factor in Reinforcement Learning: A Decision
  Theoretic Approach

Rethinking the Discount Factor in Reinforcement Learning: A Decision Theoretic Approach

8 February 2019
Silviu Pitis
    OffRL
ArXivPDFHTML

Papers citing "Rethinking the Discount Factor in Reinforcement Learning: A Decision Theoretic Approach"

11 / 11 papers shown
Title
On shallow planning under partial observability
On shallow planning under partial observability
Randy Lefebvre
Audrey Durand
OffRL
33
0
0
22 Jul 2024
Three Dogmas of Reinforcement Learning
Three Dogmas of Reinforcement Learning
David Abel
Mark K. Ho
A. Harutyunyan
38
5
0
15 Jul 2024
On the Expressivity of Multidimensional Markov Reward
On the Expressivity of Multidimensional Markov Reward
Shuwa Miura
18
4
0
22 Jul 2023
Markov Decision Processes with Time-Varying Geometric Discounting
Markov Decision Processes with Time-Varying Geometric Discounting
Jiarui Gan
Ann-Kathrin Hennes
R. Majumdar
Debmalya Mandal
Goran Radanović
13
1
0
19 Jul 2023
Factors of Influence of the Overestimation Bias of Q-Learning
Factors of Influence of the Overestimation Bias of Q-Learning
Julius Wagenbach
M. Sabatelli
15
1
0
11 Oct 2022
LCRL: Certified Policy Synthesis via Logically-Constrained Reinforcement
  Learning
LCRL: Certified Policy Synthesis via Logically-Constrained Reinforcement Learning
Mohammadhosein Hasanbeig
Daniel Kroening
Alessandro Abate
18
15
0
21 Sep 2022
Utility Theory for Sequential Decision Making
Utility Theory for Sequential Decision Making
Mehran Shakerinava
Siamak Ravanbakhsh
29
7
0
27 Jun 2022
ProActive: Self-Attentive Temporal Point Process Flows for Activity
  Sequences
ProActive: Self-Attentive Temporal Point Process Flows for Activity Sequences
Vinayak Gupta
Srikanta J. Bedathur
AI4TS
22
16
0
10 Jun 2022
On the Expressivity of Markov Reward
On the Expressivity of Markov Reward
David Abel
Will Dabney
A. Harutyunyan
Mark K. Ho
Michael L. Littman
Doina Precup
Satinder Singh
26
82
0
01 Nov 2021
EnTRPO: Trust Region Policy Optimization Method with Entropy
  Regularization
EnTRPO: Trust Region Policy Optimization Method with Entropy Regularization
Sahar Roostaie
M. Ebadzadeh
11
3
0
26 Oct 2021
Hyperbolic Discounting and Learning over Multiple Horizons
Hyperbolic Discounting and Learning over Multiple Horizons
W. Fedus
Carles Gelada
Yoshua Bengio
Marc G. Bellemare
Hugo Larochelle
21
105
0
19 Feb 2019
1