ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2505.12611
  4. Cited By
Action-Dependent Optimality-Preserving Reward Shaping

Action-Dependent Optimality-Preserving Reward Shaping

19 May 2025
Grant C. Forbes
Jianxun Wang
Leonardo Villalobos-Arias
Arnav Jhala
David L. Roberts
    OffRL
ArXivPDFHTML

Papers citing "Action-Dependent Optimality-Preserving Reward Shaping"

9 / 9 papers shown
Title
Potential-Based Reward Shaping For Intrinsic Motivation
Potential-Based Reward Shaping For Intrinsic Motivation
Grant C. Forbes
Nitish Gupta
Leonardo Villalobos-Arias
Colin M. Potts
Arnav Jhala
David L. Roberts
6
5
0
12 Feb 2024
Beyond Surprise: Improving Exploration Through Surprise Novelty
Beyond Surprise: Improving Exploration Through Surprise Novelty
Hung Le
Kien Do
D. Nguyen
Svetha Venkatesh
48
3
0
09 Aug 2023
Redeeming Intrinsic Rewards via Constrained Optimization
Redeeming Intrinsic Rewards via Constrained Optimization
Eric Chen
Zhang-Wei Hong
Joni Pajarinen
Pulkit Agrawal
OnRL
49
24
0
14 Nov 2022
Never Give Up: Learning Directed Exploration Strategies
Never Give Up: Learning Directed Exploration Strategies
Adria Puigdomenech Badia
Pablo Sprechmann
Alex Vitvitskyi
Daniel Guo
Bilal Piot
...
O. Tieleman
Martín Arjovsky
Alexander Pritzel
Andew Bolt
Charles Blundell
70
297
0
14 Feb 2020
Proximal Policy Optimization Algorithms
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
444
18,931
0
20 Jul 2017
Curiosity-driven Exploration by Self-supervised Prediction
Curiosity-driven Exploration by Self-supervised Prediction
Deepak Pathak
Pulkit Agrawal
Alexei A. Efros
Trevor Darrell
LRM
SSL
106
2,433
0
15 May 2017
Unifying Count-Based Exploration and Intrinsic Motivation
Unifying Count-Based Exploration and Intrinsic Motivation
Marc G. Bellemare
S. Srinivasan
Georg Ostrovski
Tom Schaul
D. Saxton
Rémi Munos
167
1,473
0
06 Jun 2016
The Arcade Learning Environment: An Evaluation Platform for General
  Agents
The Arcade Learning Environment: An Evaluation Platform for General Agents
Marc G. Bellemare
Yavar Naddaf
J. Veness
Michael Bowling
109
3,002
0
19 Jul 2012
Potential-Based Shaping and Q-Value Initialization are Equivalent
Potential-Based Shaping and Q-Value Initialization are Equivalent
Eric Wiewiora
OffRL
58
178
0
26 Jun 2011
1