ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2303.12957
  4. Cited By
Reinforcement Learning with Exogenous States and Rewards

Reinforcement Learning with Exogenous States and Rewards

22 March 2023
George Trimponias
Thomas G. Dietterich
    OffRL
ArXivPDFHTML

Papers citing "Reinforcement Learning with Exogenous States and Rewards"

9 / 9 papers shown
Title
Learning a Fast Mixing Exogenous Block MDP using a Single Trajectory
Learning a Fast Mixing Exogenous Block MDP using a Single Trajectory
Alexander Levine
Peter Stone
Amy Zhang
OffRL
60
0
0
03 Oct 2024
Sample-Efficient Reinforcement Learning in the Presence of Exogenous
  Information
Sample-Efficient Reinforcement Learning in the Presence of Exogenous Information
Yonathan Efroni
Dylan J. Foster
Dipendra Kumar Misra
A. Krishnamurthy
John Langford
OffRL
48
25
0
09 Jun 2022
Did I do that? Blame as a means to identify controlled effects in
  reinforcement learning
Did I do that? Blame as a means to identify controlled effects in reinforcement learning
Oriol Corcoll
Youssef Mohamed
Raul Vicente
36
3
0
01 Jun 2021
Never Give Up: Learning Directed Exploration Strategies
Never Give Up: Learning Directed Exploration Strategies
Adria Puigdomenech Badia
Pablo Sprechmann
Alex Vitvitskyi
Daniel Guo
Bilal Piot
...
O. Tieleman
Martín Arjovsky
Alexander Pritzel
Andew Bolt
Charles Blundell
58
294
0
14 Feb 2020
Formal Limitations on the Measurement of Mutual Information
Formal Limitations on the Measurement of Mutual Information
David A. McAllester
K. Stratos
SSL
61
275
0
10 Nov 2018
Curiosity-driven Exploration by Self-supervised Prediction
Curiosity-driven Exploration by Self-supervised Prediction
Deepak Pathak
Pulkit Agrawal
Alexei A. Efros
Trevor Darrell
LRM
SSL
106
2,423
0
15 May 2017
OpenAI Gym
OpenAI Gym
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
OffRL
ODL
186
5,056
0
05 Jun 2016
f-GAN: Training Generative Neural Samplers using Variational Divergence
  Minimization
f-GAN: Training Generative Neural Samplers using Variational Divergence Minimization
Sebastian Nowozin
Botond Cseke
Ryota Tomioka
GAN
104
1,648
0
02 Jun 2016
Pymanopt: A Python Toolbox for Optimization on Manifolds using Automatic
  Differentiation
Pymanopt: A Python Toolbox for Optimization on Manifolds using Automatic Differentiation
James Townsend
Niklas Koep
S. Weichwald
62
246
0
10 Mar 2016
1