ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2312.08365
  4. Cited By
An Invitation to Deep Reinforcement Learning
v1v2v3 (latest)

An Invitation to Deep Reinforcement Learning

13 December 2023
Bernhard Jaeger
Andreas Geiger
    OffRLOOD
ArXiv (abs)PDFHTML

Papers citing "An Invitation to Deep Reinforcement Learning"

8 / 108 papers shown
Title
Continuous control with deep reinforcement learning
Continuous control with deep reinforcement learning
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
415
13,333
0
09 Sep 2015
Deep Recurrent Q-Learning for Partially Observable MDPs
Deep Recurrent Q-Learning for Partially Observable MDPs
Matthew J. Hausknecht
Peter Stone
140
1,686
0
23 Jul 2015
Language Understanding for Text-based Games Using Deep Reinforcement
  Learning
Language Understanding for Text-based Games Using Deep Reinforcement Learning
Karthik Narasimhan
Tejas D. Kulkarni
Regina Barzilay
OffRL
109
362
0
30 Jun 2015
High-Dimensional Continuous Control Using Generalized Advantage
  Estimation
High-Dimensional Continuous Control Using Generalized Advantage Estimation
John Schulman
Philipp Moritz
Sergey Levine
Michael I. Jordan
Pieter Abbeel
OffRL
166
3,448
0
08 Jun 2015
Trust Region Policy Optimization
Trust Region Policy Optimization
John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
293
6,827
0
19 Feb 2015
The Optimal Reward Baseline for Gradient-Based Reinforcement Learning
The Optimal Reward Baseline for Gradient-Based Reinforcement Learning
Lex Weaver
Nigel Tao
124
249
0
10 Jan 2013
The Arcade Learning Environment: An Evaluation Platform for General
  Agents
The Arcade Learning Environment: An Evaluation Platform for General Agents
Marc G. Bellemare
Yavar Naddaf
J. Veness
Michael Bowling
126
3,025
0
19 Jul 2012
A Reduction of Imitation Learning and Structured Prediction to No-Regret
  Online Learning
A Reduction of Imitation Learning and Structured Prediction to No-Regret Online Learning
Stéphane Ross
Geoffrey J. Gordon
J. Andrew Bagnell
OffRL
317
3,239
0
02 Nov 2010
Previous
123