ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1709.02878
  4. Cited By
TensorFlow Agents: Efficient Batched Reinforcement Learning in
  TensorFlow

TensorFlow Agents: Efficient Batched Reinforcement Learning in TensorFlow

8 September 2017
Danijar Hafner
James Davidson
Vincent Vanhoucke
    OffRL
ArXivPDFHTML

Papers citing "TensorFlow Agents: Efficient Batched Reinforcement Learning in TensorFlow"

10 / 10 papers shown
Title
Lamarckian Platform: Pushing the Boundaries of Evolutionary
  Reinforcement Learning towards Asynchronous Commercial Games
Lamarckian Platform: Pushing the Boundaries of Evolutionary Reinforcement Learning towards Asynchronous Commercial Games
Hui Bai
R. Shen
Yue Lin
Bo Xu
Ran Cheng
VLM
36
5
0
21 Sep 2022
Project proposal: A modular reinforcement learning based automated
  theorem prover
Project proposal: A modular reinforcement learning based automated theorem prover
Boris Shminke
23
1
0
06 Sep 2022
Autonomous Reinforcement Learning via Subgoal Curricula
Autonomous Reinforcement Learning via Subgoal Curricula
Archit Sharma
Abhishek Gupta
Sergey Levine
Karol Hausman
Chelsea Finn
27
27
0
27 Jul 2021
Graph Policy Gradients for Large Scale Robot Control
Graph Policy Gradients for Large Scale Robot Control
Arbaaz Khan
Ekaterina V. Tolstaya
Alejandro Ribeiro
Vijay Kumar
10
93
0
08 Jul 2019
Lyapunov-based Safe Policy Optimization for Continuous Control
Lyapunov-based Safe Policy Optimization for Continuous Control
Yinlam Chow
Ofir Nachum
Aleksandra Faust
Edgar A. Duénez-Guzmán
Mohammad Ghavamzadeh
33
244
0
28 Jan 2019
Dopamine: A Research Framework for Deep Reinforcement Learning
Dopamine: A Research Framework for Deep Reinforcement Learning
Pablo Samuel Castro
Subhodeep Moitra
Carles Gelada
Saurabh Kumar
Marc G. Bellemare
OffRL
25
276
0
14 Dec 2018
RLgraph: Modular Computation Graphs for Deep Reinforcement Learning
RLgraph: Modular Computation Graphs for Deep Reinforcement Learning
Michael Schaarschmidt
Sven Mika
Kai Fricke
Eiko Yoneki
OffRL
23
5
0
21 Oct 2018
PPO-CMA: Proximal Policy Optimization with Covariance Matrix Adaptation
PPO-CMA: Proximal Policy Optimization with Covariance Matrix Adaptation
Perttu Hämäläinen
Amin Babadi
Xiaoxiao Ma
J. Lehtinen
32
62
0
05 Oct 2018
Vehicle Communication Strategies for Simulated Highway Driving
Vehicle Communication Strategies for Simulated Highway Driving
Cinjon Resnick
I. Kulikov
Kyunghyun Cho
Jason Weston
22
7
0
19 Apr 2018
Emergence of Locomotion Behaviours in Rich Environments
Emergence of Locomotion Behaviours in Rich Environments
N. Heess
TB Dhruva
S. Sriram
Jay Lemmon
J. Merel
...
Tom Erez
Ziyun Wang
S. M. Ali Eslami
Martin Riedmiller
David Silver
143
928
0
07 Jul 2017
1