Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1902.01240
Cited By
PIPPS: Flexible Model-Based Policy Search Robust to the Curse of Chaos
4 February 2019
Paavo Parmas
C. Rasmussen
Jan Peters
Kenji Doya
Re-assign community
ArXiv
PDF
HTML
Papers citing
"PIPPS: Flexible Model-Based Policy Search Robust to the Curse of Chaos"
18 / 18 papers shown
Title
Stabilizing Reinforcement Learning in Differentiable Multiphysics Simulation
Eliot Xing
Vernon Luk
Jean Oh
97
0
0
16 Dec 2024
Gradient Informed Proximal Policy Optimization
Sanghyun Son
L. Zheng
Ryan Sullivan
Yi-Ling Qiao
Ming-Chyuan Lin
37
7
0
14 Dec 2023
Variance-Reduced Gradient Estimation via Noise-Reuse in Online Evolution Strategies
Oscar Li
James Harrison
Jascha Narain Sohl-Dickstein
Virginia Smith
Luke Metz
56
5
0
21 Apr 2023
Integrated Decision and Control for High-Level Automated Vehicles by Mixed Policy Gradient and Its Experiment Verification
Yang Guan
Liye Tang
Chuanxiao Li
Shengbo Eben Li
Yangang Ren
Junqing Wei
Bo Zhang
Ke Li
23
0
0
19 Oct 2022
Symbolic Learning to Optimize: Towards Interpretability and Scalability
Wenqing Zheng
Tianlong Chen
Ting-Kuei Hu
Zhangyang Wang
50
19
0
13 Mar 2022
Tutorial on amortized optimization
Brandon Amos
OffRL
78
43
0
01 Feb 2022
Reinforcement Learning for Personalized Drug Discovery and Design for Complex Diseases: A Systems Pharmacology Perspective
Ryan K. Tan
Yang Liu
Lei Xie
49
2
0
21 Jan 2022
Unbiased Gradient Estimation in Unrolled Computation Graphs with Persistent Evolution Strategies
Paul Vicol
Luke Metz
Jascha Narain Sohl-Dickstein
27
68
0
27 Dec 2021
Gradients are Not All You Need
Luke Metz
C. Freeman
S. Schoenholz
Tal Kachman
30
93
0
10 Nov 2021
Learning Robust Controllers Via Probabilistic Model-Based Policy Search
V. Charvet
B. S. Jensen
R. Murray-Smith
19
2
0
26 Oct 2021
PerSim: Data-Efficient Offline Reinforcement Learning with Heterogeneous Agents via Personalized Simulators
Anish Agarwal
Abdullah Alomar
Varkey Alumootil
Devavrat Shah
Dennis Shen
Zhi Xu
Cindy Yang
OffRL
18
18
0
13 Feb 2021
OPAL: Offline Primitive Discovery for Accelerating Offline Reinforcement Learning
Anurag Ajay
Aviral Kumar
Pulkit Agrawal
Sergey Levine
Ofir Nachum
OffRL
OnRL
39
155
0
26 Oct 2020
Training Stronger Baselines for Learning to Optimize
Tianlong Chen
Weiyi Zhang
Jingyang Zhou
Shiyu Chang
Sijia Liu
Lisa Amini
Zhangyang Wang
OffRL
27
51
0
18 Oct 2020
Mastering Atari with Discrete World Models
Danijar Hafner
Timothy Lillicrap
Mohammad Norouzi
Jimmy Ba
DRL
53
823
0
05 Oct 2020
Efficient Model-Based Reinforcement Learning through Optimistic Policy Search and Planning
Sebastian Curi
Felix Berkenkamp
Andreas Krause
33
82
0
15 Jun 2020
Regularizing Model-Based Planning with Energy-Based Models
Rinu Boney
Arno Solin
Alexander Ilin
11
18
0
12 Oct 2019
Monte Carlo Gradient Estimation in Machine Learning
S. Mohamed
Mihaela Rosca
Michael Figurnov
A. Mnih
45
400
0
25 Jun 2019
A survey on policy search algorithms for learning robot controllers in a handful of trials
Konstantinos Chatzilygeroudis
Vassilis Vassiliades
F. Stulp
Sylvain Calinon
Jean-Baptiste Mouret
17
155
0
06 Jul 2018
1