Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1805.12114
Cited By
Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models
30 May 2018
Kurtland Chua
Roberto Calandra
R. McAllister
Sergey Levine
BDL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"
50 / 339 papers shown
Title
Data-driven control of spatiotemporal chaos with reduced-order neural ODE-based models and reinforcement learning
Kevin Zeng
Alec J. Linot
M. Graham
AI4CE
27
28
0
01 May 2022
BATS: Best Action Trajectory Stitching
I. Char
Viraj Mehta
Adam R. Villaflor
John M. Dolan
J. Schneider
OffRL
38
8
0
26 Apr 2022
Safe Reinforcement Learning Using Black-Box Reachability Analysis
Mahmoud Selim
Amr Alanwar
Shreyas Kousik
Grace Gao
Marco Pavone
Karl H. Johansson
34
32
0
15 Apr 2022
Gradient-Based Trajectory Optimization With Learned Dynamics
Bhavya Sukhija
Nathanael Kohler
Miguel Zamora
Simon Zimmermann
Sebastian Curi
Andreas Krause
Stelian Coros
32
9
0
09 Apr 2022
Hybrid LMC: Hybrid Learning and Model-based Control for Wheeled Humanoid Robot via Ensemble Deep Reinforcement Learning
D. Baek
Amartya Purushottam
Joao Ramos
31
9
0
07 Apr 2022
Hybrid Predictive Coding: Inferring, Fast and Slow
Alexander Tschantz
Beren Millidge
A. Seth
Christopher L. Buckley
29
36
0
05 Apr 2022
Revisiting Model-based Value Expansion
Daniel Palenicek
M. Lutter
Jan Peters
32
2
0
28 Mar 2022
Aggressive Quadrotor Flight Using Curiosity-Driven Reinforcement Learning
Q. Sun
Jinbao Fang
Weixing Zheng
Yang Tang
19
27
0
26 Mar 2022
Competency Assessment for Autonomous Agents using Deep Generative Models
Aastha Acharya
Rebecca L. Russell
Nisar R. Ahmed
37
10
0
23 Mar 2022
Investigating Compounding Prediction Errors in Learned Dynamics Models
Nathan Lambert
K. Pister
Roberto Calandra
AI4CE
22
27
0
17 Mar 2022
Hyperbolic Uncertainty Aware Semantic Segmentation
Bike Chen
Wei Peng
Xiaofeng Cao
Juha Roning
UQCV
31
15
0
16 Mar 2022
Temporal Difference Learning for Model Predictive Control
Nicklas Hansen
Xiaolong Wang
H. Su
PINN
MU
41
229
0
09 Mar 2022
Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning
Chenjia Bai
Lingxiao Wang
Zhuoran Yang
Zhihong Deng
Animesh Garg
Peng Liu
Zhaoran Wang
OffRL
45
132
0
23 Feb 2022
Inference of Affordances and Active Motor Control in Simulated Agents
Fedor Scholz
Christian Gumbsch
S. Otte
Martin Volker Butz
AI4CE
37
5
0
23 Feb 2022
DeepONet-Grid-UQ: A Trustworthy Deep Operator Framework for Predicting the Power Grid's Post-Fault Trajectories
Christian Moya
Shiqi Zhang
Meng Yue
Guang Lin
35
42
0
15 Feb 2022
Saute RL: Almost Surely Safe Reinforcement Learning Using State Augmentation
Aivar Sootla
Alexander I. Cowen-Rivers
Taher Jafferjee
Ziyan Wang
D. Mguni
Jun Wang
Haitham Bou-Ammar
37
54
0
14 Feb 2022
Deconstructing the Inductive Biases of Hamiltonian Neural Networks
Nate Gruver
Marc Finzi
Samuel Stanton
A. Wilson
AI4CE
31
40
0
10 Feb 2022
Data-Driven Chance Constrained Control using Kernel Distribution Embeddings
Adam J. Thorpe
T. Lew
Meeko Oishi
Marco Pavone
38
21
0
08 Feb 2022
Physical Design using Differentiable Learned Simulators
Kelsey R. Allen
Tatiana López-Guevara
Kimberly L. Stachenfeld
Alvaro Sanchez-Gonzalez
Peter W. Battaglia
Jessica B. Hamrick
Tobias Pfaff
AI4CE
34
42
0
01 Feb 2022
Generative Planning for Temporally Coordinated Exploration in Reinforcement Learning
Haichao Zhang
Wei Xu
Haonan Yu
38
10
0
24 Jan 2022
Reinforcement Learning for Personalized Drug Discovery and Design for Complex Diseases: A Systems Pharmacology Perspective
Ryan K. Tan
Yang Liu
Lei Xie
49
2
0
21 Jan 2022
Automated Reinforcement Learning (AutoRL): A Survey and Open Problems
Jack Parker-Holder
Raghunandan Rajan
Xingyou Song
André Biedenkapp
Yingjie Miao
...
Vu-Linh Nguyen
Roberto Calandra
Aleksandra Faust
Frank Hutter
Marius Lindauer
AI4CE
38
100
0
11 Jan 2022
Multiagent Model-based Credit Assignment for Continuous Control
Dongge Han
Chris Xiaoxuan Lu
Tomasz P. Michalak
Michael Wooldridge
27
5
0
27 Dec 2021
Sample-Efficient Reinforcement Learning via Conservative Model-Based Actor-Critic
Zhihai Wang
Jie Wang
Qi Zhou
Bin Li
Houqiang Li
27
30
0
16 Dec 2021
CEM-GD: Cross-Entropy Method with Gradient Descent Planner for Model-Based Reinforcement Learning
Kevin Huang
Sahin Lale
Ugo Rosolia
Yuanyuan Shi
Anima Anandkumar
21
8
0
14 Dec 2021
Conservative and Adaptive Penalty for Model-Based Safe Reinforcement Learning
Yecheng Jason Ma
Andrew Shen
Osbert Bastani
Dinesh Jayaraman
21
25
0
14 Dec 2021
Calibrated and Sharp Uncertainties in Deep Learning via Density Estimation
Volodymyr Kuleshov
Shachi Deshpande
UQCV
BDL
40
34
0
14 Dec 2021
Safe Autonomous Navigation for Systems with Learned SE(3) Hamiltonian Dynamics
Zhichao Li
T. Duong
Nikolay Atanasov
32
2
0
09 Dec 2021
Model-Value Inconsistency as a Signal for Epistemic Uncertainty
Angelos Filos
Eszter Vértes
Zita Marinho
Gregory Farquhar
Diana Borsa
A. Friesen
Feryal M. P. Behbahani
Tom Schaul
André Barreto
Simon Osindero
44
7
0
08 Dec 2021
ED2: Environment Dynamics Decomposition World Models for Continuous Control
Jianye Hao
Yifu Yuan
Cong Wang
Zhen Wang
OffRL
21
1
0
06 Dec 2021
Residual Pathway Priors for Soft Equivariance Constraints
Marc Finzi
Gregory W. Benton
A. Wilson
BDL
UQCV
24
54
0
02 Dec 2021
A Free Lunch from the Noise: Provable and Practical Exploration for Representation Learning
Tongzheng Ren
Tianjun Zhang
Csaba Szepesvári
Bo Dai
37
19
0
22 Nov 2021
ModelLight: Model-Based Meta-Reinforcement Learning for Traffic Signal Control
Xingshuai Huang
Di Wu
M. Jenkin
Benoit Boulet
13
15
0
15 Nov 2021
Look Before You Leap: Safe Model-Based Reinforcement Learning with Human Intervention
Yunkun Xu
Zhen-yu Liu
Guifang Duan
Jiangcheng Zhu
X. Bai
Jianrong Tan
18
9
0
10 Nov 2021
Dealing with the Unknown: Pessimistic Offline Reinforcement Learning
Jinning Li
Chen Tang
Masayoshi Tomizuka
Wei Zhan
OffRL
24
21
0
09 Nov 2021
Value Function Spaces: Skill-Centric State Abstractions for Long-Horizon Reasoning
Dhruv Shah
Peng Xu
Yao Lu
Ted Xiao
Alexander Toshev
Sergey Levine
Brian Ichter
OffRL
39
41
0
04 Nov 2021
Dynamic Mirror Descent based Model Predictive Control for Accelerating Robot Learning
Utkarsh Aashu Mishra
Soumya R. Samineni
Prakhar Goel
Chandravaran Kunjeti
Himanshu Lodha
Aman Singh
Aditya Sagi
S. Bhatnagar
Shishir Kolathaya
32
3
0
04 Nov 2021
Mastering Atari Games with Limited Data
Weirui Ye
Shao-Wei Liu
Thanard Kurutach
Pieter Abbeel
Yang Gao
VLM
64
227
0
30 Oct 2021
GalilAI: Out-of-Task Distribution Detection using Causal Active Experimentation for Safe Transfer RL
Sumedh Anand Sontakke
Stephen Iota
Zizhao Hu
Arash Mehrjou
Laurent Itti
Bernhard Schölkopf
OODD
22
2
0
29 Oct 2021
Landmark-Guided Subgoal Generation in Hierarchical Reinforcement Learning
Junsup Kim
Younggyo Seo
Jinwoo Shin
28
58
0
26 Oct 2021
Which Model to Trust: Assessing the Influence of Models on the Performance of Reinforcement Learning Algorithms for Continuous Control Tasks
Giacomo Arcieri
David Wölfle
Eleni Chatzi
OffRL
27
5
0
25 Oct 2021
A Differentiable Newton-Euler Algorithm for Real-World Robotics
M. Lutter
Vallijah Subasri
Joe Watson
Frank Rudzicz
29
7
0
24 Oct 2021
Policy Search using Dynamic Mirror Descent MPC for Model Free Off Policy RL
Aarush Gupta
30
0
0
23 Oct 2021
Improving Hyperparameter Optimization by Planning Ahead
H. Jomaa
Jonas K. Falkner
Lars Schmidt-Thieme
22
0
0
15 Oct 2021
Planning from Pixels in Environments with Combinatorially Hard Search Spaces
Marco Bagatella
Miroslav Olsák
Michal Rolínek
Georg Martius
OffRL
26
6
0
12 Oct 2021
Learning Pessimism for Robust and Efficient Off-Policy Reinforcement Learning
Edoardo Cetin
Oya Celiktutan
OffRL
47
17
0
07 Oct 2021
Evaluating model-based planning and planner amortization for continuous control
Arunkumar Byravan
Leonard Hasenclever
Piotr Trochim
M. Berk Mirza
Alessandro Davide Ialongo
...
Jost Tobias Springenberg
A. Abdolmaleki
N. Heess
J. Merel
Martin Riedmiller
55
17
0
07 Oct 2021
Propagating State Uncertainty Through Trajectory Forecasting
Boris Ivanovic
Yifeng Lin
Shubham Shrivastava
Punarjay Chakravarty
Marco Pavone
83
18
0
07 Oct 2021
Imaginary Hindsight Experience Replay: Curious Model-based Learning for Sparse Reward Tasks
Robert McCarthy
Qiang Wang
S. Redmond
OffRL
32
15
0
05 Oct 2021
Dropout Q-Functions for Doubly Efficient Reinforcement Learning
Takuya Hiraoka
Takahisa Imagawa
Taisei Hashimoto
Takashi Onishi
Yoshimasa Tsuruoka
19
106
0
05 Oct 2021
Previous
1
2
3
4
5
6
7
Next