ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1805.12114
  4. Cited By
Deep Reinforcement Learning in a Handful of Trials using Probabilistic
  Dynamics Models

Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models

30 May 2018
Kurtland Chua
Roberto Calandra
R. McAllister
Sergey Levine
    BDL
ArXivPDFHTML

Papers citing "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"

50 / 339 papers shown
Title
Single and Multi-Agent Deep Reinforcement Learning for AI-Enabled
  Wireless Networks: A Tutorial
Single and Multi-Agent Deep Reinforcement Learning for AI-Enabled Wireless Networks: A Tutorial
Amal Feriani
Ekram Hossain
40
237
0
06 Nov 2020
Differentiable Physics Models for Real-world Offline Model-based
  Reinforcement Learning
Differentiable Physics Models for Real-world Offline Model-based Reinforcement Learning
M. Lutter
Johannes Silberbauer
Joe Watson
Jan Peters
OffRL
29
33
0
03 Nov 2020
Generative Temporal Difference Learning for Infinite-Horizon Prediction
Generative Temporal Difference Learning for Infinite-Horizon Prediction
Michael Janner
Igor Mordatch
Sergey Levine
AI4CE
28
34
0
27 Oct 2020
High Acceleration Reinforcement Learning for Real-World Juggling with
  Binary Rewards
High Acceleration Reinforcement Learning for Real-World Juggling with Binary Rewards
Kai Ploeger
M. Lutter
Jan Peters
22
29
0
26 Oct 2020
Action-Conditional Recurrent Kalman Networks For Forward and Inverse
  Dynamics Learning
Action-Conditional Recurrent Kalman Networks For Forward and Inverse Dynamics Learning
Vaisakh Shaj
P. Becker
Le Chen
Harit Pandya
Niels van Duijkeren
C. J. Taylor
Marc Hanheide
Gerhard Neumann
AI4CE
19
14
0
20 Oct 2020
A Differentiable Newton Euler Algorithm for Multi-body Model Learning
A Differentiable Newton Euler Algorithm for Multi-body Model Learning
M. Lutter
Johannes Silberbauer
Joe Watson
Jan Peters
21
11
0
19 Oct 2020
Model-based Policy Optimization with Unsupervised Model Adaptation
Model-based Policy Optimization with Unsupervised Model Adaptation
Jian Shen
Han Zhao
Weinan Zhang
Yong Yu
37
28
0
19 Oct 2020
Improving Sequential Latent Variable Models with Autoregressive Flows
Improving Sequential Latent Variable Models with Autoregressive Flows
Joseph Marino
Lei Chen
Jiawei He
Stephan Mandt
BDL
AI4TS
30
12
0
07 Oct 2020
Mastering Atari with Discrete World Models
Mastering Atari with Discrete World Models
Danijar Hafner
Timothy Lillicrap
Mohammad Norouzi
Jimmy Ba
DRL
55
823
0
05 Oct 2020
Heteroscedastic Bayesian Optimisation for Stochastic Model Predictive
  Control
Heteroscedastic Bayesian Optimisation for Stochastic Model Predictive Control
Rel Guzman
Rafael Oliveira
F. Ramos
29
15
0
01 Oct 2020
Continual Model-Based Reinforcement Learning with Hypernetworks
Continual Model-Based Reinforcement Learning with Hypernetworks
Yizhou Huang
Kevin Xie
Homanga Bharadhwaj
Florian Shkurti
CLL
27
46
0
25 Sep 2020
Nonholonomic Yaw Control of an Underactuated Flying Robot with
  Model-based Reinforcement Learning
Nonholonomic Yaw Control of an Underactuated Flying Robot with Model-based Reinforcement Learning
Nathan Lambert
Craig B. Schindler
Daniel S. Drew
K. Pister
22
6
0
02 Sep 2020
Document-editing Assistants and Model-based Reinforcement Learning as a
  Path to Conversational AI
Document-editing Assistants and Model-based Reinforcement Learning as a Path to Conversational AI
Katya Kudashkina
P. Pilarski
R. Sutton
KELM
25
6
0
27 Aug 2020
Learning Off-Policy with Online Planning
Learning Off-Policy with Online Planning
Harshit S. Sikchi
Wenxuan Zhou
David Held
OffRL
39
46
0
23 Aug 2020
Beyond Point Estimate: Inferring Ensemble Prediction Variation from
  Neuron Activation Strength in Recommender Systems
Beyond Point Estimate: Inferring Ensemble Prediction Variation from Neuron Activation Strength in Recommender Systems
Zhe Chen
Yuyan Wang
Dong Lin
D. Cheng
Lichan Hong
Ed H. Chi
Claire Cui
31
16
0
17 Aug 2020
Sample-efficient Cross-Entropy Method for Real-time Planning
Sample-efficient Cross-Entropy Method for Real-time Planning
Cristina Pinneri
Shambhuraj Sawant
Sebastian Blaes
Jan Achterhold
Joerg Stueckler
Michal Rolínek
Georg Martius
37
98
0
14 Aug 2020
SafePILCO: a software tool for safe and data-efficient policy synthesis
SafePILCO: a software tool for safe and data-efficient policy synthesis
Kyriakos Polymenakos
Nikitas Rontsis
Alessandro Abate
Stephen J. Roberts
32
6
0
07 Aug 2020
Towards General and Autonomous Learning of Core Skills: A Case Study in
  Locomotion
Towards General and Autonomous Learning of Core Skills: A Case Study in Locomotion
Roland Hafner
Tim Hertweck
Philipp Kloppner
Michael Bloesch
Michael Neunert
Markus Wulfmeier
S. Tunyasuvunakool
N. Heess
Martin Riedmiller
22
19
0
06 Aug 2020
An Iterative LQR Controller for Off-Road and On-Road Vehicles using a
  Neural Network Dynamics Model
An Iterative LQR Controller for Off-Road and On-Road Vehicles using a Neural Network Dynamics Model
Akhil Nagariya
Srikanth Saripalli
27
28
0
28 Jul 2020
Predictive Information Accelerates Learning in RL
Predictive Information Accelerates Learning in RL
Kuang-Huei Lee
Ian S. Fischer
Anthony Z. Liu
Yijie Guo
Honglak Lee
John F. Canny
S. Guadarrama
23
73
0
24 Jul 2020
Control as Hybrid Inference
Control as Hybrid Inference
Alexander Tschantz
Beren Millidge
A. Seth
Christopher L. Buckley
21
9
0
11 Jul 2020
SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep
  Reinforcement Learning
SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning
Kimin Lee
Michael Laskin
A. Srinivas
Pieter Abbeel
OffRL
25
199
0
09 Jul 2020
Selective Dyna-style Planning Under Limited Model Capacity
Selective Dyna-style Planning Under Limited Model Capacity
Zaheer Abbas
Samuel Sokota
Erin J. Talvitie
Martha White
45
32
0
05 Jul 2020
Information Theoretic Regret Bounds for Online Nonlinear Control
Information Theoretic Regret Bounds for Online Nonlinear Control
Sham Kakade
A. Krishnamurthy
Kendall Lowrey
Motoya Ohnishi
Wen Sun
38
117
0
22 Jun 2020
Task-Agnostic Online Reinforcement Learning with an Infinite Mixture of
  Gaussian Processes
Task-Agnostic Online Reinforcement Learning with an Infinite Mixture of Gaussian Processes
Mengdi Xu
Wenhao Ding
Jiacheng Zhu
Zuxin Liu
Baiming Chen
Ding Zhao
CLL
OffRL
30
34
0
19 Jun 2020
Learning Invariant Representations for Reinforcement Learning without
  Reconstruction
Learning Invariant Representations for Reinforcement Learning without Reconstruction
Amy Zhang
R. McAllister
Roberto Calandra
Y. Gal
Sergey Levine
OOD
SSL
60
464
0
18 Jun 2020
Model Embedding Model-Based Reinforcement Learning
Model Embedding Model-Based Reinforcement Learning
Xiao Tan
Chao Qu
Junwu Xiong
James Y. Zhang
OffRL
27
0
0
16 Jun 2020
Model-based Adversarial Meta-Reinforcement Learning
Model-based Adversarial Meta-Reinforcement Learning
Zichuan Lin
G. Thomas
Guangwen Yang
Tengyu Ma
OOD
29
52
0
16 Jun 2020
Efficient Model-Based Reinforcement Learning through Optimistic Policy
  Search and Planning
Efficient Model-Based Reinforcement Learning through Optimistic Policy Search and Planning
Sebastian Curi
Felix Berkenkamp
Andreas Krause
38
82
0
15 Jun 2020
Maximum Entropy Model Rollouts: Fast Model Based Policy Optimization
  without Compounding Errors
Maximum Entropy Model Rollouts: Fast Model Based Policy Optimization without Compounding Errors
Chi Zhang
S. Kuppannagari
Viktor Prasanna
22
4
0
08 Jun 2020
Wat zei je? Detecting Out-of-Distribution Translations with Variational
  Transformers
Wat zei je? Detecting Out-of-Distribution Translations with Variational Transformers
Tim Z. Xiao
Aidan Gomez
Y. Gal
UQLM
17
33
0
08 Jun 2020
PlanGAN: Model-based Planning With Sparse Rewards and Multiple Goals
PlanGAN: Model-based Planning With Sparse Rewards and Multiple Goals
Henry Charlesworth
Giovanni Montana
OffRL
29
24
0
01 Jun 2020
Model-Augmented Actor-Critic: Backpropagating through Paths
Model-Augmented Actor-Critic: Backpropagating through Paths
I. Clavera
Yao Fu
Pieter Abbeel
44
87
0
16 May 2020
Context-aware Dynamics Model for Generalization in Model-Based
  Reinforcement Learning
Context-aware Dynamics Model for Generalization in Model-Based Reinforcement Learning
Kimin Lee
Younggyo Seo
Seunghyun Lee
Honglak Lee
Jinwoo Shin
45
125
0
14 May 2020
Planning to Explore via Self-Supervised World Models
Planning to Explore via Self-Supervised World Models
Ramanan Sekar
Oleh Rybkin
Kostas Daniilidis
Pieter Abbeel
Danijar Hafner
Deepak Pathak
SSL
33
401
0
12 May 2020
Delay-Aware Model-Based Reinforcement Learning for Continuous Control
Delay-Aware Model-Based Reinforcement Learning for Continuous Control
Baiming Chen
Mengdi Xu
Liang-Sheng Li
Ding Zhao
OffRL
45
63
0
11 May 2020
Reinforcement Learning with Augmented Data
Reinforcement Learning with Augmented Data
Michael Laskin
Kimin Lee
Adam Stooke
Lerrel Pinto
Pieter Abbeel
A. Srinivas
OffRL
20
648
0
30 Apr 2020
Structured Mechanical Models for Robot Learning and Control
Structured Mechanical Models for Robot Learning and Control
Jayesh K. Gupta
Kunal Menda
Zachary Manchester
Mykel J. Kochenderfer
DRL
26
34
0
21 Apr 2020
Model-Predictive Control via Cross-Entropy and Gradient-Based
  Optimization
Model-Predictive Control via Cross-Entropy and Gradient-Based Optimization
Homanga Bharadhwaj
Kevin Xie
Florian Shkurti
24
49
0
19 Apr 2020
State-Only Imitation Learning for Dexterous Manipulation
State-Only Imitation Learning for Dexterous Manipulation
Ilija Radosavovic
Xiaolong Wang
Lerrel Pinto
Jitendra Malik
OffRL
24
122
0
07 Apr 2020
An empirical investigation of the challenges of real-world reinforcement
  learning
An empirical investigation of the challenges of real-world reinforcement learning
Gabriel Dulac-Arnold
Nir Levine
D. Mankowitz
Jerry Li
Cosmin Paduraru
Sven Gowal
Todd Hester
OffRL
36
121
0
24 Mar 2020
Learning to Fly via Deep Model-Based Reinforcement Learning
Learning to Fly via Deep Model-Based Reinforcement Learning
Philip Becker-Ehmck
Maximilian Karl
Jan Peters
Patrick van der Smagt
SSL
44
37
0
19 Mar 2020
Fast Online Adaptation in Robotics through Meta-Learning Embeddings of
  Simulated Priors
Fast Online Adaptation in Robotics through Meta-Learning Embeddings of Simulated Priors
Rituraj Kaushik
Timothée Anne
Jean-Baptiste Mouret
32
52
0
10 Mar 2020
PlaNet of the Bayesians: Reconsidering and Improving Deep Planning
  Network by Incorporating Bayesian Inference
PlaNet of the Bayesians: Reconsidering and Improving Deep Planning Network by Incorporating Bayesian Inference
Masashi Okada
Norio Kosaka
T. Taniguchi
13
43
0
01 Mar 2020
Reinforcement Learning through Active Inference
Reinforcement Learning through Active Inference
Alexander Tschantz
Beren Millidge
A. Seth
Christopher L. Buckley
AI4CE
36
69
0
28 Feb 2020
Generalizing Convolutional Neural Networks for Equivariance to Lie
  Groups on Arbitrary Continuous Data
Generalizing Convolutional Neural Networks for Equivariance to Lie Groups on Arbitrary Continuous Data
Marc Finzi
Samuel Stanton
Pavel Izmailov
A. Wilson
38
318
0
25 Feb 2020
Learning to Walk in the Real World with Minimal Human Effort
Learning to Walk in the Real World with Minimal Human Effort
Sehoon Ha
P. Xu
Zhenyu Tan
Sergey Levine
Jie Tan
31
169
0
20 Feb 2020
Pitfalls of In-Domain Uncertainty Estimation and Ensembling in Deep
  Learning
Pitfalls of In-Domain Uncertainty Estimation and Ensembling in Deep Learning
Arsenii Ashukha
Alexander Lyzhov
Dmitry Molchanov
Dmitry Vetrov
UQCV
FedML
38
314
0
15 Feb 2020
Ready Policy One: World Building Through Active Learning
Ready Policy One: World Building Through Active Learning
Philip J. Ball
Jack Parker-Holder
Aldo Pacchiano
K. Choromanski
Stephen J. Roberts
OffRL
32
49
0
07 Feb 2020
PAC Confidence Sets for Deep Neural Networks via Calibrated Prediction
PAC Confidence Sets for Deep Neural Networks via Calibrated Prediction
Sangdon Park
Osbert Bastani
Nikolai Matni
Insup Lee
UQCV
152
68
0
31 Dec 2019
Previous
1234567
Next