ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1805.12114
  4. Cited By
Deep Reinforcement Learning in a Handful of Trials using Probabilistic
  Dynamics Models

Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models

30 May 2018
Kurtland Chua
Roberto Calandra
R. McAllister
Sergey Levine
    BDL
ArXivPDFHTML

Papers citing "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"

50 / 339 papers shown
Title
Bridging State and History Representations: Understanding
  Self-Predictive RL
Bridging State and History Representations: Understanding Self-Predictive RL
Tianwei Ni
Benjamin Eysenbach
Erfan Seyedsalehi
Michel Ma
Clement Gehring
Aditya Mahajan
Pierre-Luc Bacon
AI4TS
AI4CE
31
22
0
17 Jan 2024
MOTO: Offline Pre-training to Online Fine-tuning for Model-based Robot
  Learning
MOTO: Offline Pre-training to Online Fine-tuning for Model-based Robot Learning
Rafael Rafailov
Kyle Hatch
Victor Kolev
John D. Martin
Mariano Phielipp
Chelsea Finn
OffRL
OnRL
29
10
0
06 Jan 2024
Multi-Agent Probabilistic Ensembles with Trajectory Sampling for
  Connected Autonomous Vehicles
Multi-Agent Probabilistic Ensembles with Trajectory Sampling for Connected Autonomous Vehicles
Ruoqi Wen
Jiahao Huang
Rongpeng Li
Guoru Ding
Zhifeng Zhao
42
1
0
21 Dec 2023
Multi-agent Reinforcement Learning: A Comprehensive Survey
Multi-agent Reinforcement Learning: A Comprehensive Survey
Dom Huh
Prasant Mohapatra
AI4CE
40
8
0
15 Dec 2023
A Tractable Inference Perspective of Offline RL
A Tractable Inference Perspective of Offline RL
Xuejie Liu
Hoang Trung-Dung
Guy Van den Broeck
Yitao Liang
OffRL
36
1
0
31 Oct 2023
Variational Curriculum Reinforcement Learning for Unsupervised Discovery
  of Skills
Variational Curriculum Reinforcement Learning for Unsupervised Discovery of Skills
Seongun Kim
Kyowoon Lee
Jaesik Choi
SSL
DRL
43
7
0
30 Oct 2023
Uncertainty-aware transfer across tasks using hybrid model-based
  successor feature reinforcement learning
Uncertainty-aware transfer across tasks using hybrid model-based successor feature reinforcement learning
Parvin Malekzadeh
Ming Hou
Konstantinos N. Plataniotis
53
1
0
16 Oct 2023
Pay Attention to How You Drive: Safe and Adaptive Model-Based
  Reinforcement Learning for Off-Road Driving
Pay Attention to How You Drive: Safe and Adaptive Model-Based Reinforcement Learning for Off-Road Driving
Sean J. Wang
Honghao Zhu
Aaron M. Johnson
34
6
0
12 Oct 2023
Machine Learning Meets Advanced Robotic Manipulation
Machine Learning Meets Advanced Robotic Manipulation
Saeid Nahavandi
R. Alizadehsani
D. Nahavandi
Chee Peng Lim
Kevin Kelly
Fernando Bello
29
17
0
22 Sep 2023
Practical Probabilistic Model-based Deep Reinforcement Learning by
  Integrating Dropout Uncertainty and Trajectory Sampling
Practical Probabilistic Model-based Deep Reinforcement Learning by Integrating Dropout Uncertainty and Trajectory Sampling
Wenjun Huang
Yunduan Cui
Huiyun Li
Xin Wu
MU
27
0
0
20 Sep 2023
Contrastive Initial State Buffer for Reinforcement Learning
Contrastive Initial State Buffer for Reinforcement Learning
Nico Messikommer
Yunlong Song
Davide Scaramuzza
OffRL
49
9
0
18 Sep 2023
Signal Temporal Logic Neural Predictive Control
Signal Temporal Logic Neural Predictive Control
Yue Meng
Chuchu Fan
31
15
0
10 Sep 2023
Distributionally Robust Model-based Reinforcement Learning with Large
  State Spaces
Distributionally Robust Model-based Reinforcement Learning with Large State Spaces
Shyam Sundhar Ramesh
Pier Giuseppe Sessa
Yifan Hu
Andreas Krause
Ilija Bogunovic
OOD
47
10
0
05 Sep 2023
Structured World Models from Human Videos
Structured World Models from Human Videos
Russell Mendonca
Shikhar Bahl
Deepak Pathak
LM&Ro
54
88
0
21 Aug 2023
DREAMWALKER: Mental Planning for Continuous Vision-Language Navigation
DREAMWALKER: Mental Planning for Continuous Vision-Language Navigation
Hanqing Wang
Wei Liang
Luc Van Gool
Wenguan Wang
LM&Ro
42
28
0
14 Aug 2023
Uncertainty Quantification for Image-based Traffic Prediction across
  Cities
Uncertainty Quantification for Image-based Traffic Prediction across Cities
Alexander Timans
Nina Wiedemann
Nishant Kumar
Ye Hong
Martin Raubal
23
1
0
11 Aug 2023
Probabilistic Constrained Reinforcement Learning with Formal
  Interpretability
Probabilistic Constrained Reinforcement Learning with Formal Interpretability
Yanran Wang
Qiuchen Qian
David E. Boyle
24
4
0
13 Jul 2023
Actor-Critic Model Predictive Control
Actor-Critic Model Predictive Control
Angel Romero
Yunlong Song
Davide Scaramuzza
52
36
0
16 Jun 2023
Simplified Temporal Consistency Reinforcement Learning
Simplified Temporal Consistency Reinforcement Learning
Yi Zhao
Wenshuai Zhao
Rinu Boney
Arno Solin
Joni Pajarinen
OffRL
35
13
0
15 Jun 2023
Optimal Exploration for Model-Based RL in Nonlinear Systems
Optimal Exploration for Model-Based RL in Nonlinear Systems
Andrew Wagenmaker
Guanya Shi
Kevin G. Jamieson
41
15
0
15 Jun 2023
Safe Offline Reinforcement Learning with Real-Time Budget Constraints
Safe Offline Reinforcement Learning with Real-Time Budget Constraints
Qian Lin
Bo Tang
Zifan Wu
Chao Yu
Shangqin Mao
Qianlong Xie
Xingxing Wang
Dong Wang
OffRL
44
11
0
01 Jun 2023
Off-Policy RL Algorithms Can be Sample-Efficient for Continuous Control
  via Sample Multiple Reuse
Off-Policy RL Algorithms Can be Sample-Efficient for Continuous Control via Sample Multiple Reuse
Jiafei Lyu
Le Wan
Zongqing Lu
Xiu Li
OffRL
38
9
0
29 May 2023
On the Value of Myopic Behavior in Policy Reuse
On the Value of Myopic Behavior in Policy Reuse
Kang Xu
Chenjia Bai
Shuang Qiu
Haoran He
Bin Zhao
Zhen Wang
Wei Li
Xuelong Li
41
1
0
28 May 2023
Adaptive PD Control using Deep Reinforcement Learning for Local-Remote
  Teleoperation with Stochastic Time Delays
Adaptive PD Control using Deep Reinforcement Learning for Local-Remote Teleoperation with Stochastic Time Delays
Lucy McCutcheon
Saber Fallah
38
0
0
26 May 2023
Learning Interpretable Models of Aircraft Handling Behaviour by
  Reinforcement Learning from Human Feedback
Learning Interpretable Models of Aircraft Handling Behaviour by Reinforcement Learning from Human Feedback
Tom Bewley
J. Lawry
Arthur G. Richards
32
1
0
26 May 2023
Pedestrian Trajectory Forecasting Using Deep Ensembles Under Sensing
  Uncertainty
Pedestrian Trajectory Forecasting Using Deep Ensembles Under Sensing Uncertainty
Anshul Nayak
A. Eskandarian
Zachary R. Doerzaph
P. Ghorai
40
4
0
26 May 2023
C-MCTS: Safe Planning with Monte Carlo Tree Search
C-MCTS: Safe Planning with Monte Carlo Tree Search
Dinesh Parthasarathy
G. Kontes
Axel Plinge
Christopher Mutschler
45
3
0
25 May 2023
Optimal Control of Nonlinear Systems with Unknown Dynamics
Optimal Control of Nonlinear Systems with Unknown Dynamics
Wenjian Hao
Paulo C. Heredia
Shaoshuai Mou
42
1
0
24 May 2023
TOM: Learning Policy-Aware Models for Model-Based Reinforcement Learning
  via Transition Occupancy Matching
TOM: Learning Policy-Aware Models for Model-Based Reinforcement Learning via Transition Occupancy Matching
Yecheng Jason Ma
K. Sivakumar
Jason Yan
Osbert Bastani
Dinesh Jayaraman
OffRL
MU
37
6
0
22 May 2023
Get Back Here: Robust Imitation by Return-to-Distribution Planning
Get Back Here: Robust Imitation by Return-to-Distribution Planning
Geoffrey Cideron
B. Tabanpour
Sebastian Curi
Sertan Girgin
Léonard Hussenot
Gabriel Dulac-Arnold
M. Geist
Olivier Pietquin
Robert Dadashi
OOD
84
2
0
02 May 2023
Learning Terrain-Aware Kinodynamic Model for Autonomous Off-Road Rally
  Driving With Model Predictive Path Integral Control
Learning Terrain-Aware Kinodynamic Model for Autonomous Off-Road Rally Driving With Model Predictive Path Integral Control
Ho-Woon Lee
Taekyung Kim
Jungwi Mun
Wonsuk Lee
40
16
0
01 May 2023
Policy Resilience to Environment Poisoning Attacks on Reinforcement
  Learning
Policy Resilience to Environment Poisoning Attacks on Reinforcement Learning
Hang Xu
Xinghua Qu
Zinovi Rabinovich
37
1
0
24 Apr 2023
Convex Optimization-based Policy Adaptation to Compensate for
  Distributional Shifts
Convex Optimization-based Policy Adaptation to Compensate for Distributional Shifts
Navid Hashemi
Justin Ruths
Jyotirmoy V. Deshmukh
31
0
0
05 Apr 2023
Dynamic Update-to-Data Ratio: Minimizing World Model Overfitting
Dynamic Update-to-Data Ratio: Minimizing World Model Overfitting
Nicolai Dorka
Tim Welschehold
Wolfram Burgard
21
3
0
17 Mar 2023
Beware of Instantaneous Dependence in Reinforcement Learning
Beware of Instantaneous Dependence in Reinforcement Learning
Zhengmao Zhu
Yu-Ren Liu
Hong Tian
Yang Yu
Kun Zhang
OffRL
41
1
0
09 Mar 2023
Environment Transformer and Policy Optimization for Model-Based Offline
  Reinforcement Learning
Environment Transformer and Policy Optimization for Model-Based Offline Reinforcement Learning
Pengqin Wang
Meixin Zhu
Shaojie Shen
OffRL
38
1
0
07 Mar 2023
A Neurosymbolic Approach to the Verification of Temporal Logic
  Properties of Learning enabled Control Systems
A Neurosymbolic Approach to the Verification of Temporal Logic Properties of Learning enabled Control Systems
Navid Hashemi
Bardh Hoxha
Tomoya Yamaguchi
Danil Prokhorov
Geogios Fainekos
Jyotirmoy Deshmukh
35
8
0
07 Mar 2023
Hallucinated Adversarial Control for Conservative Offline Policy
  Evaluation
Hallucinated Adversarial Control for Conservative Offline Policy Evaluation
Jonas Rothfuss
Bhavya Sukhija
Tobias Birchler
Parnian Kassraie
Andreas Krause
OffRL
31
10
0
02 Mar 2023
The Virtues of Laziness in Model-based RL: A Unified Objective and
  Algorithms
The Virtues of Laziness in Model-based RL: A Unified Objective and Algorithms
Anirudh Vemula
Yuda Song
Aarti Singh
J. Andrew Bagnell
Sanjiban Choudhury
OffRL
46
13
0
01 Mar 2023
Model-Based Uncertainty in Value Functions
Model-Based Uncertainty in Value Functions
Carlos E. Luis
A. Bottero
Julia Vinogradska
Felix Berkenkamp
Jan Peters
41
14
0
24 Feb 2023
Neural Optimal Control using Learned System Dynamics
Neural Optimal Control using Learned System Dynamics
Selim Engin
Volkan Isler
24
3
0
20 Feb 2023
Learning to Forecast Aleatoric and Epistemic Uncertainties over Long
  Horizon Trajectories
Learning to Forecast Aleatoric and Epistemic Uncertainties over Long Horizon Trajectories
Aastha Acharya
Rebecca L. Russell
Nisar R. Ahmed
34
6
0
17 Feb 2023
Learning a model is paramount for sample efficiency in reinforcement
  learning control of PDEs
Learning a model is paramount for sample efficiency in reinforcement learning control of PDEs
Stefan Werner
Sebastian Peitz
46
9
0
14 Feb 2023
Predictable MDP Abstraction for Unsupervised Model-Based RL
Predictable MDP Abstraction for Unsupervised Model-Based RL
Seohong Park
Sergey Levine
29
9
0
08 Feb 2023
Is Model Ensemble Necessary? Model-based RL via a Single Model with
  Lipschitz Regularized Value Function
Is Model Ensemble Necessary? Model-based RL via a Single Model with Lipschitz Regularized Value Function
Ruijie Zheng
Xiyao Wang
Huazhe Xu
Furong Huang
53
14
0
02 Feb 2023
PAC-Bayesian Soft Actor-Critic Learning
PAC-Bayesian Soft Actor-Critic Learning
Bahareh Tasdighi
Abdullah Akgul
Manuel Haussmann
Kenny Kazimirzak Brink
M. Kandemir
41
3
0
30 Jan 2023
Risk-Averse Model Uncertainty for Distributionally Robust Safe
  Reinforcement Learning
Risk-Averse Model Uncertainty for Distributionally Robust Safe Reinforcement Learning
James Queeney
M. Benosman
OOD
OffRL
46
5
0
30 Jan 2023
On Approximating the Dynamic Response of Synchronous Generators via
  Operator Learning: A Step Towards Building Deep Operator-based Power Grid
  Simulators
On Approximating the Dynamic Response of Synchronous Generators via Operator Learning: A Step Towards Building Deep Operator-based Power Grid Simulators
Christian Moya
Guang Lin
Tianqiao Zhao
Meng Yue
37
8
0
29 Jan 2023
STEERING: Stein Information Directed Exploration for Model-Based
  Reinforcement Learning
STEERING: Stein Information Directed Exploration for Model-Based Reinforcement Learning
Souradip Chakraborty
Amrit Singh Bedi
Alec Koppel
Mengdi Wang
Furong Huang
Dinesh Manocha
24
8
0
28 Jan 2023
Risk-aware Vehicle Motion Planning Using Bayesian LSTM-Based Model
  Predictive Control
Risk-aware Vehicle Motion Planning Using Bayesian LSTM-Based Model Predictive Control
Yufei Huang
Mohsen Jafari
26
5
0
15 Jan 2023
Previous
1234567
Next