ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1805.12114
  4. Cited By
Deep Reinforcement Learning in a Handful of Trials using Probabilistic
  Dynamics Models

Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models

30 May 2018
Kurtland Chua
Roberto Calandra
R. McAllister
Sergey Levine
    BDL
ArXivPDFHTML

Papers citing "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"

50 / 336 papers shown
Title
A Survey of Robotic Navigation and Manipulation with Physics Simulators in the Era of Embodied AI
A Survey of Robotic Navigation and Manipulation with Physics Simulators in the Era of Embodied AI
Lik Hang Kenny Wong
Xueyang Kang
Kaixin Bai
Jianwei Zhang
63
0
0
01 May 2025
Uncertainty-aware Latent Safety Filters for Avoiding Out-of-Distribution Failures
Uncertainty-aware Latent Safety Filters for Avoiding Out-of-Distribution Failures
Junwon Seo
Kensuke Nakamura
Andrea V. Bajcsy
56
0
0
01 May 2025
Learned Perceptive Forward Dynamics Model for Safe and Platform-aware Robotic Navigation
Learned Perceptive Forward Dynamics Model for Safe and Platform-aware Robotic Navigation
Pascal Roth
Jonas Frey
Cesar Cadena
Marco Hutter
41
0
0
27 Apr 2025
Action Flow Matching for Continual Robot Learning
Action Flow Matching for Continual Robot Learning
Alejandro Murillo-Gonzalez
Lantao Liu
CLL
47
0
0
25 Apr 2025
Look Before Leap: Look-Ahead Planning with Uncertainty in Reinforcement Learning
Look Before Leap: Look-Ahead Planning with Uncertainty in Reinforcement Learning
Yongshuai Liu
Xin Liu
101
1
0
26 Mar 2025
AdaWorld: Learning Adaptable World Models with Latent Actions
AdaWorld: Learning Adaptable World Models with Latent Actions
Shenyuan Gao
Siyuan Zhou
Yilun Du
Jun Zhang
Chuang Gan
VGen
73
4
0
24 Mar 2025
Neural Lyapunov Function Approximation with Self-Supervised Reinforcement Learning
Neural Lyapunov Function Approximation with Self-Supervised Reinforcement Learning
Luc McCutcheon
Bahman Gharesifard
Saber Fallah
58
0
0
19 Mar 2025
Towards Better Sample Efficiency in Multi-Agent Reinforcement Learning via Exploration
Towards Better Sample Efficiency in Multi-Agent Reinforcement Learning via Exploration
Amir Baghi
Jens Sjölund
Joakim Bergdahl
Linus Gisslén
Alessandro Sestini
60
0
0
17 Mar 2025
Reasoning in visual navigation of end-to-end trained agents: a dynamical systems approach
Reasoning in visual navigation of end-to-end trained agents: a dynamical systems approach
Steeven Janny
Hervé Poirier
L. Antsfeld
G. Bono
G. Monaci
Boris Chidlovskii
Francesco Giuliari
Alessio Del Bue
Christian Wolf
LM&Ro
63
0
0
11 Mar 2025
Plan2Align: Predictive Planning Based Test-Time Preference Alignment in Paragraph-Level Machine Translation
Plan2Align: Predictive Planning Based Test-Time Preference Alignment in Paragraph-Level Machine Translation
Kuang-Da Wang
Teng-Ruei Chen
Yu-Heng Hung
Shuoyang Ding
Yueh-Hua Wu
Yu-Chun Wang
Chao-Han Huck Yang
Wen-Chih Peng
Ping-Chun Hsieh
79
0
0
28 Feb 2025
IL-SOAR : Imitation Learning with Soft Optimistic Actor cRitic
IL-SOAR : Imitation Learning with Soft Optimistic Actor cRitic
Stefano Viel
Luca Viano
V. Cevher
95
0
0
27 Feb 2025
Zero-shot Model-based Reinforcement Learning using Large Language Models
Zero-shot Model-based Reinforcement Learning using Large Language Models
Abdelhakim Benechehab
Youssef Attia El Hili
Ambroise Odonnat
Oussama Zekri
Albert Thomas
Giuseppe Paolo
Maurizio Filippone
I. Redko
Balázs Kégl
OffRL
75
1
0
17 Feb 2025
HopCast: Calibration of Autoregressive Dynamics Models
HopCast: Calibration of Autoregressive Dynamics Models
Muhammad Bilal Shahid
Cody H. Fleming
UQCV
55
0
0
27 Jan 2025
ABPT: Amended Backpropagation through Time with Partially Differentiable Rewards
ABPT: Amended Backpropagation through Time with Partially Differentiable Rewards
Fanxing Li
Fangyu Sun
Tianbao Zhang
Danping Zou
41
0
0
24 Jan 2025
Boosting MCTS with Free Energy Minimization
Boosting MCTS with Free Energy Minimization
Mawaba Pascal Dao
Adrian Peter
86
0
0
22 Jan 2025
Learning to Navigate in Mazes with Novel Layouts using Abstract Top-down
  Maps
Learning to Navigate in Mazes with Novel Layouts using Abstract Top-down Maps
Linfeng Zhao
Lawson L. S. Wong
87
1
0
16 Dec 2024
Stabilizing Reinforcement Learning in Differentiable Multiphysics Simulation
Stabilizing Reinforcement Learning in Differentiable Multiphysics Simulation
Eliot Xing
Vernon Luk
Jean Oh
99
0
0
16 Dec 2024
Remote Manipulation of Multiple Objects with Airflow Field Using Model-Based Learning Control
Remote Manipulation of Multiple Objects with Airflow Field Using Model-Based Learning Control
Artur Kopitca
Shahriar Haeri
Quan Zhou
73
0
0
04 Dec 2024
Variance-Aware Linear UCB with Deep Representation for Neural Contextual Bandits
Variance-Aware Linear UCB with Deep Representation for Neural Contextual Bandits
H. Bui
Enrique Mallada
Anqi Liu
219
0
0
08 Nov 2024
Constrained Latent Action Policies for Model-Based Offline Reinforcement Learning
Constrained Latent Action Policies for Model-Based Offline Reinforcement Learning
Marvin Alles
Philip Becker-Ehmck
Patrick van der Smagt
Maximilian Karl
OffRL
47
1
0
07 Nov 2024
Prioritized Generative Replay
Prioritized Generative Replay
Renhao Wang
Kevin Frans
Pieter Abbeel
Sergey Levine
Alexei A. Efros
OnRL
DiffM
119
2
0
23 Oct 2024
ActSafe: Active Exploration with Safety Constraints for Reinforcement Learning
ActSafe: Active Exploration with Safety Constraints for Reinforcement Learning
Yarden As
Bhavya Sukhija
Lenart Treven
Carmelo Sferrazza
Stelian Coros
Andreas Krause
38
1
0
12 Oct 2024
MAD-TD: Model-Augmented Data stabilizes High Update Ratio RL
MAD-TD: Model-Augmented Data stabilizes High Update Ratio RL
C. Voelcker
Marcel Hussing
Eric Eaton
Amir-massoud Farahmand
Igor Gilitschenski
49
2
0
11 Oct 2024
Zero-Shot Offline Imitation Learning via Optimal Transport
Zero-Shot Offline Imitation Learning via Optimal Transport
Thomas Rupf
Marco Bagatella
Nico Gürtler
Jonas Frey
Georg Martius
OffRL
246
0
0
11 Oct 2024
Efficient Policy Evaluation with Safety Constraint for Reinforcement Learning
Efficient Policy Evaluation with Safety Constraint for Reinforcement Learning
Claire Chen
Shuze Liu
Shangtong Zhang
OffRL
198
1
0
08 Oct 2024
Efficient Model-Based Reinforcement Learning Through Optimistic Thompson Sampling
Efficient Model-Based Reinforcement Learning Through Optimistic Thompson Sampling
Jasmine Bayrooti
Carl Henrik Ek
Amanda Prorok
50
0
0
07 Oct 2024
Doubly Optimal Policy Evaluation for Reinforcement Learning
Doubly Optimal Policy Evaluation for Reinforcement Learning
Shuze Liu
Claire Chen
Shangtong Zhang
OffRL
43
2
0
03 Oct 2024
Uncertainty-aware Reward Model: Teaching Reward Models to Know What is Unknown
Uncertainty-aware Reward Model: Teaching Reward Models to Know What is Unknown
Xingzhou Lou
Dong Yan
Wei Shen
Yuzi Yan
Jian Xie
Junge Zhang
58
22
0
01 Oct 2024
LiRA: Light-Robust Adversary for Model-based Reinforcement Learning in Real World
LiRA: Light-Robust Adversary for Model-based Reinforcement Learning in Real World
Taisuke Kobayashi
71
2
0
29 Sep 2024
Learning to Refine Input Constrained Control Barrier Functions via Uncertainty-Aware Online Parameter Adaptation
Learning to Refine Input Constrained Control Barrier Functions via Uncertainty-Aware Online Parameter Adaptation
Taekyung Kim
Robin Inho Kee
Dimitra Panagou
58
7
0
22 Sep 2024
SHIRE: Enhancing Sample Efficiency using Human Intuition in REinforcement Learning
SHIRE: Enhancing Sample Efficiency using Human Intuition in REinforcement Learning
Amogh Joshi
Adarsh Kosta
Kaushik Roy
OffRL
55
2
0
16 Sep 2024
Quantifying Aleatoric and Epistemic Dynamics Uncertainty via Local Conformal Calibration
Quantifying Aleatoric and Epistemic Dynamics Uncertainty via Local Conformal Calibration
Luís Marques
Dmitry Berenson
40
0
0
12 Sep 2024
Traffic expertise meets residual RL: Knowledge-informed model-based residual reinforcement learning for CAV trajectory control
Traffic expertise meets residual RL: Knowledge-informed model-based residual reinforcement learning for CAV trajectory control
Zihao Sheng
Zilin Huang
Sikai Chen
41
9
0
30 Aug 2024
PAIL: Performance based Adversarial Imitation Learning Engine for Carbon
  Neutral Optimization
PAIL: Performance based Adversarial Imitation Learning Engine for Carbon Neutral Optimization
Yuyang Ye
Lu-An Tang
Haoyu Wang
Runlong Yu
Wenchao Yu
Erhu He
Haifeng Chen
Hui Xiong
27
0
0
12 Jul 2024
SE(3)-Hyena Operator for Scalable Equivariant Learning
SE(3)-Hyena Operator for Scalable Equivariant Learning
Artem Moskalev
Mangal Prakash
Rui Liao
Tommaso Mansi
57
2
0
01 Jul 2024
Meta-Gradient Search Control: A Method for Improving the Efficiency of
  Dyna-style Planning
Meta-Gradient Search Control: A Method for Improving the Efficiency of Dyna-style Planning
Bradley Burega
John D. Martin
Luke Kapeluck
Michael Bowling
42
0
0
27 Jun 2024
Shedding Light on Large Generative Networks: Estimating Epistemic
  Uncertainty in Diffusion Models
Shedding Light on Large Generative Networks: Estimating Epistemic Uncertainty in Diffusion Models
Lucas Berry
Axel Brando
David Meger
37
6
0
05 Jun 2024
NeoRL: Efficient Exploration for Nonepisodic RL
NeoRL: Efficient Exploration for Nonepisodic RL
Bhavya Sukhija
Lenart Treven
Florian Dorfler
Stelian Coros
Andreas Krause
OffRL
41
0
0
03 Jun 2024
Trust the Model Where It Trusts Itself -- Model-Based Actor-Critic with
  Uncertainty-Aware Rollout Adaption
Trust the Model Where It Trusts Itself -- Model-Based Actor-Critic with Uncertainty-Aware Rollout Adaption
Bernd Frauenknecht
Artur Eisele
Devdutt Subhasish
Friedrich Solowjow
Sebastian Trimpe
54
5
0
29 May 2024
BWArea Model: Learning World Model, Inverse Dynamics, and Policy for
  Controllable Language Generation
BWArea Model: Learning World Model, Inverse Dynamics, and Policy for Controllable Language Generation
Chengxing Jia
Pengyuan Wang
Ziniu Li
Yi-Chen Li
Zhilong Zhang
Nan Tang
Yang Yu
OffRL
42
1
0
27 May 2024
State-Constrained Offline Reinforcement Learning
State-Constrained Offline Reinforcement Learning
Charles A. Hepburn
Yue Jin
Giovanni Montana
OffRL
49
0
0
23 May 2024
Adaptive Teaching in Heterogeneous Agents: Balancing Surprise in Sparse
  Reward Scenarios
Adaptive Teaching in Heterogeneous Agents: Balancing Surprise in Sparse Reward Scenarios
Emma Clark
Kanghyun Ryu
Negar Mehr
18
1
0
23 May 2024
RACER: Epistemic Risk-Sensitive RL Enables Fast Driving with Fewer
  Crashes
RACER: Epistemic Risk-Sensitive RL Enables Fast Driving with Fewer Crashes
Kyle Stachowicz
Sergey Levine
22
6
0
07 May 2024
The Curse of Diversity in Ensemble-Based Exploration
The Curse of Diversity in Ensemble-Based Exploration
Zhixuan Lin
P. DÓro
Evgenii Nikishin
Rameswar Panda
55
1
0
07 May 2024
Continual Model-based Reinforcement Learning for Data Efficient Wireless
  Network Optimisation
Continual Model-based Reinforcement Learning for Data Efficient Wireless Network Optimisation
Cengis Hasan
Alexandros Agapitos
David Lynch
Alberto Castagna
Giorgio Cruciata
Hao Wang
Aleksandar Milenovic
51
0
0
30 Apr 2024
Full Shot Predictions for the DIII-D Tokamak via Deep Recurrent Networks
Full Shot Predictions for the DIII-D Tokamak via Deep Recurrent Networks
I. Char
Youngseog Chung
J. Abbate
E. Kolemen
Jeff Schneider
51
5
0
18 Apr 2024
Model-based Reinforcement Learning for Parameterized Action Spaces
Model-based Reinforcement Learning for Parameterized Action Spaces
Renhao Zhang
Haotian Fu
Yilin Miao
George Konidaris
36
3
0
03 Apr 2024
Active Exploration in Bayesian Model-based Reinforcement Learning for
  Robot Manipulation
Active Exploration in Bayesian Model-based Reinforcement Learning for Robot Manipulation
Carlos Plou
Ana C. Murillo
Ruben Martinez-Cantin
OffRL
45
0
0
02 Apr 2024
The Edge-of-Reach Problem in Offline Model-Based Reinforcement Learning
The Edge-of-Reach Problem in Offline Model-Based Reinforcement Learning
Anya Sims
Cong Lu
Yee Whye Teh
OffRL
41
3
0
19 Feb 2024
Port-Hamiltonian Neural ODE Networks on Lie Groups For Robot Dynamics
  Learning and Control
Port-Hamiltonian Neural ODE Networks on Lie Groups For Robot Dynamics Learning and Control
T. Duong
Abdullah Altawaitan
Jason Stanley
Nikolay Atanasov
54
10
0
17 Jan 2024
1234567
Next