Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models

30 May 2018

Papers citing "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"

50 / 336 papers shown

Title
A Survey of Robotic Navigation and Manipulation with Physics Simulators in the Era of Embodied AI Lik Hang Kenny Wong Xueyang Kang Kaixin Bai Jianwei Zhang 63 0 0 01 May 2025
Uncertainty-aware Latent Safety Filters for Avoiding Out-of-Distribution Failures Junwon Seo Kensuke Nakamura Andrea V. Bajcsy 56 0 0 01 May 2025
Learned Perceptive Forward Dynamics Model for Safe and Platform-aware Robotic Navigation Pascal Roth Jonas Frey Cesar Cadena Marco Hutter 41 0 0 27 Apr 2025
Action Flow Matching for Continual Robot Learning Alejandro Murillo-Gonzalez Lantao Liu CLL 47 0 0 25 Apr 2025
Look Before Leap: Look-Ahead Planning with Uncertainty in Reinforcement Learning Yongshuai Liu Xin Liu 101 1 0 26 Mar 2025
AdaWorld: Learning Adaptable World Models with Latent Actions Shenyuan Gao Siyuan Zhou Yilun Du Jun Zhang Chuang Gan VGen 73 4 0 24 Mar 2025
Neural Lyapunov Function Approximation with Self-Supervised Reinforcement Learning Luc McCutcheon Bahman Gharesifard Saber Fallah 58 0 0 19 Mar 2025
Towards Better Sample Efficiency in Multi-Agent Reinforcement Learning via Exploration Amir Baghi Jens Sjölund Joakim Bergdahl Linus Gisslén Alessandro Sestini 60 0 0 17 Mar 2025
Reasoning in visual navigation of end-to-end trained agents: a dynamical systems approach Steeven Janny Hervé Poirier L. Antsfeld G. Bono G. Monaci Boris Chidlovskii Francesco Giuliari Alessio Del Bue Christian Wolf LM&Ro 63 0 0 11 Mar 2025
Plan2Align: Predictive Planning Based Test-Time Preference Alignment in Paragraph-Level Machine Translation Kuang-Da Wang Teng-Ruei Chen Yu-Heng Hung Shuoyang Ding Yueh-Hua Wu Yu-Chun Wang Chao-Han Huck Yang Wen-Chih Peng Ping-Chun Hsieh 79 0 0 28 Feb 2025
IL-SOAR : Imitation Learning with Soft Optimistic Actor cRitic Stefano Viel Luca Viano V. Cevher 95 0 0 27 Feb 2025
Zero-shot Model-based Reinforcement Learning using Large Language Models Abdelhakim Benechehab Youssef Attia El Hili Ambroise Odonnat Oussama Zekri Albert Thomas Giuseppe Paolo Maurizio Filippone I. Redko Balázs Kégl OffRL 75 1 0 17 Feb 2025
HopCast: Calibration of Autoregressive Dynamics Models Muhammad Bilal Shahid Cody H. Fleming UQCV 55 0 0 27 Jan 2025
ABPT: Amended Backpropagation through Time with Partially Differentiable Rewards Fanxing Li Fangyu Sun Tianbao Zhang Danping Zou 41 0 0 24 Jan 2025
Boosting MCTS with Free Energy Minimization Mawaba Pascal Dao Adrian Peter 86 0 0 22 Jan 2025
Learning to Navigate in Mazes with Novel Layouts using Abstract Top-down Maps Linfeng Zhao Lawson L. S. Wong 87 1 0 16 Dec 2024
Stabilizing Reinforcement Learning in Differentiable Multiphysics Simulation Eliot Xing Vernon Luk Jean Oh 99 0 0 16 Dec 2024
Remote Manipulation of Multiple Objects with Airflow Field Using Model-Based Learning Control Artur Kopitca Shahriar Haeri Quan Zhou 73 0 0 04 Dec 2024
Variance-Aware Linear UCB with Deep Representation for Neural Contextual Bandits H. Bui Enrique Mallada Anqi Liu 219 0 0 08 Nov 2024
Constrained Latent Action Policies for Model-Based Offline Reinforcement Learning Marvin Alles Philip Becker-Ehmck Patrick van der Smagt Maximilian Karl OffRL 47 1 0 07 Nov 2024
Prioritized Generative Replay Renhao Wang Kevin Frans Pieter Abbeel Sergey Levine Alexei A. Efros OnRL DiffM 119 2 0 23 Oct 2024
ActSafe: Active Exploration with Safety Constraints for Reinforcement Learning Yarden As Bhavya Sukhija Lenart Treven Carmelo Sferrazza Stelian Coros Andreas Krause 38 1 0 12 Oct 2024
MAD-TD: Model-Augmented Data stabilizes High Update Ratio RL C. Voelcker Marcel Hussing Eric Eaton Amir-massoud Farahmand Igor Gilitschenski 49 2 0 11 Oct 2024
Zero-Shot Offline Imitation Learning via Optimal Transport Thomas Rupf Marco Bagatella Nico Gürtler Jonas Frey Georg Martius OffRL 246 0 0 11 Oct 2024
Efficient Policy Evaluation with Safety Constraint for Reinforcement Learning Claire Chen Shuze Liu Shangtong Zhang OffRL 198 1 0 08 Oct 2024
Efficient Model-Based Reinforcement Learning Through Optimistic Thompson Sampling Jasmine Bayrooti Carl Henrik Ek Amanda Prorok 50 0 0 07 Oct 2024
Doubly Optimal Policy Evaluation for Reinforcement Learning Shuze Liu Claire Chen Shangtong Zhang OffRL 43 2 0 03 Oct 2024
Uncertainty-aware Reward Model: Teaching Reward Models to Know What is Unknown Xingzhou Lou Dong Yan Wei Shen Yuzi Yan Jian Xie Junge Zhang 58 22 0 01 Oct 2024
LiRA: Light-Robust Adversary for Model-based Reinforcement Learning in Real World Taisuke Kobayashi 71 2 0 29 Sep 2024
Learning to Refine Input Constrained Control Barrier Functions via Uncertainty-Aware Online Parameter Adaptation Taekyung Kim Robin Inho Kee Dimitra Panagou 58 7 0 22 Sep 2024
SHIRE: Enhancing Sample Efficiency using Human Intuition in REinforcement Learning Amogh Joshi Adarsh Kosta Kaushik Roy OffRL 55 2 0 16 Sep 2024
Quantifying Aleatoric and Epistemic Dynamics Uncertainty via Local Conformal Calibration Luís Marques Dmitry Berenson 40 0 0 12 Sep 2024
Traffic expertise meets residual RL: Knowledge-informed model-based residual reinforcement learning for CAV trajectory control Zihao Sheng Zilin Huang Sikai Chen 41 9 0 30 Aug 2024
PAIL: Performance based Adversarial Imitation Learning Engine for Carbon Neutral Optimization Yuyang Ye Lu-An Tang Haoyu Wang Runlong Yu Wenchao Yu Erhu He Haifeng Chen Hui Xiong 27 0 0 12 Jul 2024
SE(3)-Hyena Operator for Scalable Equivariant Learning Artem Moskalev Mangal Prakash Rui Liao Tommaso Mansi 57 2 0 01 Jul 2024
Meta-Gradient Search Control: A Method for Improving the Efficiency of Dyna-style Planning Bradley Burega John D. Martin Luke Kapeluck Michael Bowling 42 0 0 27 Jun 2024
Shedding Light on Large Generative Networks: Estimating Epistemic Uncertainty in Diffusion Models Lucas Berry Axel Brando David Meger 37 6 0 05 Jun 2024
NeoRL: Efficient Exploration for Nonepisodic RL Bhavya Sukhija Lenart Treven Florian Dorfler Stelian Coros Andreas Krause OffRL 41 0 0 03 Jun 2024
Trust the Model Where It Trusts Itself -- Model-Based Actor-Critic with Uncertainty-Aware Rollout Adaption Bernd Frauenknecht Artur Eisele Devdutt Subhasish Friedrich Solowjow Sebastian Trimpe 54 5 0 29 May 2024
BWArea Model: Learning World Model, Inverse Dynamics, and Policy for Controllable Language Generation Chengxing Jia Pengyuan Wang Ziniu Li Yi-Chen Li Zhilong Zhang Nan Tang Yang Yu OffRL 42 1 0 27 May 2024
State-Constrained Offline Reinforcement Learning Charles A. Hepburn Yue Jin Giovanni Montana OffRL 49 0 0 23 May 2024
Adaptive Teaching in Heterogeneous Agents: Balancing Surprise in Sparse Reward Scenarios Emma Clark Kanghyun Ryu Negar Mehr 18 1 0 23 May 2024
RACER: Epistemic Risk-Sensitive RL Enables Fast Driving with Fewer Crashes Kyle Stachowicz Sergey Levine 22 6 0 07 May 2024
The Curse of Diversity in Ensemble-Based Exploration Zhixuan Lin P. DÓro Evgenii Nikishin Rameswar Panda 55 1 0 07 May 2024
Continual Model-based Reinforcement Learning for Data Efficient Wireless Network Optimisation Cengis Hasan Alexandros Agapitos David Lynch Alberto Castagna Giorgio Cruciata Hao Wang Aleksandar Milenovic 51 0 0 30 Apr 2024
Full Shot Predictions for the DIII-D Tokamak via Deep Recurrent Networks I. Char Youngseog Chung J. Abbate E. Kolemen Jeff Schneider 51 5 0 18 Apr 2024
Model-based Reinforcement Learning for Parameterized Action Spaces Renhao Zhang Haotian Fu Yilin Miao George Konidaris 36 3 0 03 Apr 2024
Active Exploration in Bayesian Model-based Reinforcement Learning for Robot Manipulation Carlos Plou Ana C. Murillo Ruben Martinez-Cantin OffRL 45 0 0 02 Apr 2024
The Edge-of-Reach Problem in Offline Model-Based Reinforcement Learning Anya Sims Cong Lu Yee Whye Teh OffRL 41 3 0 19 Feb 2024
Port-Hamiltonian Neural ODE Networks on Lie Groups For Robot Dynamics Learning and Control T. Duong Abdullah Altawaitan Jason Stanley Nikolay Atanasov 54 10 0 17 Jan 2024