Align-RUDDER: Learning From Few Demonstrations by Reward Redistribution

Align-RUDDER: Learning From Few Demonstrations by Reward Redistribution

29 September 2020

Marius-Constantin Dinu

Matthias Dorfer

Johannes Brandstetter

Jose A. Arjona-Medina

Sepp Hochreiter

Papers citing "Align-RUDDER: Learning From Few Demonstrations by Reward Redistribution"

9 / 9 papers shown

Title
A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks Thomas Schmied Thomas Adler Vihang Patil M. Beck Korbinian Poppel Johannes Brandstetter G. Klambauer Razvan Pascanu Sepp Hochreiter 75 5 0 21 Feb 2025
SALMON: Self-Alignment with Instructable Reward Models Zhiqing Sun Songlin Yang Hongxin Zhang Qinhong Zhou Zhenfang Chen David D. Cox Yiming Yang Chuang Gan ALM SyDa 38 35 0 09 Oct 2023
Describe, Explain, Plan and Select: Interactive Planning with Large Language Models Enables Open-World Multi-Task Agents Zihao Wang Shaofei Cai Guanzhou Chen Guy Van den Broeck Xiaojian Ma Yitao Liang LM&Ro LLMAG 60 318 0 03 Feb 2023
Do Embodied Agents Dream of Pixelated Sheep: Embodied Decision Making using Language Guided World Modelling Kolby Nottingham Prithviraj Ammanabrolu Alane Suhr Yejin Choi Hannaneh Hajishirzi Sameer Singh Roy Fox LLMAG LM&Ro 44 77 0 28 Jan 2023
Towards Artificial Virtuous Agents: Games, Dilemmas and Machine Learning Ajay Vishwanath E. Bøhn Ole-Christoffer Granmo Charl Maree C. Omlin AI4CE 35 5 0 30 Aug 2022
GriddlyJS: A Web IDE for Reinforcement Learning C. Bamford Minqi Jiang Mikayel Samvelyan Tim Rocktaschel OnRL 38 4 0 13 Jul 2022
Reactive Exploration to Cope with Non-Stationarity in Lifelong Reinforcement Learning C. Steinparz Thomas Schmied Fabian Paischer Marius-Constantin Dinu Vihang Patil Angela Bitto-Nemling Hamid Eghbalzadeh Sepp Hochreiter CLL 24 11 0 12 Jul 2022
A Globally Convergent Evolutionary Strategy for Stochastic Constrained Optimization with Applications to Reinforcement Learning Youssef Diouane Aurelien Lucchi Vihang Patil 24 3 0 21 Feb 2022
CLOOB: Modern Hopfield Networks with InfoLOOB Outperform CLIP Andreas Fürst Elisabeth Rumetshofer Johannes Lehner Viet-Hung Tran Fei Tang ... David P. Kreil Michael K Kopp G. Klambauer Angela Bitto-Nemling Sepp Hochreiter VLM CLIP 207 102 0 21 Oct 2021