Skew-Fit: State-Covering Self-Supervised Reinforcement Learning

8 March 2019

Papers citing "Skew-Fit: State-Covering Self-Supervised Reinforcement Learning"

50 / 64 papers shown

Title
Imagine, Verify, Execute: Memory-Guided Agentic Exploration with Vision-Language Models Seungjae Lee Daniel Ekpo Haowen Liu Furong Huang Abhinav Shrivastava Jia-Bin Huang LM&Ro 40 0 0 12 May 2025
Exploring the Adversarial Vulnerabilities of Vision-Language-Action Models in Robotics Taowen Wang Dongfang Liu James Liang Wenhao Yang Qifan Wang Cheng Han Jiebo Luo Ruixiang Tang Ruixiang Tang AAML 82 3 0 18 Nov 2024
Grounding Video Models to Actions through Goal Conditioned Exploration Yunhao Luo Yilun Du LM&Ro VGen 85 1 0 11 Nov 2024
Goal Exploration via Adaptive Skill Distribution for Goal-Conditioned Reinforcement Learning Lisheng Wu Ke Chen 29 0 0 19 Apr 2024
Entity-Centric Reinforcement Learning for Object Manipulation from Pixels Dan Haramati Tal Daniel Aviv Tamar LM&Ro OffRL OCL 40 10 0 01 Apr 2024
Dr. Strategy: Model-Based Generalist Agents with Strategic Dreaming Hany Hamed Subin Kim Dongyeong Kim Jaesik Yoon Sungjin Ahn 47 4 0 29 Feb 2024
Adaptive Mobile Manipulation for Articulated Objects In the Open World Haoyu Xiong Russell Mendonca Kenneth Shaw Deepak Pathak 34 38 0 25 Jan 2024
A Definition of Open-Ended Learning Problems for Goal-Conditioned Agents Olivier Sigaud Gianluca Baldassarre Cédric Colas Stéphane Doncieux Richard J. Duro Pierre-Yves Oudeyer Nicolas Perrin-Gilbert V. Santucci AI4CE 29 10 0 01 Nov 2023
Variational Curriculum Reinforcement Learning for Unsupervised Discovery of Skills Seongun Kim Kyowoon Lee Jaesik Choi SSL DRL 41 7 0 30 Oct 2023
METRA: Scalable Unsupervised RL with Metric-Aware Abstraction Seohong Park Oleh Rybkin Sergey Levine OffRL 33 34 0 13 Oct 2023
RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic Control Anthony Brohan Noah Brown Justice Carbajal Yevgen Chebotar Xi Chen ... Ted Xiao Peng Xu Sichun Xu Tianhe Yu Brianna Zitkovich LM&Ro LRM 30 1,100 0 28 Jul 2023
Visual Affordance Prediction for Guiding Robot Exploration Homanga Bharadhwaj Abhi Gupta Shubham Tulsiani 44 12 0 28 May 2023
Augmenting Autotelic Agents with Large Language Models Cédric Colas Laetitia Teodorescu Pierre-Yves Oudeyer Xingdi Yuan Marc-Alexandre Côté LLMAG LM&Ro 28 22 0 21 May 2023
Affordances from Human Videos as a Versatile Representation for Robotics Shikhar Bahl Russell Mendonca Lili Chen Unnat Jain Deepak Pathak 44 164 0 17 Apr 2023
Efficient Quality-Diversity Optimization through Diverse Quality Species Ryan Wickman Bibek Poudel Taylor Michael Villarreal Xiaofei Zhang Weizi Li 36 6 0 14 Apr 2023
Improved Sample Complexity for Reward-free Reinforcement Learning under Low-rank MDPs Yuan Cheng Ruiquan Huang J. Yang Yitao Liang OffRL 41 8 0 20 Mar 2023
ALAN: Autonomously Exploring Robotic Agents in the Real World Russell Mendonca Shikhar Bahl Deepak Pathak LM&Ro 36 20 0 13 Feb 2023
Layered State Discovery for Incremental Autonomous Exploration Liyu Chen Andrea Tirinzoni A. Lazaric Matteo Pirotta 34 0 0 07 Feb 2023
Learning Goal-Conditioned Policies Offline with Self-Supervised Reward Shaping Lina Mezghani Sainbayar Sukhbaatar Piotr Bojanowski A. Lazaric Alahari Karteek OffRL 44 18 0 05 Jan 2023
Learning Robotic Navigation from Experience: Principles, Methods, and Recent Results Sergey Levine Dhruv Shah SSL 37 21 0 13 Dec 2022
CACTI: A Framework for Scalable Multi-Task Multi-Scene Visual Imitation Learning Zhao Mandi Homanga Bharadhwaj Vincent Moens Shuran Song Aravind Rajeswaran Vikash Kumar LM&Ro 28 70 0 12 Dec 2022
First Go, then Post-Explore: the Benefits of Post-Exploration in Intrinsic Motivation Zhao Yang Thomas M. Moerland Mike Preuss Aske Plaat 30 2 0 06 Dec 2022
Goal Exploration Augmentation via Pre-trained Skills for Sparse-Reward Long-Horizon Goal-Conditioned Reinforcement Learning Lisheng Wu Ke Chen 31 3 0 28 Oct 2022
Learning on the Job: Self-Rewarding Offline-to-Online Finetuning for Industrial Insertion of Novel Connectors from Vision Ashvin Nair Brian Zhu Gokul Narayanan Eugen Solowjow Sergey Levine OffRL OnRL 28 14 0 27 Oct 2022
An information-theoretic perspective on intrinsic motivation in reinforcement learning: a survey A. Aubret L. Matignon S. Hassas 34 35 0 19 Sep 2022
Cell-Free Latent Go-Explore Quentin Gallouedec Emmanuel Dellandrea 14 1 0 31 Aug 2022
Spectral Decomposition Representation for Reinforcement Learning Tongzheng Ren Tianjun Zhang Lisa Lee Joseph E. Gonzalez Dale Schuurmans Bo Dai OffRL 40 27 0 19 Aug 2022
Human-to-Robot Imitation in the Wild Shikhar Bahl Abhi Gupta Deepak Pathak 30 165 0 19 Jul 2022
Walk the Random Walk: Learning to Discover and Reach Goals Without Supervision Lina Mezghani Sainbayar Sukhbaatar Piotr Bojanowski Alahari Karteek 31 4 0 23 Jun 2022
BYOL-Explore: Exploration by Bootstrapped Prediction Z. Guo S. Thakoor Miruna Pislar Bernardo Avila-Pires Florent Altché ... Yunhao Tang Michal Valko Rémi Munos M. G. Azar Bilal Piot 22 68 0 16 Jun 2022
Contrastive Learning as Goal-Conditioned Reinforcement Learning Benjamin Eysenbach Tianjun Zhang Ruslan Salakhutdinov Sergey Levine SSL OffRL 28 139 0 15 Jun 2022
Action Noise in Off-Policy Deep Reinforcement Learning: Impact on Exploration and Performance Jakob J. Hollenstein Sayantan Auddy Matteo Saveriano Erwan Renaudo J. Piater 38 17 0 08 Jun 2022
Planning to Practice: Efficient Online Fine-Tuning by Composing Goals in Latent Space Kuan Fang Patrick Yin Ashvin Nair Sergey Levine OffRL 58 29 0 17 May 2022
When to Go, and When to Explore: The Benefit of Post-Exploration in Intrinsic Motivation Zhao Yang Thomas M. Moerland Mike Preuss Aske Plaat 21 1 0 29 Mar 2022
Evolving Curricula with Regret-Based Environment Design Jack Parker-Holder Minqi Jiang Michael Dennis Mikayel Samvelyan Jakob N. Foerster Edward Grefenstette Tim Rocktaschel 31 117 0 02 Mar 2022
Open-Ended Reinforcement Learning with Neural Reward Functions Robert Meier Asier Mujika 37 7 0 16 Feb 2022
Generative Planning for Temporally Coordinated Exploration in Reinforcement Learning Haichao Zhang Wei-ping Xu Haonan Yu 38 10 0 24 Jan 2022
Goal-Conditioned Reinforcement Learning: Problems and Solutions Minghuan Liu Menghui Zhu Weinan Zhang 33 132 0 20 Jan 2022
Physical Derivatives: Computing policy gradients by physical forward-propagation Arash Mehrjou Ashkan Soleymani Stefan Bauer Bernhard Schölkopf 38 0 0 15 Jan 2022
Autonomous Reinforcement Learning: Formalism and Benchmarking Archit Sharma Kelvin Xu Nikhil Sardana Abhishek Gupta Karol Hausman Sergey Levine Chelsea Finn OffRL 41 26 0 17 Dec 2021
Hindsight Goal Ranking on Replay Buffer for Sparse Reward Environment Tung M. Luu Chang D. Yoo 15 8 0 28 Oct 2021
Direct then Diffuse: Incremental Unsupervised Skill Discovery for State Covering and Goal Reaching Pierre-Alexandre Kamienny Jean Tarbouriech Sylvain Lamprier A. Lazaric Ludovic Denoyer SSL 40 18 0 27 Oct 2021
Learning Domain Invariant Representations in Goal-conditioned Block MDPs Beining Han Chongyi Zheng Harris Chan Keiran Paster Michael Ruogu Zhang Jimmy Ba OOD AI4CE 20 13 0 27 Oct 2021
C-Planning: An Automatic Curriculum for Learning Goal-Reaching Tasks Tianjun Zhang Benjamin Eysenbach Ruslan Salakhutdinov Sergey Levine Joseph E. Gonzalez OffRL 37 16 0 22 Oct 2021
Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain Jianye Hao Tianpei Yang Hongyao Tang Chenjia Bai Jinyi Liu Zhaopeng Meng Peng Liu Zhen Wang OffRL 36 92 0 14 Sep 2021
Self-supervised Reinforcement Learning with Independently Controllable Subgoals Andrii Zadaianchuk Georg Martius Fanny Yang SSL 64 16 0 09 Sep 2021
Open-Ended Learning Leads to Generally Capable Agents Open-Ended Learning Team Adam Stooke Anuj Mahajan Catarina Barros Charlie Deck ... Nicolas Porcel Roberta Raileanu Steph Hughes-Fitt Valentin Dalibard Wojciech M. Czarnecki 40 181 0 27 Jul 2021
LS3: Latent Space Safe Sets for Long-Horizon Visuomotor Control of Sparse Reward Iterative Tasks Albert Wilcox Ashwin Balakrishna Brijen Thananjeyan Joseph E. Gonzalez Ken Goldberg 29 11 0 10 Jul 2021
Rapid Exploration for Open-World Navigation with Latent Goal Models Dhruv Shah Benjamin Eysenbach G. Kahn Nicholas Rhinehart Sergey Levine 26 70 0 12 Apr 2021
TeachMyAgent: a Benchmark for Automatic Curriculum Learning in Deep RL Clément Romac Rémy Portelas Katja Hofmann Pierre-Yves Oudeyer 27 21 0 17 Mar 2021