ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2104.07749
  4. Cited By
Actionable Models: Unsupervised Offline Reinforcement Learning of
  Robotic Skills

Actionable Models: Unsupervised Offline Reinforcement Learning of Robotic Skills

15 April 2021
Yevgen Chebotar
Karol Hausman
Yao Lu
Ted Xiao
Dmitry Kalashnikov
Jacob Varley
A. Irpan
Benjamin Eysenbach
Ryan Julian
Chelsea Finn
Sergey Levine
    SSL
    OffRL
ArXivPDFHTML

Papers citing "Actionable Models: Unsupervised Offline Reinforcement Learning of Robotic Skills"

50 / 106 papers shown
Title
NeoRL-2: Near Real-World Benchmarks for Offline Reinforcement Learning with Extended Realistic Scenarios
NeoRL-2: Near Real-World Benchmarks for Offline Reinforcement Learning with Extended Realistic Scenarios
Songyi Gao
Zuolin Tu
Rong-Jun Qin
Yi-Hao Sun
Xiong-Hui Chen
Yang Yu
OffRL
40
0
0
25 Mar 2025
Evaluation-Time Policy Switching for Offline Reinforcement Learning
Evaluation-Time Policy Switching for Offline Reinforcement Learning
Natinael Solomon Neggatu
Jeremie Houssineau
Giovanni Montana
OffRL
OnRL
70
0
0
15 Mar 2025
Preference-Based Multi-Agent Reinforcement Learning: Data Coverage and Algorithmic Techniques
Preference-Based Multi-Agent Reinforcement Learning: Data Coverage and Algorithmic Techniques
Natalia Zhang
X. Wang
Qiwen Cui
Runlong Zhou
Sham Kakade
Simon S. Du
OffRL
48
0
0
10 Jan 2025
Learning Versatile Skills with Curriculum Masking
Learning Versatile Skills with Curriculum Masking
Yao Tang
Zhihui Xie
Zichuan Lin
Deheng Ye
Shuai Li
OffRL
33
0
0
23 Oct 2024
Whole-Body Control Through Narrow Gaps From Pixels To Action
Whole-Body Control Through Narrow Gaps From Pixels To Action
Tianyue Wu
Yeke Chen
Tianyang Chen
Guangyu Zhao
Fei Gao
48
4
0
02 Sep 2024
Unsupervised-to-Online Reinforcement Learning
Unsupervised-to-Online Reinforcement Learning
Junsu Kim
Seohong Park
Sergey Levine
OnRL
53
3
0
27 Aug 2024
Offline Policy Learning via Skill-step Abstraction for Long-horizon
  Goal-Conditioned Tasks
Offline Policy Learning via Skill-step Abstraction for Long-horizon Goal-Conditioned Tasks
Donghoon Kim
Minjong Yoo
Honguk Woo
OffRL
19
0
0
21 Aug 2024
How to Solve Contextual Goal-Oriented Problems with Offline Datasets?
How to Solve Contextual Goal-Oriented Problems with Offline Datasets?
Ying Fan
Jingling Li
Adith Swaminathan
Aditya Modi
Ching-An Cheng
OffRL
72
0
0
14 Aug 2024
Autonomous Improvement of Instruction Following Skills via Foundation
  Models
Autonomous Improvement of Instruction Following Skills via Foundation Models
Zhiyuan Zhou
P. Atreya
Abraham Lee
Homer Walke
Oier Mees
Sergey Levine
32
11
0
30 Jul 2024
WayEx: Waypoint Exploration using a Single Demonstration
WayEx: Waypoint Exploration using a Single Demonstration
Mara Levy
Nirat Saini
Abhinav Shrivastava
55
1
0
22 Jul 2024
To Err is Robotic: Rapid Value-Based Trial-and-Error during Deployment
To Err is Robotic: Rapid Value-Based Trial-and-Error during Deployment
Maximilian Du
Alexander Khazatsky
Tobias Gerstenberg
Chelsea Finn
49
0
0
22 Jun 2024
Urban-Focused Multi-Task Offline Reinforcement Learning with Contrastive
  Data Sharing
Urban-Focused Multi-Task Offline Reinforcement Learning with Contrastive Data Sharing
Xinbo Zhao
Yingxue Zhang
Xin Zhang
Yu Yang
Yiqun Xie
Yanhua Li
Jun Luo
OffRL
42
2
0
20 Jun 2024
Decision Mamba: A Multi-Grained State Space Model with Self-Evolution Regularization for Offline RL
Decision Mamba: A Multi-Grained State Space Model with Self-Evolution Regularization for Offline RL
Qi Lv
Xiang Deng
Gongwei Chen
Michael Yu Wang
Liqiang Nie
75
7
0
08 Jun 2024
Robot Air Hockey: A Manipulation Testbed for Robot Learning with
  Reinforcement Learning
Robot Air Hockey: A Manipulation Testbed for Robot Learning with Reinforcement Learning
Caleb Chuck
Carl Qi
M. Munje
Shuozhe Li
Max Rudolph
...
Kavan Mehta
Anthony Wang
Peter Stone
Amy Zhang
S. Niekum
40
4
0
06 May 2024
Offline Goal-Conditioned Reinforcement Learning for Safety-Critical
  Tasks with Recovery Policy
Offline Goal-Conditioned Reinforcement Learning for Safety-Critical Tasks with Recovery Policy
Chenyang Cao
Zichen Yan
Renhao Lu
Junbo Tan
Xueqian Wang
OffRL
36
2
0
04 Mar 2024
Unsupervised Zero-Shot Reinforcement Learning via Functional Reward
  Encodings
Unsupervised Zero-Shot Reinforcement Learning via Functional Reward Encodings
Kevin Frans
Seohong Park
Pieter Abbeel
Sergey Levine
OffRL
45
10
0
27 Feb 2024
Foundation Policies with Hilbert Representations
Foundation Policies with Hilbert Representations
Seohong Park
Tobias Kreiman
Sergey Levine
SSL
OffRL
50
19
0
23 Feb 2024
The Edge-of-Reach Problem in Offline Model-Based Reinforcement Learning
The Edge-of-Reach Problem in Offline Model-Based Reinforcement Learning
Anya Sims
Cong Lu
Yee Whye Teh
OffRL
35
3
0
19 Feb 2024
Stitching Sub-Trajectories with Conditional Diffusion Model for
  Goal-Conditioned Offline RL
Stitching Sub-Trajectories with Conditional Diffusion Model for Goal-Conditioned Offline RL
Sungyoon Kim
Yunseon Choi
Daiki E. Matsunaga
Kee-Eung Kim
OffRL
40
6
0
11 Feb 2024
Offline Actor-Critic Reinforcement Learning Scales to Large Models
Offline Actor-Critic Reinforcement Learning Scales to Large Models
Jost Tobias Springenberg
A. Abdolmaleki
Jingwei Zhang
Oliver Groth
Michael Bloesch
...
Sarah Bechtle
Steven Kapturowski
Roland Hafner
N. Heess
Martin Riedmiller
OffRL
LRM
27
12
0
08 Feb 2024
Deep autoregressive density nets vs neural ensembles for model-based
  offline reinforcement learning
Deep autoregressive density nets vs neural ensembles for model-based offline reinforcement learning
Abdelhakim Benechehab
Albert Thomas
Balázs Kégl
OffRL
32
2
0
05 Feb 2024
Mastering Stacking of Diverse Shapes with Large-Scale Iterative
  Reinforcement Learning on Real Robots
Mastering Stacking of Diverse Shapes with Large-Scale Iterative Reinforcement Learning on Real Robots
Thomas Lampe
A. Abdolmaleki
Sarah Bechtle
Sandy H. Huang
Jost Tobias Springenberg
...
Markus Wulfmeier
Jingwei Zhang
Francesco Nori
N. Heess
Martin Riedmiller
OffRL
32
9
0
18 Dec 2023
ReRoGCRL: Representation-based Robustness in Goal-Conditioned
  Reinforcement Learning
ReRoGCRL: Representation-based Robustness in Goal-Conditioned Reinforcement Learning
Xiangyu Yin
Sihao Wu
Jiaxu Liu
Meng Fang
Xingyu Zhao
Xiaowei Huang
Wenjie Ruan
AAML
35
5
0
12 Dec 2023
Decoupling Meta-Reinforcement Learning with Gaussian Task Contexts and
  Skills
Decoupling Meta-Reinforcement Learning with Gaussian Task Contexts and Skills
Hongcai He
Anjie Zhu
Shuang Liang
Feiyu Chen
Jie Shao
OffRL
40
4
0
11 Dec 2023
Backward Learning for Goal-Conditioned Policies
Backward Learning for Goal-Conditioned Policies
Marc Höftmann
Jan Robine
Stefan Harmeling
29
1
0
08 Dec 2023
Goal-conditioned Offline Planning from Curious Exploration
Goal-conditioned Offline Planning from Curious Exploration
Marco Bagatella
Georg Martius
OffRL
18
1
0
28 Nov 2023
SMORE: Score Models for Offline Goal-Conditioned Reinforcement Learning
SMORE: Score Models for Offline Goal-Conditioned Reinforcement Learning
Harshit S. Sikchi
Rohan Chitnis
Ahmed Touati
A. Geramifard
Amy Zhang
S. Niekum
OffRL
31
6
0
03 Nov 2023
GOPlan: Goal-conditioned Offline Reinforcement Learning by Planning with
  Learned Models
GOPlan: Goal-conditioned Offline Reinforcement Learning by Planning with Learned Models
Mianchu Wang
Rui Yang
Xi Chen
Hao Sun
Meng Fang
Giovanni Montana
OffRL
30
9
0
30 Oct 2023
TAIL: Task-specific Adapters for Imitation Learning with Large
  Pretrained Models
TAIL: Task-specific Adapters for Imitation Learning with Large Pretrained Models
Zuxin Liu
Jesse Zhang
Kavosh Asadi
Yao Liu
Ding Zhao
Shoham Sabach
Rasool Fakoor
ALM
AI4CE
21
25
0
09 Oct 2023
Pre-Training and Fine-Tuning Generative Flow Networks
Pre-Training and Fine-Tuning Generative Flow Networks
Ling Pan
Moksh Jain
Kanika Madan
Yoshua Bengio
47
13
0
05 Oct 2023
Learning to Reach Goals via Diffusion
Learning to Reach Goals via Diffusion
V. Jain
Siamak Ravanbakhsh
DiffM
OffRL
43
3
0
04 Oct 2023
Prompt, Plan, Perform: LLM-based Humanoid Control via Quantized
  Imitation Learning
Prompt, Plan, Perform: LLM-based Humanoid Control via Quantized Imitation Learning
Jingkai Sun
Qiang Zhang
Yiqun Duan
Xiaoyang Jiang
Chong Cheng
Renjing Xu
LM&Ro
50
23
0
20 Sep 2023
Q-Transformer: Scalable Offline Reinforcement Learning via
  Autoregressive Q-Functions
Q-Transformer: Scalable Offline Reinforcement Learning via Autoregressive Q-Functions
Yevgen Chebotar
Q. Vuong
A. Irpan
Karol Hausman
F. Xia
...
Brianna Zitkovich
Tomas Jackson
Kanishka Rao
Chelsea Finn
Sergey Levine
OffRL
129
81
0
18 Sep 2023
BridgeData V2: A Dataset for Robot Learning at Scale
BridgeData V2: A Dataset for Robot Learning at Scale
Homer Walke
Kevin Black
Abraham Lee
Moo Jin Kim
Maximilian Du
...
Andre Wang He
Vivek Myers
Kuan Fang
Chelsea Finn
Sergey Levine
32
206
0
24 Aug 2023
General Purpose Artificial Intelligence Systems (GPAIS): Properties,
  Definition, Taxonomy, Societal Implications and Responsible Governance
General Purpose Artificial Intelligence Systems (GPAIS): Properties, Definition, Taxonomy, Societal Implications and Responsible Governance
I. Triguero
Daniel Molina
Javier Poyatos
Javier Del Ser
Francisco Herrera
AI4TS
AI4MH
34
5
0
26 Jul 2023
HIQL: Offline Goal-Conditioned RL with Latent States as Actions
HIQL: Offline Goal-Conditioned RL with Latent States as Actions
Seohong Park
Dibya Ghosh
Benjamin Eysenbach
Sergey Levine
OffRL
30
44
0
22 Jul 2023
Robotic Manipulation Datasets for Offline Compositional Reinforcement
  Learning
Robotic Manipulation Datasets for Offline Compositional Reinforcement Learning
Marcel Hussing
Jorge Armando Mendez Mendez
Anisha Singrodia
Cassandra Kent
Eric Eaton
OffRL
31
5
0
13 Jul 2023
ChiPFormer: Transferable Chip Placement via Offline Decision Transformer
ChiPFormer: Transferable Chip Placement via Offline Decision Transformer
Yao Lai
Jinxin Liu
Zhentao Tang
Bin Wang
Jianye Hao
Ping Luo
OffRL
21
41
0
26 Jun 2023
Deep Predictive Learning: Motion Learning Concept inspired by Cognitive
  Robotics
Deep Predictive Learning: Motion Learning Concept inspired by Cognitive Robotics
Kanata Suzuki
Hiroshi Ito
Tatsuro Yamada
Kei Kase
Tetsuya Ogata
24
12
0
26 Jun 2023
Design from Policies: Conservative Test-Time Adaptation for Offline
  Policy Optimization
Design from Policies: Conservative Test-Time Adaptation for Offline Policy Optimization
Jinxin Liu
Hongyin Zhang
Zifeng Zhuang
Yachen Kang
Donglin Wang
Bin Wang
OffRL
42
8
0
26 Jun 2023
Beyond OOD State Actions: Supported Cross-Domain Offline Reinforcement
  Learning
Beyond OOD State Actions: Supported Cross-Domain Offline Reinforcement Learning
Jinxin Liu
Ziqi Zhang
Zhenyu Wei
Zifeng Zhuang
Yachen Kang
Sibo Gai
Donglin Wang
OffRL
20
16
0
22 Jun 2023
SPRINT: Scalable Policy Pre-Training via Language Instruction Relabeling
SPRINT: Scalable Policy Pre-Training via Language Instruction Relabeling
Jesse Zhang
Karl Pertsch
Jiahui Zhang
Joseph J. Lim
LM&Ro
36
17
0
20 Jun 2023
Learning with a Mole: Transferable latent spatial representations for
  navigation without reconstruction
Learning with a Mole: Transferable latent spatial representations for navigation without reconstruction
G. Bono
L. Antsfeld
Assem Sadek
G. Monaci
Christian Wolf
SSL
32
5
0
06 Jun 2023
Stabilizing Contrastive RL: Techniques for Robotic Goal Reaching from
  Offline Data
Stabilizing Contrastive RL: Techniques for Robotic Goal Reaching from Offline Data
Chongyi Zheng
Benjamin Eysenbach
Homer Walke
Patrick Yin
Kuan Fang
Ruslan Salakhutdinov
Sergey Levine
SSL
OffRL
34
4
0
06 Jun 2023
SACSoN: Scalable Autonomous Control for Social Navigation
SACSoN: Scalable Autonomous Control for Social Navigation
Noriaki Hirose
Dhruv Shah
A. Sridhar
Sergey Levine
25
29
0
02 Jun 2023
What is Essential for Unseen Goal Generalization of Offline
  Goal-conditioned RL?
What is Essential for Unseen Goal Generalization of Offline Goal-conditioned RL?
Rui Yang
Yong Lin
Xiaoteng Ma
Haotian Hu
Chongjie Zhang
Tong Zhang
OffRL
21
22
0
30 May 2023
Learning Video-Conditioned Policies for Unseen Manipulation Tasks
Learning Video-Conditioned Policies for Unseen Manipulation Tasks
Elliot Chane-Sane
Cordelia Schmid
Ivan Laptev
27
18
0
10 May 2023
Get Back Here: Robust Imitation by Return-to-Distribution Planning
Get Back Here: Robust Imitation by Return-to-Distribution Planning
Geoffrey Cideron
B. Tabanpour
Sebastian Curi
Sertan Girgin
Léonard Hussenot
Gabriel Dulac-Arnold
M. Geist
Olivier Pietquin
Robert Dadashi
OOD
84
2
0
02 May 2023
Distance Weighted Supervised Learning for Offline Interaction Data
Distance Weighted Supervised Learning for Offline Interaction Data
Joey Hejna
Jensen Gao
Dorsa Sadigh
OffRL
36
12
0
26 Apr 2023
CASOG: Conservative Actor-critic with SmOoth Gradient for Skill Learning
  in Robot-Assisted Intervention
CASOG: Conservative Actor-critic with SmOoth Gradient for Skill Learning in Robot-Assisted Intervention
Hao Li
Xiao-Hu Zhou
Xiaoliang Xie
Shiqi Liu
Zhen-Qiu Feng
Z. Hou
OffRL
16
11
0
19 Apr 2023
123
Next