ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2102.02915
  4. Cited By
How to Train Your Robot with Deep Reinforcement Learning; Lessons We've
  Learned

How to Train Your Robot with Deep Reinforcement Learning; Lessons We've Learned

4 February 2021
Julian Ibarz
Jie Tan
Chelsea Finn
Mrinal Kalakrishnan
P. Pastor
Sergey Levine
    OffRL
ArXivPDFHTML

Papers citing "How to Train Your Robot with Deep Reinforcement Learning; Lessons We've Learned"

50 / 211 papers shown
Title
OpenBot-Fleet: A System for Collective Learning with Real Robots
OpenBot-Fleet: A System for Collective Learning with Real Robots
Matthias M¨uller
Samarth Brahmbhatt
Ankur Deka
Quentin Leboutet
David Hafner
V. Koltun
55
0
0
13 May 2024
Contextual Affordances for Safe Exploration in Robotic Scenarios
Contextual Affordances for Safe Exploration in Robotic Scenarios
William Z. Ye
Eduardo B. Sandoval
Pamela Carreno-Medrano
Francisco Cru
40
0
0
10 May 2024
Learning Robot Soccer from Egocentric Vision with Deep Reinforcement
  Learning
Learning Robot Soccer from Egocentric Vision with Deep Reinforcement Learning
Dhruva Tirumala
Markus Wulfmeier
Ben Moran
Sandy Huang
Jan Humplik
...
Kushal Patel
Marlon Gwira
Francesco Nori
Martin Riedmiller
N. Heess
45
13
0
03 May 2024
On the Utility of External Agent Intention Predictor for Human-AI
  Coordination
On the Utility of External Agent Intention Predictor for Human-AI Coordination
Chenxu Wang
Zilong Chen
Angelo Cangelosi
Huaping Liu
44
1
0
03 May 2024
SwarmRL: Building the Future of Smart Active Systems
SwarmRL: Building the Future of Smart Active Systems
S. Tovey
Christoph Lohrmann
Tobias Merkt
David Zimmer
Konstantin Nikolaou
Simon Koppenhoefer
Anna Bushmakina
Jonas Scheunemann
Christian Holm
AI4CE
50
2
0
25 Apr 2024
RUMOR: Reinforcement learning for Understanding a Model of the Real World for Navigation in Dynamic Environments
RUMOR: Reinforcement learning for Understanding a Model of the Real World for Navigation in Dynamic Environments
Diego Martínez Baselga
L. Riazuelo
Luis Montano
100
1
0
25 Apr 2024
Safe Reinforcement Learning on the Constraint Manifold: Theory and
  Applications
Safe Reinforcement Learning on the Constraint Manifold: Theory and Applications
Puze Liu
Haitham Bou-Ammar
Jan Peters
Davide Tateo
56
9
0
13 Apr 2024
Is Exploration All You Need? Effective Exploration Characteristics for
  Transfer in Reinforcement Learning
Is Exploration All You Need? Effective Exploration Characteristics for Transfer in Reinforcement Learning
Jonathan C. Balloch
Rishav Bhagat
Geigh Zollicoffer
Ruoran Jia
Julia Kim
Mark O. Riedl
OffRL
47
1
0
02 Apr 2024
Active Exploration in Bayesian Model-based Reinforcement Learning for
  Robot Manipulation
Active Exploration in Bayesian Model-based Reinforcement Learning for Robot Manipulation
Carlos Plou
Ana C. Murillo
Ruben Martinez-Cantin
OffRL
45
0
0
02 Apr 2024
Extremum-Seeking Action Selection for Accelerating Policy Optimization
Extremum-Seeking Action Selection for Accelerating Policy Optimization
Ya-Chien Chang
Sicun Gao
66
0
0
02 Apr 2024
Survey on Large Language Model-Enhanced Reinforcement Learning: Concept,
  Taxonomy, and Methods
Survey on Large Language Model-Enhanced Reinforcement Learning: Concept, Taxonomy, and Methods
Yuji Cao
Huan Zhao
Yuheng Cheng
Ting Shu
Guolong Liu
Gaoqi Liang
Junhua Zhao
Yun Li
LLMAG
KELM
OffRL
LM&Ro
67
54
0
30 Mar 2024
Robust Model Based Reinforcement Learning Using $\mathcal{L}_1$ Adaptive
  Control
Robust Model Based Reinforcement Learning Using L1\mathcal{L}_1L1​ Adaptive Control
Minjun Sung
Sambhu H. Karumanchi
Aditya Gahlawat
N. Hovakimyan
69
1
0
21 Mar 2024
A Roadmap Towards Automated and Regulated Robotic Systems
A Roadmap Towards Automated and Regulated Robotic Systems
Yihao Liu
Mehran Armand
60
2
0
21 Mar 2024
Generalising Multi-Agent Cooperation through Task-Agnostic Communication
Generalising Multi-Agent Cooperation through Task-Agnostic Communication
Dulhan Jayalath
Steven D. Morad
Amanda Prorok
45
0
0
11 Mar 2024
Robustifying a Policy in Multi-Agent RL with Diverse Cooperative
  Behaviors and Adversarial Style Sampling for Assistive Tasks
Robustifying a Policy in Multi-Agent RL with Diverse Cooperative Behaviors and Adversarial Style Sampling for Assistive Tasks
Takayuki Osa
Tatsuya Harada
63
2
0
01 Mar 2024
Advancing Investment Frontiers: Industry-grade Deep Reinforcement
  Learning for Portfolio Optimization
Advancing Investment Frontiers: Industry-grade Deep Reinforcement Learning for Portfolio Optimization
Philip Ndikum
Serge Ndikum
67
1
0
27 Feb 2024
FLD: Fourier Latent Dynamics for Structured Motion Representation and
  Learning
FLD: Fourier Latent Dynamics for Structured Motion Representation and Learning
Chenhao Li
Elijah Stanger-Jones
Steve Heim
Sangbae Kim
53
10
0
21 Feb 2024
Beyond Worst-case Attacks: Robust RL with Adaptive Defense via
  Non-dominated Policies
Beyond Worst-case Attacks: Robust RL with Adaptive Defense via Non-dominated Policies
Xiangyu Liu
Chenghao Deng
Yanchao Sun
Yongyuan Liang
Furong Huang
AAML
51
6
0
20 Feb 2024
Analyzing Adversarial Inputs in Deep Reinforcement Learning
Analyzing Adversarial Inputs in Deep Reinforcement Learning
Davide Corsi
Guy Amir
Guy Katz
Alessandro Farinelli
AAML
41
7
0
07 Feb 2024
Towards Optimal Adversarial Robust Q-learning with Bellman
  Infinity-error
Towards Optimal Adversarial Robust Q-learning with Bellman Infinity-error
Haoran Li
Zicheng Zhang
Wang Luo
Congying Han
Yudong Hu
Tiande Guo
Shichen Liao
AAML
62
2
0
03 Feb 2024
Optimal Potential Shaping on SE(3) via Neural ODEs on Lie Groups
Optimal Potential Shaping on SE(3) via Neural ODEs on Lie Groups
Yannik P. Wotte
Federico Califano
Stefano Stramigioli
AI4CE
37
1
0
25 Jan 2024
Concept: Dynamic Risk Assessment for AI-Controlled Robotic Systems
Concept: Dynamic Risk Assessment for AI-Controlled Robotic Systems
Philipp Grimmeisen
Friedrich Sautter
Andrey Morozov
18
1
0
25 Jan 2024
A Safe Reinforcement Learning Algorithm for Supervisory Control of Power
  Plants
A Safe Reinforcement Learning Algorithm for Supervisory Control of Power Plants
Yixuan Sun
Sami Khairy
Richard B. Vilim
Rui Hu
Akshay J. Dave
57
2
0
23 Jan 2024
Port-Hamiltonian Neural ODE Networks on Lie Groups For Robot Dynamics
  Learning and Control
Port-Hamiltonian Neural ODE Networks on Lie Groups For Robot Dynamics Learning and Control
T. Duong
Abdullah Altawaitan
Jason Stanley
Nikolay Atanasov
54
10
0
17 Jan 2024
BET: Explaining Deep Reinforcement Learning through The Error-Prone
  Decisions
BET: Explaining Deep Reinforcement Learning through The Error-Prone Decisions
Xiao Liu
Jie Zhao
Wubing Chen
Mao Tan
Yongxin Su
OffRL
FAtt
38
0
0
14 Jan 2024
General-purpose foundation models for increased autonomy in
  robot-assisted surgery
General-purpose foundation models for increased autonomy in robot-assisted surgery
Samuel Schmidgall
Ji Woong Kim
Alan Kuntz
A. Ghazi
Axel Krieger
MedIm
64
10
0
01 Jan 2024
Efficient Reinforcement Learning via Decoupling Exploration and
  Utilization
Efficient Reinforcement Learning via Decoupling Exploration and Utilization
Jingpu Yang
Helin Wang
Qirui Zhao
Zhecheng Shi
Zirui Song
Miao Fang
31
0
0
26 Dec 2023
Human-AI Collaboration in Real-World Complex Environment with
  Reinforcement Learning
Human-AI Collaboration in Real-World Complex Environment with Reinforcement Learning
Md Saiful Islam
Srijita Das
S. Gottipati
William Duguay
Clodéric Mars
Jalal Arabneydi
Antoine Fagette
Matthew J. Guzdial
Matthew E. Taylor
43
1
0
23 Dec 2023
GraspLDM: Generative 6-DoF Grasp Synthesis using Latent Diffusion Models
GraspLDM: Generative 6-DoF Grasp Synthesis using Latent Diffusion Models
K. R. Barad
Andrej Orsula
Antoine Richard
Jan Dentler
Miguel Olivares-Mendez
Carol Martinez
34
16
0
18 Dec 2023
Modifying RL Policies with Imagined Actions: How Predictable Policies
  Can Enable Users to Perform Novel Tasks
Modifying RL Policies with Imagined Actions: How Predictable Policies Can Enable Users to Perform Novel Tasks
Isaac S. Sheidlower
Reuben M. Aronson
E. Short
OffRL
93
1
0
10 Dec 2023
A Multifidelity Sim-to-Real Pipeline for Verifiable and Compositional
  Reinforcement Learning
A Multifidelity Sim-to-Real Pipeline for Verifiable and Compositional Reinforcement Learning
Cyrus Neary
Christian Ellis
Aryaman Singh Samyal
Craig T. Lennon
Ufuk Topcu
OffRL
271
0
0
02 Dec 2023
Program Machine Policy: Addressing Long-Horizon Tasks by Integrating
  Program Synthesis and State Machines
Program Machine Policy: Addressing Long-Horizon Tasks by Integrating Program Synthesis and State Machines
Yu-An Lin
Chen-Tao Lee
Guanhui. Liu
Pu-Jen Cheng
Shao-Hua Sun
36
0
0
27 Nov 2023
Offline Skill Generalization via Task and Motion Planning
Offline Skill Generalization via Task and Motion Planning
Shin Watanabe
Geir Horn
J. Tørresen
K. Ellefsen
OffRL
50
0
0
24 Nov 2023
Learning to Control under Uncertainty with Data-Based Iterative Linear
  Quadratic Regulator
Learning to Control under Uncertainty with Data-Based Iterative Linear Quadratic Regulator
Ran A. Wang
Raman Goyal
S. Chakravorty
23
1
0
08 Nov 2023
Accelerating Reinforcement Learning of Robotic Manipulations via
  Feedback from Large Language Models
Accelerating Reinforcement Learning of Robotic Manipulations via Feedback from Large Language Models
Kun-Mo Chu
Xufeng Zhao
C. Weber
Mengdi Li
Stefan Wermter
LLMAG
LM&Ro
54
14
0
04 Nov 2023
Grow Your Limits: Continuous Improvement with Real-World RL for Robotic
  Locomotion
Grow Your Limits: Continuous Improvement with Real-World RL for Robotic Locomotion
Laura M. Smith
Yunhao Cao
Sergey Levine
OffRL
38
19
0
26 Oct 2023
Robot Skill Generalization via Keypoint Integrated Soft Actor-Critic
  Gaussian Mixture Models
Robot Skill Generalization via Keypoint Integrated Soft Actor-Critic Gaussian Mixture Models
Iman Nematollahi
Kirill Yankov
Wolfram Burgard
Tim Welschehold
44
0
0
23 Oct 2023
Examining the simulation-to-reality gap of a wheel loader digging in
  deformable terrain
Examining the simulation-to-reality gap of a wheel loader digging in deformable terrain
Kojiro Aoshima
Martin Servin
AI4CE
81
4
0
09 Oct 2023
Domain Randomization for Sim2real Transfer of Automatically Generated
  Grasping Datasets
Domain Randomization for Sim2real Transfer of Automatically Generated Grasping Datasets
J. Huber
François Hélénon
Hippolyte Watrelot
F. B. Amar
Stéphane Doncieux
35
12
0
06 Oct 2023
Compositional Servoing by Recombining Demonstrations
Compositional Servoing by Recombining Demonstrations
Max Argus
Abhijeet Nayak
Martin Buchner
Silvio Galesso
Abhinav Valada
Thomas Brox
41
0
0
06 Oct 2023
In-Hand Re-grasp Manipulation with Passive Dynamic Actions via Imitation
  Learning
In-Hand Re-grasp Manipulation with Passive Dynamic Actions via Imitation Learning
Dehao Wei
Guokang Sun
Zeyu Ren
Shuang Li
Zhufeng Shao
Xiang Li
Nikos Tsagarakis
Shaohua Ma
17
1
0
27 Sep 2023
Maximum diffusion reinforcement learning
Maximum diffusion reinforcement learning
Thomas A. Berrueta
Allison Pinosky
Todd Murphey
AI4CE
DiffM
45
5
0
26 Sep 2023
Effective Multi-Agent Deep Reinforcement Learning Control with Relative
  Entropy Regularization
Effective Multi-Agent Deep Reinforcement Learning Control with Relative Entropy Regularization
Chenyang Miao
Yunduan Cui
Huiyun Li
Xin Wu
55
5
0
26 Sep 2023
Machine Learning Meets Advanced Robotic Manipulation
Machine Learning Meets Advanced Robotic Manipulation
Saeid Nahavandi
R. Alizadehsani
D. Nahavandi
Chee Peng Lim
Kevin Kelly
Fernando Bello
38
17
0
22 Sep 2023
Machine Learning-Driven Burrowing with a Snake-Like Robot
Machine Learning-Driven Burrowing with a Snake-Like Robot
Sean Even
Holden Gordon
Hoeseok Yang
Yasemin Ozkan-Aydin
39
2
0
19 Sep 2023
Contrastive Initial State Buffer for Reinforcement Learning
Contrastive Initial State Buffer for Reinforcement Learning
Nico Messikommer
Yunlong Song
Davide Scaramuzza
OffRL
49
9
0
18 Sep 2023
Learning Visual Tracking and Reaching with Deep Reinforcement Learning
  on a UR10e Robotic Arm
Learning Visual Tracking and Reaching with Deep Reinforcement Learning on a UR10e Robotic Arm
C. Bellinger
Laurence Lamarche-Cliche
37
0
0
28 Aug 2023
Intentionally-underestimated Value Function at Terminal State for
  Temporal-difference Learning with Mis-designed Reward
Intentionally-underestimated Value Function at Terminal State for Temporal-difference Learning with Mis-designed Reward
Taisuke Kobayashi
43
3
0
24 Aug 2023
Learning the Plasticity: Plasticity-Driven Learning Framework in Spiking
  Neural Networks
Learning the Plasticity: Plasticity-Driven Learning Framework in Spiking Neural Networks
Guobin Shen
Dongcheng Zhao
Yiting Dong
Yang Li
Feifei Zhao
Yi Zeng
AI4CE
38
0
0
23 Aug 2023
Value-Informed Skill Chaining for Policy Learning of Long-Horizon Tasks
  with Surgical Robot
Value-Informed Skill Chaining for Policy Learning of Long-Horizon Tasks with Surgical Robot
Tao Huang
Kai-xiang Chen
Wang Wei
Jianan Li
Yonghao Long
Qi Dou
OffRL
39
6
0
31 Jul 2023
Previous
12345
Next