Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2202.07152
Cited By
L2C2: Locally Lipschitz Continuous Constraint towards Stable and Smooth Reinforcement Learning
15 February 2022
Taisuke Kobayashi
Re-assign community
ArXiv
PDF
HTML
Papers citing
"L2C2: Locally Lipschitz Continuous Constraint towards Stable and Smooth Reinforcement Learning"
13 / 13 papers shown
Title
Deep Reinforcement Learning for Day-to-day Dynamic Tolling in Tradable Credit Schemes
Xiaoyi Wu
Ravi Seshadri
Filipe Rodrigues
Carlos Lima Azevedo
26
0
0
10 Apr 2025
Weber-Fechner Law in Temporal Difference learning derived from Control as Inference
Keiichiro Takahashi
Taisuke Kobayashi
Tomoya Yamanokuchi
Takamitsu Matsubara
31
0
0
31 Dec 2024
Benchmarking Smoothness and Reducing High-Frequency Oscillations in Continuous Control Policies
Guilherme Christmann
Ying-Sheng Luo
Hanjaya Mandala
Wei-Chao Chen
21
0
0
22 Oct 2024
HG2P: Hippocampus-inspired High-reward Graph and Model-Free Q-Gradient Penalty for Path Planning and Motion Control
Haoran Wang
Yaoru Sun
Zeshen Tang
Haibo Shi
Chenyuan Jiao
27
0
0
12 Oct 2024
LiRA: Light-Robust Adversary for Model-based Reinforcement Learning in Real World
Taisuke Kobayashi
71
2
0
29 Sep 2024
Gradient-based Regularization for Action Smoothness in Robotic Control with Reinforcement Learning
I. Lee
Hoang-Giang Cao
Cong-Tinh Dao
Yu-Cheng Chen
I-Chen Wu
25
0
0
05 Jul 2024
Explicit Lipschitz Value Estimation Enhances Policy Robustness Against Perturbation
Xulin Chen
Ruipeng Liu
Garret E. Katz
42
0
0
22 Apr 2024
Revisiting Experience Replayable Conditions
Taisuke Kobayashi
24
3
0
15 Feb 2024
Soft Actor-Critic Algorithm with Truly-satisfied Inequality Constraint
Taisuke Kobayashi
29
3
0
08 Mar 2023
Reward Bonuses with Gain Scheduling Inspired by Iterative Deepening Search
Taisuke Kobayashi
9
1
0
21 Dec 2022
Learning to Walk in Minutes Using Massively Parallel Deep Reinforcement Learning
Nikita Rudin
David Hoeller
Philipp Reist
Marco Hutter
115
545
0
24 Sep 2021
Robust Reinforcement Learning on State Observations with Learned Optimal Adversary
Huan Zhang
Hongge Chen
Duane S. Boning
Cho-Jui Hsieh
64
162
0
21 Jan 2021
t-Soft Update of Target Network for Deep Reinforcement Learning
Taisuke Kobayashi
Wendyam Eric Lionel Ilboudo
79
50
0
25 Aug 2020
1