Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2410.07574
Cited By
Gap-Dependent Bounds for Q-Learning using Reference-Advantage Decomposition
10 October 2024
Zhong Zheng
Haochen Zhang
Lingzhou Xue
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Gap-Dependent Bounds for Q-Learning using Reference-Advantage Decomposition"
1 / 1 papers shown
Title
In-Trajectory Inverse Reinforcement Learning: Learn Incrementally Before An Ongoing Trajectory Terminates
Shicheng Liu
Minghui Zhu
54
0
0
21 Oct 2024
1