Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2302.14833
Cited By
Learning to Control Autonomous Fleets from Observation via Offline Reinforcement Learning
28 February 2023
Carolin Schmidt
Daniele Gammelli
Francisco Câmara Pereira
Filipe Rodrigues
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Learning to Control Autonomous Fleets from Observation via Offline Reinforcement Learning"
12 / 12 papers shown
Title
Graph Reinforcement Learning for Network Control via Bi-Level Optimization
Daniele Gammelli
James Harrison
Kaidi Yang
Marco Pavone
Filipe Rodrigues
Francisco Câmara Pereira
AI4CE
63
6
0
16 May 2023
Spatio-temporal Incentives Optimization for Ride-hailing Services with Offline Deep Reinforcement Learning
Yanqiu Wu
Qingyang Li
Zhiwei Qin
OffRL
39
3
0
06 Nov 2022
Reinforcement Learning in the Wild: Scalable RL Dispatching Algorithm Deployed in Ridehailing Marketplace
S. S. Eshkevari
Xiaocheng Tang
Zhiwei Qin
Jinhan Mei
Cheng Zhang
Qianying Meng
Jia Xu
31
23
0
10 Feb 2022
Offline Reinforcement Learning as One Big Sequence Modeling Problem
Michael Janner
Qiyang Li
Sergey Levine
OffRL
95
667
0
03 Jun 2021
Value Function is All You Need: A Unified Learning Framework for Ride Hailing Platforms
Xiaocheng Tang
Fan Zhang
Zhiwei Qin
Yansheng Wang
D. Shi
Bingchen Song
Yongxin Tong
Hongtu Zhu
Jieping Ye
OffRL
42
48
0
18 May 2021
Reinforcement Learning for Ridesharing: An Extended Survey
Zhiwei Qin
Hongtu Zhu
Jieping Ye
72
85
0
03 May 2021
NeoRL: A Near Real-World Benchmark for Offline Reinforcement Learning
Rongjun Qin
Songyi Gao
Xingyuan Zhang
Zhen Xu
Shengkai Huang
Zewen Li
Weinan Zhang
Yang Yu
OffRL
177
81
0
01 Feb 2021
Conservative Q-Learning for Offline Reinforcement Learning
Aviral Kumar
Aurick Zhou
George Tucker
Sergey Levine
OffRL
OnRL
99
1,780
0
08 Jun 2020
MOReL : Model-Based Offline Reinforcement Learning
Rahul Kidambi
Aravind Rajeswaran
Praneeth Netrapalli
Thorsten Joachims
OffRL
70
662
0
12 May 2020
D4RL: Datasets for Deep Data-Driven Reinforcement Learning
Justin Fu
Aviral Kumar
Ofir Nachum
George Tucker
Sergey Levine
GP
OffRL
179
1,338
0
15 Apr 2020
Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction
Aviral Kumar
Justin Fu
George Tucker
Sergey Levine
OffRL
OnRL
79
1,044
0
03 Jun 2019
Offline A/B testing for Recommender Systems
Alexandre Gilotte
Clément Calauzènes
Thomas Nedelec
A. Abraham
Simon Dollé
OffRL
62
220
0
22 Jan 2018
1