Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2403.19648
Cited By
v1
v2 (latest)
Human-compatible driving partners through data-regularized self-play reinforcement learning
28 March 2024
Daphne Cornelisse
Eugene Vinitsky
Re-assign community
ArXiv (abs)
PDF
HTML
Github (32★)
Papers citing
"Human-compatible driving partners through data-regularized self-play reinforcement learning"
23 / 23 papers shown
Title
A Survey on Self-play Methods in Reinforcement Learning
Chao Yu
Zelai Xu
Chengdong Ma
Chao Yu
Weijuan Tu
...
Deheng Ye
Wenbo Ding
Yaodong Yang
Yu Wang
Yu Wang
SyDa
SSL
OnRL
98
9
0
02 Aug 2024
Learning Realistic Traffic Agents in Closed-loop
Chris Zhang
James Tu
Lunjun Zhang
Kelvin Wong
Simon Suo
R. Urtasun
77
21
0
02 Nov 2023
Language Conditioned Traffic Generation
Shuhan Tan
Boris Ivanovic
Xinshuo Weng
Marco Pavone
Philipp Kraehenbuehl
77
57
0
16 Jul 2023
Diffusion Policy: Visuomotor Policy Learning via Action Diffusion
Cheng Chi
Zhenjia Xu
S. Feng
Eric A. Cousineau
Yilun Du
Benjamin Burchfiel
Russ Tedrake
Shuran Song
347
1,189
0
07 Mar 2023
imitation: Clean Imitation Learning Implementations
Adam Gleave
Mohammad Taufeeque
Juan Rocamonde
Erik Jenner
Steven H. Wang
Sam Toyer
M. Ernestus
Nora Belrose
Scott Emmons
Stuart J. Russell
MLAU
130
32
0
22 Nov 2022
BITS: Bi-level Imitation for Traffic Simulation
Danfei Xu
Yuxiao Chen
Boris Ivanovic
Marco Pavone
89
84
0
26 Aug 2022
Wayformer: Motion Forecasting via Simple & Efficient Attention Networks
Nigamaa Nayakanti
Rami Al-Rfou
Aurick Zhou
Kratarth Goel
Khaled S. Refaat
Benjamin Sapp
AI4TS
118
256
0
12 Jul 2022
Symphony: Learning Realistic and Diverse Agents for Autonomous Driving Simulation
Maximilian Igl
Daewoo Kim
Alex Kuefler
Paul Mougin
Punit Shah
K. Shiarlis
Drago Anguelov
Mark Palatucci
Brandyn White
Shimon Whiteson
86
67
0
06 May 2022
Modeling Strong and Human-Like Gameplay with KL-Regularized Search
Athul Paul Jacob
David J. Wu
Gabriele Farina
Adam Lerer
Hengyuan Hu
A. Bakhtin
Jacob Andreas
Noam Brown
52
54
0
14 Dec 2021
No-Press Diplomacy from Scratch
A. Bakhtin
David J. Wu
Adam Lerer
Noam Brown
162
44
0
06 Oct 2021
MetaDrive: Composing Diverse Driving Scenarios for Generalizable Reinforcement Learning
Quanyi Li
Zhenghao Peng
Lan Feng
Qihang Zhang
Zhenghai Xue
Bolei Zhou
98
254
0
26 Sep 2021
Large Scale Interactive Motion Forecasting for Autonomous Driving : The Waymo Open Motion Dataset
Scott Ettinger
Shuyang Cheng
Benjamin Caine
Chenxi Liu
Hang Zhao
...
Jiquan Ngiam
Vijay Vasudevan
Alexander McCauley
Jonathon Shlens
Drago Anguelov
192
561
0
20 Apr 2021
TrafficSim: Learning to Simulate Realistic Multi-Agent Behaviors
Simon Suo
S. Regalado
Sergio Casas
R. Urtasun
183
230
0
17 Jan 2021
"Other-Play" for Zero-Shot Coordination
Hengyuan Hu
Adam Lerer
A. Peysakhovich
Jakob N. Foerster
VLM
OffRL
173
222
0
06 Mar 2020
The Hanabi Challenge: A New Frontier for AI Research
Nolan Bard
Jakob N. Foerster
A. Chandar
Neil Burch
Marc Lanctot
...
Iain Dunning
Shibl Mourad
Hugo Larochelle
Marc G. Bellemare
Michael Bowling
LLMAG
69
353
0
01 Feb 2019
CIRL: Controllable Imitative Reinforcement Learning for Vision-based Self-driving
Xiaodan Liang
Tairui Wang
Luona Yang
Eric Xing
58
269
0
10 Jul 2018
CARLA: An Open Urban Driving Simulator
Alexey Dosovitskiy
G. Ros
Felipe Codevilla
Antonio M. López
V. Koltun
VLM
137
5,199
0
10 Nov 2017
Learning Robust Rewards with Adversarial Inverse Reinforcement Learning
Justin Fu
Katie Z Luo
Sergey Levine
129
757
0
30 Oct 2017
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
523
19,237
0
20 Jul 2017
Virtual to Real Reinforcement Learning for Autonomous Driving
Xinlei Pan
Yurong You
Ziyan Wang
Cewu Lu
OffRL
69
336
0
13 Apr 2017
Generative Adversarial Imitation Learning
Jonathan Ho
Stefano Ermon
GAN
156
3,119
0
10 Jun 2016
End to End Learning for Self-Driving Cars
Mariusz Bojarski
D. Testa
Daniel Dworakowski
Bernhard Firner
B. Flepp
...
Urs Muller
Jiakai Zhang
Xin Zhang
Jake Zhao
Karol Zieba
SSL
100
4,175
0
25 Apr 2016
A Reduction of Imitation Learning and Structured Prediction to No-Regret Online Learning
Stéphane Ross
Geoffrey J. Gordon
J. Andrew Bagnell
OffRL
231
3,232
0
02 Nov 2010
1