Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2409.18343
Cited By
Improving Agent Behaviors with RL Fine-tuning for Autonomous Driving
26 September 2024
Zhenghao Peng
Wenjie Luo
Yiren Lu
Tianyi Shen
Cole Gulino
Ari Seff
Justin Fu
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Improving Agent Behaviors with RL Fine-tuning for Autonomous Driving"
22 / 22 papers shown
Title
Learning Realistic Traffic Agents in Closed-loop
Chris Zhang
James Tu
Lunjun Zhang
Kelvin Wong
Simon Suo
R. Urtasun
77
21
0
02 Nov 2023
ScenarioNet: Open-Source Platform for Large-Scale Traffic Scenario Simulation and Modeling
Quanyi Li
Zhenghao Peng
Lan Feng
Zhizheng Liu
Chenda Duan
Wen-An Mo
Bolei Zhou
64
48
0
21 Jun 2023
TrafficBots: Towards World Models for Autonomous Driving Simulation and Motion Prediction
Zhejun Zhang
Alexander Liniger
Dengxin Dai
Feng Yu
Luc Van Gool
115
43
0
07 Mar 2023
A Review of Off-Policy Evaluation in Reinforcement Learning
Masatoshi Uehara
C. Shi
Nathan Kallus
OffRL
92
76
0
13 Dec 2022
MolE: a molecular foundation model for drug discovery
Oscar Méndez-Lucio
C. Nicolaou
Berton Earnshaw
65
10
0
03 Nov 2022
Guided Conditional Diffusion for Controllable Traffic Simulation
Ziyuan Zhong
Davis Rempe
Danfei Xu
Yuxiao Chen
Sushant Veer
Tong Che
Baishakhi Ray
Marco Pavone
68
155
0
31 Oct 2022
TrafficGen: Learning to Generate Diverse and Realistic Traffic Scenarios
Lan Feng
Quanyi Li
Zhenghao Peng
Shuhan Tan
Bolei Zhou
80
88
0
12 Oct 2022
Motion Transformer with Global Intention Localization and Local Movement Refinement
Shaoshuai Shi
Li Jiang
Dengxin Dai
Bernt Schiele
94
239
0
27 Sep 2022
Wayformer: Motion Forecasting via Simple & Efficient Attention Networks
Nigamaa Nayakanti
Rami Al-Rfou
Aurick Zhou
Kratarth Goel
Khaled S. Refaat
Benjamin Sapp
AI4TS
118
256
0
12 Jul 2022
Hierarchical Text-Conditional Image Generation with CLIP Latents
Aditya A. Ramesh
Prafulla Dhariwal
Alex Nichol
Casey Chu
Mark Chen
VLM
DiffM
410
6,897
0
13 Apr 2022
TrajGen: Generating Realistic and Diverse Trajectories with Reactive and Feasible Agent Behaviors for Autonomous Driving
Qichao Zhang
Yinfeng Gao
Yikang Zhang
Youtian Guo
Dawei Ding
Yunpeng Wang
Peng Sun
Dongbin Zhao
81
35
0
31 Mar 2022
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
880
13,148
0
04 Mar 2022
MetaDrive: Composing Diverse Driving Scenarios for Generalizable Reinforcement Learning
Quanyi Li
Zhenghao Peng
Lan Feng
Qihang Zhang
Zhenghai Xue
Bolei Zhou
98
254
0
26 Sep 2021
PredictionNet: Real-Time Joint Probabilistic Traffic Prediction for Planning, Control, and Simulation
A. Kamenev
Lirui Wang
Ollin Boer Bohan
Ishwar Kulkarni
Bilal Kartal
Artem Molchanov
Stan Birchfield
David Nistér
Nikolai Smolyanskiy
95
40
0
23 Sep 2021
SimNet: Learning Reactive Self-driving Simulations from Real-world Observations
Luca Bergamini
Yawei Ye
Oliver Scheel
Long Chen
Chih Hu
Luca Del Pero
B. Osinski
Hugo Grimmett
Peter Ondruska
59
103
0
26 May 2021
TrafficSim: Learning to Simulate Realistic Multi-Agent Behaviors
Simon Suo
S. Regalado
Sergio Casas
R. Urtasun
183
230
0
17 Jan 2021
SMARTS: Scalable Multi-Agent Reinforcement Learning Training School for Autonomous Driving
Ming Zhou
Jun Luo
Julian Villela
Yaodong Yang
David Rusu
...
H. Ammar
Hongbo Zhang
Wulong Liu
Jianye Hao
Jun Wang
175
195
0
19 Oct 2020
Language Models are Few-Shot Learners
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
...
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
BDL
826
42,332
0
28 May 2020
ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations for Vision-and-Language Tasks
Jiasen Lu
Dhruv Batra
Devi Parikh
Stefan Lee
SSL
VLM
231
3,693
0
06 Aug 2019
CARLA: An Open Urban Driving Simulator
Alexey Dosovitskiy
G. Ros
Felipe Codevilla
Antonio M. López
V. Koltun
VLM
137
5,199
0
10 Nov 2017
End-to-end Driving via Conditional Imitation Learning
Felipe Codevilla
Matthias Muller
Antonio M. López
V. Koltun
Alexey Dosovitskiy
131
1,066
0
06 Oct 2017
A Reduction of Imitation Learning and Structured Prediction to No-Regret Online Learning
Stéphane Ross
Geoffrey J. Gordon
J. Andrew Bagnell
OffRL
231
3,232
0
02 Nov 2010
1