ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2409.18343
  4. Cited By
Improving Agent Behaviors with RL Fine-tuning for Autonomous Driving

Improving Agent Behaviors with RL Fine-tuning for Autonomous Driving

26 September 2024
Zhenghao Peng
Wenjie Luo
Yiren Lu
Tianyi Shen
Cole Gulino
Ari Seff
Justin Fu
ArXiv (abs)PDFHTML

Papers citing "Improving Agent Behaviors with RL Fine-tuning for Autonomous Driving"

22 / 22 papers shown
Title
Learning Realistic Traffic Agents in Closed-loop
Learning Realistic Traffic Agents in Closed-loop
Chris Zhang
James Tu
Lunjun Zhang
Kelvin Wong
Simon Suo
R. Urtasun
80
21
0
02 Nov 2023
ScenarioNet: Open-Source Platform for Large-Scale Traffic Scenario
  Simulation and Modeling
ScenarioNet: Open-Source Platform for Large-Scale Traffic Scenario Simulation and Modeling
Quanyi Li
Zhenghao Peng
Lan Feng
Zhizheng Liu
Chenda Duan
Wen-An Mo
Bolei Zhou
64
48
0
21 Jun 2023
TrafficBots: Towards World Models for Autonomous Driving Simulation and
  Motion Prediction
TrafficBots: Towards World Models for Autonomous Driving Simulation and Motion Prediction
Zhejun Zhang
Alexander Liniger
Dengxin Dai
Feng Yu
Luc Van Gool
115
43
0
07 Mar 2023
A Review of Off-Policy Evaluation in Reinforcement Learning
A Review of Off-Policy Evaluation in Reinforcement Learning
Masatoshi Uehara
C. Shi
Nathan Kallus
OffRL
92
76
0
13 Dec 2022
MolE: a molecular foundation model for drug discovery
MolE: a molecular foundation model for drug discovery
Oscar Méndez-Lucio
C. Nicolaou
Berton Earnshaw
68
10
0
03 Nov 2022
Guided Conditional Diffusion for Controllable Traffic Simulation
Guided Conditional Diffusion for Controllable Traffic Simulation
Ziyuan Zhong
Davis Rempe
Danfei Xu
Yuxiao Chen
Sushant Veer
Tong Che
Baishakhi Ray
Marco Pavone
68
155
0
31 Oct 2022
TrafficGen: Learning to Generate Diverse and Realistic Traffic Scenarios
TrafficGen: Learning to Generate Diverse and Realistic Traffic Scenarios
Lan Feng
Quanyi Li
Zhenghao Peng
Shuhan Tan
Bolei Zhou
80
88
0
12 Oct 2022
Motion Transformer with Global Intention Localization and Local Movement
  Refinement
Motion Transformer with Global Intention Localization and Local Movement Refinement
Shaoshuai Shi
Li Jiang
Dengxin Dai
Bernt Schiele
94
239
0
27 Sep 2022
Wayformer: Motion Forecasting via Simple & Efficient Attention Networks
Wayformer: Motion Forecasting via Simple & Efficient Attention Networks
Nigamaa Nayakanti
Rami Al-Rfou
Aurick Zhou
Kratarth Goel
Khaled S. Refaat
Benjamin Sapp
AI4TS
118
256
0
12 Jul 2022
Hierarchical Text-Conditional Image Generation with CLIP Latents
Hierarchical Text-Conditional Image Generation with CLIP Latents
Aditya A. Ramesh
Prafulla Dhariwal
Alex Nichol
Casey Chu
Mark Chen
VLMDiffM
410
6,897
0
13 Apr 2022
TrajGen: Generating Realistic and Diverse Trajectories with Reactive and
  Feasible Agent Behaviors for Autonomous Driving
TrajGen: Generating Realistic and Diverse Trajectories with Reactive and Feasible Agent Behaviors for Autonomous Driving
Qichao Zhang
Yinfeng Gao
Yikang Zhang
Youtian Guo
Dawei Ding
Yunpeng Wang
Peng Sun
Dongbin Zhao
81
35
0
31 Mar 2022
Training language models to follow instructions with human feedback
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLMALM
880
13,148
0
04 Mar 2022
MetaDrive: Composing Diverse Driving Scenarios for Generalizable
  Reinforcement Learning
MetaDrive: Composing Diverse Driving Scenarios for Generalizable Reinforcement Learning
Quanyi Li
Zhenghao Peng
Lan Feng
Qihang Zhang
Zhenghai Xue
Bolei Zhou
98
254
0
26 Sep 2021
PredictionNet: Real-Time Joint Probabilistic Traffic Prediction for
  Planning, Control, and Simulation
PredictionNet: Real-Time Joint Probabilistic Traffic Prediction for Planning, Control, and Simulation
A. Kamenev
Lirui Wang
Ollin Boer Bohan
Ishwar Kulkarni
Bilal Kartal
Artem Molchanov
Stan Birchfield
David Nistér
Nikolai Smolyanskiy
95
40
0
23 Sep 2021
SimNet: Learning Reactive Self-driving Simulations from Real-world
  Observations
SimNet: Learning Reactive Self-driving Simulations from Real-world Observations
Luca Bergamini
Yawei Ye
Oliver Scheel
Long Chen
Chih Hu
Luca Del Pero
B. Osinski
Hugo Grimmett
Peter Ondruska
59
103
0
26 May 2021
TrafficSim: Learning to Simulate Realistic Multi-Agent Behaviors
TrafficSim: Learning to Simulate Realistic Multi-Agent Behaviors
Simon Suo
S. Regalado
Sergio Casas
R. Urtasun
183
230
0
17 Jan 2021
SMARTS: Scalable Multi-Agent Reinforcement Learning Training School for
  Autonomous Driving
SMARTS: Scalable Multi-Agent Reinforcement Learning Training School for Autonomous Driving
Ming Zhou
Jun Luo
Julian Villela
Yaodong Yang
David Rusu
...
H. Ammar
Hongbo Zhang
Wulong Liu
Jianye Hao
Jun Wang
175
195
0
19 Oct 2020
Language Models are Few-Shot Learners
Language Models are Few-Shot Learners
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
...
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
BDL
835
42,332
0
28 May 2020
ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations for
  Vision-and-Language Tasks
ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations for Vision-and-Language Tasks
Jiasen Lu
Dhruv Batra
Devi Parikh
Stefan Lee
SSLVLM
234
3,693
0
06 Aug 2019
CARLA: An Open Urban Driving Simulator
CARLA: An Open Urban Driving Simulator
Alexey Dosovitskiy
G. Ros
Felipe Codevilla
Antonio M. López
V. Koltun
VLM
137
5,199
0
10 Nov 2017
End-to-end Driving via Conditional Imitation Learning
End-to-end Driving via Conditional Imitation Learning
Felipe Codevilla
Matthias Muller
Antonio M. López
V. Koltun
Alexey Dosovitskiy
131
1,066
0
06 Oct 2017
A Reduction of Imitation Learning and Structured Prediction to No-Regret
  Online Learning
A Reduction of Imitation Learning and Structured Prediction to No-Regret Online Learning
Stéphane Ross
Geoffrey J. Gordon
J. Andrew Bagnell
OffRL
231
3,232
0
02 Nov 2010
1