ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1610.03295
  4. Cited By
Safe, Multi-Agent, Reinforcement Learning for Autonomous Driving

Safe, Multi-Agent, Reinforcement Learning for Autonomous Driving

11 October 2016
Shai Shalev-Shwartz
Shaked Shammah
Amnon Shashua
ArXivPDFHTML

Papers citing "Safe, Multi-Agent, Reinforcement Learning for Autonomous Driving"

17 / 17 papers shown
Title
Feasibility-Aware Pessimistic Estimation: Toward Long-Horizon Safety in Offline RL
Feasibility-Aware Pessimistic Estimation: Toward Long-Horizon Safety in Offline RL
Zhikun Tao
Gang Xiong
He Fang
Zhen Shen
Yunjun Han
Qing-Shan Jia
OffRL
89
0
0
13 May 2025
MARFT: Multi-Agent Reinforcement Fine-Tuning
MARFT: Multi-Agent Reinforcement Fine-Tuning
Junwei Liao
Muning Wen
Jun Wang
Weinan Zhang
OffRL
72
1
0
21 Apr 2025
Autonomous Algorithm for Training Autonomous Vehicles with Minimal Human Intervention
Autonomous Algorithm for Training Autonomous Vehicles with Minimal Human Intervention
Sang-Hyun Lee
Daehyeok Kwon
Seung-Woo Seo
97
1
0
17 Jan 2025
Multi-Agent Quantum Reinforcement Learning using Evolutionary Optimization
Multi-Agent Quantum Reinforcement Learning using Evolutionary Optimization
Michael Kolle
Felix Topp
Thomy Phan
Philipp Altmann
Jonas Nusslein
Claudia Linnhoff-Popien
AI4CE
74
5
0
03 Jan 2025
A Multi-Agent Approach for REST API Testing with Semantic Graphs and LLM-Driven Inputs
A Multi-Agent Approach for REST API Testing with Semantic Graphs and LLM-Driven Inputs
Myeongsoo Kim
Tyler Stennett
Saurabh Sinha
Alessandro Orso
57
6
0
11 Nov 2024
Towards an Information Theoretic Framework of Context-Based Offline Meta-Reinforcement Learning
Towards an Information Theoretic Framework of Context-Based Offline Meta-Reinforcement Learning
Lanqing Li
Hai Zhang
Xinyu Zhang
Shatong Zhu
Junqiao Zhao
Junqiao Zhao
Pheng-Ann Heng
OffRL
65
8
0
04 Feb 2024
Networked Communication for Decentralised Agents in Mean-Field Games
Networked Communication for Decentralised Agents in Mean-Field Games
Patrick Benjamin
Alessandro Abate
FedML
75
2
0
05 Jun 2023
Global Convergence of Policy Gradient for Linear-Quadratic Mean-Field
  Control/Game in Continuous Time
Global Convergence of Policy Gradient for Linear-Quadratic Mean-Field Control/Game in Continuous Time
Weichen Wang
Jiequn Han
Zhuoran Yang
Zhaoran Wang
54
27
0
16 Aug 2020
Reinforcement Learning with Uncertainty Estimation for Tactical
  Decision-Making in Intersections
Reinforcement Learning with Uncertainty Estimation for Tactical Decision-Making in Intersections
C. Hoel
Tommy Tram
J. Sjöberg
43
30
0
17 Jun 2020
Algorithmic decision-making in AVs: Understanding ethical and technical
  concerns for smart cities
Algorithmic decision-making in AVs: Understanding ethical and technical concerns for smart cities
H. S. M. Lim
Araz Taeihagh
41
83
0
29 Oct 2019
End to End Learning for Self-Driving Cars
End to End Learning for Self-Driving Cars
Mariusz Bojarski
D. Testa
Daniel Dworakowski
Bernhard Firner
B. Flepp
...
Urs Muller
Jiakai Zhang
Xin Zhang
Jake Zhao
Karol Zieba
SSL
46
4,153
0
25 Apr 2016
A Deep Hierarchical Approach to Lifelong Learning in Minecraft
A Deep Hierarchical Approach to Lifelong Learning in Minecraft
Chen Tessler
Shahar Givony
Tom Zahavy
D. Mankowitz
Shie Mannor
CLL
110
378
0
25 Apr 2016
SDCA without Duality, Regularization, and Individual Convexity
SDCA without Duality, Regularization, and Individual Convexity
Shai Shalev-Shwartz
29
104
0
04 Feb 2016
High-Dimensional Continuous Control Using Generalized Advantage
  Estimation
High-Dimensional Continuous Control Using Generalized Advantage Estimation
John Schulman
Philipp Moritz
Sergey Levine
Michael I. Jordan
Pieter Abbeel
OffRL
38
3,368
0
08 Jun 2015
Safe Policy Search for Lifelong Reinforcement Learning with Sublinear
  Regret
Safe Policy Search for Lifelong Reinforcement Learning with Sublinear Regret
Haitham Bou-Ammar
Rasul Tutunov
Eric Eaton
OffRL
CLL
48
64
0
21 May 2015
Stochastic Gradient Descent, Weighted Sampling, and the Randomized
  Kaczmarz algorithm
Stochastic Gradient Descent, Weighted Sampling, and the Randomized Kaczmarz algorithm
Deanna Needell
Nathan Srebro
Rachel A. Ward
94
551
0
21 Oct 2013
Stochastic Dual Coordinate Ascent Methods for Regularized Loss
  Minimization
Stochastic Dual Coordinate Ascent Methods for Regularized Loss Minimization
Shai Shalev-Shwartz
Tong Zhang
104
1,031
0
10 Sep 2012
1