Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2003.03900
Cited By
FormulaZero: Distributionally Robust Online Adaptation via Offline Population Synthesis
9 March 2020
Aman Sinha
Matthew O'Kelly
Hongrui Zheng
Rahul Mangharam
John C. Duchi
Russ Tedrake
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"FormulaZero: Distributionally Robust Online Adaptation via Offline Population Synthesis"
29 / 29 papers shown
Title
Dota 2 with Large Scale Deep Reinforcement Learning
OpenAI OpenAI
:
Christopher Berner
Greg Brockman
Brooke Chan
...
Szymon Sidor
Ilya Sutskever
Jie Tang
Filip Wolski
Susan Zhang
GNN
VLM
CLL
AI4CE
LRM
63
1,805
0
13 Dec 2019
Efficient Black-box Assessment of Autonomous Vehicle Safety
J. Norden
Matthew O'Kelly
Aman Sinha
35
66
0
08 Dec 2019
Jointly Learnable Behavior and Trajectory Planning for Self-Driving Vehicles
Abbas Sadat
Mengye Ren
A. Pokrovsky
Yen-Chen Lin
Ersin Yumer
R. Urtasun
37
93
0
10 Oct 2019
Adversarial Policies: Attacking Deep Reinforcement Learning
Adam Gleave
Michael Dennis
Cody Wild
Neel Kant
Sergey Levine
Stuart J. Russell
AAML
31
350
0
25 May 2019
Online Vehicle Trajectory Prediction using Policy Anticipation Network and Optimization-based Context Reasoning
Wenchao Ding
Shaojie Shen
15
52
0
03 Mar 2019
Online Control with Adversarial Disturbances
Naman Agarwal
Brian Bullins
Elad Hazan
Sham Kakade
Karan Singh
15
236
0
23 Feb 2019
Distributionally Robust Reinforcement Learning
E. Smirnova
Elvis Dohmatob
Jérémie Mary
OffRL
29
59
0
23 Feb 2019
AlphaStar: An Evolutionary Computation Perspective
Kai Arulkumaran
Antoine Cully
Julian Togelius
21
183
0
05 Feb 2019
Human-level performance in first-person multiplayer games with population-based deep reinforcement learning
Max Jaderberg
Wojciech M. Czarnecki
Iain Dunning
Luke Marris
Guy Lever
...
Joel Z Leibo
David Silver
Demis Hassabis
Koray Kavukcuoglu
T. Graepel
OffRL
54
717
0
03 Jul 2018
Probabilistic Model-Agnostic Meta-Learning
Chelsea Finn
Kelvin Xu
Sergey Levine
BDL
236
666
0
07 Jun 2018
Regret Bounds for Robust Adaptive Control of the Linear Quadratic Regulator
Sarah Dean
Horia Mania
Nikolai Matni
Benjamin Recht
Stephen Tu
18
283
0
23 May 2018
Addressing Function Approximation Error in Actor-Critic Methods
Scott Fujimoto
H. V. Hoof
David Meger
OffRL
122
5,121
0
26 Feb 2018
A Non-Cooperative Game Approach to Autonomous Racing
Alexander Liniger
John Lygeros
36
120
0
11 Dec 2017
Parallel WaveNet: Fast High-Fidelity Speech Synthesis
Aaron van den Oord
Yazhe Li
Igor Babuschkin
Karen Simonyan
Oriol Vinyals
...
Alex Graves
Helen King
T. Walters
Dan Belov
Demis Hassabis
83
857
0
28 Nov 2017
Population Based Training of Neural Networks
Max Jaderberg
Valentin Dalibard
Simon Osindero
Wojciech M. Czarnecki
Jeff Donahue
...
Tim Green
Iain Dunning
Karen Simonyan
Chrisantha Fernando
Koray Kavukcuoglu
32
736
0
27 Nov 2017
Budget-Constrained Multi-Armed Bandits with Multiple Plays
Datong P. Zhou
Claire Tomlin
22
59
0
16 Nov 2017
Certifying Some Distributional Robustness with Principled Adversarial Training
Aman Sinha
Hongseok Namkoong
Riccardo Volpi
John C. Duchi
OOD
69
858
0
29 Oct 2017
Emergent Complexity via Multi-Agent Competition
Trapit Bansal
J. Pachocki
Szymon Sidor
Ilya Sutskever
Igor Mordatch
37
384
0
10 Oct 2017
Autonomous Racing with AutoRally Vehicles and Differential Games
Grady Williams
Brian Goldfain
P. Drews
James M. Rehg
Evangelos A. Theodorou
18
22
0
14 Jul 2017
Masked Autoregressive Flow for Density Estimation
George Papamakarios
Theo Pavlakou
Iain Murray
73
1,340
0
19 May 2017
CDDT: Fast Approximate 2D Ray Casting for Accelerated Localization
Corey H. Walsh
S. Karaman
22
32
0
02 May 2017
Robust Adversarial Reinforcement Learning
Lerrel Pinto
James Davidson
Rahul Sukthankar
Abhinav Gupta
OOD
67
848
0
08 Mar 2017
Opponent Modeling in Deep Reinforcement Learning
He He
Jordan L. Boyd-Graber
Kevin Kwok
Hal Daumé III
BDL
51
324
0
18 Sep 2016
Improving Variational Inference with Inverse Autoregressive Flow
Diederik P. Kingma
Tim Salimans
Rafal Jozefowicz
Xi Chen
Ilya Sutskever
Max Welling
BDL
DRL
72
1,805
0
15 Jun 2016
Variational Inference with Normalizing Flows
Danilo Jimenez Rezende
S. Mohamed
DRL
BDL
203
4,143
0
21 May 2015
Illuminating search spaces by mapping elites
Jean-Baptiste Mouret
Jeff Clune
42
728
0
20 Apr 2015
Adam: A Method for Stochastic Optimization
Diederik P. Kingma
Jimmy Ba
ODL
262
149,474
0
22 Dec 2014
Determinantal point processes for machine learning
Alex Kulesza
B. Taskar
208
1,130
0
25 Jul 2012
Sampling-based Algorithms for Optimal Motion Planning
S. Karaman
Emilio Frazzoli
56
4,660
0
05 May 2011
1