Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2210.05639
Cited By
v1
v2 (latest)
Discovered Policy Optimisation
11 October 2022
Chris Xiaoxuan Lu
J. Kuba
Alistair Letcher
Luke Metz
Christian Schroeder de Witt
Jakob N. Foerster
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Discovered Policy Optimisation"
13 / 63 papers shown
Title
ReLU to the Rescue: Improve Your On-Policy Actor-Critic with Positive Advantages
Andrew Jesson
Chris Xiaoxuan Lu
Gunshi Gupta
Angelos Filos
Jakob N. Foerster
Y. Gal
OffRL
77
8
0
02 Jun 2023
Hyperparameters in Reinforcement Learning and How To Tune Them
Theresa Eimer
Marius Lindauer
Roberta Raileanu
OffRL
219
44
0
02 Jun 2023
Discovering Attention-Based Genetic Algorithms via Meta-Black-Box Optimization
R. T. Lange
Tom Schaul
Yutian Chen
Chris Xiaoxuan Lu
Tom Zahavy
Valentin Dalibard
Sebastian Flennerhag
114
36
0
08 Apr 2023
Pgx: Hardware-Accelerated Parallel Game Simulators for Reinforcement Learning
Sotetsu Koyamada
Shinri Okano
Soichiro Nishimori
Y. Murata
Keigo Habara
Haruka Kita
Shin Ishii
120
26
0
29 Mar 2023
Arbitrary Order Meta-Learning with Simple Population-Based Evolution
Chris Xiaoxuan Lu
Sebastian Towers
Jakob N. Foerster
74
5
0
16 Mar 2023
Structured State Space Models for In-Context Reinforcement Learning
Chris Xiaoxuan Lu
Yannick Schroecker
Albert Gu
Emilio Parisotto
Jakob N. Foerster
Satinder Singh
Feryal M. P. Behbahani
AI4TS
180
99
0
07 Mar 2023
Learning to Optimize for Reinforcement Learning
Qingfeng Lan
Rupam Mahmood
Shuicheng Yan
Zhongwen Xu
OffRL
115
7
0
03 Feb 2023
A Novel Framework for Policy Mirror Descent with General Parameterization and Linear Convergence
Carlo Alfano
Rui Yuan
Patrick Rebeschini
145
15
0
30 Jan 2023
evosax: JAX-based Evolution Strategies
R. T. Lange
102
57
0
08 Dec 2022
Discovering Evolution Strategies via Meta-Black-Box Optimization
R. T. Lange
Tom Schaul
Yutian Chen
Tom Zahavy
Valenti Dallibard
Chris Xiaoxuan Lu
Satinder Singh
Sebastian Flennerhag
116
49
0
21 Nov 2022
Illusory Attacks: Information-Theoretic Detectability Matters in Adversarial Attacks
Tim Franzmeyer
Stephen McAleer
João F. Henriques
Jakob N. Foerster
Philip Torr
Adel Bibi
Christian Schroeder de Witt
AAML
78
8
0
20 Jul 2022
Evolving Pareto-Optimal Actor-Critic Algorithms for Generalizability and Stability
Juan Jose Garau-Luis
Yingjie Miao
John D. Co-Reyes
Aaron T Parisi
Jie Tan
Esteban Real
Aleksandra Faust
88
0
0
08 Apr 2022
Mava: a research library for distributed multi-agent reinforcement learning in JAX
Arnu Pretorius
Kale-ab Tessera
St John Grimbly
Kevin Eloff
Lawrence Francis
Claude Formanek
Andries P. Smit
Alexandre Laterre
119
13
0
03 Jul 2021
Previous
1
2