Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1703.03864
Cited By
Evolution Strategies as a Scalable Alternative to Reinforcement Learning
10 March 2017
Tim Salimans
Jonathan Ho
Xi Chen
Szymon Sidor
Ilya Sutskever
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Evolution Strategies as a Scalable Alternative to Reinforcement Learning"
50 / 407 papers shown
Title
Black box meta-learning intrinsic rewards for sparse-reward environments
Octavio Pappalardo
Rodrigo Ramele
Juan Miguel Santos
OffRL
48
0
0
31 Jul 2024
Online Pseudo-Zeroth-Order Training of Neuromorphic Spiking Neural Networks
Mingqing Xiao
Qingyan Meng
Zongpeng Zhang
D.K. He
Zhouchen Lin
45
0
0
17 Jul 2024
Can Learned Optimization Make Reinforcement Learning Less Difficult?
Alexander David Goldie
Chris Xiaoxuan Lu
Matthew Jackson
Shimon Whiteson
Jakob N. Foerster
44
3
0
09 Jul 2024
Augmented Bayesian Policy Search
Mahdi Kallel
Debabrota Basu
R. Akrour
Carlo DÉramo
47
2
0
05 Jul 2024
Efficient Expert Pruning for Sparse Mixture-of-Experts Language Models: Enhancing Performance and Reducing Inference Costs
Enshu Liu
Junyi Zhu
Zinan Lin
Xuefei Ning
Matthew B. Blaschko
Shengen Yan
Guohao Dai
Huazhong Yang
Yu Wang
MoE
62
5
0
01 Jul 2024
MLAAN: Scaling Supervised Local Learning with Multilaminar Leap Augmented Auxiliary Network
Yuming Zhang
Shouxin Zhang
Peizhe Wang
Feiyu Zhu
Dongzhi Guan
Junhao Su
Jiabin Liu
Changpeng Cai
33
2
0
24 Jun 2024
Behaviour Distillation
Andrei Lupu
Chris Xiaoxuan Lu
Jarek Liesen
R. T. Lange
Jakob Foerster
DD
49
4
0
21 Jun 2024
Resource-efficient Medical Image Analysis with Self-adapting Forward-Forward Networks
Johanna P. Müller
Bernhard Kainz
MedIm
48
1
0
20 Jun 2024
Knowledge Fusion By Evolving Weights of Language Models
Guodong Du
Jing Li
Hanting Liu
Runhua Jiang
Shuyang Yu
Yifei Guo
S. Goh
Ho-Kin Tang
MoMe
46
8
0
18 Jun 2024
EvIL: Evolution Strategies for Generalisable Imitation Learning
Silvia Sapora
Gokul Swamy
Chris Xiaoxuan Lu
Yee Whye Teh
Jakob Nicolaus Foerster
42
6
0
15 Jun 2024
AlphaZeroES: Direct score maximization outperforms planning loss minimization
Carlos Martin
Tuomas Sandholm
28
0
0
12 Jun 2024
EGAN: Evolutional GAN for Ransomware Evasion
Daniel Commey
Benjamin Appiah
B. K. Frimpong
Isaac Osei
Ebenezer N. A. Hammond
Garth V. Crosby
AAML
GAN
37
0
0
20 May 2024
Comparisons Are All You Need for Optimizing Smooth Functions
Chenyi Zhang
Tongyang Li
AAML
37
1
0
19 May 2024
Growing Artificial Neural Networks for Control: the Role of Neuronal Diversity
Eleni Nisioti
Erwan Plantec
Milton L. Montero
J. Pedersen
Sebastian Risi
29
1
0
14 May 2024
Genetic Drift Regularization: on preventing Actor Injection from breaking Evolution Strategies
Paul Templier
Emmanuel Rachelson
Antoine Cully
Dennis G. Wilson
29
0
0
07 May 2024
Quality with Just Enough Diversity in Evolutionary Policy Search
Paul Templier
Luca Grillotti
Emmanuel Rachelson
Dennis G. Wilson
Antoine Cully
35
1
0
07 May 2024
Dynamic Anisotropic Smoothing for Noisy Derivative-Free Optimization
S. Reifenstein
T. Leleu
Yoshihisa Yamamoto
48
1
0
02 May 2024
Hard-Thresholding Meets Evolution Strategies in Reinforcement Learning
Chengqian Gao
William de Vazelhes
Hualin Zhang
Bin Gu
Zhiqiang Xu
54
0
0
02 May 2024
Evolutionary Reinforcement Learning via Cooperative Coevolution
Chengpeng Hu
Jialin Liu
Xinghu Yao
24
0
0
23 Apr 2024
Learning to Cut via Hierarchical Sequence/Set Model for Efficient Mixed-Integer Programming
Jie Wang
Zhihai Wang
Xijun Li
Yufei Kuang
Zhihao Shi
Fangzhou Zhu
Mingxuan Yuan
Jianguo Zeng
Yongdong Zhang
Feng Wu
56
7
0
19 Apr 2024
Graph Reinforcement Learning for Combinatorial Optimization: A Survey and Unifying Perspective
Victor-Alexandru Darvariu
Stephen Hailes
Mirco Musolesi
AI4CE
50
6
0
09 Apr 2024
Structurally Flexible Neural Networks: Evolving the Building Blocks for General Agents
J. Pedersen
Erwan Plantec
Eleni Nisioti
Milton L. Montero
Sebastian Risi
55
1
0
06 Apr 2024
Embodied AI with Two Arms: Zero-shot Learning, Safety and Modularity
Jacob Varley
Sumeet Singh
Deepali Jain
Krzysztof Choromanski
Andy Zeng
Somnath Basu Roy Chowdhury
Kumar Avinava Dubey
Vikas Sindhwani
LM&Ro
34
14
0
04 Apr 2024
Discrete Natural Evolution Strategies
Ahmad Ayaz Amin
25
0
0
30 Mar 2024
Learning Traffic Signal Control via Genetic Programming
Xiao-Cheng Liao
Yi Mei
Mengjie Zhang
40
6
0
26 Mar 2024
Forward Learning for Gradient-based Black-box Saliency Map Generation
Zeliang Zhang
Mingqian Feng
Jinyang Jiang
Rongyi Zhu
Yijie Peng
Chenliang Xu
FAtt
34
2
0
22 Mar 2024
Searching Search Spaces: Meta-evolving a Geometric Encoding for Neural Networks
Tarek Kunze
Paul Templier
Dennis G. Wilson
35
0
0
20 Mar 2024
Federated reinforcement learning for robot motion planning with zero-shot generalization
Zhenyuan Yuan
Siyuan Xu
Minghui Zhu
FedML
40
1
0
20 Mar 2024
Approximated Likelihood Ratio: A Forward-Only and Parallel Framework for Boosting Neural Network Training
Zeliang Zhang
Jinyang Jiang
Zhuo Liu
Susan Liang
Yijie Peng
Chenliang Xu
37
0
0
18 Mar 2024
Single- and Multi-Agent Private Active Sensing: A Deep Neuroevolution Approach
George Stamatelis
Angelos-Nikolaos Kanatas
Ioannis Asprogerakas
G. C. Alexandropoulos
26
1
0
15 Mar 2024
Design and Control Co-Optimization for Automated Design Iteration of Dexterous Anthropomorphic Soft Robotic Hands
Pragna Mannam
Xingyu Liu
Ding Zhao
Jean Oh
N. S. Pollard
43
0
0
15 Mar 2024
SINDy-RL: Interpretable and Efficient Model-Based Reinforcement Learning
Nicholas Zolman
Urban Fasel
J. Nathan Kutz
Steven L. Brunton
AI4CE
30
11
0
14 Mar 2024
Derivative-Free Optimization for Low-Rank Adaptation in Large Language Models
Feihu Jin
Yin Liu
Ying Tan
35
3
0
04 Mar 2024
Guided Sketch-Based Program Induction by Search Gradients
Ahmad Ayaz Amin
19
0
0
10 Feb 2024
Discovering Temporally-Aware Reinforcement Learning Algorithms
Matthew Jackson
Chris Xiaoxuan Lu
Louis Kirsch
R. T. Lange
Shimon Whiteson
Jakob N. Foerster
27
18
0
08 Feb 2024
Learning mirror maps in policy mirror descent
Carlo Alfano
Sebastian Towers
Silvia Sapora
Chris Xiaoxuan Lu
Patrick Rebeschini
32
0
0
07 Feb 2024
Moco: A Learnable Meta Optimizer for Combinatorial Optimization
Tim Dernedde
Daniela Thyssens
Soren Dittrich
Maximilan Stubbemann
Lars Schmidt-Thieme
59
5
0
07 Feb 2024
Abstracted Trajectory Visualization for Explainability in Reinforcement Learning
Yoshiki Takagi
Roderick S. Tabalba
Nurit Kirshenbaum
Jason Leigh
19
0
0
05 Feb 2024
Stochastic Two Points Method for Deep Model Zeroth-order Optimization
Yijiang Pang
Jiayu Zhou
24
0
0
02 Feb 2024
Learning to Stop Cut Generation for Efficient Mixed-Integer Linear Programming
Haotian Ling
Zhihai Wang
Jie Wang
47
5
0
31 Jan 2024
How to Forget Clients in Federated Online Learning to Rank?
Shuyi Wang
Bing Liu
Guido Zuccon
17
7
0
24 Jan 2024
Bridging Evolutionary Algorithms and Reinforcement Learning: A Comprehensive Survey on Hybrid Algorithms
Pengyi Li
Jianye Hao
Hongyao Tang
Xian Fu
Yan Zheng
Ke Tang
39
9
0
22 Jan 2024
Open the Black Box: Step-based Policy Updates for Temporally-Correlated Episodic Reinforcement Learning
Ge Li
Hongyi Zhou
Dominik Roth
Serge Thilges
Fabian Otto
Rudolf Lioutikov
Gerhard Neumann
OffRL
27
7
0
21 Jan 2024
When Large Language Models Meet Evolutionary Algorithms: Potential Enhancements and Challenges
Wang Chao
Jiaxuan Zhao
Licheng Jiao
Lingling Li
Fang Liu
Shuyuan Yang
75
13
0
19 Jan 2024
Identifying Policy Gradient Subspaces
Jan Schneider-Barnes
Pierre Schumacher
Simon Guist
Tianyu Cui
Daniel Haeufle
Bernhard Scholkopf
Le Chen
49
5
0
12 Jan 2024
Convolutional Channel-wise Competitive Learning for the Forward-Forward Algorithm
A. Papachristodoulou
C. Kyrkou
S. Timotheou
T. Theocharides
32
8
0
19 Dec 2023
Scaling Opponent Shaping to High Dimensional Games
Akbir Khan
Timon Willi
Newton Kwan
Andrea Tacchetti
Chris Xiaoxuan Lu
Edward Grefenstette
Tim Rocktaschel
Jakob N. Foerster
38
10
0
19 Dec 2023
Leading the Pack: N-player Opponent Shaping
Alexandra Souly
Timon Willi
Akbir Khan
Robert Kirk
Chris Xiaoxuan Lu
Edward Grefenstette
Tim Rocktaschel
48
3
0
19 Dec 2023
MToP: A MATLAB Optimization Platform for Evolutionary Multitasking
Yanchi Li
Wenyin Gong
Feifei Ming
Tingyu Zhang
Shuijia Li
Qiong Gu
34
4
0
13 Dec 2023
Beyond Expected Return: Accounting for Policy Reproducibility when Evaluating Reinforcement Learning Algorithms
Manon Flageat
Bryan Lim
Antoine Cully
OffRL
25
3
0
12 Dec 2023
Previous
1
2
3
4
5
6
7
8
9
Next