ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1703.03864
  4. Cited By
Evolution Strategies as a Scalable Alternative to Reinforcement Learning

Evolution Strategies as a Scalable Alternative to Reinforcement Learning

10 March 2017
Tim Salimans
Jonathan Ho
Xi Chen
Szymon Sidor
Ilya Sutskever
ArXivPDFHTML

Papers citing "Evolution Strategies as a Scalable Alternative to Reinforcement Learning"

50 / 407 papers shown
Title
Benchmarking MOEAs for solving continuous multi-objective RL problems
Benchmarking MOEAs for solving continuous multi-objective RL problems
Carlos Hernández
Roberto Santana
4
0
0
19 May 2025
Efficient training for large-scale optical neural network using an evolutionary strategy and attention pruning
Efficient training for large-scale optical neural network using an evolutionary strategy and attention pruning
Zhiwei Yang
Zeyang Fan
Yihang Lai
Qi Chen
Tian Zhang
Jian Dai
Kun Xu
7
0
0
19 May 2025
GATES: Cost-aware Dynamic Workflow Scheduling via Graph Attention Networks and Evolution Strategy
GATES: Cost-aware Dynamic Workflow Scheduling via Graph Attention Networks and Evolution Strategy
Ya Shen
Gang Chen
Hui Ma
Mengjie Zhang
14
0
0
18 May 2025
Scalability of Reinforcement Learning Methods for Dispatching in Semiconductor Frontend Fabs: A Comparison of Open-Source Models with Real Industry Datasets
Scalability of Reinforcement Learning Methods for Dispatching in Semiconductor Frontend Fabs: A Comparison of Open-Source Models with Real Industry Datasets
Patrick Stöckermann
Henning Südfeld
Alessandro Immordino
Thomas Altenmüller
Marc Wegmann
Martin Gebser
Konstantin Schekotihin
Georg Seidel
Chew Wye Chan
Fei Fei Zhang
OffRL
17
0
0
16 May 2025
Embodied Intelligence: The Key to Unblocking Generalized Artificial Intelligence
Embodied Intelligence: The Key to Unblocking Generalized Artificial Intelligence
Jinhao Jiang
Changlin Chen
Shile Feng
Wanru Geng
Zesheng Zhou
Ni Wang
Shuai Li
Feng-Qi Cui
Erbao Dong
AI4CE
33
0
0
11 May 2025
Giving Simulated Cells a Voice: Evolving Prompt-to-Intervention Models for Cellular Control
Giving Simulated Cells a Voice: Evolving Prompt-to-Intervention Models for Cellular Control
Nam H. Le
Patrick Erikson
Yanbo Zhang
Michael Levin
Josh C. Bongard
LM&Ro
31
0
0
05 May 2025
Model Tensor Planning
Model Tensor Planning
An T. Le
K. Nguyen
Minh Nhat Vu
João Carvalho
Jan Peters
35
0
0
02 May 2025
Learning Heterogeneous Performance-Fairness Trade-offs in Federated Learning
Learning Heterogeneous Performance-Fairness Trade-offs in Federated Learning
Rongguang Ye
Ming Tang
FedML
53
0
0
30 Apr 2025
Evolutionary Policy Optimization
Evolutionary Policy Optimization
Zelal Su "Lain" Mustafaoglu
Keshav Pingali
Risto Miikkulainen
36
0
0
17 Apr 2025
Min-Max Optimisation for Nonconvex-Nonconcave Functions Using a Random Zeroth-Order Extragradient Algorithm
Min-Max Optimisation for Nonconvex-Nonconcave Functions Using a Random Zeroth-Order Extragradient Algorithm
Amir Ali Farzin
Yuen-Man Pun
Philipp Braun
Antoine Lesage-Landry
Youssef Diouane
Iman Shames
53
1
0
10 Apr 2025
IPA-CHILDES & G2P+: Feature-Rich Resources for Cross-Lingual Phonology and Phonemic Language Modeling
IPA-CHILDES & G2P+: Feature-Rich Resources for Cross-Lingual Phonology and Phonemic Language Modeling
Zébulon Goriely
P. Buttery
26
1
0
03 Apr 2025
Forward Learning with Differential Privacy
Forward Learning with Differential Privacy
Mingqian Feng
Zeliang Zhang
Jinyang Jiang
Yijie Peng
Chenliang Xu
47
0
0
01 Apr 2025
Evolutionary Policy Optimization
Evolutionary Policy Optimization
Jianren Wang
Yifan Su
Abhinav Gupta
Deepak Pathak
50
0
0
24 Mar 2025
Parental Guidance: Efficient Lifelong Learning through Evolutionary Distillation
Parental Guidance: Efficient Lifelong Learning through Evolutionary Distillation
Octi Zhang
Quanquan Peng
Rosario Scalise
Bryon Boots
50
0
0
24 Mar 2025
Bio-Inspired Plastic Neural Networks for Zero-Shot Out-of-Distribution Generalization in Complex Animal-Inspired Robots
Bio-Inspired Plastic Neural Networks for Zero-Shot Out-of-Distribution Generalization in Complex Animal-Inspired Robots
Binggwong Leung
Worasuchad Haomachai
J. Pedersen
S. Risi
Poramate Manoonpong
OODD
76
0
0
16 Mar 2025
Optimizing Gene-Based Testing for Antibiotic Resistance Prediction
Optimizing Gene-Based Testing for Antibiotic Resistance Prediction
David Hagerman
Anna Johnning
Roman Naeem
Fredrik Kahl
Erik Kristiansson
Lennart Svensson
69
0
0
24 Feb 2025
Robotic Table Tennis: A Case Study into a High Speed Learning System
Robotic Table Tennis: A Case Study into a High Speed Learning System
David B. DÁmbrosio
Jonathan Abelian
Saminda Abeyruwan
Michael Ahn
Alex Bewley
...
Vikas Sindhwani
Avi Singh
Vincent Vanhoucke
Grace Vesom
Peng Xu
60
13
0
20 Feb 2025
Utilizing Novelty-based Evolution Strategies to Train Transformers in Reinforcement Learning
Matyáš Lorenc
OffRL
75
0
0
10 Feb 2025
Coreset-Based Task Selection for Sample-Efficient Meta-Reinforcement Learning
Coreset-Based Task Selection for Sample-Efficient Meta-Reinforcement Learning
Donglin Zhan
Leonardo F. Toso
James Anderson
101
1
0
04 Feb 2025
Evolving Hard Maximum Cut Instances for Quantum Approximate Optimization Algorithms
Evolving Hard Maximum Cut Instances for Quantum Approximate Optimization Algorithms
Shuaiqun Pan
Yash J. Patel
Aneta Neumann
Frank Neumann
Thomas Bäck
Hao Wang
43
0
0
30 Jan 2025
A Genetic Algorithm-Based Approach for Automated Optimization of Kolmogorov-Arnold Networks in Classification Tasks
A Genetic Algorithm-Based Approach for Automated Optimization of Kolmogorov-Arnold Networks in Classification Tasks
Quan Long
Bin Wang
Bing Xue
Mengjie Zhang
53
0
0
29 Jan 2025
Towards General-Purpose Model-Free Reinforcement Learning
Scott Fujimoto
P. DÓro
Amy Zhang
Yuandong Tian
Michael Rabbat
OffRL
44
3
0
28 Jan 2025
Upside Down Reinforcement Learning with Policy Generators
Upside Down Reinforcement Learning with Policy Generators
Jacopo Di Ventura
Dylan R. Ashley
Vincent Herrmann
Francesco Faccio
Jürgen Schmidhuber
36
0
0
27 Jan 2025
Utilizing Evolution Strategies to Train Transformers in Reinforcement Learning
Utilizing Evolution Strategies to Train Transformers in Reinforcement Learning
Matyáš Lorenc
52
1
0
23 Jan 2025
Evolution and The Knightian Blindspot of Machine Learning
Evolution and The Knightian Blindspot of Machine Learning
Joel Lehman
Elliot Meyerson
Tarek El-Gaaly
Kenneth O. Stanley
Tarin Ziyaee
96
2
0
22 Jan 2025
Solving Infinite-Player Games with Player-to-Strategy Networks
Solving Infinite-Player Games with Player-to-Strategy Networks
Carlos Martin
T. Sandholm
64
0
0
17 Jan 2025
EVaDE : Event-Based Variational Thompson Sampling for Model-Based Reinforcement Learning
EVaDE : Event-Based Variational Thompson Sampling for Model-Based Reinforcement Learning
Siddharth Aravindan
Dixant Mittal
Wee Sun Lee
BDL
79
0
0
17 Jan 2025
Pareto Set Learning for Multi-Objective Reinforcement Learning
Pareto Set Learning for Multi-Objective Reinforcement Learning
Erlong Liu
Yu-Chang Wu
Xiaobin Huang
Chengrui Gao
Ren-Jian Wang
Ke Xue
Chao Qian
OffRL
47
2
0
12 Jan 2025
CoMAL: Collaborative Multi-Agent Large Language Models for Mixed-Autonomy Traffic
CoMAL: Collaborative Multi-Agent Large Language Models for Mixed-Autonomy Traffic
Huaiyuan Yao
Longchao Da
Vishnu Nandam
Justin Turnau
Zhiwei Liu
Linsey Pang
Hua Wei
LLMAG
65
4
0
10 Jan 2025
Cost-Aware Dynamic Cloud Workflow Scheduling using Self-Attention and Evolutionary Reinforcement Learning
Cost-Aware Dynamic Cloud Workflow Scheduling using Self-Attention and Evolutionary Reinforcement Learning
Ya Shen
Gang Chen
Hui Ma
Mengjie Zhang
40
1
0
31 Dec 2024
GraCo -- A Graph Composer for Integrated Circuits
GraCo -- A Graph Composer for Integrated Circuits
Stefan Uhlich
Andrea Bonetti
Arun Venkitaraman
Ali Momeni
Ryoga Matsuo
Chia-Yu Hsieh
Eisaku Ohbuchi
Lorenzo Servadei
GNN
95
0
0
21 Nov 2024
Acceleration for Deep Reinforcement Learning using Parallel and
  Distributed Computing: A Survey
Acceleration for Deep Reinforcement Learning using Parallel and Distributed Computing: A Survey
Zhihong Liu
Xin Xu
Peng Qiao
Dongsheng Li
OffRL
27
2
0
08 Nov 2024
Toward Automated Algorithm Design: A Survey and Practical Guide to Meta-Black-Box-Optimization
Toward Automated Algorithm Design: A Survey and Practical Guide to Meta-Black-Box-Optimization
Zeyuan Ma
Hongshu Guo
Yue-jiao Gong
Jun Zhang
Kay Chen Tan
128
2
0
01 Nov 2024
Offline Behavior Distillation
Offline Behavior Distillation
Shiye Lei
Sen Zhang
Dacheng Tao
OffRL
41
0
0
30 Oct 2024
Sharpness-Aware Black-Box Optimization
Sharpness-Aware Black-Box Optimization
Feiyang Ye
Yueming Lyu
Xuehao Wang
Masashi Sugiyama
Yu-Jie Zhang
Ivor W. Tsang
AAML
47
0
0
16 Oct 2024
Evolutionary Retrofitting
Evolutionary Retrofitting
Mathurin Videau
M. Zameshina
Alessandro Leite
Laurent Najman
Marc Schoenauer
O. Teytaud
41
0
0
15 Oct 2024
Stein Variational Evolution Strategies
Stein Variational Evolution Strategies
Cornelius V. Braun
Robert T. Lange
Marc Toussaint
31
0
0
14 Oct 2024
Reinforcement Learning in Hyperbolic Spaces: Models and Experiments
Reinforcement Learning in Hyperbolic Spaces: Models and Experiments
V. Jaćimović
Zinaid Kapić
Aladin Crnkić
37
0
0
12 Oct 2024
TOP-ERL: Transformer-based Off-Policy Episodic Reinforcement Learning
TOP-ERL: Transformer-based Off-Policy Episodic Reinforcement Learning
Ge Li
Dong Tian
Hongyi Zhou
Xinkai Jiang
Rudolf Lioutikov
Gerhard Neumann
OffRL
241
3
0
12 Oct 2024
Neural Circuit Architectural Priors for Quadruped Locomotion
Neural Circuit Architectural Priors for Quadruped Locomotion
Nikhil X. Bhattasali
Venkatesh Pattabiraman
Lerrel Pinto
Grace W. Lindsay
33
2
0
09 Oct 2024
FLOPS: Forward Learning with OPtimal Sampling
FLOPS: Forward Learning with OPtimal Sampling
Tao Ren
Zishi Zhang
Jinyang Jiang
Guanghao Li
Zeliang Zhang
Mingqian Feng
Yijie Peng
40
1
0
08 Oct 2024
Diffusion Models are Evolutionary Algorithms
Diffusion Models are Evolutionary Algorithms
Yanbo Zhang
Benedikt Hartl
Hananel Hazan
Michael Levin
26
4
0
03 Oct 2024
Zeroth-Order Policy Gradient for Reinforcement Learning from Human Feedback without Reward Inference
Zeroth-Order Policy Gradient for Reinforcement Learning from Human Feedback without Reward Inference
Qining Zhang
Lei Ying
OffRL
37
2
0
25 Sep 2024
An Adaptive Re-evaluation Method for Evolution Strategy under Additive Noise
An Adaptive Re-evaluation Method for Evolution Strategy under Additive Noise
Catalin-Viorel Dinu
Yash J. Patel
X. Bonet-Monroig
Hao Wang
33
0
0
25 Sep 2024
Compatible Gradient Approximations for Actor-Critic Algorithms
Compatible Gradient Approximations for Actor-Critic Algorithms
Baturay Saglam
Dionysis Kalogerias
37
0
0
02 Sep 2024
Learning Randomized Algorithms with Transformers
Learning Randomized Algorithms with Transformers
J. Oswald
Seijin Kobayashi
Yassir Akram
Angelika Steger
AAML
44
0
0
20 Aug 2024
Narrowing the Focus: Learned Optimizers for Pretrained Models
Narrowing the Focus: Learned Optimizers for Pretrained Models
Gus Kristiansen
Mark Sandler
A. Zhmoginov
Nolan Miller
Anirudh Goyal
Jihwan Lee
Max Vladymyrov
39
1
0
17 Aug 2024
Joint-perturbation simultaneous pseudo-gradient
Joint-perturbation simultaneous pseudo-gradient
Carlos Martin
Tuomas Sandholm
41
2
0
17 Aug 2024
Learning to Explore for Stochastic Gradient MCMC
Learning to Explore for Stochastic Gradient MCMC
Seunghyun Kim
Seohyeon Jung
Seonghyeon Kim
Juho Lee
BDL
48
1
0
17 Aug 2024
Impacts of Darwinian Evolution on Pre-trained Deep Neural Networks
Impacts of Darwinian Evolution on Pre-trained Deep Neural Networks
Guodong Du
Runhua Jiang
Senqiao Yang
HaoYang Li
Wei Chen
Keren Li
S. Goh
Ho-Kin Tang
39
3
0
10 Aug 2024
123456789
Next