ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2007.08794
  4. Cited By
Discovering Reinforcement Learning Algorithms

Discovering Reinforcement Learning Algorithms

17 July 2020
Junhyuk Oh
Matteo Hessel
Wojciech M. Czarnecki
Zhongwen Xu
H. V. Hasselt
Satinder Singh
David Silver
ArXivPDFHTML

Papers citing "Discovering Reinforcement Learning Algorithms"

35 / 35 papers shown
Title
Automated Hybrid Reward Scheduling via Large Language Models for Robotic Skill Learning
Automated Hybrid Reward Scheduling via Large Language Models for Robotic Skill Learning
Changxin Huang
Junyang Liang
Yanbin Chang
Jingzhao Xu
Jianqiang Li
34
0
0
05 May 2025
Scalable Meta-Learning via Mixed-Mode Differentiation
Scalable Meta-Learning via Mixed-Mode Differentiation
Iurii Kemaev
Dan A Calian
Luisa M Zintgraf
Gregory Farquhar
H. V. Hasselt
57
0
0
01 May 2025
Discovering Quality-Diversity Algorithms via Meta-Black-Box Optimization
Discovering Quality-Diversity Algorithms via Meta-Black-Box Optimization
Maxence Faldor
Robert Tjarko Lange
Antoine Cully
81
0
0
04 Feb 2025
Can Learned Optimization Make Reinforcement Learning Less Difficult?
Can Learned Optimization Make Reinforcement Learning Less Difficult?
Alexander David Goldie
Chris Xiaoxuan Lu
Matthew Jackson
Shimon Whiteson
Jakob N. Foerster
46
3
0
09 Jul 2024
Discovering Preference Optimization Algorithms with and for Large
  Language Models
Discovering Preference Optimization Algorithms with and for Large Language Models
Chris Xiaoxuan Lu
Samuel Holt
Claudio Fanconi
Alex J. Chan
Jakob Foerster
M. Schaar
R. T. Lange
OffRL
40
16
0
12 Jun 2024
Searching Search Spaces: Meta-evolving a Geometric Encoding for Neural
  Networks
Searching Search Spaces: Meta-evolving a Geometric Encoding for Neural Networks
Tarek Kunze
Paul Templier
Dennis G. Wilson
35
0
0
20 Mar 2024
Adaptive Feature Fusion: Enhancing Generalization in Deep Learning
  Models
Adaptive Feature Fusion: Enhancing Generalization in Deep Learning Models
Neelesh Mungoli
28
23
0
04 Apr 2023
Learning to Optimize for Reinforcement Learning
Learning to Optimize for Reinforcement Learning
Qingfeng Lan
Rupam Mahmood
Shuicheng Yan
Zhongwen Xu
OffRL
36
6
0
03 Feb 2023
Importance Weighted Actor-Critic for Optimal Conservative Offline
  Reinforcement Learning
Importance Weighted Actor-Critic for Optimal Conservative Offline Reinforcement Learning
Hanlin Zhu
Paria Rashidinejad
Jiantao Jiao
OffRL
45
15
0
30 Jan 2023
A Survey of Meta-Reinforcement Learning
A Survey of Meta-Reinforcement Learning
Jacob Beck
Risto Vuorio
E. Liu
Zheng Xiong
L. Zintgraf
Chelsea Finn
Shimon Whiteson
OOD
OffRL
42
124
0
19 Jan 2023
POMRL: No-Regret Learning-to-Plan with Increasing Horizons
POMRL: No-Regret Learning-to-Plan with Increasing Horizons
Khimya Khetarpal
Claire Vernade
Brendan O'Donoghue
Satinder Singh
Tom Zahavy
OffRL
31
0
0
30 Dec 2022
General-Purpose In-Context Learning by Meta-Learning Transformers
General-Purpose In-Context Learning by Meta-Learning Transformers
Louis Kirsch
James Harrison
Jascha Narain Sohl-Dickstein
Luke Metz
46
72
0
08 Dec 2022
Discovering Evolution Strategies via Meta-Black-Box Optimization
Discovering Evolution Strategies via Meta-Black-Box Optimization
R. T. Lange
Tom Schaul
Yutian Chen
Tom Zahavy
Valenti Dallibard
Chris Xiaoxuan Lu
Satinder Singh
Sebastian Flennerhag
49
47
0
21 Nov 2022
Reward Shaping Using Convolutional Neural Network
Reward Shaping Using Convolutional Neural Network
Hani Sami
Hadi Otrok
Jamal Bentahar
Azzam Mourad
Ernesto Damiani
32
3
0
30 Oct 2022
Auxiliary task discovery through generate-and-test
Auxiliary task discovery through generate-and-test
Banafsheh Rafiee
Sina Ghiassian
Jun Jin
R. Sutton
Jun Luo
Adam White
21
0
0
25 Oct 2022
An Investigation of the Bias-Variance Tradeoff in Meta-Gradients
An Investigation of the Bias-Variance Tradeoff in Meta-Gradients
Risto Vuorio
Jacob Beck
Shimon Whiteson
Jakob N. Foerster
Gregory Farquhar
33
8
0
22 Sep 2022
Meta-Gradients in Non-Stationary Environments
Meta-Gradients in Non-Stationary Environments
Jelena Luketina
Sebastian Flennerhag
Yannick Schroecker
David Abel
Tom Zahavy
Satinder Singh
31
10
0
13 Sep 2022
Bayesian Generational Population-Based Training
Bayesian Generational Population-Based Training
Xingchen Wan
Cong Lu
Jack Parker-Holder
Philip J. Ball
Vu-Linh Nguyen
Binxin Ru
Michael A. Osborne
OffRL
31
15
0
19 Jul 2022
Transformers are Meta-Reinforcement Learners
Transformers are Meta-Reinforcement Learners
Luckeciano C. Melo
OffRL
43
50
0
14 Jun 2022
Automated Reinforcement Learning (AutoRL): A Survey and Open Problems
Automated Reinforcement Learning (AutoRL): A Survey and Open Problems
Jack Parker-Holder
Raghunandan Rajan
Xingyou Song
André Biedenkapp
Yingjie Miao
...
Vu-Linh Nguyen
Roberto Calandra
Aleksandra Faust
Frank Hutter
Marius Lindauer
AI4CE
33
100
0
11 Jan 2022
Adaptive Incentive Design with Multi-Agent Meta-Gradient Reinforcement
  Learning
Adaptive Incentive Design with Multi-Agent Meta-Gradient Reinforcement Learning
Jiachen Yang
Ethan Wang
Rakshit S. Trivedi
T. Zhao
H. Zha
32
20
0
20 Dec 2021
Biased Gradient Estimate with Drastic Variance Reduction for Meta
  Reinforcement Learning
Biased Gradient Estimate with Drastic Variance Reduction for Meta Reinforcement Learning
Yunhao Tang
27
7
0
14 Dec 2021
Learning Pessimism for Robust and Efficient Off-Policy Reinforcement
  Learning
Learning Pessimism for Robust and Efficient Off-Policy Reinforcement Learning
Edoardo Cetin
Oya Celiktutan
OffRL
47
17
0
07 Oct 2021
Introducing Symmetries to Black Box Meta Reinforcement Learning
Introducing Symmetries to Black Box Meta Reinforcement Learning
Louis Kirsch
Sebastian Flennerhag
Hado van Hasselt
A. Friesen
Junhyuk Oh
Yutian Chen
22
30
0
22 Sep 2021
Few-shot Quality-Diversity Optimization
Few-shot Quality-Diversity Optimization
Achkan Salehi
Alexandre Coninx
Stéphane Doncieux
21
14
0
14 Sep 2021
Bootstrapped Meta-Learning
Bootstrapped Meta-Learning
Sebastian Flennerhag
Yannick Schroecker
Tom Zahavy
Hado van Hasselt
David Silver
Satinder Singh
38
59
0
09 Sep 2021
Evolving Decomposed Plasticity Rules for Information-Bottlenecked
  Meta-Learning
Evolving Decomposed Plasticity Rules for Information-Bottlenecked Meta-Learning
Fan Wang
Hao Tian
Haoyi Xiong
Hua Wu
Jie Fu
Yang Cao
Yu Kang
Haifeng Wang
AI4CE
15
3
0
08 Sep 2021
Evaluating the progress of Deep Reinforcement Learning in the real
  world: aligning domain-agnostic and domain-specific research
Evaluating the progress of Deep Reinforcement Learning in the real world: aligning domain-agnostic and domain-specific research
J. Luis
E. Crawley
B. Cameron
OffRL
25
6
0
07 Jul 2021
Evolving Reinforcement Learning Algorithms
Evolving Reinforcement Learning Algorithms
John D. Co-Reyes
Yingjie Miao
Daiyi Peng
Esteban Real
Sergey Levine
Quoc V. Le
Honglak Lee
Aleksandra Faust
46
73
0
08 Jan 2021
Meta Learning Backpropagation And Improving It
Meta Learning Backpropagation And Improving It
Louis Kirsch
Jürgen Schmidhuber
61
56
0
29 Dec 2020
Policy Gradient RL Algorithms as Directed Acyclic Graphs
Policy Gradient RL Algorithms as Directed Acyclic Graphs
J. Luis
26
0
0
14 Dec 2020
Forethought and Hindsight in Credit Assignment
Forethought and Hindsight in Credit Assignment
Veronica Chelu
Doina Precup
H. V. Hasselt
22
25
0
26 Oct 2020
Meta-Gradient Reinforcement Learning with an Objective Discovered Online
Meta-Gradient Reinforcement Learning with an Objective Discovered Online
Zhongwen Xu
H. V. Hasselt
Matteo Hessel
Junhyuk Oh
Satinder Singh
David Silver
27
77
0
16 Jul 2020
Finding online neural update rules by learning to remember
Finding online neural update rules by learning to remember
Karol Gregor
CLL
39
6
0
06 Mar 2020
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Chelsea Finn
Pieter Abbeel
Sergey Levine
OOD
496
11,727
0
09 Mar 2017
1