ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2202.02790
  4. Cited By
Learning Synthetic Environments and Reward Networks for Reinforcement
  Learning

Learning Synthetic Environments and Reward Networks for Reinforcement Learning

6 February 2022
Fabio Ferreira
Thomas Nierhoff
Andreas Saelinger
Frank Hutter
ArXiv (abs)PDFHTML

Papers citing "Learning Synthetic Environments and Reward Networks for Reinforcement Learning"

18 / 18 papers shown
Title
Emergent Complexity and Zero-shot Transfer via Unsupervised Environment
  Design
Emergent Complexity and Zero-shot Transfer via Unsupervised Environment Design
Michael Dennis
Natasha Jaques
Eugene Vinitsky
Alexandre M. Bayen
Stuart J. Russell
Andrew Critch
Sergey Levine
83
236
0
03 Dec 2020
Learning to Utilize Shaping Rewards: A New Approach of Reward Shaping
Learning to Utilize Shaping Rewards: A New Approach of Reward Shaping
Yujing Hu
Weixun Wang
Hangtian Jia
Yixiang Wang
Yingfeng Chen
Jianye Hao
Feng Wu
Changjie Fan
OffRL
90
178
0
05 Nov 2020
Generative Teaching Networks: Accelerating Neural Architecture Search by
  Learning to Generate Synthetic Training Data
Generative Teaching Networks: Accelerating Neural Architecture Search by Learning to Generate Synthetic Training Data
F. Such
Aditya Rawal
Joel Lehman
Kenneth O. Stanley
Jeff Clune
DD
70
157
0
17 Dec 2019
What Can Learned Intrinsic Rewards Capture?
What Can Learned Intrinsic Rewards Capture?
Zeyu Zheng
Junhyuk Oh
Matteo Hessel
Zhongwen Xu
M. Kroiss
H. V. Hasselt
David Silver
Satinder Singh
66
78
0
11 Dec 2019
Leveraging Procedural Generation to Benchmark Reinforcement Learning
Leveraging Procedural Generation to Benchmark Reinforcement Learning
K. Cobbe
Christopher Hesse
Jacob Hilton
John Schulman
79
557
0
03 Dec 2019
Understanding and correcting pathologies in the training of learned
  optimizers
Understanding and correcting pathologies in the training of learned optimizers
Luke Metz
Niru Maheswaranathan
Jeremy Nixon
C. Freeman
Jascha Narain Sohl-Dickstein
ODL
79
147
0
24 Oct 2018
BOHB: Robust and Efficient Hyperparameter Optimization at Scale
BOHB: Robust and Efficient Hyperparameter Optimization at Scale
Stefan Falkner
Aaron Klein
Frank Hutter
BDL
208
1,102
0
04 Jul 2018
Human-level performance in first-person multiplayer games with
  population-based deep reinforcement learning
Human-level performance in first-person multiplayer games with population-based deep reinforcement learning
Max Jaderberg
Wojciech M. Czarnecki
Iain Dunning
Luke Marris
Guy Lever
...
Joel Z Leibo
David Silver
Demis Hassabis
Koray Kavukcuoglu
T. Graepel
OffRL
112
727
0
03 Jul 2018
Evolving Mario Levels in the Latent Space of a Deep Convolutional
  Generative Adversarial Network
Evolving Mario Levels in the Latent Space of a Deep Convolutional Generative Adversarial Network
Vanessa Volz
Jacob Schrum
Jialin Liu
Simon Lucas
Adam M. Smith
S. Risi
GAN
109
233
0
02 May 2018
Addressing Function Approximation Error in Actor-Critic Methods
Addressing Function Approximation Error in Actor-Critic Methods
Scott Fujimoto
H. V. Hoof
David Meger
OffRL
187
5,212
0
26 Feb 2018
Curiosity-driven Exploration by Self-supervised Prediction
Curiosity-driven Exploration by Self-supervised Prediction
Deepak Pathak
Pulkit Agrawal
Alexei A. Efros
Trevor Darrell
LRMSSL
113
2,449
0
15 May 2017
Domain Randomization for Transferring Deep Neural Networks from
  Simulation to the Real World
Domain Randomization for Transferring Deep Neural Networks from Simulation to the Real World
Joshua Tobin
Rachel Fong
Alex Ray
Jonas Schneider
Wojciech Zaremba
Pieter Abbeel
259
2,972
0
20 Mar 2017
Evolution Strategies as a Scalable Alternative to Reinforcement Learning
Evolution Strategies as a Scalable Alternative to Reinforcement Learning
Tim Salimans
Jonathan Ho
Xi Chen
Szymon Sidor
Ilya Sutskever
115
1,541
0
10 Mar 2017
Fairness in Reinforcement Learning
Fairness in Reinforcement Learning
S. Jabbari
Matthew Joseph
Michael Kearns
Jamie Morgenstern
Aaron Roth
FaML
75
203
0
09 Nov 2016
Categorical Reparameterization with Gumbel-Softmax
Categorical Reparameterization with Gumbel-Softmax
Eric Jang
S. Gu
Ben Poole
BDL
346
5,379
0
03 Nov 2016
Unifying Count-Based Exploration and Intrinsic Motivation
Unifying Count-Based Exploration and Intrinsic Motivation
Marc G. Bellemare
S. Srinivasan
Georg Ostrovski
Tom Schaul
D. Saxton
Rémi Munos
179
1,483
0
06 Jun 2016
Dueling Network Architectures for Deep Reinforcement Learning
Dueling Network Architectures for Deep Reinforcement Learning
Ziyun Wang
Tom Schaul
Matteo Hessel
H. V. Hasselt
Marc Lanctot
Nando de Freitas
OffRL
91
3,768
0
20 Nov 2015
Deep Reinforcement Learning with Double Q-learning
Deep Reinforcement Learning with Double Q-learning
H. V. Hasselt
A. Guez
David Silver
OffRL
170
7,662
0
22 Sep 2015
1