ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1804.06893
  4. Cited By
A Study on Overfitting in Deep Reinforcement Learning

A Study on Overfitting in Deep Reinforcement Learning

18 April 2018
Chiyuan Zhang
Oriol Vinyals
Rémi Munos
Samy Bengio
    OffRL
    OnRL
ArXivPDFHTML

Papers citing "A Study on Overfitting in Deep Reinforcement Learning"

35 / 35 papers shown
Title
PPO-BR: Dual-Signal Entropy-Reward Adaptation for Trust Region Policy Optimization
PPO-BR: Dual-Signal Entropy-Reward Adaptation for Trust Region Policy Optimization
Ben Rahman
22
0
0
23 May 2025
Task Aware Dreamer for Task Generalization in Reinforcement Learning
Task Aware Dreamer for Task Generalization in Reinforcement Learning
Chengyang Ying
Zhongkai Hao
Xinning Zhou
Hang Su
Songming Liu
Dong Yan
Jun Zhu
117
3
0
17 Feb 2025
Prioritized Generative Replay
Prioritized Generative Replay
Renhao Wang
Kevin Frans
Pieter Abbeel
Sergey Levine
Alexei A. Efros
OnRL
DiffM
127
3
0
23 Oct 2024
MENTOR: Mixture-of-Experts Network with Task-Oriented Perturbation for Visual Reinforcement Learning
MENTOR: Mixture-of-Experts Network with Task-Oriented Perturbation for Visual Reinforcement Learning
Suning Huang
Zheyu Zhang
Tianhai Liang
Yihan Xu
Zhehao Kou
Chenhao Lu
Guowei Xu
Zhengrong Xue
Huazhe Xu
MoE
68
3
0
19 Oct 2024
Disentangling Recognition and Decision Regrets in Image-Based Reinforcement Learning
Disentangling Recognition and Decision Regrets in Image-Based Reinforcement Learning
Alihan Hüyük
A. R. Koblitz
Atefeh Mohajeri
M. Andrews
OffRL
47
0
0
19 Sep 2024
The Overcooked Generalisation Challenge
The Overcooked Generalisation Challenge
Constantin Ruhdorfer
Matteo Bortoletto
Anna Penzkofer
Andreas Bulling
58
4
0
25 Jun 2024
Invariant Causal Prediction for Block MDPs
Invariant Causal Prediction for Block MDPs
Amy Zhang
Clare Lyle
Shagun Sodhani
Angelos Filos
Marta Z. Kwiatkowska
Joelle Pineau
Y. Gal
Doina Precup
OffRL
AI4CE
OOD
60
139
0
12 Mar 2020
Mastering Chess and Shogi by Self-Play with a General Reinforcement
  Learning Algorithm
Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm
David Silver
Thomas Hubert
Julian Schrittwieser
Ioannis Antonoglou
Matthew Lai
...
D. Kumaran
T. Graepel
Timothy Lillicrap
Karen Simonyan
Demis Hassabis
79
1,755
0
05 Dec 2017
Can Deep Reinforcement Learning Solve Erdos-Selfridge-Spencer Games?
Can Deep Reinforcement Learning Solve Erdos-Selfridge-Spencer Games?
M. Raghu
A. Irpan
Jacob Andreas
Robert D. Kleinberg
Quoc V. Le
Jon M. Kleinberg
30
28
0
07 Nov 2017
Fisher-Rao Metric, Geometry, and Complexity of Neural Networks
Fisher-Rao Metric, Geometry, and Complexity of Neural Networks
Tengyuan Liang
T. Poggio
Alexander Rakhlin
J. Stokes
42
225
0
05 Nov 2017
A Bayesian Perspective on Generalization and Stochastic Gradient Descent
A Bayesian Perspective on Generalization and Stochastic Gradient Descent
Samuel L. Smith
Quoc V. Le
BDL
35
247
0
17 Oct 2017
High-dimensional dynamics of generalization error in neural networks
High-dimensional dynamics of generalization error in neural networks
Madhu S. Advani
Andrew M. Saxe
AI4CE
106
467
0
10 Oct 2017
Deep Reinforcement Learning that Matters
Deep Reinforcement Learning that Matters
Peter Henderson
Riashat Islam
Philip Bachman
Joelle Pineau
Doina Precup
David Meger
OffRL
87
1,940
0
19 Sep 2017
Revisiting the Arcade Learning Environment: Evaluation Protocols and
  Open Problems for General Agents
Revisiting the Arcade Learning Environment: Evaluation Protocols and Open Problems for General Agents
Marlos C. Machado
Marc G. Bellemare
Erik Talvitie
J. Veness
Matthew J. Hausknecht
Michael Bowling
51
549
0
18 Sep 2017
StarCraft II: A New Challenge for Reinforcement Learning
StarCraft II: A New Challenge for Reinforcement Learning
Oriol Vinyals
T. Ewalds
Sergey Bartunov
Petko Georgiev
A. Vezhnevets
...
Anthony Brunasso
David Lawrence
Anders Ekermo
J. Repp
Rodney Tsing
42
868
0
16 Aug 2017
Exploring Generalization in Deep Learning
Exploring Generalization in Deep Learning
Behnam Neyshabur
Srinadh Bhojanapalli
David A. McAllester
Nathan Srebro
FAtt
121
1,245
0
27 Jun 2017
Spectrally-normalized margin bounds for neural networks
Spectrally-normalized margin bounds for neural networks
Peter L. Bartlett
Dylan J. Foster
Matus Telgarsky
ODL
95
1,208
0
26 Jun 2017
Towards Understanding the Invertibility of Convolutional Neural Networks
Towards Understanding the Invertibility of Convolutional Neural Networks
A. Gilbert
Yi Zhang
Kibok Lee
Y. Zhang
Honglak Lee
31
64
0
24 May 2017
Computing Nonvacuous Generalization Bounds for Deep (Stochastic) Neural
  Networks with Many More Parameters than Training Data
Computing Nonvacuous Generalization Bounds for Deep (Stochastic) Neural Networks with Many More Parameters than Training Data
Gintare Karolina Dziugaite
Daniel M. Roy
61
808
0
31 Mar 2017
Unifying PAC and Regret: Uniform PAC Bounds for Episodic Reinforcement
  Learning
Unifying PAC and Regret: Uniform PAC Bounds for Episodic Reinforcement Learning
Christoph Dann
Tor Lattimore
Emma Brunskill
41
307
0
22 Mar 2017
Minimax Regret Bounds for Reinforcement Learning
Minimax Regret Bounds for Reinforcement Learning
M. G. Azar
Ian Osband
Rémi Munos
45
768
0
16 Mar 2017
Adversarial Attacks on Neural Network Policies
Adversarial Attacks on Neural Network Policies
Sandy Huang
Nicolas Papernot
Ian Goodfellow
Yan Duan
Pieter Abbeel
MLAU
AAML
53
832
0
08 Feb 2017
Understanding deep learning requires rethinking generalization
Understanding deep learning requires rethinking generalization
Chiyuan Zhang
Samy Bengio
Moritz Hardt
Benjamin Recht
Oriol Vinyals
HAI
203
4,612
0
10 Nov 2016
Contextual Decision Processes with Low Bellman Rank are PAC-Learnable
Contextual Decision Processes with Low Bellman Rank are PAC-Learnable
Nan Jiang
A. Krishnamurthy
Alekh Agarwal
John Langford
Robert Schapire
45
417
0
29 Oct 2016
Generalization Properties and Implicit Regularization for Multiple
  Passes SGM
Generalization Properties and Implicit Regularization for Multiple Passes SGM
Junhong Lin
Raffaello Camoriano
Lorenzo Rosasco
44
70
0
26 May 2016
Asynchronous Methods for Deep Reinforcement Learning
Asynchronous Methods for Deep Reinforcement Learning
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
138
8,805
0
04 Feb 2016
Deep Residual Learning for Image Recognition
Deep Residual Learning for Image Recognition
Kaiming He
Xinming Zhang
Shaoqing Ren
Jian Sun
MedIm
786
192,638
0
10 Dec 2015
Continuous control with deep reinforcement learning
Continuous control with deep reinforcement learning
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
113
13,174
0
09 Sep 2015
Train faster, generalize better: Stability of stochastic gradient
  descent
Train faster, generalize better: Stability of stochastic gradient descent
Moritz Hardt
Benjamin Recht
Y. Singer
78
1,234
0
03 Sep 2015
Massively Parallel Methods for Deep Reinforcement Learning
Massively Parallel Methods for Deep Reinforcement Learning
Arun Nair
Praveen Srinivasan
Sam Blackwell
Cagdas Alcicek
Rory Fearon
...
Stig Petersen
Shane Legg
Volodymyr Mnih
Koray Kavukcuoglu
David Silver
OffRL
AI4CE
GNN
51
504
0
15 Jul 2015
Taming the Monster: A Fast and Simple Algorithm for Contextual Bandits
Taming the Monster: A Fast and Simple Algorithm for Contextual Bandits
Alekh Agarwal
Daniel J. Hsu
Satyen Kale
John Langford
Lihong Li
Robert Schapire
OffRL
71
504
0
04 Feb 2014
Optimal Demand Response Using Device Based Reinforcement Learning
Optimal Demand Response Using Device Based Reinforcement Learning
Zheng Wen
D. OÑeill
H. Maei
OffRL
42
233
0
08 Jan 2014
Playing Atari with Deep Reinforcement Learning
Playing Atari with Deep Reinforcement Learning
Volodymyr Mnih
Koray Kavukcuoglu
David Silver
Alex Graves
Ioannis Antonoglou
Daan Wierstra
Martin Riedmiller
47
12,163
0
19 Dec 2013
Signal Recovery from Pooling Representations
Signal Recovery from Pooling Representations
Joan Bruna
Arthur Szlam
Yann LeCun
47
96
0
16 Nov 2013
The Arcade Learning Environment: An Evaluation Platform for General
  Agents
The Arcade Learning Environment: An Evaluation Platform for General Agents
Marc G. Bellemare
Yavar Naddaf
J. Veness
Michael Bowling
42
2,992
0
19 Jul 2012
1