ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1810.00123
  4. Cited By
Generalization and Regularization in DQN

Generalization and Regularization in DQN

29 September 2018
Jesse Farebrother
Marlos C. Machado
Michael Bowling
ArXivPDFHTML

Papers citing "Generalization and Regularization in DQN"

50 / 54 papers shown
Title
Bidirectional Distillation: A Mixed-Play Framework for Multi-Agent Generalizable Behaviors
Bidirectional Distillation: A Mixed-Play Framework for Multi-Agent Generalizable Behaviors
Lang Feng
Jiahao Lin
Dong Xing
Li Zhang
De Ma
Gang Pan
31
0
0
16 May 2025
Understanding the theoretical properties of projected Bellman equation, linear Q-learning, and approximate value iteration
Understanding the theoretical properties of projected Bellman equation, linear Q-learning, and approximate value iteration
Han-Dong Lim
Donghwan Lee
21
0
0
15 Apr 2025
Hyperspherical Normalization for Scalable Deep Reinforcement Learning
Hyperspherical Normalization for Scalable Deep Reinforcement Learning
Hojoon Lee
Youngdo Lee
Takuma Seno
Donghu Kim
Peter Stone
Jaegul Choo
63
1
0
24 Feb 2025
BlendRL: A Framework for Merging Symbolic and Neural Policy Learning
BlendRL: A Framework for Merging Symbolic and Neural Policy Learning
Hikaru Shindo
Quentin Delfosse
Devendra Singh Dhami
Kristian Kersting
43
3
0
15 Oct 2024
SigmaRL: A Sample-Efficient and Generalizable Multi-Agent Reinforcement Learning Framework for Motion Planning
SigmaRL: A Sample-Efficient and Generalizable Multi-Agent Reinforcement Learning Framework for Motion Planning
Jianye Xu
Pan Hu
Bassam Alrifaee
44
5
0
14 Aug 2024
Towards Generalizable Reinforcement Learning via Causality-Guided Self-Adaptive Representations
Yupei Yang
Erdun Gao
Fan Feng
Xinyue Wang
Shikui Tu
Lei Xu
CML
OOD
TTA
40
1
0
30 Jul 2024
Evaluation of Reinforcement Learning for Autonomous Penetration Testing
  using A3C, Q-learning and DQN
Evaluation of Reinforcement Learning for Autonomous Penetration Testing using A3C, Q-learning and DQN
Norman Becker
Daniel Reti
Evridiki V. Ntagiou
M. Wallum
Hans D. Schotten
37
1
0
22 Jul 2024
The Overcooked Generalisation Challenge
The Overcooked Generalisation Challenge
Constantin Ruhdorfer
Matteo Bortoletto
Anna Penzkofer
Andreas Bulling
48
4
0
25 Jun 2024
Massively Multiagent Minigames for Training Generalist Agents
Massively Multiagent Minigames for Training Generalist Agents
Kyoung Whan Choe
Ryan Sullivan
Joseph Suárez
AI4CE
34
0
0
07 Jun 2024
Intervention-Assisted Policy Gradient Methods for Online Stochastic
  Queuing Network Optimization: Technical Report
Intervention-Assisted Policy Gradient Methods for Online Stochastic Queuing Network Optimization: Technical Report
Jerrod Wigmore
B. Shrader
E. Modiano
OffRL
32
1
0
05 Apr 2024
Dissecting Deep RL with High Update Ratios: Combatting Value Divergence
Dissecting Deep RL with High Update Ratios: Combatting Value Divergence
Marcel Hussing
C. Voelcker
Igor Gilitschenski
Amir-massoud Farahmand
Eric Eaton
42
3
0
09 Mar 2024
What Makes Pre-Trained Visual Representations Successful for Robust
  Manipulation?
What Makes Pre-Trained Visual Representations Successful for Robust Manipulation?
Kaylee Burns
Zach Witzel
Jubayer Ibn Hamid
Tianhe Yu
Chelsea Finn
Karol Hausman
OOD
SSL
32
23
0
03 Nov 2023
RAPID: Enabling Fast Online Policy Learning in Dynamic Public Cloud
  Environments
RAPID: Enabling Fast Online Policy Learning in Dynamic Public Cloud Environments
Drew Penney
Bin Li
Lizhong Chen
J. Sydir
Anna Drewek-Ossowicka
R. Illikkal
Charlie Tai
R. Iyer
Andrew J. Herdrich
31
1
0
10 Apr 2023
Unified Emulation-Simulation Training Environment for Autonomous Cyber
  Agents
Unified Emulation-Simulation Training Environment for Autonomous Cyber Agents
Li Li
Jean-Pierre S. El Rami
Adrian Taylor
James Hailing Rao
T. Kunz
13
4
0
03 Apr 2023
Policy Evaluation in Decentralized POMDPs with Belief Sharing
Policy Evaluation in Decentralized POMDPs with Belief Sharing
Mert Kayaalp
Fatima Ghadieh
Ali H. Sayed
19
2
0
08 Feb 2023
Offline Q-Learning on Diverse Multi-Task Data Both Scales And
  Generalizes
Offline Q-Learning on Diverse Multi-Task Data Both Scales And Generalizes
Aviral Kumar
Rishabh Agarwal
Xinyang Geng
George Tucker
Sergey Levine
OffRL
44
48
0
28 Nov 2022
Melting Pot 2.0
Melting Pot 2.0
J. Agapiou
A. Vezhnevets
Edgar A. Duénez-Guzmán
Jayd Matyas
Yiran Mao
...
Sukhdeep Singh
Julia Haas
Igor Mordatch
D. Mobbs
Joel Z Leibo
45
31
0
24 Nov 2022
Probing Transfer in Deep Reinforcement Learning without Task Engineering
Probing Transfer in Deep Reinforcement Learning without Task Engineering
Andrei A. Rusu
Sebastian Flennerhag
Dushyant Rao
Razvan Pascanu
R. Hadsell
34
6
0
22 Oct 2022
Bridging the Gap Between Target Networks and Functional Regularization
Alexandre Piché
Valentin Thomas
Joseph Marino
Rafael Pardiñas
Gian Maria Marconi
C. Pal
Mohammad Emtiyaz Khan
14
1
0
21 Oct 2022
Scaling Laws for Reward Model Overoptimization
Scaling Laws for Reward Model Overoptimization
Leo Gao
John Schulman
Jacob Hilton
ALM
41
489
0
19 Oct 2022
On the Power of Pre-training for Generalization in RL: Provable Benefits
  and Hardness
On the Power of Pre-training for Generalization in RL: Provable Benefits and Hardness
Haotian Ye
Xiaoyu Chen
Liwei Wang
S. Du
OffRL
37
6
0
19 Oct 2022
A Comprehensive Survey of Data Augmentation in Visual Reinforcement
  Learning
A Comprehensive Survey of Data Augmentation in Visual Reinforcement Learning
Guozheng Ma
Zhen Wang
Zhecheng Yuan
Xueqian Wang
Bo Yuan
Dacheng Tao
OffRL
43
27
0
10 Oct 2022
Learn the Time to Learn: Replay Scheduling in Continual Learning
Learn the Time to Learn: Replay Scheduling in Continual Learning
Marcus Klasson
Hedvig Kjellström
Chen Zhang
CLL
35
9
0
18 Sep 2022
Look where you look! Saliency-guided Q-networks for generalization in
  visual Reinforcement Learning
Look where you look! Saliency-guided Q-networks for generalization in visual Reinforcement Learning
David Bertoin
Adil Zouitine
Mehdi Zouitine
Emmanuel Rachelson
36
30
0
16 Sep 2022
Bootstrap State Representation using Style Transfer for Better
  Generalization in Deep Reinforcement Learning
Bootstrap State Representation using Style Transfer for Better Generalization in Deep Reinforcement Learning
Md Masudur Rahman
Yexiang Xue
OffRL
34
4
0
15 Jul 2022
GriddlyJS: A Web IDE for Reinforcement Learning
GriddlyJS: A Web IDE for Reinforcement Learning
C. Bamford
Minqi Jiang
Mikayel Samvelyan
Tim Rocktaschel
OnRL
38
4
0
13 Jul 2022
Efficient Scheduling of Data Augmentation for Deep Reinforcement
  Learning
Efficient Scheduling of Data Augmentation for Deep Reinforcement Learning
Byungchan Ko
Jungseul Ok
OnRL
27
5
0
01 Jun 2022
Chain of Thought Imitation with Procedure Cloning
Chain of Thought Imitation with Procedure Cloning
Mengjiao Yang
Dale Schuurmans
Pieter Abbeel
Ofir Nachum
OffRL
35
30
0
22 May 2022
Learning Task-relevant Representations for Generalization via
  Characteristic Functions of Reward Sequence Distributions
Learning Task-relevant Representations for Generalization via Characteristic Functions of Reward Sequence Distributions
Rui Yang
Jie Wang
Zijie Geng
Mingxuan Ye
Shuiwang Ji
Bin Li
Fengli Wu
OOD
31
20
0
20 May 2022
Local Feature Swapping for Generalization in Reinforcement Learning
Local Feature Swapping for Generalization in Reinforcement Learning
David Bertoin
Emmanuel Rachelson
OOD
23
14
0
13 Apr 2022
Consistent Dropout for Policy Gradient Reinforcement Learning
Consistent Dropout for Policy Gradient Reinforcement Learning
Matthew J. Hausknecht
Nolan Wagener
OffRL
24
10
0
23 Feb 2022
CORA: Benchmarks, Baselines, and Metrics as a Platform for Continual
  Reinforcement Learning Agents
CORA: Benchmarks, Baselines, and Metrics as a Platform for Continual Reinforcement Learning Agents
Sam Powers
Eliot Xing
Eric Kolve
Roozbeh Mottaghi
Abhinav Gupta
OffRL
36
38
0
19 Oct 2021
On The Transferability of Deep-Q Networks
On The Transferability of Deep-Q Networks
M. Sabatelli
Pierre Geurts
37
2
0
06 Oct 2021
Robust Predictable Control
Robust Predictable Control
Benjamin Eysenbach
Ruslan Salakhutdinov
Sergey Levine
OffRL
29
44
0
07 Sep 2021
Why Generalization in RL is Difficult: Epistemic POMDPs and Implicit
  Partial Observability
Why Generalization in RL is Difficult: Epistemic POMDPs and Implicit Partial Observability
Dibya Ghosh
Jad Rahme
Aviral Kumar
Amy Zhang
Ryan P. Adams
Sergey Levine
OffRL
278
110
0
13 Jul 2021
Improve Agents without Retraining: Parallel Tree Search with Off-Policy
  Correction
Improve Agents without Retraining: Parallel Tree Search with Off-Policy Correction
Assaf Hallak
Gal Dalal
Steven Dalton
I. Frosio
Shie Mannor
Gal Chechik
OffRL
OnRL
35
9
0
04 Jul 2021
Stabilizing Deep Q-Learning with ConvNets and Vision Transformers under
  Data Augmentation
Stabilizing Deep Q-Learning with ConvNets and Vision Transformers under Data Augmentation
Nicklas Hansen
H. Su
Xiaolong Wang
OffRL
28
134
0
01 Jul 2021
SECANT: Self-Expert Cloning for Zero-Shot Generalization of Visual
  Policies
SECANT: Self-Expert Cloning for Zero-Shot Generalization of Visual Policies
Linxi Fan
Guanzhi Wang
De-An Huang
Zhiding Yu
Li Fei-Fei
Yuke Zhu
Anima Anandkumar
OffRL
30
63
0
17 Jun 2021
Safety Enhancement for Deep Reinforcement Learning in Autonomous
  Separation Assurance
Safety Enhancement for Deep Reinforcement Learning in Autonomous Separation Assurance
Wei Guo
Marc Brittain
Peng Wei
31
18
0
05 May 2021
Domain Generalization with MixStyle
Domain Generalization with MixStyle
Kaiyang Zhou
Yongxin Yang
Yu Qiao
Tao Xiang
71
744
0
05 Apr 2021
One Solution is Not All You Need: Few-Shot Extrapolation via Structured
  MaxEnt RL
One Solution is Not All You Need: Few-Shot Extrapolation via Structured MaxEnt RL
Saurabh Kumar
Aviral Kumar
Sergey Levine
Chelsea Finn
OffRL
16
90
0
27 Oct 2020
Emergent Social Learning via Multi-agent Reinforcement Learning
Emergent Social Learning via Multi-agent Reinforcement Learning
Kamal Ndousse
Douglas Eck
Sergey Levine
Natasha Jaques
8
41
0
01 Oct 2020
Discount Factor as a Regularizer in Reinforcement Learning
Discount Factor as a Regularizer in Reinforcement Learning
Ron Amit
Ron Meir
K. Ciosek
OffRL
22
71
0
04 Jul 2020
Group Equivariant Deep Reinforcement Learning
Group Equivariant Deep Reinforcement Learning
Arnab Kumar Mondal
Pratheeksha Nair
Kaleem Siddiqi
17
31
0
01 Jul 2020
Automatic Data Augmentation for Generalization in Deep Reinforcement
  Learning
Automatic Data Augmentation for Generalization in Deep Reinforcement Learning
Roberta Raileanu
M. Goldstein
Denis Yarats
Ilya Kostrikov
Rob Fergus
OffRL
22
109
0
23 Jun 2020
Deep Reinforcement and InfoMax Learning
Deep Reinforcement and InfoMax Learning
Bogdan Mazoure
Rémi Tachet des Combes
T. Doan
Philip Bachman
R. Devon Hjelm
AI4CE
25
108
0
12 Jun 2020
F2A2: Flexible Fully-decentralized Approximate Actor-critic for
  Cooperative Multi-agent Reinforcement Learning
F2A2: Flexible Fully-decentralized Approximate Actor-critic for Cooperative Multi-agent Reinforcement Learning
Wenhao Li
Bo Jin
Xiangfeng Wang
Junchi Yan
H. Zha
25
21
0
17 Apr 2020
Leveraging Procedural Generation to Benchmark Reinforcement Learning
Leveraging Procedural Generation to Benchmark Reinforcement Learning
K. Cobbe
Christopher Hesse
Jacob Hilton
John Schulman
45
541
0
03 Dec 2019
Robust Visual Domain Randomization for Reinforcement Learning
Robust Visual Domain Randomization for Reinforcement Learning
Reda Bahi Slaoui
W. Clements
Jakob N. Foerster
Sébastien Toth
OOD
8
12
0
23 Oct 2019
Generalizing from a few environments in safety-critical reinforcement
  learning
Generalizing from a few environments in safety-critical reinforcement learning
Zachary Kenton
Angelos Filos
Owain Evans
Y. Gal
8
16
0
02 Jul 2019
12
Next