Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1810.00123
Cited By
Generalization and Regularization in DQN
29 September 2018
Jesse Farebrother
Marlos C. Machado
Michael Bowling
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Generalization and Regularization in DQN"
50 / 54 papers shown
Title
Bidirectional Distillation: A Mixed-Play Framework for Multi-Agent Generalizable Behaviors
Lang Feng
Jiahao Lin
Dong Xing
Li Zhang
De Ma
Gang Pan
31
0
0
16 May 2025
Understanding the theoretical properties of projected Bellman equation, linear Q-learning, and approximate value iteration
Han-Dong Lim
Donghwan Lee
21
0
0
15 Apr 2025
Hyperspherical Normalization for Scalable Deep Reinforcement Learning
Hojoon Lee
Youngdo Lee
Takuma Seno
Donghu Kim
Peter Stone
Jaegul Choo
63
1
0
24 Feb 2025
BlendRL: A Framework for Merging Symbolic and Neural Policy Learning
Hikaru Shindo
Quentin Delfosse
Devendra Singh Dhami
Kristian Kersting
43
3
0
15 Oct 2024
SigmaRL: A Sample-Efficient and Generalizable Multi-Agent Reinforcement Learning Framework for Motion Planning
Jianye Xu
Pan Hu
Bassam Alrifaee
44
5
0
14 Aug 2024
Towards Generalizable Reinforcement Learning via Causality-Guided Self-Adaptive Representations
Yupei Yang
Erdun Gao
Fan Feng
Xinyue Wang
Shikui Tu
Lei Xu
CML
OOD
TTA
40
1
0
30 Jul 2024
Evaluation of Reinforcement Learning for Autonomous Penetration Testing using A3C, Q-learning and DQN
Norman Becker
Daniel Reti
Evridiki V. Ntagiou
M. Wallum
Hans D. Schotten
37
1
0
22 Jul 2024
The Overcooked Generalisation Challenge
Constantin Ruhdorfer
Matteo Bortoletto
Anna Penzkofer
Andreas Bulling
48
4
0
25 Jun 2024
Massively Multiagent Minigames for Training Generalist Agents
Kyoung Whan Choe
Ryan Sullivan
Joseph Suárez
AI4CE
34
0
0
07 Jun 2024
Intervention-Assisted Policy Gradient Methods for Online Stochastic Queuing Network Optimization: Technical Report
Jerrod Wigmore
B. Shrader
E. Modiano
OffRL
32
1
0
05 Apr 2024
Dissecting Deep RL with High Update Ratios: Combatting Value Divergence
Marcel Hussing
C. Voelcker
Igor Gilitschenski
Amir-massoud Farahmand
Eric Eaton
42
3
0
09 Mar 2024
What Makes Pre-Trained Visual Representations Successful for Robust Manipulation?
Kaylee Burns
Zach Witzel
Jubayer Ibn Hamid
Tianhe Yu
Chelsea Finn
Karol Hausman
OOD
SSL
32
23
0
03 Nov 2023
RAPID: Enabling Fast Online Policy Learning in Dynamic Public Cloud Environments
Drew Penney
Bin Li
Lizhong Chen
J. Sydir
Anna Drewek-Ossowicka
R. Illikkal
Charlie Tai
R. Iyer
Andrew J. Herdrich
31
1
0
10 Apr 2023
Unified Emulation-Simulation Training Environment for Autonomous Cyber Agents
Li Li
Jean-Pierre S. El Rami
Adrian Taylor
James Hailing Rao
T. Kunz
13
4
0
03 Apr 2023
Policy Evaluation in Decentralized POMDPs with Belief Sharing
Mert Kayaalp
Fatima Ghadieh
Ali H. Sayed
19
2
0
08 Feb 2023
Offline Q-Learning on Diverse Multi-Task Data Both Scales And Generalizes
Aviral Kumar
Rishabh Agarwal
Xinyang Geng
George Tucker
Sergey Levine
OffRL
44
48
0
28 Nov 2022
Melting Pot 2.0
J. Agapiou
A. Vezhnevets
Edgar A. Duénez-Guzmán
Jayd Matyas
Yiran Mao
...
Sukhdeep Singh
Julia Haas
Igor Mordatch
D. Mobbs
Joel Z Leibo
45
31
0
24 Nov 2022
Probing Transfer in Deep Reinforcement Learning without Task Engineering
Andrei A. Rusu
Sebastian Flennerhag
Dushyant Rao
Razvan Pascanu
R. Hadsell
34
6
0
22 Oct 2022
Bridging the Gap Between Target Networks and Functional Regularization
Alexandre Piché
Valentin Thomas
Joseph Marino
Rafael Pardiñas
Gian Maria Marconi
C. Pal
Mohammad Emtiyaz Khan
14
1
0
21 Oct 2022
Scaling Laws for Reward Model Overoptimization
Leo Gao
John Schulman
Jacob Hilton
ALM
41
489
0
19 Oct 2022
On the Power of Pre-training for Generalization in RL: Provable Benefits and Hardness
Haotian Ye
Xiaoyu Chen
Liwei Wang
S. Du
OffRL
37
6
0
19 Oct 2022
A Comprehensive Survey of Data Augmentation in Visual Reinforcement Learning
Guozheng Ma
Zhen Wang
Zhecheng Yuan
Xueqian Wang
Bo Yuan
Dacheng Tao
OffRL
43
27
0
10 Oct 2022
Learn the Time to Learn: Replay Scheduling in Continual Learning
Marcus Klasson
Hedvig Kjellström
Chen Zhang
CLL
35
9
0
18 Sep 2022
Look where you look! Saliency-guided Q-networks for generalization in visual Reinforcement Learning
David Bertoin
Adil Zouitine
Mehdi Zouitine
Emmanuel Rachelson
36
30
0
16 Sep 2022
Bootstrap State Representation using Style Transfer for Better Generalization in Deep Reinforcement Learning
Md Masudur Rahman
Yexiang Xue
OffRL
34
4
0
15 Jul 2022
GriddlyJS: A Web IDE for Reinforcement Learning
C. Bamford
Minqi Jiang
Mikayel Samvelyan
Tim Rocktaschel
OnRL
38
4
0
13 Jul 2022
Efficient Scheduling of Data Augmentation for Deep Reinforcement Learning
Byungchan Ko
Jungseul Ok
OnRL
27
5
0
01 Jun 2022
Chain of Thought Imitation with Procedure Cloning
Mengjiao Yang
Dale Schuurmans
Pieter Abbeel
Ofir Nachum
OffRL
35
30
0
22 May 2022
Learning Task-relevant Representations for Generalization via Characteristic Functions of Reward Sequence Distributions
Rui Yang
Jie Wang
Zijie Geng
Mingxuan Ye
Shuiwang Ji
Bin Li
Fengli Wu
OOD
31
20
0
20 May 2022
Local Feature Swapping for Generalization in Reinforcement Learning
David Bertoin
Emmanuel Rachelson
OOD
23
14
0
13 Apr 2022
Consistent Dropout for Policy Gradient Reinforcement Learning
Matthew J. Hausknecht
Nolan Wagener
OffRL
24
10
0
23 Feb 2022
CORA: Benchmarks, Baselines, and Metrics as a Platform for Continual Reinforcement Learning Agents
Sam Powers
Eliot Xing
Eric Kolve
Roozbeh Mottaghi
Abhinav Gupta
OffRL
36
38
0
19 Oct 2021
On The Transferability of Deep-Q Networks
M. Sabatelli
Pierre Geurts
37
2
0
06 Oct 2021
Robust Predictable Control
Benjamin Eysenbach
Ruslan Salakhutdinov
Sergey Levine
OffRL
29
44
0
07 Sep 2021
Why Generalization in RL is Difficult: Epistemic POMDPs and Implicit Partial Observability
Dibya Ghosh
Jad Rahme
Aviral Kumar
Amy Zhang
Ryan P. Adams
Sergey Levine
OffRL
278
110
0
13 Jul 2021
Improve Agents without Retraining: Parallel Tree Search with Off-Policy Correction
Assaf Hallak
Gal Dalal
Steven Dalton
I. Frosio
Shie Mannor
Gal Chechik
OffRL
OnRL
35
9
0
04 Jul 2021
Stabilizing Deep Q-Learning with ConvNets and Vision Transformers under Data Augmentation
Nicklas Hansen
H. Su
Xiaolong Wang
OffRL
28
134
0
01 Jul 2021
SECANT: Self-Expert Cloning for Zero-Shot Generalization of Visual Policies
Linxi Fan
Guanzhi Wang
De-An Huang
Zhiding Yu
Li Fei-Fei
Yuke Zhu
Anima Anandkumar
OffRL
30
63
0
17 Jun 2021
Safety Enhancement for Deep Reinforcement Learning in Autonomous Separation Assurance
Wei Guo
Marc Brittain
Peng Wei
31
18
0
05 May 2021
Domain Generalization with MixStyle
Kaiyang Zhou
Yongxin Yang
Yu Qiao
Tao Xiang
71
744
0
05 Apr 2021
One Solution is Not All You Need: Few-Shot Extrapolation via Structured MaxEnt RL
Saurabh Kumar
Aviral Kumar
Sergey Levine
Chelsea Finn
OffRL
16
90
0
27 Oct 2020
Emergent Social Learning via Multi-agent Reinforcement Learning
Kamal Ndousse
Douglas Eck
Sergey Levine
Natasha Jaques
8
41
0
01 Oct 2020
Discount Factor as a Regularizer in Reinforcement Learning
Ron Amit
Ron Meir
K. Ciosek
OffRL
22
71
0
04 Jul 2020
Group Equivariant Deep Reinforcement Learning
Arnab Kumar Mondal
Pratheeksha Nair
Kaleem Siddiqi
17
31
0
01 Jul 2020
Automatic Data Augmentation for Generalization in Deep Reinforcement Learning
Roberta Raileanu
M. Goldstein
Denis Yarats
Ilya Kostrikov
Rob Fergus
OffRL
22
109
0
23 Jun 2020
Deep Reinforcement and InfoMax Learning
Bogdan Mazoure
Rémi Tachet des Combes
T. Doan
Philip Bachman
R. Devon Hjelm
AI4CE
25
108
0
12 Jun 2020
F2A2: Flexible Fully-decentralized Approximate Actor-critic for Cooperative Multi-agent Reinforcement Learning
Wenhao Li
Bo Jin
Xiangfeng Wang
Junchi Yan
H. Zha
25
21
0
17 Apr 2020
Leveraging Procedural Generation to Benchmark Reinforcement Learning
K. Cobbe
Christopher Hesse
Jacob Hilton
John Schulman
45
541
0
03 Dec 2019
Robust Visual Domain Randomization for Reinforcement Learning
Reda Bahi Slaoui
W. Clements
Jakob N. Foerster
Sébastien Toth
OOD
8
12
0
23 Oct 2019
Generalizing from a few environments in safety-critical reinforcement learning
Zachary Kenton
Angelos Filos
Owain Evans
Y. Gal
8
16
0
02 Jul 2019
1
2
Next