ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1812.02341
  4. Cited By
Quantifying Generalization in Reinforcement Learning

Quantifying Generalization in Reinforcement Learning

6 December 2018
K. Cobbe
Oleg Klimov
Christopher Hesse
Taehoon Kim
John Schulman
    OffRL
ArXivPDFHTML

Papers citing "Quantifying Generalization in Reinforcement Learning"

50 / 397 papers shown
Title
Scaling Laws for Reward Model Overoptimization
Scaling Laws for Reward Model Overoptimization
Leo Gao
John Schulman
Jacob Hilton
ALM
41
481
0
19 Oct 2022
CLUTR: Curriculum Learning via Unsupervised Task Representation Learning
CLUTR: Curriculum Learning via Unsupervised Task Representation Learning
Abdus Salam Azad
Izzeddin Gur
Jasper Emhoff
Nathaniel Alexis
Aleksandra Faust
Pieter Abbeel
Ion Stoica
SSL
29
12
0
19 Oct 2022
Rethinking Value Function Learning for Generalization in Reinforcement
  Learning
Rethinking Value Function Learning for Generalization in Reinforcement Learning
Seungyong Moon
JunYeong Lee
Hyun Oh Song
OOD
OffRL
21
16
0
18 Oct 2022
WILD-SCAV: Benchmarking FPS Gaming AI on Unity3D-based Environments
WILD-SCAV: Benchmarking FPS Gaming AI on Unity3D-based Environments
Xi Chen
Tianyuan Shi
Qing Zhao
Yuchen Sun
Yunfei Gao
Xiangjun Wang
33
2
0
14 Oct 2022
Bootstrap Advantage Estimation for Policy Optimization in Reinforcement
  Learning
Bootstrap Advantage Estimation for Policy Optimization in Reinforcement Learning
Md Masudur Rahman
Yexiang Xue
OffRL
11
0
0
13 Oct 2022
Exploration via Elliptical Episodic Bonuses
Exploration via Elliptical Episodic Bonuses
Mikael Henaff
Roberta Raileanu
Minqi Jiang
Tim Rocktaschel
OffRL
35
40
0
11 Oct 2022
Benchmarking Reinforcement Learning Techniques for Autonomous Navigation
Benchmarking Reinforcement Learning Techniques for Autonomous Navigation
Zifan Xu
Bo Liu
Xuesu Xiao
Anirudh Nair
Peter Stone
36
42
0
10 Oct 2022
A Comprehensive Survey of Data Augmentation in Visual Reinforcement
  Learning
A Comprehensive Survey of Data Augmentation in Visual Reinforcement Learning
Guozheng Ma
Zhen Wang
Zhecheng Yuan
Xueqian Wang
Bo Yuan
Dacheng Tao
OffRL
38
26
0
10 Oct 2022
Vision+X: A Survey on Multimodal Learning in the Light of Data
Vision+X: A Survey on Multimodal Learning in the Light of Data
Ye Zhu
Yuehua Wu
N. Sebe
Yan Yan
35
16
0
05 Oct 2022
Neural Distillation as a State Representation Bottleneck in
  Reinforcement Learning
Neural Distillation as a State Representation Bottleneck in Reinforcement Learning
Valentin Guillet
D. Wilson
Carlos Aguilar-Melchor
Emmanuel Rachelson
16
1
0
05 Oct 2022
Goal Misgeneralization: Why Correct Specifications Aren't Enough For
  Correct Goals
Goal Misgeneralization: Why Correct Specifications Aren't Enough For Correct Goals
Rohin Shah
Vikrant Varma
Ramana Kumar
Mary Phuong
Victoria Krakovna
J. Uesato
Zachary Kenton
40
68
0
04 Oct 2022
Hyperbolic Deep Reinforcement Learning
Hyperbolic Deep Reinforcement Learning
Edoardo Cetin
B. Chamberlain
Michael M. Bronstein
Jonathan J. Hunt
43
21
0
04 Oct 2022
Generalization in Deep Reinforcement Learning for Robotic Navigation by
  Reward Shaping
Generalization in Deep Reinforcement Learning for Robotic Navigation by Reward Shaping
Victor R. F. Miranda
A. A. Neto
G. Freitas
L. Mozelli
35
18
0
28 Sep 2022
Online Policy Optimization for Robust MDP
Online Policy Optimization for Robust MDP
Jing Dong
Jingwei Li
Baoxiang Wang
J.N. Zhang
OffRL
34
12
0
28 Sep 2022
Measuring Interventional Robustness in Reinforcement Learning
Measuring Interventional Robustness in Reinforcement Learning
Katherine Avery
Jack Kenney
Pracheta Amaranath
Erica Cai
David D. Jensen
21
0
0
19 Sep 2022
Learn the Time to Learn: Replay Scheduling in Continual Learning
Learn the Time to Learn: Replay Scheduling in Continual Learning
Marcus Klasson
Hedvig Kjellström
Chen Zhang
CLL
35
9
0
18 Sep 2022
Look where you look! Saliency-guided Q-networks for generalization in
  visual Reinforcement Learning
Look where you look! Saliency-guided Q-networks for generalization in visual Reinforcement Learning
David Bertoin
Adil Zouitine
Mehdi Zouitine
Emmanuel Rachelson
36
30
0
16 Sep 2022
Style-Agnostic Reinforcement Learning
Style-Agnostic Reinforcement Learning
Juyong Lee
Seokjun Ahn
Jaesik Park
25
4
0
31 Aug 2022
Bayesian Generational Population-Based Training
Bayesian Generational Population-Based Training
Xingchen Wan
Cong Lu
Jack Parker-Holder
Philip J. Ball
Vu-Linh Nguyen
Binxin Ru
Michael A. Osborne
OffRL
31
15
0
19 Jul 2022
Bootstrap State Representation using Style Transfer for Better
  Generalization in Deep Reinforcement Learning
Bootstrap State Representation using Style Transfer for Better Generalization in Deep Reinforcement Learning
Md Masudur Rahman
Yexiang Xue
OffRL
34
4
0
15 Jul 2022
Temporal Disentanglement of Representations for Improved Generalisation
  in Reinforcement Learning
Temporal Disentanglement of Representations for Improved Generalisation in Reinforcement Learning
Mhairi Dunion
Trevor A. McInroe
K. Luck
Josiah P. Hanna
Stefano V. Albrecht
OOD
DRL
26
18
0
12 Jul 2022
Test-Time Adaptation via Self-Training with Nearest Neighbor Information
Test-Time Adaptation via Self-Training with Nearest Neighbor Information
M-U Jang
Sae-Young Chung
Hye Won Chung
OOD
TTA
41
56
0
08 Jul 2022
Towards Understanding How Machines Can Learn Causal Overhypotheses
Towards Understanding How Machines Can Learn Causal Overhypotheses
Eliza Kosoy
David M. Chan
Adrian Liu
Jasmine Collins
Bryanna Kaufmann
Sandy Han Huang
Jessica B. Hamrick
John F. Canny
Nan Rosemary Ke
Alison Gopnik
CML
AI4CE
28
18
0
16 Jun 2022
On the Generalization and Adaption Performance of Causal Models
On the Generalization and Adaption Performance of Causal Models
Nino Scherrer
Anirudh Goyal
Stefan Bauer
Yoshua Bengio
Nan Rosemary Ke
CML
OOD
BDL
TTA
31
8
0
09 Jun 2022
Balancing Profit, Risk, and Sustainability for Portfolio Management
Balancing Profit, Risk, and Sustainability for Portfolio Management
Charl Maree
C. Omlin
19
9
0
06 Jun 2022
Learning Dynamics and Generalization in Reinforcement Learning
Learning Dynamics and Generalization in Reinforcement Learning
Clare Lyle
Mark Rowland
Will Dabney
Marta Z. Kwiatkowska
Y. Gal
OOD
OffRL
30
12
0
05 Jun 2022
Disentangling Epistemic and Aleatoric Uncertainty in Reinforcement
  Learning
Disentangling Epistemic and Aleatoric Uncertainty in Reinforcement Learning
Bertrand Charpentier
Ransalu Senanayake
Mykel Kochenderfer
Stephan Günnemann
PER
UD
50
24
0
03 Jun 2022
Efficient Scheduling of Data Augmentation for Deep Reinforcement
  Learning
Efficient Scheduling of Data Augmentation for Deep Reinforcement Learning
Byungchan Ko
Jungseul Ok
OnRL
27
5
0
01 Jun 2022
Human-AI Shared Control via Policy Dissection
Human-AI Shared Control via Policy Dissection
Quanyi Li
Zhenghao Peng
Haibin Wu
Lan Feng
Bolei Zhou
18
13
0
31 May 2022
Chain of Thought Imitation with Procedure Cloning
Chain of Thought Imitation with Procedure Cloning
Mengjiao Yang
Dale Schuurmans
Pieter Abbeel
Ofir Nachum
OffRL
35
30
0
22 May 2022
A Verification Framework for Certifying Learning-Based Safety-Critical
  Aviation Systems
A Verification Framework for Certifying Learning-Based Safety-Critical Aviation Systems
Ali Baheri
Hao Ren
B. Johnson
Pouria Razzaghi
Peng Wei
21
5
0
09 May 2022
Procedural Content Generation using Neuroevolution and Novelty Search
  for Diverse Video Game Levels
Procedural Content Generation using Neuroevolution and Novelty Search for Diverse Video Game Levels
Michael Beukman
C. Cleghorn
Steven D. James
31
15
0
14 Apr 2022
Improving generalization to new environments and removing catastrophic
  forgetting in Reinforcement Learning by using an eco-system of agents
Improving generalization to new environments and removing catastrophic forgetting in Reinforcement Learning by using an eco-system of agents
Olivier Moulin
Vincent François-Lavet
Paul Elbers
Mark Hoogendoorn
CLL
17
0
0
13 Apr 2022
Local Feature Swapping for Generalization in Reinforcement Learning
Local Feature Swapping for Generalization in Reinforcement Learning
David Bertoin
Emmanuel Rachelson
OOD
23
14
0
13 Apr 2022
Evolving Pareto-Optimal Actor-Critic Algorithms for Generalizability and
  Stability
Evolving Pareto-Optimal Actor-Critic Algorithms for Generalizability and Stability
Juan Jose Garau-Luis
Yingjie Miao
John D. Co-Reyes
Aaron T Parisi
Jie Tan
Esteban Real
Aleksandra Faust
31
0
0
08 Apr 2022
Long Video Generation with Time-Agnostic VQGAN and Time-Sensitive
  Transformer
Long Video Generation with Time-Agnostic VQGAN and Time-Sensitive Transformer
Songwei Ge
Thomas Hayes
Harry Yang
Xiaoyue Yin
Guan Pang
David Jacobs
Jia-Bin Huang
Devi Parikh
ViT
56
215
0
07 Apr 2022
Model Based Meta Learning of Critics for Policy Gradients
Model Based Meta Learning of Critics for Policy Gradients
Sarah Bechtle
Ludovic Righetti
Franziska Meier
OffRL
22
0
0
05 Apr 2022
Investigating the Properties of Neural Network Representations in
  Reinforcement Learning
Investigating the Properties of Neural Network Representations in Reinforcement Learning
Han Wang
Erfan Miahi
Martha White
Marlos C. Machado
Zaheer Abbas
Raksha Kumaraswamy
Vincent Liu
Adam White
22
26
0
30 Mar 2022
Dynamic Noises of Multi-Agent Environments Can Improve Generalization:
  Agent-based Models meets Reinforcement Learning
Dynamic Noises of Multi-Agent Environments Can Improve Generalization: Agent-based Models meets Reinforcement Learning
Mohamed Akrout
Amal Feriani
Bob McLeod
AI4CE
15
0
0
26 Mar 2022
SURF: Semi-supervised Reward Learning with Data Augmentation for
  Feedback-efficient Preference-based Reinforcement Learning
SURF: Semi-supervised Reward Learning with Data Augmentation for Feedback-efficient Preference-based Reinforcement Learning
Jongjin Park
Younggyo Seo
Jinwoo Shin
Honglak Lee
Pieter Abbeel
Kimin Lee
17
82
0
18 Mar 2022
Symmetry-Based Representations for Artificial and Biological General
  Intelligence
Symmetry-Based Representations for Artificial and Biological General Intelligence
I. Higgins
S. Racanière
Danilo Jimenez Rezende
AI4CE
31
44
0
17 Mar 2022
Zipfian environments for Reinforcement Learning
Zipfian environments for Reinforcement Learning
Stephanie C. Y. Chan
Andrew Kyle Lampinen
Pierre Harvey Richemond
Felix Hill
OffRL
15
15
0
15 Mar 2022
Consistent Dropout for Policy Gradient Reinforcement Learning
Consistent Dropout for Policy Gradient Reinforcement Learning
Matthew J. Hausknecht
Nolan Wagener
OffRL
24
10
0
23 Feb 2022
Inference of Affordances and Active Motor Control in Simulated Agents
Inference of Affordances and Active Motor Control in Simulated Agents
Fedor Scholz
Christian Gumbsch
S. Otte
Martin Volker Butz
AI4CE
32
5
0
23 Feb 2022
Learning Causal Overhypotheses through Exploration in Children and
  Computational Models
Learning Causal Overhypotheses through Exploration in Children and Computational Models
Eliza Kosoy
Adrian Liu
Jasmine Collins
David M. Chan
Jessica B. Hamrick
Nan Rosemary Ke
Sandy H Huang
Bryanna Kaufmann
John F. Canny
Alison Gopnik
CML
22
9
0
21 Feb 2022
Don't Touch What Matters: Task-Aware Lipschitz Data Augmentation for
  Visual Reinforcement Learning
Don't Touch What Matters: Task-Aware Lipschitz Data Augmentation for Visual Reinforcement Learning
Zhecheng Yuan
Guozheng Ma
Yao Mu
Bo Xia
Bo Yuan
Xueqian Wang
Ping Luo
Huazhe Xu
33
28
0
21 Feb 2022
Plasticity and evolvability under environmental variability: the joint
  role of fitness-based selection and niche-limited competition
Plasticity and evolvability under environmental variability: the joint role of fitness-based selection and niche-limited competition
Eleni Nisioti
Clément Moulin-Frier
30
2
0
17 Feb 2022
User-Oriented Robust Reinforcement Learning
User-Oriented Robust Reinforcement Learning
Haoyi You
Beichen Yu
Haiming Jin
Zhaoxing Yang
Jiahui Sun
OffRL
32
0
0
15 Feb 2022
Learning to Solve Routing Problems via Distributionally Robust
  Optimization
Learning to Solve Routing Problems via Distributionally Robust Optimization
Yuan Jiang
Yaoxin Wu
Zhiguang Cao
Jie Zhang
OOD
31
37
0
15 Feb 2022
L2C2: Locally Lipschitz Continuous Constraint towards Stable and Smooth
  Reinforcement Learning
L2C2: Locally Lipschitz Continuous Constraint towards Stable and Smooth Reinforcement Learning
Taisuke Kobayashi
26
15
0
15 Feb 2022
Previous
12345678
Next