ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1812.02341
  4. Cited By
Quantifying Generalization in Reinforcement Learning

Quantifying Generalization in Reinforcement Learning

6 December 2018
K. Cobbe
Oleg Klimov
Christopher Hesse
Taehoon Kim
John Schulman
    OffRL
ArXivPDFHTML

Papers citing "Quantifying Generalization in Reinforcement Learning"

50 / 397 papers shown
Title
Adversarial Style Transfer for Robust Policy Optimization in Deep
  Reinforcement Learning
Adversarial Style Transfer for Robust Policy Optimization in Deep Reinforcement Learning
Md Masudur Rahman
Yexiang Xue
29
4
0
29 Aug 2023
Go Beyond Imagination: Maximizing Episodic Reachability with World
  Models
Go Beyond Imagination: Maximizing Episodic Reachability with World Models
Yao Fu
Run Peng
Honglak Lee
24
1
0
25 Aug 2023
JoinGym: An Efficient Query Optimization Environment for Reinforcement
  Learning
JoinGym: An Efficient Query Optimization Environment for Reinforcement Learning
Kaiwen Wang
Junxiong Wang
Yueying Li
Nathan Kallus
Immanuel Trummer
Wen Sun
GP
50
2
0
21 Jul 2023
Measuring and Mitigating Interference in Reinforcement Learning
Measuring and Mitigating Interference in Reinforcement Learning
Vincent Liu
Han Wang
Ruo Yu Tao
Khurram Javed
Adam White
Martha White
27
5
0
10 Jul 2023
Concept Extrapolation: A Conceptual Primer
Concept Extrapolation: A Conceptual Primer
Matija Franklin
Rebecca Gorman
Hal Ashton
Stuart Armstrong
18
1
0
19 Jun 2023
Online Prototype Alignment for Few-shot Policy Transfer
Online Prototype Alignment for Few-shot Policy Transfer
Qi Yi
Rui Zhang
Shaohui Peng
Jiaming Guo
Yunkai Gao
...
Xingui Hu
Zidong Du
Xishan Zhang
Qi Guo
Yunji Chen
OffRL
27
4
0
12 Jun 2023
The Role of Diverse Replay for Generalisation in Reinforcement Learning
The Role of Diverse Replay for Generalisation in Reinforcement Learning
Max Weltevrede
M. Spaan
Wendelin Bohmer
OffRL
18
1
0
09 Jun 2023
On the Importance of Exploration for Generalization in Reinforcement
  Learning
On the Importance of Exploration for Generalization in Reinforcement Learning
Yiding Jiang
J. Zico Kolter
Roberta Raileanu
UQCV
OffRL
32
20
0
08 Jun 2023
Instructed Diffuser with Temporal Condition Guidance for Offline
  Reinforcement Learning
Instructed Diffuser with Temporal Condition Guidance for Offline Reinforcement Learning
Jifeng Hu
Yan Sun
Sili Huang
Siyuan Guo
Hechang Chen
Li Shen
Lichao Sun
Yi-Ju Chang
Dacheng Tao
DiffM
OffRL
43
13
0
08 Jun 2023
A Study of Global and Episodic Bonuses for Exploration in Contextual
  MDPs
A Study of Global and Episodic Bonuses for Exploration in Contextual MDPs
Mikael Henaff
Minqi Jiang
Roberta Raileanu
46
13
0
05 Jun 2023
Explore to Generalize in Zero-Shot RL
Explore to Generalize in Zero-Shot RL
E. Zisselman
Itai Lavie
Daniel Soudry
Aviv Tamar
28
15
0
05 Jun 2023
Normalization Enhances Generalization in Visual Reinforcement Learning
Normalization Enhances Generalization in Visual Reinforcement Learning
Lu Li
Jiafei Lyu
Guozheng Ma
Zilin Wang
Zhen Yang
Xiu Li
Zhiheng Li
OOD
25
8
0
01 Jun 2023
Solving Robust MDPs through No-Regret Dynamics
Solving Robust MDPs through No-Regret Dynamics
E. Guha
33
0
0
30 May 2023
What is Essential for Unseen Goal Generalization of Offline
  Goal-conditioned RL?
What is Essential for Unseen Goal Generalization of Offline Goal-conditioned RL?
Rui Yang
Yong Lin
Xiaoteng Ma
Haotian Hu
Chongjie Zhang
Tong Zhang
OffRL
29
22
0
30 May 2023
Diffusion Model is an Effective Planner and Data Synthesizer for
  Multi-Task Reinforcement Learning
Diffusion Model is an Effective Planner and Data Synthesizer for Multi-Task Reinforcement Learning
Haoran He
Chenjia Bai
Kang Xu
Zhuoran Yang
Weinan Zhang
Dong Wang
Bingyan Zhao
Xuelong Li
DiffM
OffRL
38
90
0
29 May 2023
Self-Supervised Reinforcement Learning that Transfers using Random
  Features
Self-Supervised Reinforcement Learning that Transfers using Random Features
Boyuan Chen
Chuning Zhu
Pulkit Agrawal
Kaipeng Zhang
Abhishek Gupta
OffRL
SSL
30
6
0
26 May 2023
Towards Generalizable Reinforcement Learning for Trade Execution
Towards Generalizable Reinforcement Learning for Trade Execution
Chuheng Zhang
Yitong Duan
Xiaoyu Chen
Jianyu Chen
Jian Li
L. Zhao
OffRL
17
7
0
12 May 2023
Train a Real-world Local Path Planner in One Hour via Partially
  Decoupled Reinforcement Learning and Vectorized Diversity
Train a Real-world Local Path Planner in One Hour via Partially Decoupled Reinforcement Learning and Vectorized Diversity
Jinghao Xin
Jinwoo Kim
Zehan Li
Ning Li
OffRL
28
3
0
07 May 2023
Simple Noisy Environment Augmentation for Reinforcement Learning
Simple Noisy Environment Augmentation for Reinforcement Learning
Raad Khraishi
Ramin Okhrati
OffRL
18
1
0
04 May 2023
Get Back Here: Robust Imitation by Return-to-Distribution Planning
Get Back Here: Robust Imitation by Return-to-Distribution Planning
Geoffrey Cideron
B. Tabanpour
Sebastian Curi
Sertan Girgin
Léonard Hussenot
Gabriel Dulac-Arnold
M. Geist
Olivier Pietquin
Robert Dadashi
OOD
84
2
0
02 May 2023
Adversarial Policy Optimization in Deep Reinforcement Learning
Adversarial Policy Optimization in Deep Reinforcement Learning
Md Masudur Rahman
Yexiang Xue
AAML
27
0
0
27 Apr 2023
Can Agents Run Relay Race with Strangers? Generalization of RL to
  Out-of-Distribution Trajectories
Can Agents Run Relay Race with Strangers? Generalization of RL to Out-of-Distribution Trajectories
Li-Cheng Lan
Huan Zhang
Cho-Jui Hsieh
OODD
26
9
0
26 Apr 2023
LASER: A Neuro-Symbolic Framework for Learning Spatial-Temporal Scene Graphs with Weak Supervision
LASER: A Neuro-Symbolic Framework for Learning Spatial-Temporal Scene Graphs with Weak Supervision
Jiani Huang
Ziyang Li
Mayur Naik
Ser-Nam Lim
37
3
0
15 Apr 2023
Habits and goals in synergy: a variational Bayesian framework for
  behavior
Habits and goals in synergy: a variational Bayesian framework for behavior
Dongqi Han
Kenji Doya
Dongsheng Li
Jun Tani
BDL
28
220
0
11 Apr 2023
Scallop: A Language for Neurosymbolic Programming
Scallop: A Language for Neurosymbolic Programming
Ziyang Li
Jiani Huang
Mayur Naik
ReLM
LRM
NAI
24
30
0
10 Apr 2023
On-line reinforcement learning for optimization of real-life energy
  trading strategy
On-line reinforcement learning for optimization of real-life energy trading strategy
Lukasz Lepak
Pawel Wawrzyñski
34
0
0
28 Mar 2023
CelebV-Text: A Large-Scale Facial Text-Video Dataset
CelebV-Text: A Large-Scale Facial Text-Video Dataset
Jianhui Yu
Hao Zhu
Liming Jiang
Chen Change Loy
Weidong (Tom) Cai
Wayne Wu
30
56
0
26 Mar 2023
A State Augmentation based approach to Reinforcement Learning from Human
  Preferences
A State Augmentation based approach to Reinforcement Learning from Human Preferences
Mudit Verma
Subbarao Kambhampati
33
2
0
17 Feb 2023
Policy Evaluation in Decentralized POMDPs with Belief Sharing
Policy Evaluation in Decentralized POMDPs with Belief Sharing
Mert Kayaalp
Fatima Ghadieh
Ali H. Sayed
19
2
0
08 Feb 2023
Hierarchically Composing Level Generators for the Creation of Complex
  Structures
Hierarchically Composing Level Generators for the Creation of Complex Structures
Michael Beukman
Manuel A. Fokam
Marcel Kruger
Guy Axelrod
Muhammad Umair Nasir
Branden Ingram
Benjamin Rosman
Steven D. James
40
9
0
03 Feb 2023
Partitioning Distributed Compute Jobs with Reinforcement Learning and
  Graph Neural Networks
Partitioning Distributed Compute Jobs with Reinforcement Learning and Graph Neural Networks
Christopher W. F. Parsonson
Zacharaya Shabka
Alessandro Ottino
G. Zervas
34
0
0
31 Jan 2023
Hierarchical Programmatic Reinforcement Learning via Learning to Compose
  Programs
Hierarchical Programmatic Reinforcement Learning via Learning to Compose Programs
Guanhui. Liu
En-Pei Hu
Pu-Jen Cheng
Hung-yi Lee
Shao-Hua Sun
74
18
0
30 Jan 2023
Automatic Intrinsic Reward Shaping for Exploration in Deep Reinforcement
  Learning
Automatic Intrinsic Reward Shaping for Exploration in Deep Reinforcement Learning
Mingqi Yuan
Bo Li
Xin Jin
Wenjun Zeng
OffRL
29
8
0
26 Jan 2023
Generalization through Diversity: Improving Unsupervised Environment
  Design
Generalization through Diversity: Improving Unsupervised Environment Design
Wenjun Li
Pradeep Varakantham
Dexun Li
33
7
0
19 Jan 2023
Human-Timescale Adaptation in an Open-Ended Task Space
Human-Timescale Adaptation in an Open-Ended Task Space
Adaptive Agent Team
Jakob Bauer
Kate Baumli
Satinder Baveja
Feryal M. P. Behbahani
...
Jakub Sygnowski
K. Tuyls
Sarah York
Alexander Zacherl
Lei Zhang
LM&Ro
OffRL
AI4CE
LRM
38
109
0
18 Jan 2023
Learning Generalizable Representations for Reinforcement Learning via
  Adaptive Meta-learner of Behavioral Similarities
Learning Generalizable Representations for Reinforcement Learning via Adaptive Meta-learner of Behavioral Similarities
Jianda Chen
Sinno Jialin Pan
SSL
21
6
0
26 Dec 2022
Decoding surface codes with deep reinforcement learning and
  probabilistic policy reuse
Decoding surface codes with deep reinforcement learning and probabilistic policy reuse
E. S. Matekole
Esther Ye
Ramya Iyer
Samuel Yen-Chi Chen
29
2
0
22 Dec 2022
Pre-Trained Image Encoder for Generalizable Visual Reinforcement
  Learning
Pre-Trained Image Encoder for Generalizable Visual Reinforcement Learning
Zhecheng Yuan
Zhengrong Xue
Bo Yuan
Xueqian Wang
Yi Wu
Yang Gao
Huazhe Xu
SSL
OffRL
46
70
0
17 Dec 2022
Policy Adaptation from Foundation Model Feedback
Policy Adaptation from Foundation Model Feedback
Yuying Ge
Annabella Macaluso
Erran L. Li
Ping Luo
Xiaolong Wang
LM&Ro
27
12
0
14 Dec 2022
Improving generalization in reinforcement learning through forked agents
Improving generalization in reinforcement learning through forked agents
Olivier Moulin
Vincent François-Lavet
Mark Hoogendoorn
AI4CE
28
0
0
13 Dec 2022
Monocular Camera and Single-Beam Sonar-Based Underwater Collision-Free
  Navigation with Domain Randomization
Monocular Camera and Single-Beam Sonar-Based Underwater Collision-Free Navigation with Domain Randomization
Pengzhi Yang
Haowen Liu
Monika Roznere
Alberto Quattrini Li
22
9
0
08 Dec 2022
Enhanced method for reinforcement learning based dynamic obstacle
  avoidance by assessment of collision risk
Enhanced method for reinforcement learning based dynamic obstacle avoidance by assessment of collision risk
Fabian Hart
Ostap Okhrin
6
12
0
08 Dec 2022
Melting Pot 2.0
Melting Pot 2.0
J. Agapiou
A. Vezhnevets
Edgar A. Duénez-Guzmán
Jayd Matyas
Yiran Mao
...
Sukhdeep Singh
Julia Haas
Igor Mordatch
D. Mobbs
Joel Z Leibo
40
31
0
24 Nov 2022
Make-A-Story: Visual Memory Conditioned Consistent Story Generation
Make-A-Story: Visual Memory Conditioned Consistent Story Generation
Tanzila Rahman
Hsin-Ying Lee
Jian Ren
Sergey Tulyakov
Shweta Mahajan
Leonid Sigal
DiffM
19
68
0
23 Nov 2022
Tell Me What Happened: Unifying Text-guided Video Completion via
  Multimodal Masked Video Generation
Tell Me What Happened: Unifying Text-guided Video Completion via Multimodal Masked Video Generation
Tsu-jui Fu
Licheng Yu
Ning Zhang
Cheng-Yang Fu
Jong-Chyi Su
William Yang Wang
Sean Bell
VGen
61
37
0
23 Nov 2022
General Intelligence Requires Rethinking Exploration
General Intelligence Requires Rethinking Exploration
Minqi Jiang
Tim Rocktaschel
Edward Grefenstette
LRM
29
18
0
15 Nov 2022
A taxonomic system for failure cause analysis of open source AI
  incidents
A taxonomic system for failure cause analysis of open source AI incidents
Nikiforos Pittaras
Sean McGregor
21
9
0
14 Nov 2022
Deep Reinforcement Learning with Vector Quantized Encoding
Deep Reinforcement Learning with Vector Quantized Encoding
Liang Zhang
Justin Lieffers
A. Pyarelal
OffRL
18
2
0
12 Nov 2022
ADLight: A Universal Approach of Traffic Signal Control with Augmented
  Data Using Reinforcement Learning
ADLight: A Universal Approach of Traffic Signal Control with Augmented Data Using Reinforcement Learning
Maonan Wang
Yutong Xu
Xincheng Xiong
Yuheng Kan
Chengcheng Xu
Man-On Pun
23
7
0
24 Oct 2022
Probing Transfer in Deep Reinforcement Learning without Task Engineering
Probing Transfer in Deep Reinforcement Learning without Task Engineering
Andrei A. Rusu
Sebastian Flennerhag
Dushyant Rao
Razvan Pascanu
R. Hadsell
34
6
0
22 Oct 2022
Previous
12345678
Next