ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1812.02341
  4. Cited By
Quantifying Generalization in Reinforcement Learning

Quantifying Generalization in Reinforcement Learning

6 December 2018
K. Cobbe
Oleg Klimov
Christopher Hesse
Taehoon Kim
John Schulman
    OffRL
ArXivPDFHTML

Papers citing "Quantifying Generalization in Reinforcement Learning"

50 / 397 papers shown
Title
Efficient Embedding of Semantic Similarity in Control Policies via
  Entangled Bisimulation
Efficient Embedding of Semantic Similarity in Control Policies via Entangled Bisimulation
Martín Bertrán
Walter A. Talbott
Nitish Srivastava
J. Susskind
45
3
0
28 Jan 2022
Look Closer: Bridging Egocentric and Third-Person Views with
  Transformers for Robotic Manipulation
Look Closer: Bridging Egocentric and Third-Person Views with Transformers for Robotic Manipulation
Rishabh Jangir
Nicklas Hansen
Sambaran Ghosal
Mohit Jain
Xiaolong Wang
32
65
0
19 Jan 2022
Automated Reinforcement Learning (AutoRL): A Survey and Open Problems
Automated Reinforcement Learning (AutoRL): A Survey and Open Problems
Jack Parker-Holder
Raghunandan Rajan
Xingyou Song
André Biedenkapp
Yingjie Miao
...
Vu-Linh Nguyen
Roberto Calandra
Aleksandra Faust
Frank Hutter
Marius Lindauer
AI4CE
33
100
0
11 Jan 2022
Cooperation for Scalable Supervision of Autonomy in Mixed Traffic
Cooperation for Scalable Supervision of Autonomy in Mixed Traffic
Cameron Hickert
Sirui Li
Cathy Wu
22
5
0
14 Dec 2021
Learning Generalizable Behavior via Visual Rewrite Rules
Learning Generalizable Behavior via Visual Rewrite Rules
Yiheng Xie
Mingxuan Li
Shangqun Yu
Michael Littman
DRL
14
1
0
09 Dec 2021
Hyper-parameter optimization based on soft actor critic and hierarchical
  mixture regularization
Hyper-parameter optimization based on soft actor critic and hierarchical mixture regularization
Chaoyue Liu
Yulai Zhang
19
0
0
08 Dec 2021
Hindsight Task Relabelling: Experience Replay for Sparse Reward Meta-RL
Hindsight Task Relabelling: Experience Replay for Sparse Reward Meta-RL
Charles Packer
Pieter Abbeel
Joseph E. Gonzalez
OffRL
23
18
0
02 Dec 2021
Meta Arcade: A Configurable Environment Suite for Meta-Learning
Meta Arcade: A Configurable Environment Suite for Meta-Learning
Edward W. Staley
C. Ashcraft
Ben Stoler
Jared Markowitz
Gautam K. Vallabha
Christopher R. Ratto
Kapil D. Katyal
19
6
0
01 Dec 2021
Improving Zero-shot Generalization in Offline Reinforcement Learning
  using Generalized Similarity Functions
Improving Zero-shot Generalization in Offline Reinforcement Learning using Generalized Similarity Functions
Bogdan Mazoure
Ilya Kostrikov
Ofir Nachum
Jonathan Tompson
OffRL
51
21
0
29 Nov 2021
A Survey of Zero-shot Generalisation in Deep Reinforcement Learning
A Survey of Zero-shot Generalisation in Deep Reinforcement Learning
Robert Kirk
Amy Zhang
Edward Grefenstette
Tim Rocktaschel
OffRL
17
157
0
18 Nov 2021
Learning Provably Robust Motion Planners Using Funnel Libraries
Learning Provably Robust Motion Planners Using Funnel Libraries
Alim Gurgen
Anirudha Majumdar
Sushant Veer
OOD
25
2
0
16 Nov 2021
B-Pref: Benchmarking Preference-Based Reinforcement Learning
B-Pref: Benchmarking Preference-Based Reinforcement Learning
Kimin Lee
Laura M. Smith
Anca Dragan
Pieter Abbeel
OffRL
40
93
0
04 Nov 2021
Procedural Generalization by Planning with Self-Supervised World Models
Procedural Generalization by Planning with Self-Supervised World Models
Ankesh Anand
Jacob Walker
Yazhe Li
Eszter Vértes
Julian Schrittwieser
Sherjil Ozair
T. Weber
Jessica B. Hamrick
31
30
0
02 Nov 2021
URLB: Unsupervised Reinforcement Learning Benchmark
URLB: Unsupervised Reinforcement Learning Benchmark
Michael Laskin
Denis Yarats
Hao Liu
Kimin Lee
Albert Zhan
Kevin Lu
Catherine Cang
Lerrel Pinto
Pieter Abbeel
SSL
OffRL
35
133
0
28 Oct 2021
A Versatile and Efficient Reinforcement Learning Framework for
  Autonomous Driving
A Versatile and Efficient Reinforcement Learning Framework for Autonomous Driving
Guan-Bo Wang
Haoyi Niu
Desheng Zhu
Jianming Hu
Xianyuan Zhan
Guyue Zhou
OffRL
24
2
0
22 Oct 2021
Recurrent Model-Free RL Can Be a Strong Baseline for Many POMDPs
Recurrent Model-Free RL Can Be a Strong Baseline for Many POMDPs
Tianwei Ni
Benjamin Eysenbach
Ruslan Salakhutdinov
26
103
0
11 Oct 2021
Replay-Guided Adversarial Environment Design
Replay-Guided Adversarial Environment Design
Minqi Jiang
Michael Dennis
Jack Parker-Holder
Jakob N. Foerster
Edward Grefenstette
Tim Rocktaschel
129
95
0
06 Oct 2021
Unsolved Problems in ML Safety
Unsolved Problems in ML Safety
Dan Hendrycks
Nicholas Carlini
John Schulman
Jacob Steinhardt
186
275
0
28 Sep 2021
MetaDrive: Composing Diverse Driving Scenarios for Generalizable
  Reinforcement Learning
MetaDrive: Composing Diverse Driving Scenarios for Generalizable Reinforcement Learning
Quanyi Li
Zhenghao Peng
Lan Feng
Qihang Zhang
Zhenghai Xue
Bolei Zhou
41
232
0
26 Sep 2021
Generalization in Text-based Games via Hierarchical Reinforcement
  Learning
Generalization in Text-based Games via Hierarchical Reinforcement Learning
Yunqiu Xu
Meng Fang
Ling Chen
Yali Du
Chengqi Zhang
AI4CE
40
20
0
21 Sep 2021
CompilerGym: Robust, Performant Compiler Optimization Environments for
  AI Research
CompilerGym: Robust, Performant Compiler Optimization Environments for AI Research
Chris Cummins
Bram Wasti
Jiadong Guo
Brandon Cui
Jason Ansel
...
Jia-Wei Liu
O. Teytaud
Benoit Steiner
Yuandong Tian
Hugh Leather
31
68
0
17 Sep 2021
Reinforcement Learning on Encrypted Data
Reinforcement Learning on Encrypted Data
Alberto Jesu
Victor-Alexandru Darvariu
Alessandro Staffolani
R. Montanari
Mirco Musolesi
OffRL
21
1
0
16 Sep 2021
Robust Predictable Control
Robust Predictable Control
Benjamin Eysenbach
Ruslan Salakhutdinov
Sergey Levine
OffRL
29
44
0
07 Sep 2021
A review of mobile robot motion planning methods: from classical motion
  planning workflows to reinforcement learning-based architectures
A review of mobile robot motion planning methods: from classical motion planning workflows to reinforcement learning-based architectures
Changyin Sun
Zicheng He
Chunwei Song
Changyin Sun
38
54
0
31 Aug 2021
Active Reinforcement Learning over MDPs
Qi Yang
Peng Yang
K. Tang
41
0
0
05 Aug 2021
Risk Conditioned Neural Motion Planning
Risk Conditioned Neural Motion Planning
Xin Huang
Meng Feng
A. Jasour
Guy Rosman
B. Williams
27
6
0
04 Aug 2021
How to Certify Machine Learning Based Safety-critical Systems? A
  Systematic Literature Review
How to Certify Machine Learning Based Safety-critical Systems? A Systematic Literature Review
Florian Tambon
Gabriel Laberge
Le An
Amin Nikanjam
Paulina Stevia Nouwou Mindom
Y. Pequignot
Foutse Khomh
G. Antoniol
E. Merlo
François Laviolette
37
66
0
26 Jul 2021
Scalable Evaluation of Multi-Agent Reinforcement Learning with Melting
  Pot
Scalable Evaluation of Multi-Agent Reinforcement Learning with Melting Pot
Joel Z Leibo
Edgar A. Duénez-Guzmán
A. Vezhnevets
J. Agapiou
P. Sunehag
Raphael Köster
Jayd Matyas
Charlie Beattie
Igor Mordatch
T. Graepel
OffRL
58
103
0
14 Jul 2021
Why Generalization in RL is Difficult: Epistemic POMDPs and Implicit
  Partial Observability
Why Generalization in RL is Difficult: Epistemic POMDPs and Implicit Partial Observability
Dibya Ghosh
Jad Rahme
Aviral Kumar
Amy Zhang
Ryan P. Adams
Sergey Levine
OffRL
278
109
0
13 Jul 2021
The Role of Pretrained Representations for the OOD Generalization of
  Reinforcement Learning Agents
The Role of Pretrained Representations for the OOD Generalization of Reinforcement Learning Agents
Andrea Dittadi
Frederik Trauble
M. Wuthrich
Felix Widmaier
Peter V. Gehler
Ole Winther
Francesco Locatello
Olivier Bachem
Bernhard Schölkopf
Stefan Bauer
OOD
41
16
0
12 Jul 2021
MixStyle Neural Networks for Domain Generalization and Adaptation
MixStyle Neural Networks for Domain Generalization and Adaptation
Kaiyang Zhou
Yongxin Yang
Yu Qiao
Tao Xiang
OOD
TTA
31
76
0
05 Jul 2021
Sample Efficient Reinforcement Learning via Model-Ensemble Exploration
  and Exploitation
Sample Efficient Reinforcement Learning via Model-Ensemble Exploration and Exploitation
Yaowen Yao
Li Xiao
Zhicheng An
Wanpeng Zhang
Dijun Luo
68
20
0
05 Jul 2021
Systematic Evaluation of Causal Discovery in Visual Model Based
  Reinforcement Learning
Systematic Evaluation of Causal Discovery in Visual Model Based Reinforcement Learning
Nan Rosemary Ke
Aniket Didolkar
Sarthak Mittal
Anirudh Goyal
Guillaume Lajoie
Stefan Bauer
Danilo Jimenez Rezende
Yoshua Bengio
Michael C. Mozer
C. Pal
CML
29
54
0
02 Jul 2021
Stabilizing Deep Q-Learning with ConvNets and Vision Transformers under
  Data Augmentation
Stabilizing Deep Q-Learning with ConvNets and Vision Transformers under Data Augmentation
Nicklas Hansen
H. Su
Xiaolong Wang
OffRL
26
134
0
01 Jul 2021
Tuning Mixed Input Hyperparameters on the Fly for Efficient Population
  Based AutoRL
Tuning Mixed Input Hyperparameters on the Fly for Efficient Population Based AutoRL
Jack Parker-Holder
Vu Nguyen
Shaan Desai
Stephen J. Roberts
43
16
0
30 Jun 2021
Generalization of Reinforcement Learning with Policy-Aware Adversarial
  Data Augmentation
Generalization of Reinforcement Learning with Policy-Aware Adversarial Data Augmentation
Hanping Zhang
Yuhong Guo
30
23
0
29 Jun 2021
Scenic4RL: Programmatic Modeling and Generation of Reinforcement
  Learning Environments
Scenic4RL: Programmatic Modeling and Generation of Reinforcement Learning Environments
Abdus Salam Azad
Edward Kim
M. Wu
Kimin Lee
Ion Stoica
Pieter Abbeel
S. Seshia
20
7
0
18 Jun 2021
SECANT: Self-Expert Cloning for Zero-Shot Generalization of Visual
  Policies
SECANT: Self-Expert Cloning for Zero-Shot Generalization of Visual Policies
Linxi Fan
Guanzhi Wang
De-An Huang
Zhiding Yu
Li Fei-Fei
Yuke Zhu
Anima Anandkumar
OffRL
30
63
0
17 Jun 2021
Vision-Language Navigation with Random Environmental Mixup
Vision-Language Navigation with Random Environmental Mixup
Chong Liu
Fengda Zhu
Xiaojun Chang
Xiaodan Liang
Zongyuan Ge
Yi-Dong Shen
LM&Ro
56
86
0
15 Jun 2021
Sample Efficient Reinforcement Learning In Continuous State Spaces: A
  Perspective Beyond Linearity
Sample Efficient Reinforcement Learning In Continuous State Spaces: A Perspective Beyond Linearity
Dhruv Malik
Aldo Pacchiano
Vishwak Srinivasan
Yuanzhi Li
12
6
0
15 Jun 2021
Automatic Risk Adaptation in Distributional Reinforcement Learning
Automatic Risk Adaptation in Distributional Reinforcement Learning
Frederik Schubert
Theresa Eimer
Bodo Rosenhahn
Marius Lindauer
20
8
0
11 Jun 2021
Brittle AI, Causal Confusion, and Bad Mental Models: Challenges and
  Successes in the XAI Program
Brittle AI, Causal Confusion, and Bad Mental Models: Challenges and Successes in the XAI Program
Jeff Druce
J. Niehaus
Vanessa Moody
David D. Jensen
Michael L. Littman
15
15
0
10 Jun 2021
Towards robust and domain agnostic reinforcement learning competitions
Towards robust and domain agnostic reinforcement learning competitions
William H. Guss
Stephanie Milani
Nicholay Topin
Brandon Houghton
Sharada Mohanty
...
Lu Liu
Daichi Nishio
Toi Tsuneda
Karolis Ramanauskas
Gabija Juceviciute
OOD
27
2
0
07 Jun 2021
Differentiable Architecture Search for Reinforcement Learning
Differentiable Architecture Search for Reinforcement Learning
Yingjie Miao
Xingyou Song
John D. Co-Reyes
Daiyi Peng
Summer Yue
E. Brevdo
Aleksandra Faust
20
4
0
04 Jun 2021
Towards Deeper Deep Reinforcement Learning with Spectral Normalization
Towards Deeper Deep Reinforcement Learning with Spectral Normalization
Johan Bjorck
Carla P. Gomes
Kilian Q. Weinberger
19
23
0
02 Jun 2021
Improving Generalization in Meta-RL with Imaginary Tasks from Latent
  Dynamics Mixture
Improving Generalization in Meta-RL with Imaginary Tasks from Latent Dynamics Mixture
Suyoung Lee
Sae-Young Chung
OffRL
AI4CE
18
16
0
28 May 2021
Spectral Normalisation for Deep Reinforcement Learning: an Optimisation
  Perspective
Spectral Normalisation for Deep Reinforcement Learning: an Optimisation Perspective
Florin Gogianu
Tudor Berariu
Mihaela Rosca
Claudia Clopath
L. Buşoniu
Razvan Pascanu
24
53
0
11 May 2021
Safety Enhancement for Deep Reinforcement Learning in Autonomous
  Separation Assurance
Safety Enhancement for Deep Reinforcement Learning in Autonomous Separation Assurance
Wei Guo
Marc Brittain
Peng Wei
29
18
0
05 May 2021
Joint Attention for Multi-Agent Coordination and Social Learning
Joint Attention for Multi-Agent Coordination and Social Learning
Dennis Lee
Natasha Jaques
Chase Kew
Jiaxing Wu
Douglas Eck
Dale Schuurmans
Aleksandra Faust
30
8
0
15 Apr 2021
Level Generation for Angry Birds with Sequential VAE and Latent Variable
  Evolution
Level Generation for Angry Birds with Sequential VAE and Latent Variable Evolution
Takumi Tanabe
Kazuto Fukuchi
Jun Sakuma
Youhei Akimoto
24
11
0
13 Apr 2021
Previous
12345678
Next