ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1912.01588
  4. Cited By
Leveraging Procedural Generation to Benchmark Reinforcement Learning

Leveraging Procedural Generation to Benchmark Reinforcement Learning

3 December 2019
K. Cobbe
Christopher Hesse
Jacob Hilton
John Schulman
ArXivPDFHTML

Papers citing "Leveraging Procedural Generation to Benchmark Reinforcement Learning"

50 / 286 papers shown
Title
Constant-Memory Strategies in Stochastic Games: Best Responses and Equilibria
Constant-Memory Strategies in Stochastic Games: Best Responses and Equilibria
Fengming Zhu
Fangzhen Lin
29
0
0
11 May 2025
CLAM: Continuous Latent Action Models for Robot Learning from Unlabeled Demonstrations
CLAM: Continuous Latent Action Models for Robot Learning from Unlabeled Demonstrations
Anthony Liang
Pavel Czempin
Matthew Hong
Yutai Zhou
Erdem Biyik
Stephen Tu
47
0
0
08 May 2025
Benchmarking Vision, Language, & Action Models in Procedurally Generated, Open Ended Action Environments
Benchmarking Vision, Language, & Action Models in Procedurally Generated, Open Ended Action Environments
Pranav Guruprasad
Yangyue Wang
Sudipta Chowdhury
Harshvardhan Sikka
LM&Ro
VLM
177
0
0
08 May 2025
Plasticine: Accelerating Research in Plasticity-Motivated Deep Reinforcement Learning
Plasticine: Accelerating Research in Plasticity-Motivated Deep Reinforcement Learning
Mingqi Yuan
Qi Wang
Guozheng Ma
Bo-wen Li
Xin Jin
Yunbo Wang
Xiaokang Yang
Wenjun Zeng
D. Tao
OffRL
AI4CE
33
0
0
24 Apr 2025
Surrogate Fitness Metrics for Interpretable Reinforcement Learning
Surrogate Fitness Metrics for Interpretable Reinforcement Learning
Philipp Altmann
Céline Davignon
Maximilian Zorn
Fabian Ritz
Claudia Linnhoff-Popien
Thomas Gabor
29
0
0
20 Apr 2025
Exploration and Adaptation in Non-Stationary Tasks with Diffusion Policies
Exploration and Adaptation in Non-Stationary Tasks with Diffusion Policies
Gunbir Singh Baveja
40
0
0
31 Mar 2025
AdaWorld: Learning Adaptable World Models with Latent Actions
AdaWorld: Learning Adaptable World Models with Latent Actions
Shenyuan Gao
Siyuan Zhou
Yilun Du
Jun Zhang
Chuang Gan
VGen
62
4
0
24 Mar 2025
Infinite Mobility: Scalable High-Fidelity Synthesis of Articulated Objects via Procedural Generation
Infinite Mobility: Scalable High-Fidelity Synthesis of Articulated Objects via Procedural Generation
Xinyu Lian
Zichao Yu
Ruiming Liang
Yitong Wang
Li Ray Luo
...
Qihong Tang
Xudong Xu
Zhaoyang Lyu
Bo Dai
Jiangmiao Pang
58
0
0
17 Mar 2025
Studying the Interplay Between the Actor and Critic Representations in Reinforcement Learning
Studying the Interplay Between the Actor and Critic Representations in Reinforcement Learning
Samuel Garcin
Trevor A. McInroe
Pablo Samuel Castro
Prakash Panangaden
Christopher G. Lucas
David Abel
Stefano V. Albrecht
56
0
0
08 Mar 2025
Impoola: The Power of Average Pooling for Image-Based Deep Reinforcement Learning
Raphael Trumpp
Ansgar Schäfftlein
Mirco Theile
Marco Caccamo
39
0
0
07 Mar 2025
A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks
A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks
Thomas Schmied
Thomas Adler
Vihang Patil
M. Beck
Korbinian Poppel
Johannes Brandstetter
G. Klambauer
Razvan Pascanu
Sepp Hochreiter
75
5
0
21 Feb 2025
Sliding Puzzles Gym: A Scalable Benchmark for State Representation in Visual Reinforcement Learning
Sliding Puzzles Gym: A Scalable Benchmark for State Representation in Visual Reinforcement Learning
Bryan L. M. de Oliveira
Murilo L. da Luz
Bruno Brandão
Luana G. B. Martins
Telma W. de L. Soares
Luckeciano C. Melo
OffRL
70
1
0
17 Feb 2025
Episodic Novelty Through Temporal Distance
Y. Jiang
Qihan Liu
Yiqin Yang
Xiaoteng Ma
Dianyu Zhong
...
Jun Yang
Bin Liang
Bo Xu
Chongjie Zhang
Qianchuan Zhao
OffRL
40
0
0
28 Jan 2025
State Combinatorial Generalization In Decision Making With Conditional Diffusion Models
State Combinatorial Generalization In Decision Making With Conditional Diffusion Models
Xintong Duan
Yutong He
Fahim Tajwar
Wen-Tse Chen
Ruslan Salakhutdinov
Jeff Schneider
OffRL
AI4CE
101
0
0
22 Jan 2025
Environment Descriptions for Usability and Generalisation in
  Reinforcement Learning
Environment Descriptions for Usability and Generalisation in Reinforcement Learning
Dennis J. N. J. Soemers
Spyridon Samothrakis
Kurt Driessens
M. Winands
OffRL
85
1
0
22 Dec 2024
Enabling Realtime Reinforcement Learning at Scale with Staggered
  Asynchronous Inference
Enabling Realtime Reinforcement Learning at Scale with Staggered Asynchronous Inference
Matthew D Riemer
G. Subbaraj
Glen Berseth
Irina Rish
OffRL
79
1
0
18 Dec 2024
Sample-efficient Unsupervised Policy Cloning from Ensemble Self-supervised Labeled Videos
Sample-efficient Unsupervised Policy Cloning from Ensemble Self-supervised Labeled Videos
Xin Liu
Yaran Chen
Haoran Li
SSL
94
0
0
14 Dec 2024
AMaze: An intuitive benchmark generator for fast prototyping of
  generalizable agents
AMaze: An intuitive benchmark generator for fast prototyping of generalizable agents
Kevin Godin-Dubois
Karine Miras
Anna V. Kononova
71
0
0
20 Nov 2024
BetterBench: Assessing AI Benchmarks, Uncovering Issues, and
  Establishing Best Practices
BetterBench: Assessing AI Benchmarks, Uncovering Issues, and Establishing Best Practices
Anka Reuel
Amelia F. Hardy
Chandler Smith
Max Lamparth
Malcolm Hardy
Mykel J. Kochenderfer
ELM
81
17
0
20 Nov 2024
AMAGO-2: Breaking the Multi-Task Barrier in Meta-Reinforcement Learning with Transformers
Jake Grigsby
Justin Sasek
Samyak Parajuli
Daniel Adebi
Amy Zhang
Yuke Zhu
OffRL
26
3
0
17 Nov 2024
GSL-PCD: Improving Generalist-Specialist Learning with Point Cloud
  Feature-based Task Partitioning
GSL-PCD: Improving Generalist-Specialist Learning with Point Cloud Feature-based Task Partitioning
Xiu Yuan
41
0
0
11 Nov 2024
Enabling Adaptive Agent Training in Open-Ended Simulators by Targeting
  Diversity
Enabling Adaptive Agent Training in Open-Ended Simulators by Targeting Diversity
Robby Costales
Stefanos Nikolaidis
AI4CE
31
0
0
07 Nov 2024
Beyond The Rainbow: High Performance Deep Reinforcement Learning On A
  Desktop PC
Beyond The Rainbow: High Performance Deep Reinforcement Learning On A Desktop PC
Tyler Clark
Mark Towers
Christine Evers
Jonathon Hare
OffRL
38
0
0
06 Nov 2024
Hierarchical Orchestra of Policies
Hierarchical Orchestra of Policies
Thomas P Cannon
Özgür Simsek
CLL
36
0
0
05 Nov 2024
Mechanistic Interpretability of Reinforcement Learning Agents
Mechanistic Interpretability of Reinforcement Learning Agents
Tristan Trim
Triston Grayston
AI4CE
27
0
0
30 Oct 2024
Permutation Invariant Learning with High-Dimensional Particle Filters
Permutation Invariant Learning with High-Dimensional Particle Filters
Akhilan Boopathy
Aneesh Muppidi
Peggy Yang
Abhiram Iyer
William Yue
Ila R Fiete
SSL
38
0
0
30 Oct 2024
Getting By Goal Misgeneralization With a Little Help From a Mentor
Getting By Goal Misgeneralization With a Little Help From a Mentor
Tu Trinh
Mohamad H. Danesh
Nguyen X. Khanh
Benjamin Plaut
39
1
0
28 Oct 2024
IntersectionZoo: Eco-driving for Benchmarking Multi-Agent Contextual
  Reinforcement Learning
IntersectionZoo: Eco-driving for Benchmarking Multi-Agent Contextual Reinforcement Learning
Vindula Jayawardana
Baptiste Freydt
Ao Qu
Cameron Hickert
Zhongxia Yan
Cathy Wu
51
1
0
19 Oct 2024
Latent Action Pretraining from Videos
Latent Action Pretraining from Videos
Seonghyeon Ye
Joel Jang
Byeongguk Jeon
Sejune Joo
Jianwei Yang
...
Kimin Lee
J. Gao
Luke Zettlemoyer
Dieter Fox
Minjoon Seo
35
28
0
15 Oct 2024
Improving Generalization on the ProcGen Benchmark with Simple
  Architectural Changes and Scale
Improving Generalization on the ProcGen Benchmark with Simple Architectural Changes and Scale
Andrew Jesson
Yiding Jiang
OffRL
34
1
0
13 Oct 2024
Can we hop in general? A discussion of benchmark selection and design
  using the Hopper environment
Can we hop in general? A discussion of benchmark selection and design using the Hopper environment
C. Voelcker
Marcel Hussing
Eric Eaton
OffRL
31
3
0
11 Oct 2024
Masked Generative Priors Improve World Models Sequence Modelling Capabilities
Masked Generative Priors Improve World Models Sequence Modelling Capabilities
Cristian Meo
Mircea Lica
Zarif Ikram
Akihiro Nakano
Vedant Shah
Aniket Didolkar
Dianbo Liu
Anirudh Goyal
Justin Dauwels
OffRL
90
0
0
10 Oct 2024
Retrieval-Augmented Decision Transformer: External Memory for In-context
  RL
Retrieval-Augmented Decision Transformer: External Memory for In-context RL
Thomas Schmied
Fabian Paischer
Vihang Patil
M. Hofmarcher
Razvan Pascanu
Sepp Hochreiter
OffRL
44
6
0
09 Oct 2024
DataEnvGym: Data Generation Agents in Teacher Environments with Student Feedback
DataEnvGym: Data Generation Agents in Teacher Environments with Student Feedback
Zaid Khan
Elias Stengel-Eskin
Jaemin Cho
Joey Tianyi Zhou
VGen
46
1
0
08 Oct 2024
Don't flatten, tokenize! Unlocking the key to SoftMoE's efficacy in deep RL
Don't flatten, tokenize! Unlocking the key to SoftMoE's efficacy in deep RL
Ghada Sokar
J. Obando-Ceron
Rameswar Panda
Hugo Larochelle
Pablo Samuel Castro
MoE
142
2
0
02 Oct 2024
AVID: Adapting Video Diffusion Models to World Models
AVID: Adapting Video Diffusion Models to World Models
Marc Rigter
Tarun Gupta
Agrin Hilmkil
Chao Ma
VGen
19
4
0
01 Oct 2024
Verti-Selector: Automatic Curriculum Learning for Wheeled Mobility on
  Vertically Challenging Terrain
Verti-Selector: Automatic Curriculum Learning for Wheeled Mobility on Vertically Challenging Terrain
Tong Xu
Chenhui Pan
Xuesu Xiao
29
2
0
26 Sep 2024
Exploring Semantic Clustering in Deep Reinforcement Learning for Video
  Games
Exploring Semantic Clustering in Deep Reinforcement Learning for Video Games
Liang Zhang
Justin Lieffers
A. Pyarelal
24
0
0
25 Sep 2024
Representing Positional Information in Generative World Models for
  Object Manipulation
Representing Positional Information in Generative World Models for Object Manipulation
Stefano Ferraro
Pietro Mazzaglia
Tim Verbelen
Bart Dhoedt
Sai Rajeswar
LM&Ro
OCL
43
0
0
18 Sep 2024
What makes math problems hard for reinforcement learning: a case study
What makes math problems hard for reinforcement learning: a case study
Ali Shehper
A. Medina-Mardones
Lucas Fagan
Angus Gruen
Piotr Kucharski
Sergei Gukov
Piotr Kucharski
Zhenghan Wang
Sergei Gukov
32
3
0
27 Aug 2024
Domain Adaptation for Offline Reinforcement Learning with Limited Samples
Domain Adaptation for Offline Reinforcement Learning with Limited Samples
Weiqin Chen
Sandipan Mishra
Santiago Paternain
OffRL
46
2
0
22 Aug 2024
D5RL: Diverse Datasets for Data-Driven Deep Reinforcement Learning
D5RL: Diverse Datasets for Data-Driven Deep Reinforcement Learning
Rafael Rafailov
Kyle Hatch
Anikait Singh
Laura Smith
Aviral Kumar
...
Victor Kolev
Philip J. Ball
Jiajun Wu
Chelsea Finn
Sergey Levine
OffRL
34
3
0
15 Aug 2024
Model-Based Transfer Learning for Contextual Reinforcement Learning
Model-Based Transfer Learning for Contextual Reinforcement Learning
Jung-Hoon Cho
Vindula Jayawardana
Sirui Li
Cathy Wu
OffRL
50
0
0
08 Aug 2024
KnowPC: Knowledge-Driven Programmatic Reinforcement Learning for
  Zero-shot Coordination
KnowPC: Knowledge-Driven Programmatic Reinforcement Learning for Zero-shot Coordination
Yin Gu
Qi Liu
Zhi Li
Kai Zhang
31
0
0
08 Aug 2024
Dataset Distillation for Offline Reinforcement Learning
Dataset Distillation for Offline Reinforcement Learning
Jonathan Light
Yuanzhe Liu
Ziniu Hu
DD
35
2
0
29 Jul 2024
Maximum Entropy On-Policy Actor-Critic via Entropy Advantage Estimation
Maximum Entropy On-Policy Actor-Critic via Entropy Advantage Estimation
Jean Seong Bjorn Choe
Jong-Kook Kim
46
2
0
25 Jul 2024
Domain Adaptation of Visual Policies with a Single Demonstration
Domain Adaptation of Visual Policies with a Single Demonstration
Weiyao Wang
Gregory D. Hager
38
0
0
23 Jul 2024
Proximal Policy Distillation
Proximal Policy Distillation
Giacomo Spigler
OffRL
28
1
0
21 Jul 2024
Hyp2Nav: Hyperbolic Planning and Curiosity for Crowd Navigation
Hyp2Nav: Hyperbolic Planning and Curiosity for Crowd Navigation
Guido Maria DÁmely di Melendugno
Alessandro Flaborea
Pascal Mettes
Fabio Galasso
46
0
0
18 Jul 2024
Structural Design Through Reinforcement Learning
Structural Design Through Reinforcement Learning
Thomas Rochefort-Beaudoin
Aurelian Vadean
Niels Aage
S. Achiche
AI4CE
29
0
0
10 Jul 2024
123456
Next