Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2407.17032
Cited By
Gymnasium: A Standard Interface for Reinforcement Learning Environments
24 July 2024
Mark Towers
Ariel Kwiatkowski
Jordan Terry
John U. Balis
Gianluca De Cola
Tristan Deleu
Manuel Goulão
Andreas Kallinteris
Markus Krimmel
KG Arjun
Rodrigo Perez-Vicente
Andrea Pierré
Sander Schulhoff
Jun Jet Tai
Hannah Tan
Omar G. Younis
AuLLM
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Gymnasium: A Standard Interface for Reinforcement Learning Environments"
50 / 97 papers shown
Title
Augmenting Online RL with Offline Data is All You Need: A Unified Hybrid RL Algorithm Design and Analysis
Ruiquan Huang
Donghao Li
Chengshuai Shi
Cong Shen
Jing Yang
OffRL
2
0
0
19 May 2025
Multi-CALF: A Policy Combination Approach with Statistical Guarantees
Georgiy Malaniya
Anton Bolychev
Grigory Yaremenko
Anastasia Krasnaya
Pavel Osinenko
7
0
0
18 May 2025
Of Mice and Machines: A Comparison of Learning Between Real World Mice and RL Agents
Shuo Han
German Espinosa
Junda Huang
D. Dombeck
Malcolm A. MacIver
Bradly C. Stadie
2
0
0
18 May 2025
Bench-NPIN: Benchmarking Non-prehensile Interactive Navigation
Ninghan Zhong
Steven Caro
Avraiem Iskandar
Megnath Ramesh
Stephen L. Smith
6
0
0
17 May 2025
SAINT: Attention-Based Modeling of Sub-Action Dependencies in Multi-Action Policies
Matthew Landers
Taylor W. Killian
Thomas Hartvigsen
Afsaneh Doryab
7
0
0
17 May 2025
Counterfactual Behavior Cloning: Offline Imitation Learning from Imperfect Human Demonstrations
Shahabedin Sagheb
Dylan P. Losey
OffRL
22
0
0
16 May 2025
Certifying Stability of Reinforcement Learning Policies using Generalized Lyapunov Functions
Kehan Long
Jorge Cortés
Nikolay Atanasov
12
0
0
16 May 2025
An agentic system with reinforcement-learned subsystem improvements for parsing form-like documents
Ayesha Amjad
Saurav Sthapit
Tahir Qasim Syed
2
0
0
16 May 2025
Exploration by Random Distribution Distillation
Zhirui Fang
Kai Yang
Jian Tao
Jiafei Lyu
Lusong Li
Li Shen
Xiu Li
12
0
0
16 May 2025
Meta-World+: An Improved, Standardized, RL Benchmark
Reginald McLean
Evangelos Chatzaroulas
Luc McCutcheon
Frank Röder
Tianhe Yu
...
Ryan Julian
Jordan Terry
Isaac Woungang
Nariman Farsad
Pablo Samuel Castro
OffRL
14
0
0
16 May 2025
Improving the Data-efficiency of Reinforcement Learning by Warm-starting with LLM
Thang Duong
Minglai Yang
Chicheng Zhang
OffRL
19
0
0
16 May 2025
General Dynamic Goal Recognition
Osher Elhadad
Reuth Mirsky
AI4CE
26
0
0
14 May 2025
Neural Multivariate Regression: Qualitative Insights from the Unconstrained Feature Model
George Andriopoulos
Soyuj Jung Basnet
Juan Guevara
Li Guo
Keith Ross
30
0
0
14 May 2025
Continual Reinforcement Learning via Autoencoder-Driven Task and New Environment Recognition
Zeki Doruk Erden
Donia Gasmi
Boi Faltings
CLL
31
0
0
13 May 2025
Zero-Shot Sim-to-Real Reinforcement Learning for Fruit Harvesting
Emlyn Williams
Athanasios Polydoros
OffRL
34
0
0
13 May 2025
A Two-Timescale Primal-Dual Framework for Reinforcement Learning via Online Dual Variable Guidance
Axel Friedrich Wolter
Tobias Sutter
OffRL
37
0
0
07 May 2025
Flow Models for Unbounded and Geometry-Aware Distributional Reinforcement Learning
Simo Alami C.
Rim Kaddah
Jesse Read
Marie-Paule Cani
51
0
0
07 May 2025
Aerodynamic and structural airfoil shape optimisation via Transfer Learning-enhanced Deep Reinforcement Learning
David Ramos
Lucas Lacasa
E. Valero
G. Rubio
AI4CE
27
0
0
05 May 2025
Model Tensor Planning
An T. Le
K. Nguyen
Minh Nhat Vu
João Carvalho
Jan Peters
35
0
0
02 May 2025
Deformable Cargo Transport in Microgravity with Astrobee
Daniel Morton
Rika Antonova
Brian Coltin
Marco Pavone
Jeannette Bohg
OT
53
0
0
02 May 2025
Neuroevolution of Self-Attention Over Proto-Objects
Rafael C. Pinto
Anderson R. Tavares
OCL
176
0
0
30 Apr 2025
Cognitive maps are generative programs
Marta Kryven
Cole Wyeth
Aidan Curtis
Kevin Ellis
46
0
0
29 Apr 2025
Return Capping: Sample-Efficient CVaR Policy Gradient Optimisation
Harry Mead
Clarissa Costen
Bruno Lacerda
Nick Hawes
24
0
0
29 Apr 2025
Rulebook: bringing co-routines to reinforcement learning environments
Massimo Fioravanti
Samuele Pasini
Giovanni Agosta
33
0
0
28 Apr 2025
RoboVerse: Towards a Unified Platform, Dataset and Benchmark for Scalable and Generalizable Robot Learning
Haoran Geng
Feishi Wang
Songlin Wei
Y. Li
Bangjun Wang
...
Hao Dong
Siyuan Huang
Yue Wang
Jitendra Malik
Pieter Abbeel
85
4
0
26 Apr 2025
CaRL: Learning Scalable Planning Policies with Simple Rewards
Bernhard Jaeger
D. Dauner
Jens Beißwenger
Simon Gerstenecker
Kashyap Chitta
Andreas Geiger
60
0
0
24 Apr 2025
HERB: Human-augmented Efficient Reinforcement learning for Bin-packing
Gojko Perovic
Nuno Ferreira Duarte
Atabak Dehban
Gonçalo Teixeira
Egidio Falotico
J. Santos-Victor
OffRL
33
0
0
23 Apr 2025
JEPA for RL: Investigating Joint-Embedding Predictive Architectures for Reinforcement Learning
Tristan Kenneweg
Philip Kenneweg
Barbara Hammer
AI4TS
34
0
0
23 Apr 2025
Zero-shot Sim-to-Real Transfer for Reinforcement Learning-based Visual Servoing of Soft Continuum Arms
Hsin-Jung Yang
Mahsa Khosravi
Benjamin Walt
Girish Krishnan
Soumik Sarkar
15
0
0
23 Apr 2025
Quantum-Enhanced Reinforcement Learning for Power Grid Security Assessment
Benjamin M. Peter
Mert Korkali
29
0
0
19 Apr 2025
DoomArena: A framework for Testing AI Agents Against Evolving Security Threats
Léo Boisvert
Mihir Bansal
Chandra Kiran Reddy Evuru
Gabriel Huang
Abhay Puri
...
Quentin Cappart
Jason Stanley
Alexandre Lacoste
Alexandre Drouin
Krishnamurthy Dvijotham
35
0
0
18 Apr 2025
A Graph-Based Reinforcement Learning Approach with Frontier Potential Based Reward for Safe Cluttered Environment Exploration
Gabriele Calzolari
Vidya Sumathy
Christoforos Kanellakis
G. Nikolakopoulos
189
0
0
16 Apr 2025
Control of Rayleigh-Bénard Convection: Effectiveness of Reinforcement Learning in the Turbulent Regime
Thorben Markmann
Michiel Straat
Sebastian Peitz
Barbara Hammer
AI4CE
38
0
0
16 Apr 2025
Co-optimizing Physical Reconfiguration Parameters and Controllers for an Origami-inspired Reconfigurable Manipulator
Zhengzhang Chen
L. Chen
Hao Zhang
Jun Zhao
21
0
0
14 Apr 2025
Adaptive Sensor Steering Strategy Using Deep Reinforcement Learning for Dynamic Data Acquisition in Digital Twins
Collins O. Ogbodo
Timothy J. Rogers
Mattia Dal Borgo
D. Wagg
36
0
0
14 Apr 2025
IPA-CHILDES & G2P+: Feature-Rich Resources for Cross-Lingual Phonology and Phonemic Language Modeling
Zébulon Goriely
P. Buttery
23
0
0
03 Apr 2025
Nuclear Microreactor Control with Deep Reinforcement Learning
Leo Tunkle
Kamal Abdulraheem
Linyu Lin
M. Radaideh
36
0
0
31 Mar 2025
An Organizationally-Oriented Approach to Enhancing Explainability and Control in Multi-Agent Reinforcement Learning
Julien Soulé
Jean-Paul Jamont
Michel Occello
Louis-Marie Traonouez
Paul Théron
45
0
0
30 Mar 2025
debug-gym: A Text-Based Environment for Interactive Debugging
Xingdi Yuan
Morgane M Moss
Charbel El Feghali
Chinmay Singh
Darya Moldavskaya
...
Lucas Caccia
Matheus Pereira
Minseon Kim
Alessandro Sordoni
Marc-Alexandre Côté
LLMAG
76
2
0
27 Mar 2025
Graph-Enhanced Model-Free Reinforcement Learning Agents for Efficient Power Grid Topological Control
Eloy Anguiano Batanero
Ángela Fernández
Álvaro Barbero
72
0
0
26 Mar 2025
KEA: Keeping Exploration Alive by Proactively Coordinating Exploration Strategies
Shih-Min Yang
Martin Magnusson
J. A. Stork
Todor Stoyanov
49
0
0
23 Mar 2025
Nonparametric Bellman Mappings for Value Iteration in Distributed Reinforcement Learning
Yuki Akiyama
Konstantinos Slavakis
34
0
0
20 Mar 2025
Towards Better Sample Efficiency in Multi-Agent Reinforcement Learning via Exploration
Amir Baghi
Jens Sjölund
Joakim Bergdahl
Linus Gisslén
Alessandro Sestini
58
0
0
17 Mar 2025
Low-pass sampling in Model Predictive Path Integral Control
Piotr Kicki
44
0
0
13 Mar 2025
Taking Notes Brings Focus? Towards Multi-Turn Multimodal Dialogue Learning
Jiazheng Liu
Sipeng Zheng
Börje F. Karlsson
Zongqing Lu
34
0
0
10 Mar 2025
Multi-Fidelity Policy Gradient Algorithms
Xinjie Liu
Cyrus Neary
Kushagra Gupta
Christian Ellis
Ufuk Topcu
David Fridovich-Keil
OffRL
197
0
0
07 Mar 2025
Guidelines for Applying RL and MARL in Cybersecurity Applications
V. Mavroudis
Gregory Palmer
Sara Farmer
Kez Smithson Whitehead
David Foster
Adam Price
Ian Miles
Alberto Caron
Stephen Pasteris
AAML
49
0
0
06 Mar 2025
Seldonian Reinforcement Learning for Ad Hoc Teamwork
Edoardo Zorzi
A. Castellini
Leonidas Bakopoulos
Georgios Chalkiadakis
Alessandro Farinelli
OffRL
57
0
0
05 Mar 2025
Closing the Intent-to-Behavior Gap via Fulfillment Priority Logic
B. Mabsout
Abdelrahman AbdelGawad
R. Mancuso
43
1
0
04 Mar 2025
Stone Soup Multi-Target Tracking Feature Extraction For Autonomous Search And Track In Deep Reinforcement Learning Environment
Jan-Hendrik Ewers
Joe Gibbs
David Anderson
68
0
0
03 Mar 2025
1
2
Next