ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Communities
  3. ...

Neighbor communities

0 / 0 papers shown
Title
Top Contributors
Name# Papers# Citations
Social Events
DateLocationEvent
  1. Home
  2. Communities
  3. OnRL

Online Reinforcement Learning

OnRL
More data

Online Reinforcement Learning involves learning policies through continuous interaction with the environment, adapting to changes in real-time.

Neighbor communities

51015

Featured Papers

0 / 0 papers shown
Title

All papers

50 / 431 papers shown
Title
Bootstrap Off-policy with World Model
Bootstrap Off-policy with World Model
Guojian Zhan
Likun Wang
Xiangteng Zhang
Jiaxin Gao
Masayoshi Tomizuka
Shengbo Eben Li
OffRLOnRL
104
0
0
01 Nov 2025
Fill in the Blanks: Accelerating Q-Learning with a Handful of Demonstrations in Sparse Reward Settings
Fill in the Blanks: Accelerating Q-Learning with a Handful of Demonstrations in Sparse Reward Settings
Seyed Mahdi Basiri Azad
Joschka Boedecker
OffRLOnRL
68
0
0
28 Oct 2025
Guardian: Decoupling Exploration from Safety in Reinforcement Learning
Guardian: Decoupling Exploration from Safety in Reinforcement Learning
Kaitong Cai
Jusheng Zhang
Jing Yang
Keze Wang
OffRLOnRL
84
0
0
26 Oct 2025
RLBoost: Harvesting Preemptible Resources for Cost-Efficient Reinforcement Learning on LLMs
RLBoost: Harvesting Preemptible Resources for Cost-Efficient Reinforcement Learning on LLMs
Yongji Wu
Xueshen Liu
Haizhong Zheng
Juncheng Gu
Beidi Chen
Z. Morley Mao
Arvind Krishnamurthy
Eric Liang
OffRLOnRL
40
0
0
22 Oct 2025
Human-Agent Collaborative Paper-to-Page Crafting for Under $0.1
Human-Agent Collaborative Paper-to-Page Crafting for Under 0.10.10.1
Qianli Ma
Siyu Wang
Yilin Chen
Yinhao Tang
Yixiang Yang
Chang Guo
Bingjie Gao
Zhening Xing
Yanan Sun
Zhipeng Zhang
OnRL
36
0
0
22 Oct 2025
Adversarial Fine-tuning in Offline-to-Online Reinforcement Learning for Robust Robot Control
Adversarial Fine-tuning in Offline-to-Online Reinforcement Learning for Robust Robot Control
Shingo Ayabe
Hiroshi Kera
K. Kawamoto
AAMLOffRLOnRL
55
0
0
15 Oct 2025
Missing Data Multiple Imputation for Tabular Q-Learning in Online RL
Missing Data Multiple Imputation for Tabular Q-Learning in Online RL
Kyla Chasalow
Skyler Wu
Susan Murphy
OffRLOnRL
24
0
0
12 Oct 2025
RePro: Training Language Models to Faithfully Recycle the Web for Pretraining
RePro: Training Language Models to Faithfully Recycle the Web for Pretraining
Zichun Yu
Chenyan Xiong
OnRL
52
0
0
12 Oct 2025
PyCFRL: A Python library for counterfactually fair offline reinforcement learning via sequential data preprocessing
Jianhan Zhang
Jitao Wang
C. Shi
John D. Piette
Donglin Zeng
Zhenke Wu
OffRLFaMLOnRL
52
0
0
08 Oct 2025
Online Matching via Reinforcement Learning: An Expert Policy Orchestration Strategy
Online Matching via Reinforcement Learning: An Expert Policy Orchestration Strategy
Chiara Mignacco
Matthieu Jonckheere
Gilles Stoltz
OffRLOnRL
44
0
0
07 Oct 2025
HOFLON: Hybrid Offline Learning and Online Optimization for Process Start-Up and Grade-Transition Control
HOFLON: Hybrid Offline Learning and Online Optimization for Process Start-Up and Grade-Transition Control
Alex Durkin
Jasper Stolte
Mehmet Mercangöz
OffRLOnRL
59
0
0
04 Oct 2025
Adaptive Reinforcement Learning for Dynamic Configuration Allocation in Pre-Production Testing
Adaptive Reinforcement Learning for Dynamic Configuration Allocation in Pre-Production Testing
Yu Zhu
OffRLOnRL
52
0
0
02 Oct 2025
The Three Regimes of Offline-to-Online Reinforcement Learning
The Three Regimes of Offline-to-Online Reinforcement Learning
Lu Li
Tianwei Ni
Yihao Sun
Pierre-Luc Bacon
OffRLOnRL
44
0
0
01 Oct 2025
Integrating Offline Pre-Training with Online Fine-Tuning: A Reinforcement Learning Approach for Robot Social Navigation
Integrating Offline Pre-Training with Online Fine-Tuning: A Reinforcement Learning Approach for Robot Social Navigation
Run Su
Hao Fu
Shuai Zhou
Yingao Fu
OffRLOnRL
16
0
0
01 Oct 2025
Fine-tuning Behavioral Cloning Policies with Preference-Based Reinforcement Learning
Fine-tuning Behavioral Cloning Policies with Preference-Based Reinforcement Learning
Maël Macuglia
Paul Friedrich
Giorgia Ramponi
OffRLOnRL
30
0
0
30 Sep 2025
Robust Policy Expansion for Offline-to-Online RL under Diverse Data Corruption
Robust Policy Expansion for Offline-to-Online RL under Diverse Data Corruption
Longxiang He
Deheng Ye
Junbo Tan
Xueqian Wang
Li Shen
OnRL
67
0
0
29 Sep 2025
LRPO: Enhancing Blind Face Restoration through Online Reinforcement Learning
LRPO: Enhancing Blind Face Restoration through Online Reinforcement Learning
Bin Wu
Yahui Liu
Chi Zhang
Yao-Min Zhao
Wei Wang
CVBMOffRLCLLOnRL
44
0
0
27 Sep 2025
Adaptive Policy Backbone via Shared Network
Adaptive Policy Backbone via Shared Network
Bumgeun Park
Donghwan Lee
OffRLOnRL
52
0
0
26 Sep 2025
Advancing Metallic Surface Defect Detection via Anomaly-Guided Pretraining on a Large Industrial Dataset
Advancing Metallic Surface Defect Detection via Anomaly-Guided Pretraining on a Large Industrial Dataset
Chuni Liu
Hongjie Li
Jiaqi Du
Yangyang Hou
Qian Sun
Lei Jin
Ke Xu
OnRLAI4CE
58
0
0
23 Sep 2025
Ratatouille: Imitation Learning Ingredients for Real-world Social Robot Navigation
Ratatouille: Imitation Learning Ingredients for Real-world Social Robot Navigation
James R. Han
Mithun Vanniasinghe
Hshmat Sahak
Nicholas Rhinehart
Timothy D. Barfoot
OffRLOnRL
56
0
0
21 Sep 2025
Reconnecting Citizens to Politics via Blockchain - Starting the Debate
Reconnecting Citizens to Politics via Blockchain - Starting the Debate
Uwe Serdült
OnRL
0
2
0
18 Sep 2025
UI-S1: Advancing GUI Automation via Semi-online Reinforcement Learning
UI-S1: Advancing GUI Automation via Semi-online Reinforcement Learning
Zhengxi Lu
Jiabo Ye
Fei Tang
Yongliang Shen
Haiyang Xu
...
Weiming Lu
Ming Yan
Fei Huang
Jun Xiao
Yueting Zhuang
OffRLOnRL
121
1
0
15 Sep 2025
Generalizing Beyond Suboptimality: Offline Reinforcement Learning Learns Effective Scheduling through Random Data
Generalizing Beyond Suboptimality: Offline Reinforcement Learning Learns Effective Scheduling through Random Data
Jesse van Remmerden
Zaharah Bukhsh
Yingqian Zhang
OffRLOnRL
60
0
0
12 Sep 2025
AWorld: Orchestrating the Training Recipe for Agentic AI
AWorld: Orchestrating the Training Recipe for Agentic AI
Chengyue Yu
Siyuan Lu
Chenyi Zhuang
Dong Wang
Qintong Wu
...
Aohui Xue
Y. Wang
Jinjie Gu
David Tsai
Tao Lin
OnRL
84
4
0
28 Aug 2025
Robot Trains Robot: Automatic Real-World Policy Adaptation and Learning for Humanoids
Robot Trains Robot: Automatic Real-World Policy Adaptation and Learning for Humanoids
Kaizhe Hu
Haochen Shi
Yao He
Weizhuo Wang
Changliu Liu
Shuran Song
OnRL
108
0
0
17 Aug 2025
Synthetic Data is Sufficient for Zero-Shot Visual Generalization from Offline Data
Synthetic Data is Sufficient for Zero-Shot Visual Generalization from Offline Data
Ahmet H. Güzel
Ilija Bogunovic
Jack Parker-Holder
OffRLOnRL
72
0
0
17 Aug 2025
Value Function Initialization for Knowledge Transfer and Jump-start in Deep Reinforcement Learning
Value Function Initialization for Knowledge Transfer and Jump-start in Deep Reinforcement Learning
Soumia Mehimeh
OffRLOnRL
46
0
0
12 Aug 2025
Offline-to-Online Reinforcement Learning with Classifier-Free Diffusion Generation
Offline-to-Online Reinforcement Learning with Classifier-Free Diffusion Generation
Xiao Huang
Xu Liu
Enze Zhang
T. Yu
Shuai Li
OffRLOnRL
60
0
0
09 Aug 2025
Safe Deployment of Offline Reinforcement Learning via Input Convex Action Correction
Safe Deployment of Offline Reinforcement Learning via Input Convex Action Correction
Alex Durkin
Jasper Stolte
Matthew Jones
Raghuraman Pitchumani
Bei Li
Christian Michler
Mehmet Mercangöz
OffRLOnRL
67
1
0
30 Jul 2025
Online Pre-Training for Offline-to-Online Reinforcement Learning
Online Pre-Training for Offline-to-Online Reinforcement Learning
Yongjae Shin
Jeonghye Kim
Whiyoung Jung
Sunghoon Hong
Deunsol Yoon
...
Geonhyeong Kim
Jongseong Chae
Youngchul Sung
Kanghoon Lee
Woohyung Lim
OffRLOnRL
31
0
0
11 Jul 2025
Reinforcement Learning with Action Chunking
Reinforcement Learning with Action Chunking
Qiyang Li
Zhiyuan Zhou
Sergey Levine
OffRLOnRL
82
12
0
10 Jul 2025
MobileGUI-RL: Advancing Mobile GUI Agent through Reinforcement Learning in Online Environment
MobileGUI-RL: Advancing Mobile GUI Agent through Reinforcement Learning in Online Environment
Yucheng Shi
Wenhao Yu
Zaitang Li
Yonglin Wang
Hongming Zhang
Ninghao Liu
Haitao Mi
Dong Yu
OffRLOnRL
58
9
0
08 Jul 2025
Accelerated Online Reinforcement Learning using Auxiliary Start State Distributions
Accelerated Online Reinforcement Learning using Auxiliary Start State Distributions
Aman Mehra
Alexandre Capone
Jeff Schneider
OffRLOnRL
40
0
0
07 Jul 2025
SimLauncher: Launching Sample-Efficient Real-world Robotic Reinforcement Learning via Simulation Pre-training
SimLauncher: Launching Sample-Efficient Real-world Robotic Reinforcement Learning via Simulation Pre-training
Mingdong Wu
Lehong Wu
Yizhuo Wu
Weiyao Huang
Hongwei Fan
...
Jinzhou Li
Jiahe Ying
Long Yang
Yuanpei Chen
Hao Dong
OnRL
38
0
0
06 Jul 2025
PNAct: Crafting Backdoor Attacks in Safe Reinforcement Learning
PNAct: Crafting Backdoor Attacks in Safe Reinforcement LearningInternational Joint Conference on Artificial Intelligence (IJCAI), 2025
Weiran Guo
Guanjun Liu
Ziyuan Zhou
Ling Wang
AAMLOffRLOnRL
36
0
0
01 Jul 2025
Reliability-Adjusted Prioritized Experience Replay
Reliability-Adjusted Prioritized Experience Replay
Leonard S. Pleiss
Tobias Sutter
Maximilian Schiffer
OnRL
36
0
0
23 Jun 2025
QPPG: Quantum-Preconditioned Policy Gradient for Link Adaptation in Rayleigh Fading Channels
QPPG: Quantum-Preconditioned Policy Gradient for Link Adaptation in Rayleigh Fading Channels
Oluwaseyi Giwa
Muhammad Ahmed Mohsin
Folarin Jubril Adesola
Muhammad Ali Jamshed
OnRL
90
0
0
18 Jun 2025
The Courage to Stop: Overcoming Sunk Cost Fallacy in Deep Reinforcement Learning
The Courage to Stop: Overcoming Sunk Cost Fallacy in Deep Reinforcement Learning
Jiashun Liu
J. Obando-Ceron
Pablo Samuel Castro
Aaron Courville
L. Pan
OffRLOnRL
47
2
0
16 Jun 2025
When Forgetting Triggers Backdoors: A Clean Unlearning Attack
When Forgetting Triggers Backdoors: A Clean Unlearning Attack
Marco Arazzi
Antonino Nocera
Vinod Puthuvath
AAMLMUOnRL
116
1
0
14 Jun 2025
Visual Pre-Training on Unlabeled Images using Reinforcement Learning
Visual Pre-Training on Unlabeled Images using Reinforcement Learning
Dibya Ghosh
Sergey Levine
SSLOffRLOnRLVLM
42
0
0
13 Jun 2025
Reusing Trajectories in Policy Gradients Enables Fast Convergence
Reusing Trajectories in Policy Gradients Enables Fast Convergence
Alessandro Montenegro
Federico Mansutti
Marco Mussi
Matteo Papini
Alberto Maria Metelli
OnRL
155
0
0
06 Jun 2025
Recycling the Web: A Method to Enhance Pre-training Data Quality and Quantity for Language Models
Recycling the Web: A Method to Enhance Pre-training Data Quality and Quantity for Language Models
Thao Nguyen
Yang Li
O. Yu. Golovneva
Luke Zettlemoyer
Sewoong Oh
Ludwig Schmidt
Xian Li
OnRL
260
9
0
05 Jun 2025
Adapting Offline Reinforcement Learning with Online Delays
Adapting Offline Reinforcement Learning with Online Delays
S. Zhan
Qingyuan Wu
Frank Yang
Xiangyu Shi
Chao-Wei Huang
Qi Zhu
OffRLOnRL
34
0
0
30 May 2025
SOReL and TOReL: Two Methods for Fully Offline Reinforcement Learning
SOReL and TOReL: Two Methods for Fully Offline Reinforcement Learning
Mattie Fellows
Clarisse Wibault
Uljad Berdica
Johannes Forkel
Jakob Foerster
Michael A. Osborne
OffRLOnRL
167
0
0
28 May 2025
ProSpero: Active Learning for Robust Protein Design Beyond Wild-Type Neighborhoods
ProSpero: Active Learning for Robust Protein Design Beyond Wild-Type Neighborhoods
Michal Kmicikiewicz
Vincent Fortuin
Ewa Szczurek
OnRL
152
0
0
28 May 2025
A Provable Approach for End-to-End Safe Reinforcement Learning
A Provable Approach for End-to-End Safe Reinforcement Learning
Akifumi Wachi
Kohei Miyaguchi
Takumi Tanabe
Rei Sato
Youhei Akimoto
OffRLOnRL
24
1
0
28 May 2025
Pre-training for Recommendation Unlearning
Pre-training for Recommendation UnlearningAnnual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2025
Guoxuan Chen
Lianghao Xia
Chao Huang
MUOnRL
128
1
0
28 May 2025
Designing Pin-pression Gripper and Learning its Dexterous Grasping with Online In-hand Adjustment
Designing Pin-pression Gripper and Learning its Dexterous Grasping with Online In-hand AdjustmentACM Transactions on Graphics (TOG), 2025
Hewen Xiao
Xiuping Liu
Hang Zhao
Jian Liu
K. Xu
OnRL
306
0
0
25 May 2025
The Cell Must Go On: Agar.io for Continual Reinforcement Learning
The Cell Must Go On: Agar.io for Continual Reinforcement Learning
Mohamed A. Mohamed
Kateryna Nekhomiazh
Vedant Vyas
Marcos M. Jose
Andrew Patterson
Marlos C. Machado
CLLOnRL
22
0
0
23 May 2025
WebAgent-R1: Training Web Agents via End-to-End Multi-Turn Reinforcement Learning
WebAgent-R1: Training Web Agents via End-to-End Multi-Turn Reinforcement Learning
Zhepei Wei
Wenlin Yao
Yao Liu
Weizhi Zhang
Qin Lu
...
Puyang Xu
Chao Zhang
Bing Yin
Hyokun Yun
Lihong Li
OffRLCLLOnRLLRM
239
43
0
22 May 2025
Loading #Papers per Month with "OnRL"
Past speakers
Name (-)
Top Contributors
Name (-)
Top Organizations at ResearchTrend.AI
Name (-)
Social Events
DateLocationEvent
No social events available