ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2210.09241
  4. Cited By
Boosting Offline Reinforcement Learning via Data Rebalancing

Boosting Offline Reinforcement Learning via Data Rebalancing

17 October 2022
Yang Yue
Bingyi Kang
Xiao Ma
Zhongwen Xu
Gao Huang
Shuicheng Yan
    OffRL
ArXivPDFHTML

Papers citing "Boosting Offline Reinforcement Learning via Data Rebalancing"

18 / 18 papers shown
Title
Fewer May Be Better: Enhancing Offline Reinforcement Learning with Reduced Dataset
Fewer May Be Better: Enhancing Offline Reinforcement Learning with Reduced Dataset
Yiqin Yang
Quanwei Wang
Chenghao Li
Hao Hu
Chengjie Wu
...
Dianyu Zhong
Ziyou Zhang
Qianchuan Zhao
Chongjie Zhang
Xu Bo
OffRL
49
0
0
26 Feb 2025
Solving Continual Offline RL through Selective Weights Activation on
  Aligned Spaces
Solving Continual Offline RL through Selective Weights Activation on Aligned Spaces
Jifeng Hu
Sili Huang
Li Shen
Zhejian Yang
Shengchao Hu
Shisong Tang
H. Chen
Yi-Ju Chang
Dacheng Tao
Lichao Sun
OffRL
39
0
0
21 Oct 2024
Exploring Text-to-Motion Generation with Human Preference
Exploring Text-to-Motion Generation with Human Preference
Jenny Sheng
Matthieu Lin
Andrew Zhao
Kevin Pruvost
Yu-Hui Wen
Yangguang Li
Gao Huang
Yong-Jin Liu
VGen
39
1
0
15 Apr 2024
A2PO: Towards Effective Offline Reinforcement Learning from an
  Advantage-aware Perspective
A2PO: Towards Effective Offline Reinforcement Learning from an Advantage-aware Perspective
Yunpeng Qing
Shunyu Liu
Jingyuan Cong
Kaixuan Chen
Yihe Zhou
Mingli Song
OffRL
37
1
0
12 Mar 2024
Trajectory-wise Iterative Reinforcement Learning Framework for
  Auto-bidding
Trajectory-wise Iterative Reinforcement Learning Framework for Auto-bidding
Haoming Li
Yusen Huo
Shuai Dou
Zhenzhe Zheng
Zhilin Zhang
Chuan Yu
Jian Xu
Fan Wu
OffRL
26
3
0
23 Feb 2024
Contrastive Diffuser: Planning Towards High Return States via
  Contrastive Learning
Contrastive Diffuser: Planning Towards High Return States via Contrastive Learning
Yixiang Shan
Zhengbang Zhu
Ting Long
Qifan Liang
Yi-Ju Chang
Weinan Zhang
Liang Yin
OffRL
42
1
0
05 Feb 2024
Towards Efficient Exact Optimization of Language Model Alignment
Towards Efficient Exact Optimization of Language Model Alignment
Haozhe Ji
Cheng Lu
Yilin Niu
Pei Ke
Hongning Wang
Jun Zhu
Jie Tang
Minlie Huang
58
11
0
01 Feb 2024
Train Once, Get a Family: State-Adaptive Balances for Offline-to-Online
  Reinforcement Learning
Train Once, Get a Family: State-Adaptive Balances for Offline-to-Online Reinforcement Learning
Shenzhi Wang
Qisen Yang
Jiawei Gao
Matthieu Lin
Hao Chen
Liwei Wu
Ning Jia
Shiji Song
Gao Huang
OffRL
32
13
0
27 Oct 2023
Score Regularized Policy Optimization through Diffusion Behavior
Score Regularized Policy Optimization through Diffusion Behavior
Huayu Chen
Cheng Lu
Zhengyi Wang
Hang Su
Jun Zhu
28
20
0
11 Oct 2023
Understanding, Predicting and Better Resolving Q-Value Divergence in
  Offline-RL
Understanding, Predicting and Better Resolving Q-Value Divergence in Offline-RL
Yang Yue
Rui Lu
Bingyi Kang
Shiji Song
Gao Huang
OffRL
35
16
0
06 Oct 2023
Prioritized Trajectory Replay: A Replay Memory for Data-driven Reinforcement Learning
Prioritized Trajectory Replay: A Replay Memory for Data-driven Reinforcement Learning
Jinyi Liu
Y. Ma
Jianye Hao
Yujing Hu
Yan Zheng
Tangjie Lv
Changjie Fan
OffRL
44
2
0
27 Jun 2023
Decoupled Prioritized Resampling for Offline RL
Decoupled Prioritized Resampling for Offline RL
Yang Yue
Bingyi Kang
Xiao Ma
Qisen Yang
Gao Huang
S. Song
Shuicheng Yan
OffRL
27
0
0
08 Jun 2023
Boosting Offline Reinforcement Learning with Action Preference Query
Boosting Offline Reinforcement Learning with Action Preference Query
Qisen Yang
Shenzhi Wang
Matthieu Lin
S. Song
Gao Huang
OffRL
13
9
0
06 Jun 2023
Efficient Diffusion Policies for Offline Reinforcement Learning
Efficient Diffusion Policies for Offline Reinforcement Learning
Bingyi Kang
Xiao Ma
Chao Du
Tianyu Pang
Shuicheng Yan
OffRL
34
62
0
31 May 2023
Federated Ensemble-Directed Offline Reinforcement Learning
Federated Ensemble-Directed Offline Reinforcement Learning
Desik Rengarajan
N. Ragothaman
D. Kalathil
S. Shakkottai
OffRL
32
1
0
04 May 2023
Using Offline Data to Speed-up Reinforcement Learning in Procedurally
  Generated Environments
Using Offline Data to Speed-up Reinforcement Learning in Procedurally Generated Environments
Alain Andres
Lukas Schafer
Esther Villar-Rodriguez
Stefano V. Albrecht
Javier Del Ser
OffRL
OnRL
31
2
0
18 Apr 2023
Model-based trajectory stitching for improved behavioural cloning and
  its applications
Model-based trajectory stitching for improved behavioural cloning and its applications
Charles A. Hepburn
Giovanni Montana
OffRL
18
5
0
08 Dec 2022
Offline Reinforcement Learning with Implicit Q-Learning
Offline Reinforcement Learning with Implicit Q-Learning
Ilya Kostrikov
Ashvin Nair
Sergey Levine
OffRL
214
843
0
12 Oct 2021
1