ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2310.04411
  4. Cited By
Understanding, Predicting and Better Resolving Q-Value Divergence in
  Offline-RL

Understanding, Predicting and Better Resolving Q-Value Divergence in Offline-RL

6 October 2023
Yang Yue
Rui Lu
Bingyi Kang
Shiji Song
Gao Huang
    OffRL
ArXivPDFHTML

Papers citing "Understanding, Predicting and Better Resolving Q-Value Divergence in Offline-RL"

11 / 11 papers shown
Title
Offline-to-online Reinforcement Learning for Image-based Grasping with Scarce Demonstrations
Offline-to-online Reinforcement Learning for Image-based Grasping with Scarce Demonstrations
Bryan Chan
Anson Leung
James Bergstra
OffRL
OnRL
54
0
0
19 Oct 2024
Model Surgery: Modulating LLM's Behavior Via Simple Parameter Editing
Model Surgery: Modulating LLM's Behavior Via Simple Parameter Editing
Huanqian Wang
Yang Yue
Rui Lu
Jingxin Shi
Andrew Zhao
Shenzhi Wang
Shiji Song
Gao Huang
LM&Ro
KELM
49
6
0
11 Jul 2024
Simplifying Deep Temporal Difference Learning
Simplifying Deep Temporal Difference Learning
Matteo Gallici
Mattie Fellows
Benjamin Ellis
B. Pou
Ivan Masmitja
Jakob Foerster
Mario Martin
OffRL
59
14
0
05 Jul 2024
Offline Reinforcement Learning via High-Fidelity Generative Behavior
  Modeling
Offline Reinforcement Learning via High-Fidelity Generative Behavior Modeling
Huayu Chen
Cheng Lu
Chengyang Ying
Hang Su
Jun Zhu
DiffM
OffRL
103
105
0
29 Sep 2022
Learning to Weight Samples for Dynamic Early-exiting Networks
Learning to Weight Samples for Dynamic Early-exiting Networks
Yizeng Han
Yifan Pu
Zihang Lai
Chaofei Wang
S. Song
Junfen Cao
Wenhui Huang
Chao Deng
Gao Huang
56
54
0
17 Sep 2022
Value-Consistent Representation Learning for Data-Efficient
  Reinforcement Learning
Value-Consistent Representation Learning for Data-Efficient Reinforcement Learning
Yang Yue
Bingyi Kang
Zhongwen Xu
Gao Huang
Shuicheng Yan
OffRL
32
13
0
25 Jun 2022
Provable General Function Class Representation Learning in Multitask
  Bandits and MDPs
Provable General Function Class Representation Learning in Multitask Bandits and MDPs
Rui Lu
Andrew Zhao
S. Du
Gao Huang
OffRL
27
10
0
31 May 2022
Offline Reinforcement Learning with Implicit Q-Learning
Offline Reinforcement Learning with Implicit Q-Learning
Ilya Kostrikov
Ashvin Nair
Sergey Levine
OffRL
214
839
0
12 Oct 2021
Uncertainty-Based Offline Reinforcement Learning with Diversified
  Q-Ensemble
Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble
Gaon An
Seungyong Moon
Jang-Hyun Kim
Hyun Oh Song
OffRL
97
262
0
04 Oct 2021
COMBO: Conservative Offline Model-Based Policy Optimization
COMBO: Conservative Offline Model-Based Policy Optimization
Tianhe Yu
Aviral Kumar
Rafael Rafailov
Aravind Rajeswaran
Sergey Levine
Chelsea Finn
OffRL
219
413
0
16 Feb 2021
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on
  Open Problems
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
Sergey Levine
Aviral Kumar
George Tucker
Justin Fu
OffRL
GP
337
1,955
0
04 May 2020
1