ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2202.04478
  4. Cited By
Rethinking Goal-conditioned Supervised Learning and Its Connection to
  Offline RL

Rethinking Goal-conditioned Supervised Learning and Its Connection to Offline RL

9 February 2022
Rui Yang
Yiming Lu
Wenzhe Li
Hao Sun
Meng Fang
Yali Du
Xiu Li
Lei Han
Chongjie Zhang
    OffRL
ArXivPDFHTML

Papers citing "Rethinking Goal-conditioned Supervised Learning and Its Connection to Offline RL"

19 / 19 papers shown
Title
Inverse-RLignment: Large Language Model Alignment from Demonstrations through Inverse Reinforcement Learning
Inverse-RLignment: Large Language Model Alignment from Demonstrations through Inverse Reinforcement Learning
Hao Sun
M. Schaar
94
14
0
28 Jan 2025
OGBench: Benchmarking Offline Goal-Conditioned RL
OGBench: Benchmarking Offline Goal-Conditioned RL
Seohong Park
Kevin Frans
Benjamin Eysenbach
Sergey Levine
OffRL
50
8
0
26 Oct 2024
Zero-Shot Offline Imitation Learning via Optimal Transport
Zero-Shot Offline Imitation Learning via Optimal Transport
Thomas Rupf
Marco Bagatella
Nico Gürtler
Jonas Frey
Georg Martius
OffRL
133
0
0
11 Oct 2024
Reward-Augmented Data Enhances Direct Preference Alignment of LLMs
Reward-Augmented Data Enhances Direct Preference Alignment of LLMs
Shenao Zhang
Zhihan Liu
Boyi Liu
Yuhang Zhang
Yingxiang Yang
Y. Liu
Liyu Chen
Tao Sun
Ziyi Wang
98
3
0
10 Oct 2024
Learning Temporal Distances: Contrastive Successor Features Can Provide a Metric Structure for Decision-Making
Learning Temporal Distances: Contrastive Successor Features Can Provide a Metric Structure for Decision-Making
Vivek Myers
Chongyi Zheng
Anca Dragan
Sergey Levine
Benjamin Eysenbach
OffRL
45
7
0
24 Jun 2024
Regularizing Hidden States Enables Learning Generalizable Reward Model
  for LLMs
Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMs
Rui Yang
Ruomeng Ding
Yong Lin
Huan Zhang
Tong Zhang
44
43
0
14 Jun 2024
Looking Backward: Retrospective Backward Synthesis for Goal-Conditioned GFlowNets
Looking Backward: Retrospective Backward Synthesis for Goal-Conditioned GFlowNets
Haoran He
C. Chang
Huazhe Xu
Ling Pan
89
6
0
03 Jun 2024
Stitching Sub-Trajectories with Conditional Diffusion Model for
  Goal-Conditioned Offline RL
Stitching Sub-Trajectories with Conditional Diffusion Model for Goal-Conditioned Offline RL
Sungyoon Kim
Yunseon Choi
Daiki E. Matsunaga
Kee-Eung Kim
OffRL
43
6
0
11 Feb 2024
Reinforcement Learning in the Era of LLMs: What is Essential? What is
  needed? An RL Perspective on RLHF, Prompting, and Beyond
Reinforcement Learning in the Era of LLMs: What is Essential? What is needed? An RL Perspective on RLHF, Prompting, and Beyond
Hao Sun
OffRL
34
21
0
09 Oct 2023
Distance Weighted Supervised Learning for Offline Interaction Data
Distance Weighted Supervised Learning for Offline Interaction Data
Joey Hejna
Jensen Gao
Dorsa Sadigh
OffRL
36
12
0
26 Apr 2023
Goal-Conditioned Imitation Learning using Score-based Diffusion Policies
Goal-Conditioned Imitation Learning using Score-based Diffusion Policies
Moritz Reuss
M. Li
Xiaogang Jia
Rudolf Lioutikov
DiffM
36
156
0
05 Apr 2023
Swapped goal-conditioned offline reinforcement learning
Swapped goal-conditioned offline reinforcement learning
Wenyan Yang
Huiling Wang
Dingding Cai
Joni Pajarinen
Joni-Kristen Kämäräinen
OffRL
OnRL
30
1
0
17 Feb 2023
Provably Efficient Offline Goal-Conditioned Reinforcement Learning with
  General Function Approximation and Single-Policy Concentrability
Provably Efficient Offline Goal-Conditioned Reinforcement Learning with General Function Approximation and Single-Policy Concentrability
Hanlin Zhu
Amy Zhang
OffRL
18
2
0
07 Feb 2023
Learning Goal-Conditioned Policies Offline with Self-Supervised Reward
  Shaping
Learning Goal-Conditioned Policies Offline with Self-Supervised Reward Shaping
Lina Mezghani
Sainbayar Sukhbaatar
Piotr Bojanowski
A. Lazaric
Alahari Karteek
OffRL
41
18
0
05 Jan 2023
Learning Robotic Navigation from Experience: Principles, Methods, and
  Recent Results
Learning Robotic Navigation from Experience: Principles, Methods, and Recent Results
Sergey Levine
Dhruv Shah
SSL
34
21
0
13 Dec 2022
From Play to Policy: Conditional Behavior Generation from Uncurated
  Robot Data
From Play to Policy: Conditional Behavior Generation from Uncurated Robot Data
Zichen Jeff Cui
Yibin Wang
Nur Muhammad (Mahi) Shafiullah
Lerrel Pinto
LM&Ro
VGen
OffRL
27
89
0
18 Oct 2022
A Policy-Guided Imitation Approach for Offline Reinforcement Learning
A Policy-Guided Imitation Approach for Offline Reinforcement Learning
Haoran Xu
Li Jiang
Jianxiong Li
Xianyuan Zhan
OffRL
26
61
0
15 Oct 2022
What is Flagged in Uncertainty Quantification? Latent Density Models for
  Uncertainty Categorization
What is Flagged in Uncertainty Quantification? Latent Density Models for Uncertainty Categorization
Hao Sun
B. V. Breugel
Jonathan Crabbé
Nabeel Seedat
M. Schaar
24
4
0
11 Jul 2022
Offline Reinforcement Learning with Reverse Model-based Imagination
Offline Reinforcement Learning with Reverse Model-based Imagination
Jianhao Wang
Wenzhe Li
Haozhe Jiang
Guangxiang Zhu
Siyuan Li
Chongjie Zhang
OffRL
101
59
0
01 Oct 2021
1