ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2306.02451
  4. Cited By
For SALE: State-Action Representation Learning for Deep Reinforcement
  Learning
v1v2 (latest)

For SALE: State-Action Representation Learning for Deep Reinforcement Learning

4 June 2023
Scott Fujimoto
Wei-Di Chang
Edward James Smith
S. Gu
Doina Precup
David Meger
    OffRL
ArXiv (abs)PDFHTML

Papers citing "For SALE: State-Action Representation Learning for Deep Reinforcement Learning"

31 / 31 papers shown
Title
Network Sparsity Unlocks the Scaling Potential of Deep Reinforcement Learning
Network Sparsity Unlocks the Scaling Potential of Deep Reinforcement Learning
Guozheng Ma
Lu Li
Zilin Wang
Li Shen
Pierre-Luc Bacon
Dacheng Tao
OffRL
27
0
0
20 Jun 2025
Sparse-Reg: Improving Sample Complexity in Offline Reinforcement Learning using Sparsity
Sparse-Reg: Improving Sample Complexity in Offline Reinforcement Learning using Sparsity
Samin Yeasar Arnob
Scott Fujimoto
Doina Precup
OffRL
47
0
0
20 Jun 2025
MOBODY: Model Based Off-Dynamics Offline Reinforcement Learning
Yihong Guo
Yu Yang
Pan Xu
Anqi Liu
OffRL
47
0
0
10 Jun 2025
Bigger, Regularized, Categorical: High-Capacity Value Functions are Efficient Multi-Task Learners
Bigger, Regularized, Categorical: High-Capacity Value Functions are Efficient Multi-Task Learners
Michal Nauman
Marek Cygan
Carmelo Sferrazza
Aviral Kumar
Pieter Abbeel
OffRL
96
0
0
29 May 2025
Universal Value-Function Uncertainties
Universal Value-Function Uncertainties
Moritz A. Zanger
Max Weltevrede
Yaniv Oren
Pascal R. van der Vaart
Caroline Horsch
Wendelin Bohmer
M. Spaan
OffRL
76
0
0
27 May 2025
Temporal Distance-aware Transition Augmentation for Offline Model-based Reinforcement Learning
Temporal Distance-aware Transition Augmentation for Offline Model-based Reinforcement Learning
Dongsu Lee
Minhae Kwon
OffRL
93
0
0
19 May 2025
Multi-Goal Dexterous Hand Manipulation using Probabilistic Model-based Reinforcement Learning
Multi-Goal Dexterous Hand Manipulation using Probabilistic Model-based Reinforcement Learning
Yingzhuo Jiang
Wenjun Huang
Rongdun Lin
Chenyang Miao
Tianfu Sun
Yunduan Cui
67
0
0
30 Apr 2025
Action Tokenizer Matters in In-Context Imitation Learning
An Vuong
M. Vu
Dong An
Ian Reid
121
1
0
03 Mar 2025
Hyperspherical Normalization for Scalable Deep Reinforcement Learning
Hyperspherical Normalization for Scalable Deep Reinforcement Learning
Hojoon Lee
Youngdo Lee
Takuma Seno
Donghu Kim
Peter Stone
Jaegul Choo
181
4
0
21 Feb 2025
Towards General-Purpose Model-Free Reinforcement Learning
Scott Fujimoto
P. DÓro
Amy Zhang
Yuandong Tian
Michael Rabbat
OffRL
99
6
0
28 Jan 2025
SR-Reward: Taking The Path More Traveled
SR-Reward: Taking The Path More Traveled
Seyed Mahdi Basiri Azad
Zahra Padar
Gabriel Kalweit
Joschka Boedecker
OffRL
179
0
0
04 Jan 2025
Non-Adversarial Inverse Reinforcement Learning via Successor Feature Matching
Non-Adversarial Inverse Reinforcement Learning via Successor Feature Matching
A. Jain
Harley Wiltzer
Jesse Farebrother
Irina Rish
Glen Berseth
Sanjiban Choudhury
141
2
0
11 Nov 2024
SimBa: Simplicity Bias for Scaling Up Parameters in Deep Reinforcement Learning
SimBa: Simplicity Bias for Scaling Up Parameters in Deep Reinforcement Learning
Hojoon Lee
Dongyoon Hwang
Donghu Kim
Hyunseung Kim
Jun Jet Tai
K. Subramanian
Peter R. Wurman
Jaegul Choo
Peter Stone
Takuma Seno
OffRL
189
17
0
13 Oct 2024
MAD-TD: Model-Augmented Data stabilizes High Update Ratio RL
MAD-TD: Model-Augmented Data stabilizes High Update Ratio RL
C. Voelcker
Marcel Hussing
Eric Eaton
Amir-massoud Farahmand
Igor Gilitschenski
135
5
0
11 Oct 2024
Neuroplastic Expansion in Deep Reinforcement Learning
Neuroplastic Expansion in Deep Reinforcement Learning
Jiashun Liu
J. Obando-Ceron
Rameswar Panda
L. Pan
121
6
0
10 Oct 2024
Multi-State-Action Tokenisation in Decision Transformers for
  Multi-Discrete Action Spaces
Multi-State-Action Tokenisation in Decision Transformers for Multi-Discrete Action Spaces
Perusha Moodley
Pramod S. Kaushik
Dhillu Thambi
Mark Trovinger
Praveen Paruchuri
Xia Hong
Benjamin Rosman
143
0
0
01 Jul 2024
Diffusion Spectral Representation for Reinforcement Learning
Diffusion Spectral Representation for Reinforcement Learning
Dmitry Shribak
Chen-Xiao Gao
Yitong Li
Chenjun Xiao
Bo Dai
DiffM
111
5
0
23 Jun 2024
iQRL -- Implicitly Quantized Representations for Sample-efficient
  Reinforcement Learning
iQRL -- Implicitly Quantized Representations for Sample-efficient Reinforcement Learning
Aidan Scannell
Kalle Kujanpää
Yi Zhao
Mohammadreza Nakhaei
Dieter Büchler
Joni Pajarinen
SSL
147
5
0
04 Jun 2024
Value Improved Actor Critic Algorithms
Value Improved Actor Critic Algorithms
Yaniv Oren
Moritz A. Zanger
Pascal R. van der Vaart
M. Spaan
Wendelin Bohmer
Wendelin Bohmer
OffRL
91
0
0
03 Jun 2024
Efficient Recurrent Off-Policy RL Requires a Context-Encoder-Specific
  Learning Rate
Efficient Recurrent Off-Policy RL Requires a Context-Encoder-Specific Learning Rate
Fan Luo
Zuolin Tu
Zefang Huang
Yang Yu
OffRL
84
1
0
24 May 2024
Cross-Domain Policy Adaptation by Capturing Representation Mismatch
Cross-Domain Policy Adaptation by Capturing Representation Mismatch
Jiafei Lyu
Chenjia Bai
Jingwen Yang
Zongqing Lu
Xiu Li
90
12
0
24 May 2024
Feasibility Consistent Representation Learning for Safe Reinforcement
  Learning
Feasibility Consistent Representation Learning for Safe Reinforcement Learning
Zhepeng Cen
Yi-Fan Yao
Zuxin Liu
Ding Zhao
OffRL
91
3
0
20 May 2024
Off-OAB: Off-Policy Policy Gradient Method with Optimal Action-Dependent
  Baseline
Off-OAB: Off-Policy Policy Gradient Method with Optimal Action-Dependent Baseline
Wenjia Meng
Qian Zheng
Long Yang
Yilong Yin
Gang Pan
OffRL
84
0
0
04 May 2024
Markov flow policy -- deep MC
Markov flow policy -- deep MC
Nitsan Soffair
Gilad Katz
58
0
0
01 May 2024
Simple Ingredients for Offline Reinforcement Learning
Simple Ingredients for Offline Reinforcement Learning
Edoardo Cetin
Andrea Tirinzoni
Matteo Pirotta
A. Lazaric
Yann Ollivier
Ahmed Touati
OffRL
106
2
0
19 Mar 2024
Conservative DDPG -- Pessimistic RL without Ensemble
Conservative DDPG -- Pessimistic RL without Ensemble
Nitsan Soffair
Shie Mannor
OffRL
47
0
0
08 Mar 2024
SQT -- std $Q$-target
SQT -- std QQQ-target
Nitsan Soffair
Dotan Di Castro
Orly Avner
Shie Mannor
OffRL
56
0
0
03 Feb 2024
Bridging State and History Representations: Understanding
  Self-Predictive RL
Bridging State and History Representations: Understanding Self-Predictive RL
Tianwei Ni
Benjamin Eysenbach
Erfan Seyedsalehi
Michel Ma
Clement Gehring
Aditya Mahajan
Pierre-Luc Bacon
AI4TSAI4CE
92
29
0
17 Jan 2024
Efficient Sparse-Reward Goal-Conditioned Reinforcement Learning with a
  High Replay Ratio and Regularization
Efficient Sparse-Reward Goal-Conditioned Reinforcement Learning with a High Replay Ratio and Regularization
Takuya Hiraoka
OffRL
84
1
0
10 Dec 2023
Generalizable Imitation Learning Through Pre-Trained Representations
Generalizable Imitation Learning Through Pre-Trained Representations
Wei-Di Chang
F. Hogan
David Meger
Gregory Dudek
Gregory Dudek
108
1
0
15 Nov 2023
Zero-Shot Reinforcement Learning from Low Quality Data
Zero-Shot Reinforcement Learning from Low Quality Data
Scott Jeen
Tom Bewley
Jonathan M. Cullen
OffRLOnRL
90
4
0
26 Sep 2023
1