ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2110.12628
  4. Cited By
Recurrent Off-policy Baselines for Memory-based Continuous Control

Recurrent Off-policy Baselines for Memory-based Continuous Control

25 October 2021
Zhihan Yang
Hai V. Nguyen
    CLL
    OffRL
ArXivPDFHTML

Papers citing "Recurrent Off-policy Baselines for Memory-based Continuous Control"

16 / 16 papers shown
Title
AMAGO-2: Breaking the Multi-Task Barrier in Meta-Reinforcement Learning with Transformers
Jake Grigsby
Justin Sasek
Samyak Parajuli
Daniel Adebi
Amy Zhang
Yuke Zhu
OffRL
26
3
0
17 Nov 2024
Domains as Objectives: Domain-Uncertainty-Aware Policy Optimization
  through Explicit Multi-Domain Convex Coverage Set Learning
Domains as Objectives: Domain-Uncertainty-Aware Policy Optimization through Explicit Multi-Domain Convex Coverage Set Learning
Wendyam Eric Lionel Ilboudo
Taisuke Kobayashi
Takamitsu Matsubara
30
0
0
07 Oct 2024
Equivariant Reinforcement Learning under Partial Observability
Equivariant Reinforcement Learning under Partial Observability
Hai Nguyen
Andrea Baisero
David M. Klee
Dian Wang
Robert Platt
Christopher Amato
42
14
0
26 Aug 2024
Residual Learning and Context Encoding for Adaptive Offline-to-Online
  Reinforcement Learning
Residual Learning and Context Encoding for Adaptive Offline-to-Online Reinforcement Learning
Mohammadreza Nakhaei
Aidan Scannell
Joni Pajarinen
OffRL
49
1
0
12 Jun 2024
Efficient Recurrent Off-Policy RL Requires a Context-Encoder-Specific
  Learning Rate
Efficient Recurrent Off-Policy RL Requires a Context-Encoder-Specific Learning Rate
Fan Luo
Zuolin Tu
Zefang Huang
Yang Yu
OffRL
36
0
0
24 May 2024
Intrinsic Rewards for Exploration without Harm from Observational Noise:
  A Simulation Study Based on the Free Energy Principle
Intrinsic Rewards for Exploration without Harm from Observational Noise: A Simulation Study Based on the Free Energy Principle
Theodore Jerome Tinker
Kenji Doya
Jun Tani
29
0
0
13 May 2024
AMAGO: Scalable In-Context Reinforcement Learning for Adaptive Agents
AMAGO: Scalable In-Context Reinforcement Learning for Adaptive Agents
Jake Grigsby
Linxi Fan
Yuke Zhu
OffRL
LM&Ro
38
10
0
15 Oct 2023
Reinforcement Learning with Fast and Forgetful Memory
Reinforcement Learning with Fast and Forgetful Memory
Steven D. Morad
Ryan Kortvelesy
Stephan Liwicki
Amanda Prorok
OffRL
24
4
0
06 Oct 2023
PID-Inspired Inductive Biases for Deep Reinforcement Learning in
  Partially Observable Control Tasks
PID-Inspired Inductive Biases for Deep Reinforcement Learning in Partially Observable Control Tasks
I. Char
J. Schneider
26
4
0
12 Jul 2023
When Do Transformers Shine in RL? Decoupling Memory from Credit
  Assignment
When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment
Tianwei Ni
Michel Ma
Benjamin Eysenbach
Pierre-Luc Bacon
OffRL
26
34
0
07 Jul 2023
Seq2Seq Imitation Learning for Tactile Feedback-based Manipulation
Seq2Seq Imitation Learning for Tactile Feedback-based Manipulation
Wenyan Yang
A. Angleraud
R. Pieters
Joni Pajarinen
Joni-Kristian Kämäräinen
32
6
0
05 Mar 2023
POPGym: Benchmarking Partially Observable Reinforcement Learning
POPGym: Benchmarking Partially Observable Reinforcement Learning
Steven D. Morad
Ryan Kortvelesy
Matteo Bettini
Stephan Liwicki
Amanda Prorok
OffRL
19
37
0
03 Mar 2023
Deep Transformer Q-Networks for Partially Observable Reinforcement
  Learning
Deep Transformer Q-Networks for Partially Observable Reinforcement Learning
Kevin Esslinger
Robert W. Platt
Chris Amato
OffRL
32
35
0
02 Jun 2022
Hierarchical Reinforcement Learning under Mixed Observability
Hierarchical Reinforcement Learning under Mixed Observability
Hai V. Nguyen
Zhihan Yang
Andrea Baisero
Xiao Ma
Robert W. Platt
Chris Amato
22
4
0
02 Apr 2022
Recurrent Model-Free RL Can Be a Strong Baseline for Many POMDPs
Recurrent Model-Free RL Can Be a Strong Baseline for Many POMDPs
Tianwei Ni
Benjamin Eysenbach
Ruslan Salakhutdinov
15
103
0
11 Oct 2021
Belief-Grounded Networks for Accelerated Robot Learning under Partial
  Observability
Belief-Grounded Networks for Accelerated Robot Learning under Partial Observability
Hai V. Nguyen
Brett Daley
Xinchao Song
Chris Amato
Robert W. Platt
48
14
0
19 Oct 2020
1