ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1903.08254
  4. Cited By
Efficient Off-Policy Meta-Reinforcement Learning via Probabilistic
  Context Variables

Efficient Off-Policy Meta-Reinforcement Learning via Probabilistic Context Variables

19 March 2019
Kate Rakelly
Aurick Zhou
Deirdre Quillen
Chelsea Finn
Sergey Levine
    OffRL
ArXivPDFHTML

Papers citing "Efficient Off-Policy Meta-Reinforcement Learning via Probabilistic Context Variables"

50 / 123 papers shown
Title
Fast and Robust: Task Sampling with Posterior and Diversity Synergies for Adaptive Decision-Makers in Randomized Environments
Fast and Robust: Task Sampling with Posterior and Diversity Synergies for Adaptive Decision-Makers in Randomized Environments
Yun Qu
Luu Anh Tuan
Yixiu Mao
Yiqin Lv
Xiangyang Ji
TTA
90
0
0
27 Apr 2025
Text-to-Decision Agent: Learning Generalist Policies from Natural Language Supervision
Text-to-Decision Agent: Learning Generalist Policies from Natural Language Supervision
Shilin Zhang
Zican Hu
Wenhao Wu
Xinyi Xie
Jianxiang Tang
Chunlin Chen
Daoyi Dong
Yu Cheng
Zhenhong Sun
Zhi Wang
OffRL
145
0
0
21 Apr 2025
A Comprehensive Survey of Mixture-of-Experts: Algorithms, Theory, and Applications
A Comprehensive Survey of Mixture-of-Experts: Algorithms, Theory, and Applications
Siyuan Mu
Sen Lin
MoE
141
2
0
10 Mar 2025
Yes, Q-learning Helps Offline In-Context RL
Yes, Q-learning Helps Offline In-Context RL
Denis Tarasov
Alexander Nikulin
Ilya Zisman
Albina Klepach
Andrei Polubarov
Nikita Lyubaykin
Alexander Derevyagin
Igor Kiselev
Vladislav Kurenkov
OffRL
OnRL
181
0
0
24 Feb 2025
Task Aware Dreamer for Task Generalization in Reinforcement Learning
Task Aware Dreamer for Task Generalization in Reinforcement Learning
Chengyang Ying
Zhongkai Hao
Xinning Zhou
Hang Su
Songming Liu
Dong Yan
Jun Zhu
72
3
0
17 Feb 2025
TEA: Trajectory Encoding Augmentation for Robust and Transferable Policies in Offline Reinforcement Learning
TEA: Trajectory Encoding Augmentation for Robust and Transferable Policies in Offline Reinforcement Learning
Batıkan Bora Ormancı
Phillip Swazinna
Steffen Udluft
Thomas Runkler
OffRL
71
0
0
28 Jan 2025
State Combinatorial Generalization In Decision Making With Conditional Diffusion Models
State Combinatorial Generalization In Decision Making With Conditional Diffusion Models
Xintong Duan
Yutong He
Fahim Tajwar
Wen-Tse Chen
Ruslan Salakhutdinov
Jeff Schneider
OffRL
AI4CE
101
0
0
22 Jan 2025
A Tensor Low-Rank Approximation for Value Functions in Multi-Task Reinforcement Learning
A Tensor Low-Rank Approximation for Value Functions in Multi-Task Reinforcement Learning
Sergio Rozada
Santiago Paternain
J. Bazerque
Antonio G. Marques
69
0
0
17 Jan 2025
Enabling Adaptive Agent Training in Open-Ended Simulators by Targeting
  Diversity
Enabling Adaptive Agent Training in Open-Ended Simulators by Targeting Diversity
Robby Costales
Stefanos Nikolaidis
AI4CE
31
0
0
07 Nov 2024
Meta-DT: Offline Meta-RL as Conditional Sequence Modeling with World
  Model Disentanglement
Meta-DT: Offline Meta-RL as Conditional Sequence Modeling with World Model Disentanglement
Zhi Wang
Li Zhang
Wenhao Wu
Yuanheng Zhu
Dongbin Zhao
C. L. Philip Chen
OffRL
39
6
0
15 Oct 2024
Context-Based Meta Reinforcement Learning for Robust and Adaptable Peg-in-Hole Assembly Tasks
Context-Based Meta Reinforcement Learning for Robust and Adaptable Peg-in-Hole Assembly Tasks
Ahmed Shokry
Walid Gomaa
Tobias Zaenker
Murad Dawood
Shady A. Maged
Mohammed I. Awad
Maren Bennewitz
Maren Bennewitz
OffRL
39
0
0
24 Sep 2024
OCCAM: Online Continuous Controller Adaptation with Meta-Learned Models
OCCAM: Online Continuous Controller Adaptation with Meta-Learned Models
Hersh Sanghvi
Spencer Folk
Camillo J Taylor
45
3
0
25 Jun 2024
Residual Learning and Context Encoding for Adaptive Offline-to-Online
  Reinforcement Learning
Residual Learning and Context Encoding for Adaptive Offline-to-Online Reinforcement Learning
Mohammadreza Nakhaei
Aidan Scannell
Joni Pajarinen
OffRL
49
1
0
12 Jun 2024
Pretraining Decision Transformers with Reward Prediction for In-Context Multi-task Structured Bandit Learning
Pretraining Decision Transformers with Reward Prediction for In-Context Multi-task Structured Bandit Learning
Subhojyoti Mukherjee
Josiah P. Hanna
Qiaomin Xie
Robert Nowak
77
2
0
07 Jun 2024
Efficient Recurrent Off-Policy RL Requires a Context-Encoder-Specific
  Learning Rate
Efficient Recurrent Off-Policy RL Requires a Context-Encoder-Specific Learning Rate
Fan Luo
Zuolin Tu
Zefang Huang
Yang Yu
OffRL
36
0
0
24 May 2024
Multi Task Inverse Reinforcement Learning for Common Sense Reward
Multi Task Inverse Reinforcement Learning for Common Sense Reward
Neta Glazer
Aviv Navon
Aviv Shamsian
Ethan Fetaya
27
0
0
17 Feb 2024
Debiased Offline Representation Learning for Fast Online Adaptation in Non-stationary Dynamics
Debiased Offline Representation Learning for Fast Online Adaptation in Non-stationary Dynamics
Xinyu Zhang
Wenjie Qiu
Yi-Chen Li
Lei Yuan
Chengxing Jia
Zongzhang Zhang
Yang Yu
OffRL
35
1
0
17 Feb 2024
Hierarchical Transformers are Efficient Meta-Reinforcement Learners
Hierarchical Transformers are Efficient Meta-Reinforcement Learners
Gresa Shala
André Biedenkapp
Josif Grabocka
OffRL
37
4
0
09 Feb 2024
Towards an Information Theoretic Framework of Context-Based Offline Meta-Reinforcement Learning
Towards an Information Theoretic Framework of Context-Based Offline Meta-Reinforcement Learning
Lanqing Li
Hai Zhang
Xinyu Zhang
Shatong Zhu
Junqiao Zhao
Junqiao Zhao
Pheng-Ann Heng
OffRL
43
7
0
04 Feb 2024
Zero-Shot Reinforcement Learning via Function Encoders
Zero-Shot Reinforcement Learning via Function Encoders
Tyler Ingebrand
Amy Zhang
Ufuk Topcu
OffRL
43
2
0
30 Jan 2024
On Task-Relevant Loss Functions in Meta-Reinforcement Learning and
  Online LQR
On Task-Relevant Loss Functions in Meta-Reinforcement Learning and Online LQR
Jaeuk Shin
Giho Kim
Howon Lee
Joonho Han
Insoon Yang
OffRL
31
1
0
09 Dec 2023
Transformers as Decision Makers: Provable In-Context Reinforcement
  Learning via Supervised Pretraining
Transformers as Decision Makers: Provable In-Context Reinforcement Learning via Supervised Pretraining
Licong Lin
Yu Bai
Song Mei
OffRL
32
43
0
12 Oct 2023
Amortized Network Intervention to Steer the Excitatory Point Processes
Amortized Network Intervention to Steer the Excitatory Point Processes
Zitao Song
Wendi Ren
Sourav Garg
21
1
0
06 Oct 2023
Recurrent Hypernetworks are Surprisingly Strong in Meta-RL
Recurrent Hypernetworks are Surprisingly Strong in Meta-RL
Jacob Beck
Risto Vuorio
Zheng Xiong
Shimon Whiteson
43
9
0
26 Sep 2023
Foundational Policy Acquisition via Multitask Learning for Motor Skill Generation
Foundational Policy Acquisition via Multitask Learning for Motor Skill Generation
Satoshi Yamamori
Jun Morimoto
26
0
0
31 Aug 2023
IOB: Integrating Optimization Transfer and Behavior Transfer for
  Multi-Policy Reuse
IOB: Integrating Optimization Transfer and Behavior Transfer for Multi-Policy Reuse
Siyuan Li
Haoyang Li
Jin Zhang
Zhen Wang
Peng Liu
Chongjie Zhang
OffRL
24
1
0
14 Aug 2023
Transformers in Reinforcement Learning: A Survey
Transformers in Reinforcement Learning: A Survey
Pranav Agarwal
A. Rahman
P. St-Charles
Simon J. D. Prince
Samira Ebrahimi Kahou
OffRL
24
18
0
12 Jul 2023
SPRINT: Scalable Policy Pre-Training via Language Instruction Relabeling
SPRINT: Scalable Policy Pre-Training via Language Instruction Relabeling
Jesse Zhang
Karl Pertsch
Jiahui Zhang
Joseph J. Lim
LM&Ro
36
17
0
20 Jun 2023
Offline Meta Reinforcement Learning with In-Distribution Online
  Adaptation
Offline Meta Reinforcement Learning with In-Distribution Online Adaptation
Jianhao Wang
Jin Zhang
Haozhe Jiang
Junyu Zhang
Liwei Wang
Chongjie Zhang
OffRL
26
9
0
31 May 2023
On the Value of Myopic Behavior in Policy Reuse
On the Value of Myopic Behavior in Policy Reuse
Kang Xu
Chenjia Bai
Shuang Qiu
Haoran He
Bin Zhao
Zhen Wang
Wei Li
Xuelong Li
29
1
0
28 May 2023
Inverse Dynamics Pretraining Learns Good Representations for Multitask
  Imitation
Inverse Dynamics Pretraining Learns Good Representations for Multitask Imitation
David Brandfonbrener
Ofir Nachum
Joan Bruna
AI4CE
26
21
0
26 May 2023
Meta-Reinforcement Learning via Exploratory Task Clustering
Meta-Reinforcement Learning via Exploratory Task Clustering
Zhendong Chu
Hongning Wang
OffRL
21
5
0
15 Feb 2023
Learning How to Infer Partial MDPs for In-Context Adaptation and
  Exploration
Learning How to Infer Partial MDPs for In-Context Adaptation and Exploration
Chentian Jiang
Nan Rosemary Ke
Hado van Hasselt
16
3
0
08 Feb 2023
Online Reinforcement Learning in Non-Stationary Context-Driven Environments
Online Reinforcement Learning in Non-Stationary Context-Driven Environments
Pouya Hamadanian
Arash Nasr-Esfahany
Malte Schwarzkopf
Siddartha Sen
MohammadIman Alizadeh
CLL
OffRL
50
0
0
04 Feb 2023
Train Hard, Fight Easy: Robust Meta Reinforcement Learning
Train Hard, Fight Easy: Robust Meta Reinforcement Learning
Ido Greenberg
Shie Mannor
Gal Chechik
E. Meirom
OffRL
OOD
21
6
0
26 Jan 2023
A Survey of Meta-Reinforcement Learning
A Survey of Meta-Reinforcement Learning
Jacob Beck
Risto Vuorio
E. Liu
Zheng Xiong
L. Zintgraf
Chelsea Finn
Shimon Whiteson
OOD
OffRL
37
122
0
19 Jan 2023
Human-Timescale Adaptation in an Open-Ended Task Space
Human-Timescale Adaptation in an Open-Ended Task Space
Adaptive Agent Team
Jakob Bauer
Kate Baumli
Satinder Baveja
Feryal M. P. Behbahani
...
Jakub Sygnowski
K. Tuyls
Sarah York
Alexander Zacherl
Lei Zhang
LM&Ro
OffRL
AI4CE
LRM
38
108
0
18 Jan 2023
Cognitive Level-$k$ Meta-Learning for Safe and Pedestrian-Aware
  Autonomous Driving
Cognitive Level-kkk Meta-Learning for Safe and Pedestrian-Aware Autonomous Driving
Haozhe Lei
Quanyan Zhu
20
0
0
17 Dec 2022
CACTI: A Framework for Scalable Multi-Task Multi-Scene Visual Imitation
  Learning
CACTI: A Framework for Scalable Multi-Task Multi-Scene Visual Imitation Learning
Zhao Mandi
Homanga Bharadhwaj
Vincent Moens
Shuran Song
Aravind Rajeswaran
Vikash Kumar
LM&Ro
28
68
0
12 Dec 2022
Multi-Task Imitation Learning for Linear Dynamical Systems
Multi-Task Imitation Learning for Linear Dynamical Systems
Thomas T. Zhang
Katie Kang
Bruce D. Lee
Claire Tomlin
Sergey Levine
Stephen Tu
Nikolai Matni
38
23
0
01 Dec 2022
Hypernetworks for Zero-shot Transfer in Reinforcement Learning
Hypernetworks for Zero-shot Transfer in Reinforcement Learning
S. Rezaei-Shoshtari
Charlotte Morissette
F. Hogan
Gregory Dudek
D. Meger
OffRL
17
14
0
28 Nov 2022
Multi-Environment Pretraining Enables Transfer to Action Limited
  Datasets
Multi-Environment Pretraining Enables Transfer to Action Limited Datasets
David Venuto
Sherry Yang
Pieter Abbeel
Doina Precup
Igor Mordatch
Ofir Nachum
OffRL
25
5
0
23 Nov 2022
Giving Feedback on Interactive Student Programs with Meta-Exploration
Giving Feedback on Interactive Student Programs with Meta-Exploration
E. Liu
Moritz Stephan
Allen Nie
Chris Piech
Emma Brunskill
Chelsea Finn
AI4Ed
30
8
0
16 Nov 2022
Contextual Transformer for Offline Meta Reinforcement Learning
Contextual Transformer for Offline Meta Reinforcement Learning
Runji Lin
Ye Li
Xidong Feng
Zhaowei Zhang
Xian Hong Wu Fung
Haifeng Zhang
Jun Wang
Yali Du
Yaodong Yang
OffRL
23
6
0
15 Nov 2022
Uncertainty-based Meta-Reinforcement Learning for Robust Radar Tracking
Uncertainty-based Meta-Reinforcement Learning for Robust Radar Tracking
Julius Ott
Lorenzo Servadei
Gianfranco Mauro
Thomas Stadelmayer
Avik Santra
Robert Wille
OOD
UQCV
39
3
0
26 Oct 2022
Model-based Lifelong Reinforcement Learning with Bayesian Exploration
Model-based Lifelong Reinforcement Learning with Bayesian Exploration
Haotian Fu
Shangqun Yu
Michael Littman
George Konidaris
BDL
OffRL
16
12
0
20 Oct 2022
Hypernetworks in Meta-Reinforcement Learning
Hypernetworks in Meta-Reinforcement Learning
Jacob Beck
Matthew Jackson
Risto Vuorio
Shimon Whiteson
OffRL
27
30
0
20 Oct 2022
Oracles & Followers: Stackelberg Equilibria in Deep Multi-Agent
  Reinforcement Learning
Oracles & Followers: Stackelberg Equilibria in Deep Multi-Agent Reinforcement Learning
M. Gerstgrasser
David C. Parkes
OffRL
26
19
0
19 Oct 2022
Causal Inference for De-biasing Motion Estimation from Robotic
  Observational Data
Causal Inference for De-biasing Motion Estimation from Robotic Observational Data
Junhong Xu
Kai-Li Yin
Jason M. Gregory
Lantao Liu
CML
21
3
0
17 Oct 2022
Neurosymbolic Motion and Task Planning for Linear Temporal Logic Tasks
Neurosymbolic Motion and Task Planning for Linear Temporal Logic Tasks
Xiaowu Sun
Yasser Shoukry
48
11
0
11 Oct 2022
123
Next