Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2308.07351
Cited By
IOB: Integrating Optimization Transfer and Behavior Transfer for Multi-Policy Reuse
14 August 2023
Siyuan Li
Haoyang Li
Jin Zhang
Zhen Wang
Peng Liu
Chongjie Zhang
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"IOB: Integrating Optimization Transfer and Behavior Transfer for Multi-Policy Reuse"
18 / 18 papers shown
Title
CUP: Critic-Guided Policy Reuse
Jin Zhang
Siyuan Li
Chongjie Zhang
81
8
0
15 Oct 2022
Continual World: A Robotic Benchmark For Continual Reinforcement Learning
Maciej Wołczyk
Michal Zajkac
Razvan Pascanu
Lukasz Kuciñski
Piotr Milo's
CLL
OffRL
65
98
0
23 May 2021
Towards Continual Reinforcement Learning: A Review and Perspectives
Khimya Khetarpal
Matthew D Riemer
Irina Rish
Doina Precup
CLL
OffRL
109
323
0
25 Dec 2020
Learning Cross-Domain Correspondence for Control with Dynamics Cycle-Consistency
Qiang Zhang
Tete Xiao
Alexei A. Efros
Lerrel Pinto
Xiaolong Wang
78
65
0
17 Dec 2020
Revisiting Rainbow: Promoting more Insightful and Inclusive Deep Reinforcement Learning Research
J. Obando-Ceron
Pablo Samuel Castro
OffRL
64
109
0
20 Nov 2020
MDP Homomorphic Networks: Group Symmetries in Reinforcement Learning
Elise van der Pol
Daniel E. Worrall
H. V. Hoof
F. Oliehoek
Max Welling
BDL
AI4CE
75
162
0
30 Jun 2020
Controlling Overestimation Bias with Truncated Mixture of Continuous Distributional Quantile Critics
Arsenii Kuznetsov
Pavel Shvechikov
Alexander Grishin
Dmitry Vetrov
230
195
0
08 May 2020
Maxmin Q-learning: Controlling the Estimation Bias of Q-learning
Qingfeng Lan
Yangchen Pan
Alona Fyshe
Martha White
67
179
0
16 Feb 2020
Hierarchical Reinforcement Learning with Advantage-Based Auxiliary Rewards
Siyuan Li
Rui Wang
Minxue Tang
Chongjie Zhang
74
83
0
10 Oct 2019
Transfer in Deep Reinforcement Learning Using Successor Features and Generalised Policy Improvement
André Barreto
Diana Borsa
John Quan
Tom Schaul
David Silver
Matteo Hessel
D. Mankowitz
Augustin Žídek
Rémi Munos
OffRL
107
164
0
30 Jan 2019
Soft Actor-Critic Algorithms and Applications
Tuomas Haarnoja
Aurick Zhou
Kristian Hartikainen
George Tucker
Sehoon Ha
...
Vikash Kumar
Henry Zhu
Abhishek Gupta
Pieter Abbeel
Sergey Levine
141
2,445
0
13 Dec 2018
Progress & Compress: A scalable framework for continual learning
Jonathan Richard Schwarz
Jelena Luketina
Wojciech M. Czarnecki
A. Grabska-Barwinska
Yee Whye Teh
Razvan Pascanu
R. Hadsell
CLL
125
889
0
16 May 2018
Addressing Function Approximation Error in Actor-Critic Methods
Scott Fujimoto
H. V. Hoof
David Meger
OffRL
180
5,204
0
26 Feb 2018
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
314
8,396
0
04 Jan 2018
An Optimal Online Method of Selecting Source Policies for Reinforcement Learning
Siyuan Li
Chongjie Zhang
OnRL
48
44
0
24 Sep 2017
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Chelsea Finn
Pieter Abbeel
Sergey Levine
OOD
825
11,937
0
09 Mar 2017
RL
2
^2
2
: Fast Reinforcement Learning via Slow Reinforcement Learning
Yan Duan
John Schulman
Xi Chen
Peter L. Bartlett
Ilya Sutskever
Pieter Abbeel
OffRL
99
1,027
0
09 Nov 2016
Continuous control with deep reinforcement learning
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
323
13,272
0
09 Sep 2015
1