ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1707.04175
  4. Cited By
Distral: Robust Multitask Reinforcement Learning

Distral: Robust Multitask Reinforcement Learning

13 July 2017
Yee Whye Teh
V. Bapst
Wojciech M. Czarnecki
John Quan
J. Kirkpatrick
R. Hadsell
N. Heess
Razvan Pascanu
ArXivPDFHTML

Papers citing "Distral: Robust Multitask Reinforcement Learning"

50 / 319 papers shown
Title
How Ensembles of Distilled Policies Improve Generalisation in Reinforcement Learning
How Ensembles of Distilled Policies Improve Generalisation in Reinforcement Learning
Max Weltevrede
Moritz A. Zanger
M. Spaan
Wendelin Bohmer
OffRL
FedML
10
0
0
22 May 2025
R1-ShareVL: Incentivizing Reasoning Capability of Multimodal Large Language Models via Share-GRPO
R1-ShareVL: Incentivizing Reasoning Capability of Multimodal Large Language Models via Share-GRPO
Huanjin Yao
Qixiang Yin
Jingyi Zhang
Min Yang
Yibo Wang
...
Fei Su
Li Shen
Minghui Qiu
Dacheng Tao
Jiaxing Huang
LRM
5
0
0
22 May 2025
Multi-CALF: A Policy Combination Approach with Statistical Guarantees
Multi-CALF: A Policy Combination Approach with Statistical Guarantees
Georgiy Malaniya
Anton Bolychev
Grigory Yaremenko
Anastasia Krasnaya
Pavel Osinenko
22
0
0
18 May 2025
Neural Multivariate Regression: Qualitative Insights from the Unconstrained Feature Model
Neural Multivariate Regression: Qualitative Insights from the Unconstrained Feature Model
George Andriopoulos
Soyuj Jung Basnet
Juan Guevara
Li Guo
Keith Ross
40
0
0
14 May 2025
Combining Bayesian Inference and Reinforcement Learning for Agent Decision Making: A Review
Combining Bayesian Inference and Reinforcement Learning for Agent Decision Making: A Review
Chengmin Zhou
Ville Kyrki
Pasi Fränti
Laura Ruotsalainen
BDL
AI4CE
47
0
0
12 May 2025
Harmonia: A Multi-Agent Reinforcement Learning Approach to Data Placement and Migration in Hybrid Storage Systems
Harmonia: A Multi-Agent Reinforcement Learning Approach to Data Placement and Migration in Hybrid Storage Systems
Rakesh Nadig
Vamanan Arulchelvan
Rahul Bera
Taha Shahroodi
Gagandeep Singh
Mohammad Sadrosadati
Jisung Park
O. Mutlu
Onur Mutlu
68
0
0
26 Mar 2025
Residual Policy Gradient: A Reward View of KL-regularized Objective
Pengcheng Wang
Xinghao Zhu
Yuxin Chen
Chenfeng Xu
Masayoshi Tomizuka
Chenran Li
45
0
0
14 Mar 2025
Towards Understanding the Benefit of Multitask Representation Learning in Decision Process
Rui Lu
Yang Yue
Andrew Zhao
S. Du
Gao Huang
OffRL
59
1
0
01 Mar 2025
Dynamically Local-Enhancement Planner for Large-Scale Autonomous Driving
Dynamically Local-Enhancement Planner for Large-Scale Autonomous Driving
Nanshan Deng
Weitao Zhou
Bo Zhang
Junze Wen
Kun Jiang
Zhong Cao
Ke Wang
41
0
0
28 Feb 2025
UNIDOOR: A Universal Framework for Action-Level Backdoor Attacks in Deep Reinforcement Learning
Oubo Ma
L. Du
Yang Dai
Chunyi Zhou
Qingming Li
Yuwen Pu
Shouling Ji
48
0
0
28 Jan 2025
RLDG: Robotic Generalist Policy Distillation via Reinforcement Learning
RLDG: Robotic Generalist Policy Distillation via Reinforcement Learning
Charles Xu
Qiyang Li
Jianlan Luo
Sergey Levine
OffRL
98
6
0
13 Dec 2024
GSL-PCD: Improving Generalist-Specialist Learning with Point Cloud
  Feature-based Task Partitioning
GSL-PCD: Improving Generalist-Specialist Learning with Point Cloud Feature-based Task Partitioning
Xiu Yuan
41
0
0
11 Nov 2024
Active Fine-Tuning of Generalist Policies
Active Fine-Tuning of Generalist Policies
Marco Bagatella
Jonas Hübotter
Georg Martius
Andreas Krause
39
0
0
07 Oct 2024
C-MORL: Multi-Objective Reinforcement Learning through Efficient Discovery of Pareto Front
C-MORL: Multi-Objective Reinforcement Learning through Efficient Discovery of Pareto Front
Ruohong Liu
Yuxin Pan
Linjie Xu
Lei Song
Jiang Bian
Pengcheng You
Yize Chen
48
1
0
03 Oct 2024
Scaling Offline Model-Based RL via Jointly-Optimized World-Action Model Pretraining
Scaling Offline Model-Based RL via Jointly-Optimized World-Action Model Pretraining
Jie Cheng
Ruixi Qiao
Gang Xiong
Binhua Li
Yingwei Ma
Binhua Li
Yongbin Li
Yisheng Lv
OffRL
OnRL
LM&Ro
50
3
0
01 Oct 2024
Skills Regularized Task Decomposition for Multi-task Offline
  Reinforcement Learning
Skills Regularized Task Decomposition for Multi-task Offline Reinforcement Learning
Minjong Yoo
Sangwoo Cho
Honguk Woo
OffRL
45
10
0
28 Aug 2024
How to Solve Contextual Goal-Oriented Problems with Offline Datasets?
How to Solve Contextual Goal-Oriented Problems with Offline Datasets?
Ying Fan
Jingling Li
Adith Swaminathan
Aditya Modi
Ching-An Cheng
OffRL
72
0
0
14 Aug 2024
Model-Based Transfer Learning for Contextual Reinforcement Learning
Model-Based Transfer Learning for Contextual Reinforcement Learning
Jung-Hoon Cho
Vindula Jayawardana
Sirui Li
Cathy Wu
OffRL
60
0
0
08 Aug 2024
AdapMTL: Adaptive Pruning Framework for Multitask Learning Model
AdapMTL: Adaptive Pruning Framework for Multitask Learning Model
Mingcan Xiang
Steven Jiaxun Tang
Qizheng Yang
Hui Guan
Tongping Liu
VLM
46
0
0
07 Aug 2024
Language-Conditioned Offline RL for Multi-Robot Navigation
Language-Conditioned Offline RL for Multi-Robot Navigation
Steven D. Morad
Ajay Shankar
J. Blumenkamp
Amanda Prorok
LM&Ro
OffRL
48
6
0
29 Jul 2024
I Know How: Combining Prior Policies to Solve New Tasks
I Know How: Combining Prior Policies to Solve New Tasks
Malio Li
Elia Piccoli
Vincenzo Lomonaco
Davide Bacciu
CLL
41
0
0
14 Jun 2024
The Power of Active Multi-Task Learning in Reinforcement Learning from Human Feedback
The Power of Active Multi-Task Learning in Reinforcement Learning from Human Feedback
Ruitao Chen
Liwei Wang
75
1
0
18 May 2024
A Mixture-of-Experts Approach to Few-Shot Task Transfer in Open-Ended
  Text Worlds
A Mixture-of-Experts Approach to Few-Shot Task Transfer in Open-Ended Text Worlds
Christopher Cui
Xiangyu Peng
Mark O. Riedl
LLMAG
OffRL
MoE
41
1
0
09 May 2024
Shared learning of powertrain control policies for vehicle fleets
Shared learning of powertrain control policies for vehicle fleets
Lindsey Kerbel
B. Ayalew
Andrej Ivanco
33
0
0
27 Apr 2024
Distilling Privileged Information for Dubins Traveling Salesman Problems
  with Neighborhoods
Distilling Privileged Information for Dubins Traveling Salesman Problems with Neighborhoods
M. Shin
Su-Jeong Park
Seung-Keol Ryu
Heeyeon Kim
Han-Lim Choi
66
0
0
25 Apr 2024
Towards Multi-Morphology Controllers with Diversity and Knowledge
  Distillation
Towards Multi-Morphology Controllers with Diversity and Knowledge Distillation
Alican Mertan
Nick Cheney
36
0
0
22 Apr 2024
Efficient Multi-Task Reinforcement Learning via Task-Specific Action
  Correction
Efficient Multi-Task Reinforcement Learning via Task-Specific Action Correction
Jinyuan Feng
Min Chen
Zhiqiang Pu
Tenghai Qiu
Jianqiang Yi
32
2
0
09 Apr 2024
PhysReaction: Physically Plausible Real-Time Humanoid Reaction Synthesis
  via Forward Dynamics Guided 4D Imitation
PhysReaction: Physically Plausible Real-Time Humanoid Reaction Synthesis via Forward Dynamics Guided 4D Imitation
Yunze Liu
Changxi Chen
Chenjing Ding
Li Yi
39
6
0
01 Apr 2024
Reset & Distill: A Recipe for Overcoming Negative Transfer in Continual
  Reinforcement Learning
Reset & Distill: A Recipe for Overcoming Negative Transfer in Continual Reinforcement Learning
Hongjoon Ahn
Jinu Hyeon
Youngmin Oh
Bosun Hwang
Taesup Moon
CLL
OnRL
39
2
0
08 Mar 2024
Bidirectional Progressive Neural Networks with Episodic Return Progress
  for Emergent Task Sequencing and Robotic Skill Transfer
Bidirectional Progressive Neural Networks with Episodic Return Progress for Emergent Task Sequencing and Robotic Skill Transfer
S. E. Ada
Hanne Say
Emre Ugur
Erhan Öztop
41
1
0
06 Mar 2024
Enhancing Reinforcement Learning Agents with Local Guides
Enhancing Reinforcement Learning Agents with Local Guides
Paul Daoudi
Bogdan Robu
Christophe Prieur
Ludovic Dos Santos
M. Barlier
OnRL
31
3
0
21 Feb 2024
Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent
Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent
Quentin Gallouedec
E. Beeching
Clément Romac
Emmanuel Dellandrea
37
11
0
15 Feb 2024
Robust Analysis of Multi-Task Learning Efficiency: New Benchmarks on
  Light-Weighed Backbones and Effective Measurement of Multi-Task Learning
  Challenges by Feature Disentanglement
Robust Analysis of Multi-Task Learning Efficiency: New Benchmarks on Light-Weighed Backbones and Effective Measurement of Multi-Task Learning Challenges by Feature Disentanglement
Dayou Mao
Yuhao Chen
Yifan Wu
Maximilian Gilles
Alexander Wong
AAML
41
0
0
05 Feb 2024
Sharing Knowledge in Multi-Task Deep Reinforcement Learning
Sharing Knowledge in Multi-Task Deep Reinforcement Learning
Carlo DÉramo
Davide Tateo
Andrea Bonarini
Marcello Restelli
Jan Peters
59
125
0
17 Jan 2024
Not All Tasks Are Equally Difficult: Multi-Task Deep Reinforcement
  Learning with Dynamic Depth Routing
Not All Tasks Are Equally Difficult: Multi-Task Deep Reinforcement Learning with Dynamic Depth Routing
Jinmin He
Kai Li
Yifan Zang
Haobo Fu
Qiang Fu
Junliang Xing
Jian Cheng
MoE
41
5
0
22 Dec 2023
Prediction and Control in Continual Reinforcement Learning
Prediction and Control in Continual Reinforcement Learning
N. Anand
Doina Precup
OffRL
CLL
32
11
0
18 Dec 2023
Mastering Stacking of Diverse Shapes with Large-Scale Iterative
  Reinforcement Learning on Real Robots
Mastering Stacking of Diverse Shapes with Large-Scale Iterative Reinforcement Learning on Real Robots
Thomas Lampe
A. Abdolmaleki
Sarah Bechtle
Sandy H. Huang
Jost Tobias Springenberg
...
Markus Wulfmeier
Jingwei Zhang
Francesco Nori
N. Heess
Martin Riedmiller
OffRL
40
9
0
18 Dec 2023
Decoupling Meta-Reinforcement Learning with Gaussian Task Contexts and
  Skills
Decoupling Meta-Reinforcement Learning with Gaussian Task Contexts and Skills
Hongcai He
Anjie Zhu
Shuang Liang
Feiyu Chen
Jie Shao
OffRL
54
4
0
11 Dec 2023
Wireless Powered Metaverse: Joint Task Scheduling and Trajectory Design
  for Multi-Devices and Multi-UAVs
Wireless Powered Metaverse: Joint Task Scheduling and Trajectory Design for Multi-Devices and Multi-UAVs
Xiaojie Wang
Jiameng Li
Zhaolong Ning
Qingyang Song
Lei Guo
Abbas Jamalipour
18
18
0
28 Nov 2023
Multi-Task Reinforcement Learning with Mixture of Orthogonal Experts
Multi-Task Reinforcement Learning with Mixture of Orthogonal Experts
Ahmed Hendawy
Jan Peters
Carlo DÉramo
MoE
33
15
0
19 Nov 2023
Augmenting Unsupervised Reinforcement Learning with Self-Reference
Augmenting Unsupervised Reinforcement Learning with Self-Reference
Andrew Zhao
Erle Zhu
Rui Lu
Matthieu Lin
Yong-Jin Liu
Gao Huang
SSL
39
1
0
16 Nov 2023
Contrastive Modules with Temporal Attention for Multi-Task Reinforcement
  Learning
Contrastive Modules with Temporal Attention for Multi-Task Reinforcement Learning
Siming Lan
Rui Zhang
Qi Yi
Jiaming Guo
Shaohui Peng
...
Zidong Du
Xingui Hu
Xishan Zhang
Ling Li
Yunji Chen
24
8
0
02 Nov 2023
Combining Behaviors with the Successor Features Keyboard
Combining Behaviors with the Successor Features Keyboard
Wilka Carvalho
Andre Saraiva
Angelos Filos
Andrew Kyle Lampinen
Loic Matthey
Richard L. Lewis
Honglak Lee
Satinder Singh
Danilo Jimenez Rezende
Daniel Zoran
21
3
0
24 Oct 2023
Discovering Fatigued Movements for Virtual Character Animation
Discovering Fatigued Movements for Virtual Character Animation
N. Cheema
Rui Xu
Nam Hee Kim
Perttu Hämäläinen
Vladislav Golyanik
Marc Habermann
Christian Theobalt
Philipp Slusallek
34
4
0
12 Oct 2023
PolyTask: Learning Unified Policies through Behavior Distillation
PolyTask: Learning Unified Policies through Behavior Distillation
Siddhant Haldar
Lerrel Pinto
33
7
0
12 Oct 2023
Confronting Reward Model Overoptimization with Constrained RLHF
Confronting Reward Model Overoptimization with Constrained RLHF
Ted Moskovitz
Aaditya K. Singh
DJ Strouse
T. Sandholm
Ruslan Salakhutdinov
Anca D. Dragan
Stephen Marcus McAleer
50
48
0
06 Oct 2023
HarmonyDream: Task Harmonization Inside World Models
HarmonyDream: Task Harmonization Inside World Models
Haoyu Ma
Jialong Wu
Ningya Feng
Chenjun Xiao
Dong Li
Jianye Hao
Jianmin Wang
Mingsheng Long
41
7
0
30 Sep 2023
Distill Knowledge in Multi-task Reinforcement Learning with
  Optimal-Transport Regularization
Distill Knowledge in Multi-task Reinforcement Learning with Optimal-Transport Regularization
Bang Giang Le
Viet-Cuong Ta
OT
39
1
0
27 Sep 2023
Continual Robot Learning using Self-Supervised Task Inference
Continual Robot Learning using Self-Supervised Task Inference
Muhammad Burhan Hafez
Stefan Wermter
CLL
SSL
18
6
0
10 Sep 2023
IOB: Integrating Optimization Transfer and Behavior Transfer for
  Multi-Policy Reuse
IOB: Integrating Optimization Transfer and Behavior Transfer for Multi-Policy Reuse
Siyuan Li
Haoyang Li
Jin Zhang
Zhen Wang
Peng Liu
Chongjie Zhang
OffRL
33
1
0
14 Aug 2023
1234567
Next