ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1707.04175
  4. Cited By
Distral: Robust Multitask Reinforcement Learning

Distral: Robust Multitask Reinforcement Learning

13 July 2017
Yee Whye Teh
V. Bapst
Wojciech M. Czarnecki
John Quan
J. Kirkpatrick
R. Hadsell
N. Heess
Razvan Pascanu
ArXivPDFHTML

Papers citing "Distral: Robust Multitask Reinforcement Learning"

50 / 319 papers shown
Title
Perception-Prediction-Reaction Agents for Deep Reinforcement Learning
Perception-Prediction-Reaction Agents for Deep Reinforcement Learning
Adam Stooke
Valentin Dalibard
Siddhant M. Jayakumar
Wojciech M. Czarnecki
Max Jaderberg
22
1
0
26 Jun 2020
Mutual Information Based Knowledge Transfer Under State-Action Dimension
  Mismatch
Mutual Information Based Knowledge Transfer Under State-Action Dimension Mismatch
Michael Wan
Tanmay Gangwani
Jian-wei Peng
36
19
0
12 Jun 2020
Transient Non-Stationarity and Generalisation in Deep Reinforcement
  Learning
Transient Non-Stationarity and Generalisation in Deep Reinforcement Learning
Maximilian Igl
Gregory Farquhar
Jelena Luketina
Wendelin Boehmer
Shimon Whiteson
27
85
0
10 Jun 2020
Dual Policy Distillation
Dual Policy Distillation
Kwei-Herng Lai
Daochen Zha
Yuening Li
Xia Hu
OffRL
20
9
0
07 Jun 2020
Language Conditioned Imitation Learning over Unstructured Data
Language Conditioned Imitation Learning over Unstructured Data
Corey Lynch
P. Sermanet
LM&Ro
40
243
0
15 May 2020
A Distributional View on Multi-Objective Policy Optimization
A Distributional View on Multi-Objective Policy Optimization
A. Abdolmaleki
Sandy H. Huang
Leonard Hasenclever
Michael Neunert
H. F. Song
Martina Zambelli
M. Martins
N. Heess
R. Hadsell
Martin Riedmiller
26
74
0
15 May 2020
Learning Adaptive Exploration Strategies in Dynamic Environments Through
  Informed Policy Regularization
Learning Adaptive Exploration Strategies in Dynamic Environments Through Informed Policy Regularization
Pierre-Alexandre Kamienny
Matteo Pirotta
A. Lazaric
Thibault Lavril
Nicolas Usunier
Ludovic Denoyer
14
19
0
06 May 2020
Evolutionary Stochastic Policy Distillation
Evolutionary Stochastic Policy Distillation
Hao Sun
Xinyu Pan
Bo Dai
Dahua Lin
Bolei Zhou
32
1
0
27 Apr 2020
Zero-Shot Compositional Policy Learning via Language Grounding
Zero-Shot Compositional Policy Learning via Language Grounding
Tianshi Cao
Jingkang Wang
Yining Zhang
S. Manivasagam
LM&Ro
34
1
0
15 Apr 2020
Multi-Task Reinforcement Learning with Soft Modularization
Multi-Task Reinforcement Learning with Soft Modularization
Ruihan Yang
Huazhe Xu
Yi Wu
Xiaolong Wang
27
177
0
30 Mar 2020
Invariant Causal Prediction for Block MDPs
Invariant Causal Prediction for Block MDPs
Amy Zhang
Clare Lyle
Shagun Sodhani
Angelos Filos
Marta Z. Kwiatkowska
Joelle Pineau
Y. Gal
Doina Precup
OffRL
AI4CE
OOD
37
139
0
12 Mar 2020
Learning Discrete State Abstractions With Deep Variational Inference
Learning Discrete State Abstractions With Deep Variational Inference
Ondrej Biza
Robert Platt
Jan-Willem van de Meent
Lawson L. S. Wong
BDL
18
12
0
09 Mar 2020
Environment-agnostic Multitask Learning for Natural Language Grounded
  Navigation
Environment-agnostic Multitask Learning for Natural Language Grounded Navigation
Xinze Wang
Vihan Jain
Eugene Ie
William Yang Wang
Zornitsa Kozareva
Sujith Ravi
LM&Ro
43
63
0
01 Mar 2020
Rewriting History with Inverse RL: Hindsight Inference for Policy
  Improvement
Rewriting History with Inverse RL: Hindsight Inference for Policy Improvement
Benjamin Eysenbach
Xinyang Geng
Sergey Levine
Ruslan Salakhutdinov
OffRL
18
86
0
25 Feb 2020
Keep Doing What Worked: Behavioral Modelling Priors for Offline
  Reinforcement Learning
Keep Doing What Worked: Behavioral Modelling Priors for Offline Reinforcement Learning
Noah Y. Siegel
Jost Tobias Springenberg
Felix Berkenkamp
A. Abdolmaleki
Michael Neunert
Thomas Lampe
Roland Hafner
Nicolas Heess
Martin Riedmiller
OffRL
22
282
0
19 Feb 2020
Self-Distillation Amplifies Regularization in Hilbert Space
Self-Distillation Amplifies Regularization in Hilbert Space
H. Mobahi
Mehrdad Farajtabar
Peter L. Bartlett
40
229
0
13 Feb 2020
Learning State Abstractions for Transfer in Continuous Control
Learning State Abstractions for Transfer in Continuous Control
Kavosh Asadi
David Abel
Michael L. Littman
OffRL
30
7
0
08 Feb 2020
Periodic Intra-Ensemble Knowledge Distillation for Reinforcement
  Learning
Periodic Intra-Ensemble Knowledge Distillation for Reinforcement Learning
Zhang-Wei Hong
P. Nagarajan
Guilherme J. Maeda
OffRL
23
4
0
01 Feb 2020
Gradient Surgery for Multi-Task Learning
Gradient Surgery for Multi-Task Learning
Tianhe Yu
Saurabh Kumar
Abhishek Gupta
Sergey Levine
Karol Hausman
Chelsea Finn
41
1,175
0
19 Jan 2020
Population-Guided Parallel Policy Search for Reinforcement Learning
Population-Guided Parallel Policy Search for Reinforcement Learning
Whiyoung Jung
Giseung Park
Y. Sung
OffRL
24
38
0
09 Jan 2020
Joint Goal and Strategy Inference across Heterogeneous Demonstrators via
  Reward Network Distillation
Joint Goal and Strategy Inference across Heterogeneous Demonstrators via Reward Network Distillation
Letian Chen
Rohan R. Paleja
Muyleng Ghuy
Matthew C. Gombolay
27
38
0
02 Jan 2020
A Survey of Deep Reinforcement Learning in Video Games
A Survey of Deep Reinforcement Learning in Video Games
Kun Shao
Zhentao Tang
Yuanheng Zhu
Nannan Li
Dongbin Zhao
OffRL
AI4TS
43
188
0
23 Dec 2019
Zero-shot generalization using cascaded system-representations
Zero-shot generalization using cascaded system-representations
A. Malik
OffRL
22
2
0
11 Dec 2019
12-in-1: Multi-Task Vision and Language Representation Learning
12-in-1: Multi-Task Vision and Language Representation Learning
Jiasen Lu
Vedanuj Goswami
Marcus Rohrbach
Devi Parikh
Stefan Lee
VLM
ObjD
40
476
0
05 Dec 2019
Merging Deterministic Policy Gradient Estimations with Varied
  Bias-Variance Tradeoff for Effective Deep Reinforcement Learning
Merging Deterministic Policy Gradient Estimations with Varied Bias-Variance Tradeoff for Effective Deep Reinforcement Learning
Gang Chen
25
4
0
24 Nov 2019
Collaborative Graph Walk for Semi-supervised Multi-Label Node
  Classification
Collaborative Graph Walk for Semi-supervised Multi-Label Node Classification
Uchenna Akujuobi
Yufei Han
Qiannan Zhang
Xiangliang Zhang
25
16
0
22 Oct 2019
A Neural Entity Coreference Resolution Review
A Neural Entity Coreference Resolution Review
Nikolaos Stylianou
I. Vlahavas
26
38
0
21 Oct 2019
Regularization Matters in Policy Optimization
Regularization Matters in Policy Optimization
Zhuang Liu
Xuanlin Li
Bingyi Kang
Trevor Darrell
OffRL
28
33
0
21 Oct 2019
Multi-View Reinforcement Learning
Multi-View Reinforcement Learning
Minne Li
Lisheng Wu
Haitham Bou-Ammar
Jun Wang
21
26
0
18 Oct 2019
Actor Critic with Differentially Private Critic
Actor Critic with Differentially Private Critic
Jonathan Lebensold
William L. Hamilton
Borja Balle
Doina Precup
OffRL
25
9
0
14 Oct 2019
Ctrl-Z: Recovering from Instability in Reinforcement Learning
Ctrl-Z: Recovering from Instability in Reinforcement Learning
Vibhavari Dasagi
Jake Bruce
T. Peynot
Jurgen Leitner
25
10
0
09 Oct 2019
Recurrent Independent Mechanisms
Recurrent Independent Mechanisms
Anirudh Goyal
Alex Lamb
Jordan Hoffmann
Shagun Sodhani
Sergey Levine
Yoshua Bengio
Bernhard Schölkopf
42
334
0
24 Sep 2019
Robust Opponent Modeling via Adversarial Ensemble Reinforcement Learning
  in Asymmetric Imperfect-Information Games
Robust Opponent Modeling via Adversarial Ensemble Reinforcement Learning in Asymmetric Imperfect-Information Games
Macheng Shen
Jonathan P. How
AAML
29
19
0
18 Sep 2019
Rewarding Coreference Resolvers for Being Consistent with World
  Knowledge
Rewarding Coreference Resolvers for Being Consistent with World Knowledge
Rahul Aralikatte
Heather Lent
Ana Valeria González
Daniel Hershcovich
Chen Qiu
Anders Sandholm
Michael Ringaard
Anders Søgaard
27
16
0
05 Sep 2019
Learning Action-Transferable Policy with Action Embedding
Learning Action-Transferable Policy with Action Embedding
Yu Chen
Yingfeng Chen
Zhipeng Hu
Tianpei Yang
Changjie Fan
Yang Yu
Jianye Hao
24
0
0
05 Sep 2019
Universal Policies to Learn Them All
Universal Policies to Learn Them All
Hassam Sheikh
Ladislau Bölöni
OffRL
21
1
0
24 Aug 2019
Learning to Explore in Motion and Interaction Tasks
Learning to Explore in Motion and Interaction Tasks
Miroslav Bogdanovic
Ludovic Righetti
9
1
0
10 Aug 2019
IR-VIC: Unsupervised Discovery of Sub-goals for Transfer in RL
IR-VIC: Unsupervised Discovery of Sub-goals for Transfer in RL
Nirbhay Modhe
Prithvijit Chattopadhyay
Mohit Sharma
Abhishek Das
Devi Parikh
Dhruv Batra
Ramakrishna Vedantam
9
1
0
24 Jul 2019
Self-Attentional Credit Assignment for Transfer in Reinforcement
  Learning
Self-Attentional Credit Assignment for Transfer in Reinforcement Learning
Johan Ferret
Raphaël Marinier
M. Geist
Olivier Pietquin
OffRL
29
6
0
18 Jul 2019
DisCoRL: Continual Reinforcement Learning via Policy Distillation
DisCoRL: Continual Reinforcement Learning via Policy Distillation
Kalifou René Traoré
Hugo Caselles-Dupré
Timothée Lesort
Te Sun
Guanghang Cai
Natalia Díaz Rodríguez
David Filliat
OffRL
32
60
0
11 Jul 2019
A Model-based Approach for Sample-efficient Multi-task Reinforcement
  Learning
A Model-based Approach for Sample-efficient Multi-task Reinforcement Learning
Nicholas C. Landolfi
G. Thomas
Tengyu Ma
OffRL
8
19
0
11 Jul 2019
BAM! Born-Again Multi-Task Networks for Natural Language Understanding
BAM! Born-Again Multi-Task Networks for Natural Language Understanding
Kevin Clark
Minh-Thang Luong
Urvashi Khandelwal
Christopher D. Manning
Quoc V. Le
30
228
0
10 Jul 2019
Attentive Multi-Task Deep Reinforcement Learning
Attentive Multi-Task Deep Reinforcement Learning
Timo Bram
Gino Brunner
Oliver Richter
Roger Wattenhofer
CLL
25
18
0
05 Jul 2019
Compositional Transfer in Hierarchical Reinforcement Learning
Compositional Transfer in Hierarchical Reinforcement Learning
Markus Wulfmeier
A. Abdolmaleki
Roland Hafner
Jost Tobias Springenberg
Michael Neunert
Tim Hertweck
Thomas Lampe
Noah Y. Siegel
N. Heess
Martin Riedmiller
30
27
0
26 Jun 2019
Disentangled Skill Embeddings for Reinforcement Learning
Disentangled Skill Embeddings for Reinforcement Learning
Janith C. Petangoda
Sergio Pascual-Diaz
Vincent Adam
Peter Vrancx
Jordi Grau-Moya
DRL
OffRL
29
15
0
21 Jun 2019
Better transfer learning with inferred successor maps
Better transfer learning with inferred successor maps
T. Madarász
25
21
0
18 Jun 2019
Robust Reinforcement Learning for Continuous Control with Model
  Misspecification
Robust Reinforcement Learning for Continuous Control with Model Misspecification
D. Mankowitz
Nir Levine
Rae Jeong
Yuanyuan Shi
Jackie Kay
A. Abdolmaleki
Jost Tobias Springenberg
Timothy A. Mann
Todd Hester
Martin Riedmiller
OOD
14
118
0
18 Jun 2019
Unified Semantic Parsing with Weak Supervision
Unified Semantic Parsing with Weak Supervision
Priyanka Agrawal
Parag Jain
Ayushi Dalmia
Abhishek Bansal
Ashish R. Mittal
Karthik Sankaranarayanan
36
10
0
12 Jun 2019
Continual Reinforcement Learning deployed in Real-life using Policy
  Distillation and Sim2Real Transfer
Continual Reinforcement Learning deployed in Real-life using Policy Distillation and Sim2Real Transfer
Kalifou René Traoré
Hugo Caselles-Dupré
Timothée Lesort
Te Sun
Natalia Díaz Rodríguez
David Filliat
CLL
OffRL
28
44
0
11 Jun 2019
Transfer Learning by Modeling a Distribution over Policies
Transfer Learning by Modeling a Distribution over Policies
Disha Shrivastava
Eeshan Gunesh Dhekane
Riashat Islam
OOD
OffRL
14
0
0
09 Jun 2019
Previous
1234567
Next