ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1809.04474
  4. Cited By
Multi-task Deep Reinforcement Learning with PopArt

Multi-task Deep Reinforcement Learning with PopArt

12 September 2018
Matteo Hessel
Hubert Soyer
L. Espeholt
Wojciech M. Czarnecki
Simon Schmitt
H. V. Hasselt
ArXivPDFHTML

Papers citing "Multi-task Deep Reinforcement Learning with PopArt"

50 / 72 papers shown
Title
Multi-agent cooperation through learning-aware policy gradients
Multi-agent cooperation through learning-aware policy gradients
Alexander Meulemans
Seijin Kobayashi
J. Oswald
Nino Scherrer
Eric Elmoznino
Blake A. Richards
Guillaume Lajoie
Blaise Agüera y Arcas
João Sacramento
56
0
0
24 Oct 2024
How to Solve Contextual Goal-Oriented Problems with Offline Datasets?
How to Solve Contextual Goal-Oriented Problems with Offline Datasets?
Ying Fan
Jingling Li
Adith Swaminathan
Aditya Modi
Ching-An Cheng
OffRL
72
0
0
14 Aug 2024
Massively Multiagent Minigames for Training Generalist Agents
Massively Multiagent Minigames for Training Generalist Agents
Kyoung Whan Choe
Ryan Sullivan
Joseph Suárez
AI4CE
34
0
0
07 Jun 2024
Robust Analysis of Multi-Task Learning Efficiency: New Benchmarks on
  Light-Weighed Backbones and Effective Measurement of Multi-Task Learning
  Challenges by Feature Disentanglement
Robust Analysis of Multi-Task Learning Efficiency: New Benchmarks on Light-Weighed Backbones and Effective Measurement of Multi-Task Learning Challenges by Feature Disentanglement
Dayou Mao
Yuhao Chen
Yifan Wu
Maximilian Gilles
Alexander Wong
AAML
41
0
0
05 Feb 2024
Sharing Knowledge in Multi-Task Deep Reinforcement Learning
Sharing Knowledge in Multi-Task Deep Reinforcement Learning
Carlo DÉramo
Davide Tateo
Andrea Bonarini
Marcello Restelli
Jan Peters
59
124
0
17 Jan 2024
Decentralized Multi-Agent Reinforcement Learning with Global State
  Prediction
Decentralized Multi-Agent Reinforcement Learning with Global State Prediction
Josh Bloom
Pranjal Paliwal
Apratim Mukherjee
Carlo Pinciroli
30
3
0
22 Jun 2023
Heterogeneous Social Value Orientation Leads to Meaningful Diversity in
  Sequential Social Dilemmas
Heterogeneous Social Value Orientation Leads to Meaningful Diversity in Sequential Social Dilemmas
Udari Madhushani
Kevin R. McKee
J. Agapiou
Joel Z Leibo
Richard Everett
Thomas W. Anthony
Edward Hughes
K. Tuyls
Edgar A. Duénez-Guzmán
49
2
0
01 May 2023
Representations and Exploration for Deep Reinforcement Learning using
  Singular Value Decomposition
Representations and Exploration for Deep Reinforcement Learning using Singular Value Decomposition
Yash Chandak
S. Thakoor
Z. Guo
Yunhao Tang
Rémi Munos
Will Dabney
Diana Borsa
24
2
0
01 May 2023
Launchpad: Learning to Schedule Using Offline and Online RL Methods
Launchpad: Learning to Schedule Using Offline and Online RL Methods
V. Venkataswamy
J. E. Grigsby
A. Grimshaw
Yanjun Qi
OffRL
OnRL
24
1
0
01 Dec 2022
Multi-Task Imitation Learning for Linear Dynamical Systems
Multi-Task Imitation Learning for Linear Dynamical Systems
Thomas T. Zhang
Katie Kang
Bruce D. Lee
Claire Tomlin
Sergey Levine
Stephen Tu
Nikolai Matni
41
23
0
01 Dec 2022
Offline Q-Learning on Diverse Multi-Task Data Both Scales And
  Generalizes
Offline Q-Learning on Diverse Multi-Task Data Both Scales And Generalizes
Aviral Kumar
Rishabh Agarwal
Xinyang Geng
George Tucker
Sergey Levine
OffRL
44
48
0
28 Nov 2022
Melting Pot 2.0
Melting Pot 2.0
J. Agapiou
A. Vezhnevets
Edgar A. Duénez-Guzmán
Jayd Matyas
Yiran Mao
...
Sukhdeep Singh
Julia Haas
Igor Mordatch
D. Mobbs
Joel Z Leibo
45
32
0
24 Nov 2022
Curiosity in Hindsight: Intrinsic Exploration in Stochastic Environments
Curiosity in Hindsight: Intrinsic Exploration in Stochastic Environments
Daniel Jarrett
Corentin Tallec
Florent Altché
Thomas Mesnard
Rémi Munos
Michal Valko
48
5
0
18 Nov 2022
Neural Regression For Scale-Varying Targets
Neural Regression For Scale-Varying Targets
Adam Khakhar
Jacob Buckman
24
1
0
14 Nov 2022
Auxiliary task discovery through generate-and-test
Auxiliary task discovery through generate-and-test
Banafsheh Rafiee
Sina Ghiassian
Jun Jin
R. Sutton
Jun Luo
Adam White
21
0
0
25 Oct 2022
PaCo: Parameter-Compositional Multi-Task Reinforcement Learning
PaCo: Parameter-Compositional Multi-Task Reinforcement Learning
Lingfeng Sun
Haichao Zhang
Wei Xu
Masayoshi Tomizuka
MoE
30
37
0
21 Oct 2022
RobustAnalog: Fast Variation-Aware Analog Circuit Design Via Multi-task
  RL
RobustAnalog: Fast Variation-Aware Analog Circuit Design Via Multi-task RL
Wei Shi
Hanrui Wang
Jiaqi Gu
Mingjie Liu
David Z. Pan
Song Han
Nan Sun
16
14
0
13 Jul 2022
Don't Start From Scratch: Leveraging Prior Data to Automate Robotic
  Reinforcement Learning
Don't Start From Scratch: Leveraging Prior Data to Automate Robotic Reinforcement Learning
Homer Walke
Jonathan Yang
Albert Yu
Aviral Kumar
Jedrzej Orbik
Avi Singh
Sergey Levine
OffRL
OnRL
29
32
0
11 Jul 2022
BYOL-Explore: Exploration by Bootstrapped Prediction
BYOL-Explore: Exploration by Bootstrapped Prediction
Z. Guo
S. Thakoor
Miruna Pislar
Bernardo Avila-Pires
Florent Altché
...
Yunhao Tang
Michal Valko
Rémi Munos
M. G. Azar
Bilal Piot
22
68
0
16 Jun 2022
Fast and Precise: Adjusting Planning Horizon with Adaptive Subgoal
  Search
Fast and Precise: Adjusting Planning Horizon with Adaptive Subgoal Search
Michał Zawalski
Michał Tyrolski
K. Czechowski
Tomasz Odrzygó'zd'z
Damian Stachura
Piotr Pikekos
Yuhuai Wu
Lukasz Kuciñski
Piotr Milo's
LRM
21
9
0
01 Jun 2022
Constrained Reinforcement Learning for Short Video Recommendation
Constrained Reinforcement Learning for Short Video Recommendation
Qingpeng Cai
Ruohan Zhan
Chi Zhang
Jie Zheng
Guangwei Ding
Pinghua Gong
Dong Zheng
Peng Jiang
33
6
0
26 May 2022
Robust Losses for Learning Value Functions
Robust Losses for Learning Value Functions
Andrew Patterson
Victor Liao
Martha White
28
12
0
17 May 2022
A Generalist Agent
A Generalist Agent
Scott E. Reed
Konrad Zolna
Emilio Parisotto
Sergio Gomez Colmenarejo
Alexander Novikov
...
Yutian Chen
R. Hadsell
Oriol Vinyals
Mahyar Bordbar
Nando de Freitas
LM&Ro
LLMAG
AI4CE
119
793
0
12 May 2022
Rapid Locomotion via Reinforcement Learning
Rapid Locomotion via Reinforcement Learning
G. Margolis
Ge Yang
Kartik Paigwar
Tao Chen
Pulkit Agrawal
49
229
0
05 May 2022
Learning List-wise Representation in Reinforcement Learning for Ads
  Allocation with Multiple Auxiliary Tasks
Learning List-wise Representation in Reinforcement Learning for Ads Allocation with Multiple Auxiliary Tasks
Zehua Wang
Guogang Liao
Xiaowen Shi
Xiaoxu Wu
Chuheng Zhang
Yongkang Wang
Xingxing Wang
Dong Wang
OffRL
21
4
0
02 Apr 2022
Nearly Minimax Algorithms for Linear Bandits with Shared Representation
Nearly Minimax Algorithms for Linear Bandits with Shared Representation
Jiaqi Yang
Qi Lei
Jason D. Lee
S. Du
43
16
0
29 Mar 2022
Zipfian environments for Reinforcement Learning
Zipfian environments for Reinforcement Learning
Stephanie C. Y. Chan
Andrew Kyle Lampinen
Pierre Harvey Richemond
Felix Hill
OffRL
15
15
0
15 Mar 2022
On Steering Multi-Annotations per Sample for Multi-Task Learning
On Steering Multi-Annotations per Sample for Multi-Task Learning
Yuan Li
Yiwen Guo
Qizhang Li
Hongzhi Zhang
W. Zuo
25
0
0
06 Mar 2022
Demystifying Reinforcement Learning in Time-Varying Systems
Demystifying Reinforcement Learning in Time-Varying Systems
Pouya Hamadanian
Malte Schwarzkopf
Siddartha Sen
MohammadIman Alizadeh
45
1
0
14 Jan 2022
In Defense of the Unitary Scalarization for Deep Multi-Task Learning
In Defense of the Unitary Scalarization for Deep Multi-Task Learning
Vitaly Kurin
Alessandro De Palma
Ilya Kostrikov
Shimon Whiteson
M. P. Kumar
41
74
0
11 Jan 2022
Generalization in Dexterous Manipulation via Geometry-Aware Multi-Task
  Learning
Generalization in Dexterous Manipulation via Geometry-Aware Multi-Task Learning
Wenlong Huang
Igor Mordatch
Pieter Abbeel
Deepak Pathak
43
63
0
04 Nov 2021
Statistical discrimination in learning agents
Statistical discrimination in learning agents
Edgar A. Duénez-Guzmán
Kevin R. McKee
Yiran Mao
Ben Coppin
Silvia Chiappa
...
Yoram Bachrach
Suzanne Sadedin
William S. Isaac
K. Tuyls
Joel Z Leibo
47
7
0
21 Oct 2021
A Closer Look at Advantage-Filtered Behavioral Cloning in High-Noise
  Datasets
A Closer Look at Advantage-Filtered Behavioral Cloning in High-Noise Datasets
J. E. Grigsby
Yanjun Qi
OffRL
29
5
0
10 Oct 2021
Conservative Data Sharing for Multi-Task Offline Reinforcement Learning
Conservative Data Sharing for Multi-Task Offline Reinforcement Learning
Tianhe Yu
Aviral Kumar
Yevgen Chebotar
Karol Hausman
Sergey Levine
Chelsea Finn
OffRL
35
77
0
16 Sep 2021
Auxiliary Task Update Decomposition: The Good, The Bad and The Neutral
Auxiliary Task Update Decomposition: The Good, The Bad and The Neutral
Lucio Dery
Yann N. Dauphin
David Grangier
MoMe
21
29
0
25 Aug 2021
Open-Ended Learning Leads to Generally Capable Agents
Open-Ended Learning Leads to Generally Capable Agents
Open-Ended Learning Team
Adam Stooke
Anuj Mahajan
Catarina Barros
Charlie Deck
...
Nicolas Porcel
Roberta Raileanu
Steph Hughes-Fitt
Valentin Dalibard
Wojciech M. Czarnecki
55
181
0
27 Jul 2021
Scalable Evaluation of Multi-Agent Reinforcement Learning with Melting
  Pot
Scalable Evaluation of Multi-Agent Reinforcement Learning with Melting Pot
Joel Z Leibo
Edgar A. Duénez-Guzmán
A. Vezhnevets
J. Agapiou
P. Sunehag
Raphael Köster
Jayd Matyas
Charlie Beattie
Igor Mordatch
T. Graepel
OffRL
58
104
0
14 Jul 2021
AdaRL: What, Where, and How to Adapt in Transfer Reinforcement Learning
AdaRL: What, Where, and How to Adapt in Transfer Reinforcement Learning
Erdun Gao
Fan Feng
Chaochao Lu
Sara Magliacane
Kun Zhang
44
66
0
06 Jul 2021
Evolving Hierarchical Memory-Prediction Machines in Multi-Task
  Reinforcement Learning
Evolving Hierarchical Memory-Prediction Machines in Multi-Task Reinforcement Learning
Stephen Kelly
Tatiana Voegerl
W. Banzhaf
C. Gondro
45
13
0
23 Jun 2021
On the Power of Multitask Representation Learning in Linear MDP
On the Power of Multitask Representation Learning in Linear MDP
Rui Lu
Gao Huang
S. Du
27
28
0
15 Jun 2021
A Comprehensive Survey on Graph Anomaly Detection with Deep Learning
A Comprehensive Survey on Graph Anomaly Detection with Deep Learning
Xiaoxiao Ma
Jia Wu
Shan Xue
Jian Yang
Chuan Zhou
Quan Z. Sheng
Hui Xiong
Leman Akoglu
GNN
AI4TS
43
539
0
14 Jun 2021
Continual World: A Robotic Benchmark For Continual Reinforcement
  Learning
Continual World: A Robotic Benchmark For Continual Reinforcement Learning
Maciej Wołczyk
Michal Zajkac
Razvan Pascanu
Lukasz Kuciñski
Piotr Milo's
CLL
OffRL
17
89
0
23 May 2021
MT-Opt: Continuous Multi-Task Robotic Reinforcement Learning at Scale
MT-Opt: Continuous Multi-Task Robotic Reinforcement Learning at Scale
Dmitry Kalashnikov
Jacob Varley
Yevgen Chebotar
Benjamin Swanson
Rico Jonschkowski
Chelsea Finn
Sergey Levine
Karol Hausman
OffRL
47
271
0
16 Apr 2021
Efficient Transformers in Reinforcement Learning using Actor-Learner
  Distillation
Efficient Transformers in Reinforcement Learning using Actor-Learner Distillation
Emilio Parisotto
Ruslan Salakhutdinov
42
44
0
04 Apr 2021
TeachMyAgent: a Benchmark for Automatic Curriculum Learning in Deep RL
TeachMyAgent: a Benchmark for Automatic Curriculum Learning in Deep RL
Clément Romac
Rémy Portelas
Katja Hofmann
Pierre-Yves Oudeyer
27
21
0
17 Mar 2021
Return-Based Contrastive Representation Learning for Reinforcement
  Learning
Return-Based Contrastive Representation Learning for Reinforcement Learning
Guoqing Liu
Chuheng Zhang
Li Zhao
Tao Qin
Jinhua Zhu
Jian Li
Nenghai Yu
Tie-Yan Liu
SSL
OffRL
19
47
0
22 Feb 2021
Zero-Shot Terrain Generalization for Visual Locomotion Policies
Zero-Shot Terrain Generalization for Visual Locomotion Policies
Alejandro Escontrela
George Yu
P. Xu
Atil Iscen
Jie Tan
11
17
0
11 Nov 2020
Gradient Vaccine: Investigating and Improving Multi-task Optimization in
  Massively Multilingual Models
Gradient Vaccine: Investigating and Improving Multi-task Optimization in Massively Multilingual Models
Zirui Wang
Yulia Tsvetkov
Orhan Firat
Yuan Cao
33
196
0
12 Oct 2020
My Body is a Cage: the Role of Morphology in Graph-Based Incompatible
  Control
My Body is a Cage: the Role of Morphology in Graph-Based Incompatible Control
Vitaly Kurin
Maximilian Igl
Tim Rocktaschel
Wendelin Boehmer
Shimon Whiteson
AI4CE
27
88
0
05 Oct 2020
Multi-Task Learning with Deep Neural Networks: A Survey
Multi-Task Learning with Deep Neural Networks: A Survey
M. Crawshaw
CVBM
55
610
0
10 Sep 2020
12
Next