ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1806.06923
  4. Cited By
Implicit Quantile Networks for Distributional Reinforcement Learning

Implicit Quantile Networks for Distributional Reinforcement Learning

14 June 2018
Will Dabney
Georg Ostrovski
David Silver
Rémi Munos
    OffRL
ArXivPDFHTML

Papers citing "Implicit Quantile Networks for Distributional Reinforcement Learning"

50 / 112 papers shown
Title
Addressing Optimism Bias in Sequence Modeling for Reinforcement Learning
Addressing Optimism Bias in Sequence Modeling for Reinforcement Learning
Adam R. Villaflor
Zheng Huang
Swapnil Pande
John M. Dolan
J. Schneider
OffRL
25
24
0
21 Jul 2022
Sampling Efficient Deep Reinforcement Learning through Preference-Guided
  Stochastic Exploration
Sampling Efficient Deep Reinforcement Learning through Preference-Guided Stochastic Exploration
Wenhui Huang
Cong Zhang
Jingda Wu
Xiangkun He
Jie Zhang
Chengqi Lv
16
8
0
20 Jun 2022
Towards Safe Reinforcement Learning via Constraining Conditional
  Value-at-Risk
Towards Safe Reinforcement Learning via Constraining Conditional Value-at-Risk
Chengyang Ying
Xinning Zhou
Hang Su
Dong Yan
Ning Chen
Jun Zhu
24
41
0
09 Jun 2022
Reincarnating Reinforcement Learning: Reusing Prior Computation to
  Accelerate Progress
Reincarnating Reinforcement Learning: Reusing Prior Computation to Accelerate Progress
Rishabh Agarwal
Max Schwarzer
Pablo Samuel Castro
Rameswar Panda
Marc G. Bellemare
OffRL
OnRL
37
63
0
03 Jun 2022
Learning Generalizable Risk-Sensitive Policies to Coordinate in Decentralized Multi-Agent General-Sum Games
Ziyi Liu
Xian Guo
Yongchun Fang
26
0
0
31 May 2022
Distributed Multi-Agent Deep Reinforcement Learning for Robust
  Coordination against Noise
Distributed Multi-Agent Deep Reinforcement Learning for Robust Coordination against Noise
Yoshinari Motokawa
T. Sugawara
30
2
0
19 May 2022
Revisiting Gaussian mixture critics in off-policy reinforcement
  learning: a sample-based approach
Revisiting Gaussian mixture critics in off-policy reinforcement learning: a sample-based approach
Bobak Shahriari
A. Abdolmaleki
Arunkumar Byravan
A. Friesen
Siqi Liu
Jost Tobias Springenberg
N. Heess
Matthew W. Hoffman
Martin Riedmiller
OffRL
46
9
0
21 Apr 2022
Adaptive Risk-Tendency: Nano Drone Navigation in Cluttered Environments
  with Distributional Reinforcement Learning
Adaptive Risk-Tendency: Nano Drone Navigation in Cluttered Environments with Distributional Reinforcement Learning
Cheng Liu
E. Kampen
Guido de Croon
39
16
0
28 Mar 2022
Zipfian environments for Reinforcement Learning
Zipfian environments for Reinforcement Learning
Stephanie C. Y. Chan
Andrew Kyle Lampinen
Pierre Harvey Richemond
Felix Hill
OffRL
15
15
0
15 Mar 2022
Multivariate Quantile Function Forecaster
Multivariate Quantile Function Forecaster
Kelvin K. Kan
Franccois-Xavier Aubet
Tim Januschowski
Youngsuk Park
Konstantinos Benidis
Lars Ruthotto
Jan Gasthaus
AI4TS
39
22
0
23 Feb 2022
MCMARL: Parameterizing Value Function via Mixture of Categorical
  Distributions for Multi-Agent Reinforcement Learning
MCMARL: Parameterizing Value Function via Mixture of Categorical Distributions for Multi-Agent Reinforcement Learning
Jian Zhao
Mingyu Yang
Youpeng Zhao
Xu Hu
Wen-gang Zhou
Jiangcheng Zhu
Houqiang Li
25
3
0
21 Feb 2022
TransDreamer: Reinforcement Learning with Transformer World Models
TransDreamer: Reinforcement Learning with Transformer World Models
Changgu Chen
Yi-Fu Wu
Jaesik Yoon
Sungjin Ahn
OffRL
34
91
0
19 Feb 2022
MuZero with Self-competition for Rate Control in VP9 Video Compression
MuZero with Self-competition for Rate Control in VP9 Video Compression
Amol Mandhane
A. Zhernov
Maribeth Rauh
Chenjie Gu
Miaosen Wang
...
Jackson Broshear
Julian Schrittwieser
Thomas Hubert
Oriol Vinyals
Timothy A. Mann
37
44
0
14 Feb 2022
Deep Non-Crossing Quantiles through the Partial Derivative
Deep Non-Crossing Quantiles through the Partial Derivative
Axel Brando
J. Gimeno
Jose A. Rodríguez-Serrano
Jordi Vitrià
48
13
0
30 Jan 2022
Towards Autonomous Satellite Communications: An AI-based Framework to
  Address System-level Challenges
Towards Autonomous Satellite Communications: An AI-based Framework to Address System-level Challenges
J. Luis
Skylar Eiskowitz
Nils Pachler de la Osa
E. Crawley
B. Cameron
23
5
0
11 Dec 2021
Reinforcement Learning-based Switching Controller for a Milliscale Robot
  in a Constrained Environment
Reinforcement Learning-based Switching Controller for a Milliscale Robot in a Constrained Environment
Abbas Tariverdi
Ulysse Côté-Allard
Kim Mathiassen
O. Elle
H. Kalvøy
Ø. Martinsen
J. Tørresen
16
4
0
27 Nov 2021
Aggressive Q-Learning with Ensembles: Achieving Both High Sample
  Efficiency and High Asymptotic Performance
Aggressive Q-Learning with Ensembles: Achieving Both High Sample Efficiency and High Asymptotic Performance
Yanqiu Wu
Xinyue Chen
Che Wang
Yiming Zhang
Keith Ross
OffRL
17
9
0
17 Nov 2021
GRI: General Reinforced Imitation and its Application to Vision-Based
  Autonomous Driving
GRI: General Reinforced Imitation and its Application to Vision-Based Autonomous Driving
Raphael Chekroun
Marin Toromanoff
Sascha Hornauer
Fabien Moutarde
39
60
0
16 Nov 2021
Exponential Bellman Equation and Improved Regret Bounds for
  Risk-Sensitive Reinforcement Learning
Exponential Bellman Equation and Improved Regret Bounds for Risk-Sensitive Reinforcement Learning
Yingjie Fei
Zhuoran Yang
Yudong Chen
Zhaoran Wang
51
46
0
06 Nov 2021
d3rlpy: An Offline Deep Reinforcement Learning Library
d3rlpy: An Offline Deep Reinforcement Learning Library
Takuma Seno
M. Imai
OffRL
GP
65
100
0
06 Nov 2021
Robust Dynamic Bus Control: A Distributional Multi-agent Reinforcement
  Learning Approach
Robust Dynamic Bus Control: A Distributional Multi-agent Reinforcement Learning Approach
Changyin Sun
Lijun Sun
22
6
0
02 Nov 2021
Offline Reinforcement Learning with Implicit Q-Learning
Offline Reinforcement Learning with Implicit Q-Learning
Ilya Kostrikov
Ashvin Nair
Sergey Levine
OffRL
214
852
0
12 Oct 2021
Large Batch Experience Replay
Large Batch Experience Replay
Thibault Lahire
M. Geist
Emmanuel Rachelson
OffRL
56
13
0
04 Oct 2021
ADER:Adapting between Exploration and Robustness for Actor-Critic
  Methods
ADER:Adapting between Exploration and Robustness for Actor-Critic Methods
Bo Zhou
Kejiao Li
Hongsheng Zeng
Fan Wang
Hao Tian
OffRL
38
1
0
08 Sep 2021
Deep Reinforcement Learning at the Edge of the Statistical Precipice
Deep Reinforcement Learning at the Edge of the Statistical Precipice
Rishabh Agarwal
Max Schwarzer
Pablo Samuel Castro
Aaron Courville
Marc G. Bellemare
OffRL
61
639
0
30 Aug 2021
Conservative Offline Distributional Reinforcement Learning
Conservative Offline Distributional Reinforcement Learning
Yecheng Jason Ma
Dinesh Jayaraman
Osbert Bastani
OffRL
73
79
0
12 Jul 2021
Evaluating the progress of Deep Reinforcement Learning in the real
  world: aligning domain-agnostic and domain-specific research
Evaluating the progress of Deep Reinforcement Learning in the real world: aligning domain-agnostic and domain-specific research
J. Luis
E. Crawley
B. Cameron
OffRL
27
6
0
07 Jul 2021
Simplifying Deep Reinforcement Learning via Self-Supervision
Simplifying Deep Reinforcement Learning via Self-Supervision
Daochen Zha
Kwei-Herng Lai
Kaixiong Zhou
Xia Hu
SSL
49
15
0
10 Jun 2021
MICo: Improved representations via sampling-based state similarity for
  Markov decision processes
MICo: Improved representations via sampling-based state similarity for Markov decision processes
Pablo Samuel Castro
Tyler Kastner
Prakash Panangaden
Mark Rowland
48
35
0
03 Jun 2021
Learning to drive from a world on rails
Learning to drive from a world on rails
Di Chen
V. Koltun
Philipp Krahenbuhl
98
116
0
03 May 2021
Universal Off-Policy Evaluation
Universal Off-Policy Evaluation
Yash Chandak
S. Niekum
Bruno C. da Silva
Erik Learned-Miller
Emma Brunskill
Philip S. Thomas
OffRL
ELM
39
52
0
26 Apr 2021
Off-Policy Risk Assessment in Contextual Bandits
Off-Policy Risk Assessment in Contextual Bandits
Audrey Huang
Liu Leqi
Zachary Chase Lipton
Kamyar Azizzadenesheli
OffRL
29
36
0
18 Apr 2021
A Survey on Deep Reinforcement Learning for Audio-Based Applications
A Survey on Deep Reinforcement Learning for Audio-Based Applications
S. Latif
Heriberto Cuayáhuitl
Farrukh Pervez
Fahad Shamshad
Hafiz Shehbaz Ali
Min Zhang
OffRL
60
73
0
01 Jan 2021
POPO: Pessimistic Offline Policy Optimization
POPO: Pessimistic Offline Policy Optimization
Qiang He
Xinwen Hou
OffRL
37
10
0
26 Dec 2020
A case for new neural network smoothness constraints
A case for new neural network smoothness constraints
Mihaela Rosca
T. Weber
Arthur Gretton
S. Mohamed
AAML
35
48
0
14 Dec 2020
Deep Reinforcement Learning for Resource Constrained Multiclass
  Scheduling in Wireless Networks
Deep Reinforcement Learning for Resource Constrained Multiclass Scheduling in Wireless Networks
Apostolos Avranas
Marios Kountouris
P. Ciblat
24
7
0
27 Nov 2020
Revisiting Rainbow: Promoting more Insightful and Inclusive Deep
  Reinforcement Learning Research
Revisiting Rainbow: Promoting more Insightful and Inclusive Deep Reinforcement Learning Research
J. Obando-Ceron
Pablo Samuel Castro
OffRL
20
105
0
20 Nov 2020
Mastering Atari with Discrete World Models
Mastering Atari with Discrete World Models
Danijar Hafner
Timothy Lillicrap
Mohammad Norouzi
Jimmy Ba
DRL
53
823
0
05 Oct 2020
QR-MIX: Distributional Value Function Factorisation for Cooperative Multi-Agent Reinforcement Learning
Jian Hu
Seth Austin Harding
Haibin Wu
Siyue Hu
Shih-Wei Liao
29
9
0
09 Sep 2020
Hyperparameter Selection for Offline Reinforcement Learning
Hyperparameter Selection for Offline Reinforcement Learning
T. Paine
Cosmin Paduraru
Andrea Michi
Çağlar Gülçehre
Konrad Zolna
Alexander Novikov
Ziyun Wang
Nando de Freitas
GP
OffRL
49
146
0
17 Jul 2020
Acme: A Research Framework for Distributed Reinforcement Learning
Acme: A Research Framework for Distributed Reinforcement Learning
Matthew W. Hoffman
Bobak Shahriari
John Aslanides
Gabriel Barth-Maron
Nikola Momchev
...
Srivatsan Srinivasan
A. Cowie
Ziyun Wang
Bilal Piot
Nando de Freitas
65
225
0
01 Jun 2020
Adversarial Attacks on Reinforcement Learning based Energy Management
  Systems of Extended Range Electric Delivery Vehicles
Adversarial Attacks on Reinforcement Learning based Energy Management Systems of Extended Range Electric Delivery Vehicles
Pengyue Wang
Yuante Li
Shashi Shekhar
W. Northrop
AAML
21
8
0
01 Jun 2020
An empirical investigation of the challenges of real-world reinforcement
  learning
An empirical investigation of the challenges of real-world reinforcement learning
Gabriel Dulac-Arnold
Nir Levine
D. Mankowitz
Jerry Li
Cosmin Paduraru
Sven Gowal
Todd Hester
OffRL
34
121
0
24 Mar 2020
Distributional Soft Actor-Critic: Off-Policy Reinforcement Learning for
  Addressing Value Estimation Errors
Distributional Soft Actor-Critic: Off-Policy Reinforcement Learning for Addressing Value Estimation Errors
Jingliang Duan
Yang Guan
Shengbo Eben Li
Yangang Ren
B. Cheng
OffRL
25
174
0
09 Jan 2020
A Survey of Deep Reinforcement Learning in Video Games
A Survey of Deep Reinforcement Learning in Video Games
Kun Shao
Zhentao Tang
Yuanheng Zhu
Nannan Li
Dongbin Zhao
OffRL
AI4TS
43
188
0
23 Dec 2019
End-to-End Model-Free Reinforcement Learning for Urban Driving using
  Implicit Affordances
End-to-End Model-Free Reinforcement Learning for Urban Driving using Implicit Affordances
Marin Toromanoff
É. Wirbel
Fabien Moutarde
OffRL
44
205
0
25 Nov 2019
Worst Cases Policy Gradients
Worst Cases Policy Gradients
Yichuan Tang
Jian Zhang
Ruslan Salakhutdinov
27
75
0
09 Nov 2019
Quantile QT-Opt for Risk-Aware Vision-Based Robotic Grasping
Quantile QT-Opt for Risk-Aware Vision-Based Robotic Grasping
Cristian Bodnar
A. Li
Karol Hausman
P. Pastor
Mrinal Kalakrishnan
OffRL
28
50
0
01 Oct 2019
Harnessing Structures for Value-Based Planning and Reinforcement
  Learning
Harnessing Structures for Value-Based Planning and Reinforcement Learning
Yuzhe Yang
Guo Zhang
Zhi Xu
Dina Katabi
OffRL
27
31
0
26 Sep 2019
Off-Policy Actor-Critic with Shared Experience Replay
Off-Policy Actor-Critic with Shared Experience Replay
Simon Schmitt
Matteo Hessel
Karen Simonyan
OffRL
27
68
0
25 Sep 2019
Previous
123
Next