ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1806.02315
  4. Cited By
Randomized Value Functions via Multiplicative Normalizing Flows
v1v2v3 (latest)

Randomized Value Functions via Multiplicative Normalizing Flows

6 June 2018
Ahmed Touati
Harsh Satija
Joshua Romoff
Joelle Pineau
Pascal Vincent
ArXiv (abs)PDFHTML

Papers citing "Randomized Value Functions via Multiplicative Normalizing Flows"

22 / 22 papers shown
Title
Trust-MARL: Trust-Based Multi-Agent Reinforcement Learning Framework for Cooperative On-Ramp Merging Control in Heterogeneous Traffic Flow
Trust-MARL: Trust-Based Multi-Agent Reinforcement Learning Framework for Cooperative On-Ramp Merging Control in Heterogeneous Traffic Flow
Jie Pan
Tianyi Wang
Christian Claudel
Jing Shi
7
0
0
14 Jun 2025
Learning Uncertainty-Aware Temporally-Extended Actions
Learning Uncertainty-Aware Temporally-Extended Actions
Joongkyu Lee
Seung Joon Park
Yunhao Tang
Min-hwan Oh
55
2
0
08 Feb 2024
Bayesian Exploration Networks
Bayesian Exploration Networks
Matt Fellows
Brandon Kaplowitz
Christian Schroeder de Witt
Shimon Whiteson
BDL
92
4
0
24 Aug 2023
Constraining cosmological parameters from N-body simulations with
  Variational Bayesian Neural Networks
Constraining cosmological parameters from N-body simulations with Variational Bayesian Neural Networks
Héctor J. Hortúa
L. '. García
Leonardo Castañeda C.
BDL
53
4
0
09 Jan 2023
CEIP: Combining Explicit and Implicit Priors for Reinforcement Learning
  with Demonstrations
CEIP: Combining Explicit and Implicit Priors for Reinforcement Learning with Demonstrations
Kai Yan
Alex Schwing
Yu-Xiong Wang
OffRL
70
2
0
18 Oct 2022
Flow-based Recurrent Belief State Learning for POMDPs
Flow-based Recurrent Belief State Learning for POMDPs
Xiaoyu Chen
Yao Mu
Ping Luo
Sheng Li
Jianyu Chen
85
19
0
23 May 2022
TO-FLOW: Efficient Continuous Normalizing Flows with Temporal
  Optimization adjoint with Moving Speed
TO-FLOW: Efficient Continuous Normalizing Flows with Temporal Optimization adjoint with Moving Speed
Shian Du
Yihong Luo
Wei Chen
Jian Xu
Delu Zeng
95
8
0
19 Mar 2022
Improving the Diversity of Bootstrapped DQN by Replacing Priors With
  Noise
Improving the Diversity of Bootstrapped DQN by Replacing Priors With Noise
Li Meng
Morten Goodwin
Anis Yazidi
P. Engelstad
68
4
0
02 Mar 2022
Exploring More When It Needs in Deep Reinforcement Learning
Exploring More When It Needs in Deep Reinforcement Learning
Youtian Guo
Qitong Gao
29
0
0
28 Sep 2021
A Survey of Exploration Methods in Reinforcement Learning
A Survey of Exploration Methods in Reinforcement Learning
Susan Amin
Maziar Gomrokchi
Harsh Satija
H. V. Hoof
Doina Precup
OffRL
100
84
0
01 Sep 2021
Bayesian Bellman Operators
Bayesian Bellman Operators
M. Fellows
Kristian Hartikainen
Shimon Whiteson
OffRL
81
17
0
09 Jun 2021
Uncertainty Weighted Actor-Critic for Offline Reinforcement Learning
Uncertainty Weighted Actor-Critic for Offline Reinforcement Learning
Yue Wu
Shuangfei Zhai
Nitish Srivastava
J. Susskind
Jian Zhang
Ruslan Salakhutdinov
Hanlin Goh
EDLOffRLOnRL
80
190
0
17 May 2021
Out-of-Distribution Detection of Melanoma using Normalizing Flows
Out-of-Distribution Detection of Melanoma using Normalizing Flows
M. Valiuddin
C.G.A. Viviers
OODD
43
0
0
23 Mar 2021
Parameterized Indexed Value Function for Efficient Exploration in
  Reinforcement Learning
Parameterized Indexed Value Function for Efficient Exploration in Reinforcement Learning
Tian Tan
Zhihan Xiong
Vikranth Dwaracherla
17
5
0
23 Dec 2019
Attraction-Repulsion Actor-Critic for Continuous Control Reinforcement
  Learning
Attraction-Repulsion Actor-Critic for Continuous Control Reinforcement Learning
T. Doan
Bogdan Mazoure
Moloud Abdar
A. Durand
Joelle Pineau
R. Devon Hjelm
73
15
0
17 Sep 2019
Benchmarking Bonus-Based Exploration Methods on the Arcade Learning
  Environment
Benchmarking Bonus-Based Exploration Methods on the Arcade Learning Environment
Adrien Ali Taïga
W. Fedus
Marlos C. Machado
Aaron Courville
Marc G. Bellemare
83
41
0
06 Aug 2019
Stochastic Neural Network with Kronecker Flow
Stochastic Neural Network with Kronecker Flow
Chin-Wei Huang
Ahmed Touati
Pascal Vincent
Gintare Karolina Dziugaite
Alexandre Lacoste
Aaron Courville
BDL
67
8
0
10 Jun 2019
Worst-Case Regret Bounds for Exploration via Randomized Value Functions
Worst-Case Regret Bounds for Exploration via Randomized Value Functions
Daniel Russo
OffRL
78
88
0
07 Jun 2019
Randomised Bayesian Least-Squares Policy Iteration
Randomised Bayesian Least-Squares Policy Iteration
Nikolaos Tziortziotis
Christos Dimitrakakis
Michalis Vazirgiannis
OffRL
49
1
0
06 Apr 2019
Off-Policy Deep Reinforcement Learning without Exploration
Off-Policy Deep Reinforcement Learning without Exploration
Scott Fujimoto
David Meger
Doina Precup
OffRLBDL
296
1,626
0
07 Dec 2018
Successor Uncertainties: Exploration and Uncertainty in Temporal
  Difference Learning
Successor Uncertainties: Exploration and Uncertainty in Temporal Difference Learning
David Janz
Jiri Hron
Przemysław Mazur
Katja Hofmann
José Miguel Hernández-Lobato
Sebastian Tschiatschek
163
52
0
15 Oct 2018
Randomized Prior Functions for Deep Reinforcement Learning
Randomized Prior Functions for Deep Reinforcement Learning
Ian Osband
John Aslanides
Albin Cassirer
UQCVBDL
90
380
0
08 Jun 2018
1