ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1906.02771
  4. Cited By
Improving Exploration in Soft-Actor-Critic with Normalizing Flows
  Policies

Improving Exploration in Soft-Actor-Critic with Normalizing Flows Policies

6 June 2019
Patrick Nadeem Ward
Ariella Smofsky
A. Bose
ArXivPDFHTML

Papers citing "Improving Exploration in Soft-Actor-Critic with Normalizing Flows Policies"

16 / 16 papers shown
Title
Monotone, Bi-Lipschitz, and Polyak-Lojasiewicz Networks
Monotone, Bi-Lipschitz, and Polyak-Lojasiewicz Networks
Ruigang Wang
Krishnamurthy Dvijotham
I. Manchester
41
5
0
02 Feb 2024
LLQL: Logistic Likelihood Q-Learning for Reinforcement Learning
LLQL: Logistic Likelihood Q-Learning for Reinforcement Learning
Outongyi Lv
Bingxin Zhou
OffRL
44
0
0
05 Jul 2023
On Learning the Tail Quantiles of Driving Behavior Distributions via
  Quantile Regression and Flows
On Learning the Tail Quantiles of Driving Behavior Distributions via Quantile Regression and Flows
Jia Yu Tee
Oliver De Candido
Wolfgang Utschick
Philipp Geiger
27
0
0
22 May 2023
Soft Actor-Critic Algorithm with Truly-satisfied Inequality Constraint
Soft Actor-Critic Algorithm with Truly-satisfied Inequality Constraint
Taisuke Kobayashi
53
3
0
08 Mar 2023
CEIP: Combining Explicit and Implicit Priors for Reinforcement Learning
  with Demonstrations
CEIP: Combining Explicit and Implicit Priors for Reinforcement Learning with Demonstrations
Kai Yan
Alex Schwing
Yu-xiong Wang
OffRL
30
2
0
18 Oct 2022
Revisiting Discrete Soft Actor-Critic
Revisiting Discrete Soft Actor-Critic
Haibin Zhou
Zichuan Lin
Junyou Li
Qiang Fu
Wei Yang
Deheng Ye
51
12
0
21 Sep 2022
Action Noise in Off-Policy Deep Reinforcement Learning: Impact on
  Exploration and Performance
Action Noise in Off-Policy Deep Reinforcement Learning: Impact on Exploration and Performance
Jakob J. Hollenstein
Sayantan Auddy
Matteo Saveriano
Erwan Renaudo
J. Piater
46
17
0
08 Jun 2022
Flow-based Recurrent Belief State Learning for POMDPs
Flow-based Recurrent Belief State Learning for POMDPs
Xiaoyu Chen
Yao Mu
Ping Luo
Sheng Li
Jianyu Chen
53
18
0
23 May 2022
TO-FLOW: Efficient Continuous Normalizing Flows with Temporal
  Optimization adjoint with Moving Speed
TO-FLOW: Efficient Continuous Normalizing Flows with Temporal Optimization adjoint with Moving Speed
Shian Du
Yihong Luo
Wei Chen
Jian Xu
Delu Zeng
37
7
0
19 Mar 2022
Improved Soft Actor-Critic: Mixing Prioritized Off-Policy Samples with
  On-Policy Experience
Improved Soft Actor-Critic: Mixing Prioritized Off-Policy Samples with On-Policy Experience
C. Banerjee
Zhiyong Chen
N. Noman
19
30
0
24 Sep 2021
Implicitly Regularized RL with Implicit Q-Values
Implicitly Regularized RL with Implicit Q-Values
Nino Vieillard
Marcin Andrychowicz
Anton Raichuk
Olivier Pietquin
M. Geist
OffRL
24
9
0
16 Aug 2021
Greedification Operators for Policy Optimization: Investigating Forward
  and Reverse KL Divergences
Greedification Operators for Policy Optimization: Investigating Forward and Reverse KL Divergences
Alan Chan
Hugo Silva
Sungsu Lim
Tadashi Kozuno
A. R. Mahmood
Martha White
25
29
0
17 Jul 2021
Composing Normalizing Flows for Inverse Problems
Composing Normalizing Flows for Inverse Problems
Jay Whang
Erik M. Lindgren
A. Dimakis
TPM
26
50
0
26 Feb 2020
Augmented Normalizing Flows: Bridging the Gap Between Generative Flows
  and Latent Variable Models
Augmented Normalizing Flows: Bridging the Gap Between Generative Flows and Latent Variable Models
Chin-Wei Huang
Laurent Dinh
Aaron Courville
DRL
31
87
0
17 Feb 2020
Discrete and Continuous Action Representation for Practical RL in Video
  Games
Discrete and Continuous Action Representation for Practical RL in Video Games
Olivier Delalleau
Maxim Peter
Eloi Alonso
Adrien Logut
25
52
0
23 Dec 2019
Normalizing Flows for Probabilistic Modeling and Inference
Normalizing Flows for Probabilistic Modeling and Inference
George Papamakarios
Eric T. Nalisnick
Danilo Jimenez Rezende
S. Mohamed
Balaji Lakshminarayanan
TPM
AI4CE
67
1,635
0
05 Dec 2019
1