Improving Exploration in Soft-Actor-Critic with Normalizing Flows Policies

6 June 2019

Papers citing "Improving Exploration in Soft-Actor-Critic with Normalizing Flows Policies"

16 / 16 papers shown

Title
Monotone, Bi-Lipschitz, and Polyak-Lojasiewicz Networks Ruigang Wang Krishnamurthy Dvijotham I. Manchester 41 5 0 02 Feb 2024
LLQL: Logistic Likelihood Q-Learning for Reinforcement Learning Outongyi Lv Bingxin Zhou OffRL 44 0 0 05 Jul 2023
On Learning the Tail Quantiles of Driving Behavior Distributions via Quantile Regression and Flows Jia Yu Tee Oliver De Candido Wolfgang Utschick Philipp Geiger 27 0 0 22 May 2023
Soft Actor-Critic Algorithm with Truly-satisfied Inequality Constraint Taisuke Kobayashi 53 3 0 08 Mar 2023
CEIP: Combining Explicit and Implicit Priors for Reinforcement Learning with Demonstrations Kai Yan Alex Schwing Yu-xiong Wang OffRL 30 2 0 18 Oct 2022
Revisiting Discrete Soft Actor-Critic Haibin Zhou Zichuan Lin Junyou Li Qiang Fu Wei Yang Deheng Ye 51 12 0 21 Sep 2022
Action Noise in Off-Policy Deep Reinforcement Learning: Impact on Exploration and Performance Jakob J. Hollenstein Sayantan Auddy Matteo Saveriano Erwan Renaudo J. Piater 46 17 0 08 Jun 2022
Flow-based Recurrent Belief State Learning for POMDPs Xiaoyu Chen Yao Mu Ping Luo Sheng Li Jianyu Chen 53 18 0 23 May 2022
TO-FLOW: Efficient Continuous Normalizing Flows with Temporal Optimization adjoint with Moving Speed Shian Du Yihong Luo Wei Chen Jian Xu Delu Zeng 37 7 0 19 Mar 2022
Improved Soft Actor-Critic: Mixing Prioritized Off-Policy Samples with On-Policy Experience C. Banerjee Zhiyong Chen N. Noman 19 30 0 24 Sep 2021
Implicitly Regularized RL with Implicit Q-Values Nino Vieillard Marcin Andrychowicz Anton Raichuk Olivier Pietquin M. Geist OffRL 24 9 0 16 Aug 2021
Greedification Operators for Policy Optimization: Investigating Forward and Reverse KL Divergences Alan Chan Hugo Silva Sungsu Lim Tadashi Kozuno A. R. Mahmood Martha White 25 29 0 17 Jul 2021
Composing Normalizing Flows for Inverse Problems Jay Whang Erik M. Lindgren A. Dimakis TPM 26 50 0 26 Feb 2020
Augmented Normalizing Flows: Bridging the Gap Between Generative Flows and Latent Variable Models Chin-Wei Huang Laurent Dinh Aaron Courville DRL 31 87 0 17 Feb 2020
Discrete and Continuous Action Representation for Practical RL in Video Games Olivier Delalleau Maxim Peter Eloi Alonso Adrien Logut 25 52 0 23 Dec 2019
Normalizing Flows for Probabilistic Modeling and Inference George Papamakarios Eric T. Nalisnick Danilo Jimenez Rezende S. Mohamed Balaji Lakshminarayanan TPM AI4CE 67 1,635 0 05 Dec 2019