v1v2v3 (latest)

CAQL: Continuous Action Q-Learning

26 September 2019

Moonkyung Ryu

Yinlam Chow

Ross Anderson

Christian Tjandraatmadja

Craig Boutilier

ArXiv (abs)PDF HTML

Papers citing "CAQL: Continuous Action Q-Learning"

26 / 26 papers shown

Title
When Deep Learning Meets Polyhedral Theory: A Survey Joey Huchette Gonzalo Muñoz Thiago Serra Calvin Tsay AI4CE 146 37 0 29 Apr 2023
Equivalent and Approximate Transformations of Deep Neural Networks Abhinav Kumar Thiago Serra Srikumar Ramalingam 52 21 0 27 May 2019
Challenges of Real-World Reinforcement Learning Gabriel Dulac-Arnold D. Mankowitz Todd Hester OffRL 79 548 0 29 Apr 2019
Strong mixed-integer programming formulations for trained neural networks Ross Anderson Joey Huchette Christian Tjandraatmadja J. Vielma 156 257 0 20 Nov 2018
Horizon: Facebook's Open Source Applied Reinforcement Learning Platform J. Gauci Edoardo Conti Yitao Liang Kittipat Virochsiri Yuchen He Zachary Kaden Vivek Narayanan Xiaohui Ye Zhengxing Chen Scott Fujimoto 60 139 0 01 Nov 2018
QT-Opt: Scalable Deep Reinforcement Learning for Vision-Based Robotic Manipulation Dmitry Kalashnikov A. Irpan P. Pastor Julian Ibarz Alexander Herzog ... Deirdre Quillen E. Holly Mrinal Kalakrishnan Vincent Vanhoucke Sergey Levine 126 1,467 0 27 Jun 2018
A Lyapunov-based Approach to Safe Reinforcement Learning Yinlam Chow Ofir Nachum Edgar A. Duénez-Guzmán Mohammad Ghavamzadeh 158 506 0 20 May 2018
Planning and Learning with Stochastic Action Sets Craig Boutilier Alon Cohen Amit Daniely Avinatan Hassidim Yishay Mansour Ofer Meshi Martin Mladenov Dale Schuurmans OffRL 28 21 0 07 May 2018
Towards Fast Computation of Certified Robustness for ReLU Networks Tsui-Wei Weng Huan Zhang Hongge Chen Zhao Song Cho-Jui Hsieh Duane S. Boning Inderjit S. Dhillon Luca Daniel AAML 108 695 0 25 Apr 2018
Deep Reinforcement Learning for Vision-Based Robotic Grasping: A Simulated Comparative Evaluation of Off-Policy Methods Deirdre Quillen Eric Jang Ofir Nachum Chelsea Finn Julian Ibarz Sergey Levine OOD OffRL 66 204 0 28 Feb 2018
Addressing Function Approximation Error in Actor-Critic Methods Scott Fujimoto H. V. Hoof David Meger OffRL 175 5,187 0 26 Feb 2018
Global Convergence of Policy Gradient Methods for the Linear Quadratic Regulator Maryam Fazel Rong Ge Sham Kakade M. Mesbahi 79 605 0 15 Jan 2018
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor Tuomas Haarnoja Aurick Zhou Pieter Abbeel Sergey Levine 311 8,352 0 04 Jan 2018
Bounding and Counting Linear Regions of Deep Neural Networks Thiago Serra Christian Tjandraatmadja Srikumar Ramalingam MLT 65 250 0 06 Nov 2017
Provable defenses against adversarial examples via the convex outer adversarial polytope Eric Wong J. Zico Kolter AAML 125 1,503 0 02 Nov 2017
An approach to reachability analysis for feed-forward ReLU neural networks A. Lomuscio Lalit Maganti 65 359 0 22 Jun 2017
A unified view of entropy-regularized Markov decision processes Gergely Neu Anders Jonsson Vicencc Gómez 97 263 0 22 May 2017
Maximum Resilience of Artificial Neural Networks Chih-Hong Cheng Georg Nührenberg Harald Ruess AAML 109 284 0 28 Apr 2017
Reluplex: An Efficient SMT Solver for Verifying Deep Neural Networks Guy Katz Clark W. Barrett D. Dill Kyle D. Julian Mykel Kochenderfer AAML 318 1,873 0 03 Feb 2017
Input Convex Neural Networks Brandon Amos Lei Xu J. Zico Kolter 280 621 0 22 Sep 2016
Benchmarking Deep Reinforcement Learning for Continuous Control Yan Duan Xi Chen Rein Houthooft John Schulman Pieter Abbeel OffRL 82 1,694 0 22 Apr 2016
Continuous Deep Q-Learning with Model-based Acceleration S. Gu Timothy Lillicrap Ilya Sutskever Sergey Levine 91 1,013 0 02 Mar 2016
Asynchronous Methods for Deep Reinforcement Learning Volodymyr Mnih Adria Puigdomenech Badia M. Berk Mirza Alex Graves Timothy Lillicrap Tim Harley David Silver Koray Kavukcuoglu 199 8,859 0 04 Feb 2016
Deep Reinforcement Learning in Large Discrete Action Spaces Gabriel Dulac-Arnold Richard Evans H. V. Hasselt P. Sunehag Timothy Lillicrap Jonathan J. Hunt Timothy A. Mann T. Weber T. Degris Ben Coppin OffRL 71 574 0 24 Dec 2015
Deep Reinforcement Learning with Double Q-learning H. V. Hasselt A. Guez David Silver OffRL 170 7,641 0 22 Sep 2015
Playing Atari with Deep Reinforcement Learning Volodymyr Mnih Koray Kavukcuoglu David Silver Alex Graves Ioannis Antonoglou Daan Wierstra Martin Riedmiller 127 12,231 0 19 Dec 2013