v1v2v3 (latest)

Deep Reinforcement Learning with Double Q-learning

22 September 2015

David Silver

Papers citing "Deep Reinforcement Learning with Double Q-learning"

50 / 2,291 papers shown

Title
Aggressive Q-Learning with Ensembles: Achieving Both High Sample Efficiency and High Asymptotic Performance Yanqiu Wu Xinyue Chen Che Wang Yiming Zhang George Andriopoulos OffRL 65 9 0 17 Nov 2021
Compressive Features in Offline Reinforcement Learning for Recommender Systems Hung Nguyen Minh Nguyen Long Pham Jennifer Adorno Nieves OffRL 48 2 0 16 Nov 2021
Obstacle Avoidance for UAS in Continuous Action Space Using Deep Reinforcement Learning Jueming Hu Xuxi Yang Weichang Wang Peng Wei Lei Ying Yongming Liu 58 24 0 13 Nov 2021
Improving Experience Replay through Modeling of Similar Transitions' Sets Daniel Eugênio Neves João Pedro Oliveira Batisteli Eduardo Felipe Lopes Lucila Ishitani Zenilton K. G. Patrocínio OffRL 23 1 0 12 Nov 2021
AWD3: Dynamic Reduction of the Estimation Bias Dogan C. Cicek Enes Duran Baturay Saglam Kagan Kaya Furkan B. Mutlu Suleyman S. Kozat OffRL 26 7 0 12 Nov 2021
CubeTR: Learning to Solve The Rubiks Cube Using Transformers Mustafa Chasmai ViT 65 1 0 11 Nov 2021
Spatially and Seamlessly Hierarchical Reinforcement Learning for State Space and Policy space in Autonomous Driving Jaehyung Kim Jaeseung Jeong 18 0 0 10 Nov 2021
Dealing with the Unknown: Pessimistic Offline Reinforcement Learning Jinning Li Chen Tang Masayoshi Tomizuka Wei Zhan OffRL 94 22 0 09 Nov 2021
d3rlpy: An Offline Deep Reinforcement Learning Library Takuma Seno M. Imai OffRL GP 111 106 0 06 Nov 2021
Cross Modality 3D Navigation Using Reinforcement Learning and Neural Style Transfer Cesare Magnetti Hadrien Reynaud Bernhard Kainz MedIm 22 0 0 05 Nov 2021
Value Function Spaces: Skill-Centric State Abstractions for Long-Horizon Reasoning Dhruv Shah Peng Xu Yao Lu Ted Xiao Alexander Toshev Sergey Levine Brian Ichter OffRL 81 43 0 04 Nov 2021
Balanced Q-learning: Combining the Influence of Optimistic and Pessimistic Targets Thommen George Karimpanal Hung Le Majid Abdolshah Santu Rana Sunil R. Gupta T. Tran Svetha Venkatesh 64 5 0 03 Nov 2021
Off-Policy Correction for Deep Deterministic Policy Gradient Algorithms via Batch Prioritized Experience Replay Dogan C. Cicek Enes Duran Baturay Saglam Furkan B. Mutlu Suleyman S. Kozat OffRL 35 11 0 02 Nov 2021
Investigation of Independent Reinforcement Learning Algorithms in Multi-Agent Environments Ken Ming Lee Sriram Ganapathi Subramanian Mark Crowley 60 11 0 01 Nov 2021
Decentralized Multi-Agent Reinforcement Learning: An Off-Policy Method Kuo Li Qing-Shan Jia OffRL 18 2 0 31 Oct 2021
Mastering Atari Games with Limited Data Weirui Ye Shao-Wei Liu Thanard Kurutach Pieter Abbeel Yang Gao VLM 149 242 0 30 Oct 2021
Bayesian Sequential Optimal Experimental Design for Nonlinear Models Using Policy Gradient Reinforcement Learning Wanggang Shen Xun Huan 62 40 0 28 Oct 2021
Cooperative Deep $Q$ -learning Framework for Environments Providing Image Feedback Krishnan Raghavan Vignesh Narayanan S. Jagannathan VLM OffRL 55 1 0 28 Oct 2021
Learning to Control using Image Feedback Krishnan Raghavan Vignesh Narayanan Jagannathan Saraangapani 35 0 0 28 Oct 2021
Hindsight Goal Ranking on Replay Buffer for Sparse Reward Environment Tung M. Luu Chang D. Yoo 80 8 0 28 Oct 2021
Temporal-Difference Value Estimation via Uncertainty-Guided Soft Updates Litian Liang Yaosheng Xu Stephen Marcus McAleer Dailin Hu Alexander Ihler Pieter Abbeel Roy Fox 23 4 0 28 Oct 2021
Comparing Heuristics, Constraint Optimization, and Reinforcement Learning for an Industrial 2D Packing Problem S. Böhm Martin Neumayer Oliver Kramer Alexander Schiendorfer Alois Knoll OffRL 22 2 0 27 Oct 2021
The Difficulty of Passive Learning in Deep Reinforcement Learning Georg Ostrovski Pablo Samuel Castro Will Dabney OffRL 74 59 0 26 Oct 2021
Automating Control of Overestimation Bias for Reinforcement Learning Arsenii Kuznetsov Alexander Grishin Artem Tsypin Arsenii Ashukha Artur Kadurin Dmitry Vetrov OffRL 47 2 0 26 Oct 2021
Persona Authentication through Generative Dialogue Fengyi Tang Lifan Zeng Fei Wang Jiayu Zhou 97 8 0 25 Oct 2021
Deep Reinforcement Learning for Simultaneous Sensing and Channel Access in Cognitive Networks Yoel Bokobza R. Dabora Kobi Cohen 60 14 0 24 Oct 2021
A Reinforcement Learning Approach to Parameter Selection for Distributed Optimal Power Flow Tai-Yin Chiu Alyssa Kody Youngdae Kim Kibaek Kim Daniel K. Molzahn 41 21 0 22 Oct 2021
Deep Generative Models in Engineering Design: A Review Lyle Regenwetter Amin Heyrani Nobari Faez Ahmed 3DV AI4CE 136 192 0 21 Oct 2021
CIM-PPO:Proximal Policy Optimization with Liu-Correntropy Induced Metric Yunxiao Guo Han Long Xiaojun Duan Kaiyuan Feng Maochu Li Xiaying Ma 22 4 0 20 Oct 2021
Continuous Control with Action Quantization from Demonstrations Robert Dadashi Léonard Hussenot Damien Vincent Sertan Girgin Anton Raichuk Matthieu Geist Olivier Pietquin OffRL 103 23 0 19 Oct 2021
Reinforcement Learning-Based Coverage Path Planning with Implicit Cellular Decomposition Javad Heydari Olimpiya Saha Viswanath Ganapathy OffRL 35 16 0 18 Oct 2021
Damped Anderson Mixing for Deep Reinforcement Learning: Acceleration, Convergence, and Stabilization Ke Sun Yafei Wang Yi Liu Yingnan Zhao Bo Pan Shangling Jui Bei Jiang Linglong Kong 46 11 0 17 Oct 2021
Centroid Approximation for Bootstrap: Improving Particle Quality at Inference Mao Ye Qiang Liu 41 1 0 17 Oct 2021
SaLinA: Sequential Learning of Agents Ludovic Denoyer Alfredo De la Fuente S. Duong Jean-Baptiste Gaya Pierre-Alexandre Kamienny Daniel H. Thompson 94 11 0 15 Oct 2021
Urban traffic dynamic rerouting framework: A DRL-based model with fog-cloud architecture Runjia Du Sikai Chen Jiqian Dong Tiantian Chen Xiaowen Fu Samuel Labi 71 0 0 11 Oct 2021
Theoretically Principled Deep RL Acceleration via Nearest Neighbor Function Approximation Junhong Shen Lin F. Yang OffRL 51 17 0 09 Oct 2021
Training Transition Policies via Distribution Matching for Complex Tasks Ju-Seung Byun Andrew Perrault 55 6 0 08 Oct 2021
Medical Dead-ends and Learning to Identify High-risk States and Treatments Mehdi Fatemi Taylor W. Killian J. Subramanian Marzyeh Ghassemi OffRL 94 40 0 08 Oct 2021
Learning Pessimism for Robust and Efficient Off-Policy Reinforcement Learning Edoardo Cetin Oya Celiktutan OffRL 83 17 0 07 Oct 2021
Designing Composites with Target Effective Young's Modulus using Reinforcement Learning Aldair E. Gongora Siddharth Mysore Beichen Li Wan Shou Wojciech Matusik E. Morgan Keith A. Brown Emily Whiting AI4CE 62 9 0 07 Oct 2021
Explaining Deep Reinforcement Learning Agents In The Atari Domain through a Surrogate Model Alexander Sieusahai Matthew J. Guzdial 71 13 0 07 Oct 2021
Optimized Recommender Systems with Deep Reinforcement Learning Lucas Farris OffRL 25 0 0 06 Oct 2021
On The Transferability of Deep-Q Networks M. Sabatelli Pierre Geurts 83 2 0 06 Oct 2021
DeepEdge: A Deep Reinforcement Learning based Task Orchestrator for Edge Computing Baris Yamansavascilar A. C. Baktir Cagatay Sonmez Atay Ozgovde Cem Ersoy 49 25 0 05 Oct 2021
Deep reinforcement learning for guidewire navigation in coronary artery phantom Jihoon Kweon Kyunghwan Kim Chaehyuk Lee Hwi Kwon Jinwoo Park ... Inwook Back J. Roh Y. Moon Jaesoon Choi Young-Hak Kim OnRL 65 34 0 05 Oct 2021
Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble Gaon An Seungyong Moon Jang-Hyun Kim Hyun Oh Song OffRL 185 283 0 04 Oct 2021
Multi-Agent Path Planning Using Deep Reinforcement Learning M. Çetinkaya 58 2 0 04 Oct 2021
A Cramér Distance perspective on Quantile Regression based Distributional Reinforcement Learning Alix Lhéritier Nicolas Bondoux 38 5 0 01 Oct 2021
Scalable Online Planning via Reinforcement Learning Fine-Tuning Arnaud Fickinger Hengyuan Hu Brandon Amos Stuart J. Russell Noam Brown 97 21 0 30 Sep 2021
Modeling Interactions of Autonomous Vehicles and Pedestrians with Deep Multi-Agent Reinforcement Learning for Collision Avoidance Raphael Trumpp Harald Bayerlein David Gesbert 38 18 0 30 Sep 2021