v1v2v3v4v5 (latest)

Trust Region Policy Optimization

19 February 2015

Pieter Abbeel

Papers citing "Trust Region Policy Optimization"

50 / 2,023 papers shown

Title
Reinforcement Learning for Joint Optimization of Multiple Rewards Mridul Agarwal Vaneet Aggarwal 87 16 0 06 Sep 2019
Adaptive Trust Region Policy Optimization: Global Convergence and Faster Rates for Regularized MDPs Lior Shani Yonathan Efroni Shie Mannor 94 176 0 06 Sep 2019
ACES -- Automatic Configuration of Energy Harvesting Sensors with Reinforcement Learning Francesco Fraternali Bharathan Balaji Yuvraj Agarwal Rajesh K. Gupta 21 44 0 04 Sep 2019
rlpyt: A Research Code Base for Deep Reinforcement Learning in PyTorch Adam Stooke Pieter Abbeel OffRL 94 98 0 03 Sep 2019
Generalization in Transfer Learning S. E. Ada Emre Ugur H. L. Akin 76 18 0 03 Sep 2019
Neural Policy Gradient Methods: Global Optimality and Rates of Convergence Lingxiao Wang Qi Cai Zhuoran Yang Zhaoran Wang 115 242 0 29 Aug 2019
Tutorial and Survey on Probabilistic Graphical Model and Variational Inference in Deep Reinforcement Learning Xudong Sun B. Bischl BDL 75 9 0 25 Aug 2019
A Comparison of Action Spaces for Learning Manipulation Tasks Patrick Varin Lev Grossman S. Kuindersma 67 34 0 23 Aug 2019
Multi-Agent Manipulation via Locomotion using Hierarchical Sim2Real Ofir Nachum Michael Ahn Hugo Ponte S. Gu Vikash Kumar 68 91 0 13 Aug 2019
Generative Question Refinement with Deep Reinforcement Learning in Retrieval-based QA System Ye Liu Chenwei Zhang Xiaohui Yan Yi-Ju Chang Philip S. Yu 63 20 0 13 Aug 2019
Fast Adaptation with Meta-Reinforcement Learning for Trust Modelling in Human-Robot Interaction Yuan Gao E. Sibirtseva Ginevra Castellano Danica Kragic 74 21 0 12 Aug 2019
A Review of Cooperative Multi-Agent Deep Reinforcement Learning Afshin Oroojlooyjadid Davood Hajinezhad 126 439 0 11 Aug 2019
Trajectory-wise Control Variates for Variance Reduction in Policy Gradient Methods Ching-An Cheng Xinyan Yan Byron Boots 73 22 0 08 Aug 2019
Attention Control with Metric Learning Alignment for Image Set-based Recognition Xiaofeng Liu A. Marques J. You G. Giannakis CVBM 83 10 0 05 Aug 2019
Adaptive Stress Testing with Reward Augmentation for Autonomous Vehicle Validation Anthony Corso Peter Du Katherine Driggs-Campbell Mykel J. Kochenderfer 59 99 0 02 Aug 2019
Health-Informed Policy Gradients for Multi-Agent Reinforcement Learning R. Allen Jayesh K. Gupta Jaime Pena Yutai Zhou Javona White Bear Mykel J. Kochenderfer 58 7 0 02 Aug 2019
Neural Simplex Architecture Dung Phan Radu Grosu N. Jansen Nicola Paoletti S. Smolka Scott D. Stoller 87 62 0 01 Aug 2019
On the Theory of Policy Gradient Methods: Optimality, Approximation, and Distribution Shift Alekh Agarwal Sham Kakade Jason D. Lee G. Mahajan 142 321 0 01 Aug 2019
Optimal Attacks on Reinforcement Learning Policies Alessio Russo Alexandre Proutiere AAML 65 42 0 31 Jul 2019
Wasserstein Robust Reinforcement Learning Mohammed Abdullah Hang Ren Haitham Bou-Ammar Vladimir Milenkovic Rui Luo Mingtian Zhang Jun Wang 164 76 0 30 Jul 2019
Hindsight Trust Region Policy Optimization Hanbo Zhang Site Bai Xuguang Lan David Hsu Nanning Zheng 68 8 0 29 Jul 2019
Making Sense of Vision and Touch: Learning Multimodal Representations for Contact-Rich Tasks Michelle A. Lee Yuke Zhu Peter Zachares Matthew Tan K. Srinivasan Silvio Savarese Fei-Fei Li Animesh Garg Jeannette Bohg SSL 97 213 0 28 Jul 2019
Self-Imitation Learning of Locomotion Movements through Termination Curriculum Amin Babadi Kourosh Naderi Perttu Hämäläinen 60 7 0 27 Jul 2019
Deep Reinforcement Learning for Personalized Search Story Recommendation Jason Zhang Zhang Junming Yin Dongwon Lee Linhong Zhu 55 2 0 26 Jul 2019
Environment Probing Interaction Policies Wenxuan Zhou Lerrel Pinto Abhinav Gupta 61 67 0 26 Jul 2019
A Unified Bellman Optimality Principle Combining Reward Maximization and Empowerment Felix Leibfried Sergio Pascual-Diaz Jordi Grau-Moya 125 29 0 26 Jul 2019
An Information-theoretic On-line Learning Principle for Specialization in Hierarchical Decision-Making Systems Heinke Hihn Sebastian Gottwald Daniel A. Braun 99 16 0 26 Jul 2019
Differentiable Gaussian Process Motion Planning M. Bhardwaj Byron Boots Mustafa Mukadam 77 63 0 22 Jul 2019
Deep Reinforcement Learning for Autonomous Internet of Things: Model, Applications and Challenges Lei Lei Yue Tan Kan Zheng Shiwen Liu K. Zheng Xuemin Shen Shen OffRL 89 205 0 22 Jul 2019
Prioritized Guidance for Efficient Multi-Agent Reinforcement Learning Exploration Qisheng Wang Qichao Wang 52 1 0 18 Jul 2019
Efficient Autonomy Validation in Simulation with Adaptive Stress Testing Mark Koren Mykel Kochenderfer 41 47 0 16 Jul 2019
Seeker based Adaptive Guidance via Reinforcement Meta-Learning Applied to Asteroid Close Proximity Operations B. Gaudet R. Linares R. Furfaro 41 33 0 13 Jul 2019
Environment Reconstruction with Hidden Confounders for Reinforcement Learning based Recommendation Wenjie Shang Yang Yu Qingyang Li Zhiwei Qin Yiping Meng Jieping Ye CML 67 51 0 12 Jul 2019
Imitation-Projected Programmatic Reinforcement Learning A. Verma Hoang Minh Le Yisong Yue Swarat Chaudhuri 44 2 0 11 Jul 2019
Provably Efficient Reinforcement Learning with Linear Function Approximation Chi Jin Zhuoran Yang Zhaoran Wang Michael I. Jordan 147 561 0 11 Jul 2019
Safe Policy Improvement with Soft Baseline Bootstrapping Kimia Nadjahi Romain Laroche Rémi Tachet des Combes OffRL 70 36 0 11 Jul 2019
A Model-based Approach for Sample-efficient Multi-task Reinforcement Learning Nicholas C. Landolfi G. Thomas Tengyu Ma OffRL 64 19 0 11 Jul 2019
Trust-Region Variational Inference with Gaussian Mixture Models Oleg Arenz Mingjun Zhong Gerhard Neumann 87 20 0 10 Jul 2019
An Optimistic Perspective on Offline Reinforcement Learning Rishabh Agarwal Dale Schuurmans Mohammad Norouzi OffRL OnRL 126 70 0 10 Jul 2019
Deep Lagrangian Networks for end-to-end learning of energy-based control for under-actuated systems M. Lutter Kim D. Listmann Jan Peters PINN 84 75 0 10 Jul 2019
On-Policy Robot Imitation Learning from a Converging Supervisor Ashwin Balakrishna Brijen Thananjeyan Jonathan Lee Felix Li Arsh Zahed Joseph E. Gonzalez Ken Goldberg 141 17 0 08 Jul 2019
Deep Learning based Wireless Resource Allocation with Application to Vehicular Networks Le Liang Hao Ye Guanding Yu Geoffrey Ye Li 78 200 0 07 Jul 2019
A Review of Robot Learning for Manipulation: Challenges, Representations, and Algorithms Oliver Kroemer S. Niekum George Konidaris 153 369 0 06 Jul 2019
Entropic Regularization of Markov Decision Processes Boris Belousov Jan Peters 73 24 0 06 Jul 2019
Intrinsic Motivation Driven Intuitive Physics Learning using Deep Reinforcement Learning with Intrinsic Reward Normalization Jae-Woo Choi Sung-eui Yoon AI4CE PINN 67 3 0 06 Jul 2019
Dependency-aware Attention Control for Unconstrained Face Recognition with Image Sets Xiaofeng Liu B. Kumar Chao Yang Qingming Tang J. You CVBM 95 42 0 05 Jul 2019
Self-supervised Learning of Distance Functions for Goal-Conditioned Reinforcement Learning Srinivas Venkattaramanujam Eric Crawford T. Doan Doina Precup OffRL SSL 74 24 0 05 Jul 2019
Integration of Imitation Learning using GAIL and Reinforcement Learning using Task-achievement Rewards via Probabilistic Graphical Model Akira Kinose T. Taniguchi 107 20 0 03 Jul 2019
Benchmarking Model-Based Reinforcement Learning Tingwu Wang Xuchan Bao I. Clavera Jerrick Hoang Yeming Wen Eric D. Langlois Matthew Shunshi Zhang Guodong Zhang Pieter Abbeel Jimmy Ba OffRL 122 365 0 03 Jul 2019
Co-training for Policy Learning Jialin Song Ravi Lanka Yisong Yue M. Ono OffRL 66 20 0 03 Jul 2019