A dynamical clipping approach with task feedback for Proximal Policy
Optimization

A dynamical clipping approach with task feedback for Proximal Policy Optimization

12 December 2023

Jingzehua Xu

Shuai Zhang

Papers citing "A dynamical clipping approach with task feedback for Proximal Policy Optimization"

9 / 9 papers shown

Title
DTC: Deep Tracking Control Fabian Jenelten Junzhe He Farbod Farshidian Marco Hutter 60 80 0 27 Sep 2023
Tuning computer vision models with task rewards André Susano Pinto Alexander Kolesnikov Yuge Shi Lucas Beyer Xiaohua Zhai VLM 52 41 0 16 Feb 2023
PEBBLE: Feedback-Efficient Interactive Reinforcement Learning via Relabeling Experience and Unsupervised Pre-training Kimin Lee Laura M. Smith Pieter Abbeel OffRL 60 284 0 09 Jun 2021
The Surprising Effectiveness of PPO in Cooperative, Multi-Agent Games Chao Yu Akash Velu Eugene Vinitsky Jiaxuan Gao Yu Wang Alexandre M. Bayen Yi Wu OffRL 134 1,250 0 02 Mar 2021
Deep TAMER: Interactive Agent Shaping in High-Dimensional State Spaces Garrett A. Warnell Nicholas R. Waytowich Vernon J. Lawhern Peter Stone 44 270 0 28 Sep 2017
Proximal Policy Optimization Algorithms John Schulman Filip Wolski Prafulla Dhariwal Alec Radford Oleg Klimov OffRL 496 19,019 0 20 Jul 2017
OpenAI Gym Greg Brockman Vicki Cheung Ludwig Pettersson Jonas Schneider John Schulman Jie Tang Wojciech Zaremba OffRL ODL 214 5,076 0 05 Jun 2016
Continuous control with deep reinforcement learning Timothy Lillicrap Jonathan J. Hunt Alexander Pritzel N. Heess Tom Erez Yuval Tassa David Silver Daan Wierstra 320 13,237 0 09 Sep 2015
Playing Atari with Deep Reinforcement Learning Volodymyr Mnih Koray Kavukcuoglu David Silver Alex Graves Ioannis Antonoglou Daan Wierstra Martin Riedmiller 127 12,227 0 19 Dec 2013