ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2202.00079
  4. Cited By
You May Not Need Ratio Clipping in PPO

You May Not Need Ratio Clipping in PPO

31 January 2022
Mingfei Sun
Vitaly Kurin
Guoqing Liu
Sam Devlin
Tao Qin
Katja Hofmann
Shimon Whiteson
ArXivPDFHTML

Papers citing "You May Not Need Ratio Clipping in PPO"

12 / 12 papers shown
Title
No Representation, No Trust: Connecting Representation, Collapse, and
  Trust Issues in PPO
No Representation, No Trust: Connecting Representation, Collapse, and Trust Issues in PPO
Skander Moalla
Andrea Miele
Razvan Pascanu
Çağlar Gülçehre
31
5
0
01 May 2024
RL-X: A Deep Reinforcement Learning Library (not only) for RoboCup
RL-X: A Deep Reinforcement Learning Library (not only) for RoboCup
Nico Bohlinger
Klaus Dorer
27
4
0
20 Oct 2023
Absolute Policy Optimization
Absolute Policy Optimization
Weiye Zhao
Feihan Li
Yifan Sun
Rui Chen
Tianhao Wei
Changliu Liu
52
4
0
20 Oct 2023
Universal Morphology Control via Contextual Modulation
Universal Morphology Control via Contextual Modulation
Zheng Xiong
Jacob Beck
Shimon Whiteson
33
13
0
22 Feb 2023
Trust-Region-Free Policy Optimization for Stochastic Policies
Trust-Region-Free Policy Optimization for Stochastic Policies
Mingfei Sun
Benjamin Ellis
Anuj Mahajan
Sam Devlin
Katja Hofmann
Shimon Whiteson
6
2
0
15 Feb 2023
Sample Dropout: A Simple yet Effective Variance Reduction Technique in
  Deep Policy Optimization
Sample Dropout: A Simple yet Effective Variance Reduction Technique in Deep Policy Optimization
Zichuan Lin
Xiapeng Wu
Mingfei Sun
Deheng Ye
Qiang Fu
Wei Yang
Wei Liu
33
3
0
05 Feb 2023
Revisiting Estimation Bias in Policy Gradients for Deep Reinforcement
  Learning
Revisiting Estimation Bias in Policy Gradients for Deep Reinforcement Learning
Haoxuan Pan
Deheng Ye
Xiaoming Duan
Qiang Fu
Wei Yang
Jianping He
Mingfei Sun
OffRL
25
2
0
20 Jan 2023
SMACv2: An Improved Benchmark for Cooperative Multi-Agent Reinforcement
  Learning
SMACv2: An Improved Benchmark for Cooperative Multi-Agent Reinforcement Learning
Benjamin Ellis
Jonathan Cook
S. Moalla
Mikayel Samvelyan
Mingfei Sun
Anuj Mahajan
Jakob N. Foerster
Shimon Whiteson
33
84
0
14 Dec 2022
Inspector: Pixel-Based Automated Game Testing via Exploration,
  Detection, and Investigation
Inspector: Pixel-Based Automated Game Testing via Exploration, Detection, and Investigation
Guoqing Liu
Mengzhang Cai
Li Zhao
Tao Qin
Adrian Brown
Jimmy Bischoff
Tie-Yan Liu
24
8
0
18 Jul 2022
The Sufficiency of Off-Policyness and Soft Clipping: PPO is still
  Insufficient according to an Off-Policy Measure
The Sufficiency of Off-Policyness and Soft Clipping: PPO is still Insufficient according to an Off-Policy Measure
Xing Chen
Dongcui Diao
Hechang Chen
Hengshuai Yao
Haiyin Piao
Zhixiao Sun
Zhiwei Yang
Randy Goebel
Bei Jiang
Yi-Ju Chang
OffRL
41
8
0
20 May 2022
Trust Region Bounds for Decentralized PPO Under Non-stationarity
Trust Region Bounds for Decentralized PPO Under Non-stationarity
Mingfei Sun
Sam Devlin
Jacob Beck
Katja Hofmann
Shimon Whiteson
26
10
0
31 Jan 2022
Emergence of Locomotion Behaviours in Rich Environments
Emergence of Locomotion Behaviours in Rich Environments
N. Heess
TB Dhruva
S. Sriram
Jay Lemmon
J. Merel
...
Tom Erez
Ziyun Wang
S. M. Ali Eslami
Martin Riedmiller
David Silver
143
928
0
07 Jul 2017
1