ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2011.02669
  4. Cited By
Learning to Utilize Shaping Rewards: A New Approach of Reward Shaping

Learning to Utilize Shaping Rewards: A New Approach of Reward Shaping

5 November 2020
Yujing Hu
Weixun Wang
Hangtian Jia
Yixiang Wang
Yingfeng Chen
Jianye Hao
Feng Wu
Changjie Fan
    OffRL
ArXivPDFHTML

Papers citing "Learning to Utilize Shaping Rewards: A New Approach of Reward Shaping"

29 / 79 papers shown
Title
Sample-efficient Real-time Planning with Curiosity Cross-Entropy Method
  and Contrastive Learning
Sample-efficient Real-time Planning with Curiosity Cross-Entropy Method and Contrastive Learning
Mostafa Kotb
C. Weber
S. Wermter
32
4
0
07 Mar 2023
Temporal Video-Language Alignment Network for Reward Shaping in
  Reinforcement Learning
Temporal Video-Language Alignment Network for Reward Shaping in Reinforcement Learning
Ziyuan Cao
Reshma A. Ramachandra
K. Yu
28
2
0
08 Feb 2023
Towards Skilled Population Curriculum for Multi-Agent Reinforcement
  Learning
Towards Skilled Population Curriculum for Multi-Agent Reinforcement Learning
R. Wang
Longtao Zheng
Wei Qiu
Bowei He
Bo An
Zinovi Rabinovich
Yujing Hu
Yingfeng Chen
Tangjie Lv
Changjie Fan
33
1
0
07 Feb 2023
Mind the Gap: Offline Policy Optimization for Imperfect Rewards
Mind the Gap: Offline Policy Optimization for Imperfect Rewards
Jianxiong Li
Xiao Hu
Haoran Xu
Jingjing Liu
Xianyuan Zhan
Qing-Shan Jia
Ya Zhang
OffRL
38
19
0
03 Feb 2023
Beyond Inverted Pendulums: Task-optimal Simple Models of Legged
  Locomotion
Beyond Inverted Pendulums: Task-optimal Simple Models of Legged Locomotion
Yu-Ming Chen
Jian-bo Hu
Michael Posa
38
6
0
05 Jan 2023
Computationally Efficient Reinforcement Learning: Targeted Exploration
  leveraging Simple Rules
Computationally Efficient Reinforcement Learning: Targeted Exploration leveraging Simple Rules
L. D. Natale
B. Svetozarevic
Philipp Heer
Colin N. Jones
26
0
0
30 Nov 2022
Automatic Evaluation of Excavator Operators using Learned Reward
  Functions
Automatic Evaluation of Excavator Operators using Learned Reward Functions
Pranav Agarwal
M. Teichmann
Sheldon Andrews
Samira Ebrahimi Kahou
OffRL
30
2
0
15 Nov 2022
Redeeming Intrinsic Rewards via Constrained Optimization
Redeeming Intrinsic Rewards via Constrained Optimization
Eric Chen
Zhang-Wei Hong
Joni Pajarinen
Pulkit Agrawal
OnRL
36
24
0
14 Nov 2022
Advice Conformance Verification by Reinforcement Learning agents for
  Human-in-the-Loop
Advice Conformance Verification by Reinforcement Learning agents for Human-in-the-Loop
Mudit Verma
Ayushi Kharkwal
Subbarao Kambhampati
14
4
0
07 Oct 2022
Generalization in Deep Reinforcement Learning for Robotic Navigation by
  Reward Shaping
Generalization in Deep Reinforcement Learning for Robotic Navigation by Reward Shaping
Victor R. F. Miranda
A. A. Neto
G. Freitas
L. Mozelli
35
18
0
28 Sep 2022
Safe Reinforcement Learning with Contrastive Risk Prediction
Safe Reinforcement Learning with Contrastive Risk Prediction
Hanping Zhang
Yuhong Guo
OffRL
29
2
0
10 Sep 2022
Impact Makes a Sound and Sound Makes an Impact: Sound Guides
  Representations and Explorations
Impact Makes a Sound and Sound Makes an Impact: Sound Guides Representations and Explorations
Xufeng Zhao
C. Weber
Muhammad Burhan Hafez
S. Wermter
25
8
0
04 Aug 2022
Auto-Encoding Adversarial Imitation Learning
Auto-Encoding Adversarial Imitation Learning
Kaifeng Zhang
Rui Zhao
Ziming Zhang
Yang Gao
19
1
0
22 Jun 2022
Guarantees for Epsilon-Greedy Reinforcement Learning with Function
  Approximation
Guarantees for Epsilon-Greedy Reinforcement Learning with Function Approximation
Christoph Dann
Yishay Mansour
M. Mohri
Ayush Sekhari
Karthik Sridharan
36
49
0
19 Jun 2022
Achieving Goals using Reward Shaping and Curriculum Learning
Achieving Goals using Reward Shaping and Curriculum Learning
M. Anca
Jonathan D. Thomas
Dabal Pedamonti
M. Studley
Mark Hansen
12
1
0
06 Jun 2022
Soft Actor-Critic with Inhibitory Networks for Faster Retraining
Soft Actor-Critic with Inhibitory Networks for Faster Retraining
J. Ide
Daria Mićović
Michael J. Guarino
K. Alcedo
D. Rosenbluth
Adrian P. Pope
13
3
0
07 Feb 2022
Learning Synthetic Environments and Reward Networks for Reinforcement
  Learning
Learning Synthetic Environments and Reward Networks for Reinforcement Learning
Fabio Ferreira
Thomas Nierhoff
Andreas Saelinger
Frank Hutter
27
4
0
06 Feb 2022
Automated Reinforcement Learning (AutoRL): A Survey and Open Problems
Automated Reinforcement Learning (AutoRL): A Survey and Open Problems
Jack Parker-Holder
Raghunandan Rajan
Xingyou Song
André Biedenkapp
Yingjie Miao
...
Vu-Linh Nguyen
Roberto Calandra
Aleksandra Faust
Frank Hutter
Marius Lindauer
AI4CE
33
100
0
11 Jan 2022
A Theoretical Understanding of Gradient Bias in Meta-Reinforcement
  Learning
A Theoretical Understanding of Gradient Bias in Meta-Reinforcement Learning
Xidong Feng
Bo Liu
Jie Ren
Luo Mai
Rui Zhu
Haifeng Zhang
Jun Wang
Yaodong Yang
14
12
0
31 Dec 2021
Model-Based Safe Reinforcement Learning with Time-Varying State and
  Control Constraints: An Application to Intelligent Vehicles
Model-Based Safe Reinforcement Learning with Time-Varying State and Control Constraints: An Application to Intelligent Vehicles
Xinglong Zhang
Yaoqian Peng
Biao Luo
Wei Pan
Xin Xu
Haibin Xie
27
11
0
18 Dec 2021
On Assessing The Safety of Reinforcement Learning algorithms Using
  Formal Methods
On Assessing The Safety of Reinforcement Learning algorithms Using Formal Methods
Paulina Stevia Nouwou Mindom
Amin Nikanjam
Foutse Khomh
J. Mullins
AAML
28
3
0
08 Nov 2021
Discovering and Exploiting Sparse Rewards in a Learned Behavior Space
Discovering and Exploiting Sparse Rewards in a Learned Behavior Space
Giuseppe Paolo
Alexandre Coninx
Alban Laflaquière
Stéphane Doncieux
19
3
0
02 Nov 2021
ReLAX: Reinforcement Learning Agent eXplainer for Arbitrary Predictive
  Models
ReLAX: Reinforcement Learning Agent eXplainer for Arbitrary Predictive Models
Kiran Purohit
Soumili Das
Jia Wang
He Zhu
Santu Rana
Gabriele Tolomei
CML
OffRL
41
36
0
22 Oct 2021
Core Challenges in Embodied Vision-Language Planning
Core Challenges in Embodied Vision-Language Planning
Jonathan M Francis
Nariaki Kitamura
Felix Labelle
Xiaopeng Lu
Ingrid Navarro
Jean Oh
LM&Ro
49
45
0
26 Jun 2021
Heuristic-Guided Reinforcement Learning
Heuristic-Guided Reinforcement Learning
Ching-An Cheng
Andrey Kolobov
Adith Swaminathan
OffRL
35
61
0
05 Jun 2021
Goal-constrained Sparse Reinforcement Learning for End-to-End Driving
Goal-constrained Sparse Reinforcement Learning for End-to-End Driving
Pranav Agarwal
Pierre de Beaucorps
Raoul de Charette
29
3
0
16 Mar 2021
Learning to Shape Rewards using a Game of Two Partners
Learning to Shape Rewards using a Game of Two Partners
D. Mguni
Taher Jafferjee
Jianhong Wang
Nicolas Perez Nieves
Tianpei Yang
...
Feifei Tong
Hui Chen
Jiangcheng Zhu
Jun Wang
Yaodong Yang
38
10
0
16 Mar 2021
Model-Augmented Q-learning
Model-Augmented Q-learning
Youngmin Oh
Jinwoo Shin
Eunho Yang
Sung Ju Hwang
OffRL
14
1
0
07 Feb 2021
MDP Playground: An Analysis and Debug Testbed for Reinforcement Learning
MDP Playground: An Analysis and Debug Testbed for Reinforcement Learning
Raghunandan Rajan
Jessica Lizeth Borja Diaz
Suresh Guttikonda
Fabio Ferreira
André Biedenkapp
Jan Ole von Hartz
Frank Hutter
33
3
0
17 Sep 2019
Previous
12