ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2011.02669
  4. Cited By
Learning to Utilize Shaping Rewards: A New Approach of Reward Shaping

Learning to Utilize Shaping Rewards: A New Approach of Reward Shaping

5 November 2020
Yujing Hu
Weixun Wang
Hangtian Jia
Yixiang Wang
Yingfeng Chen
Jianye Hao
Feng Wu
Changjie Fan
    OffRL
ArXivPDFHTML

Papers citing "Learning to Utilize Shaping Rewards: A New Approach of Reward Shaping"

50 / 79 papers shown
Title
CCL: Collaborative Curriculum Learning for Sparse-Reward Multi-Agent Reinforcement Learning via Co-evolutionary Task Evolution
CCL: Collaborative Curriculum Learning for Sparse-Reward Multi-Agent Reinforcement Learning via Co-evolutionary Task Evolution
Yufei Lin
Chengwei Ye
Jun Wang
Kangsheng Wang
Linuo Xu
Shuyan Liu
Zeyu Zhang
40
1
0
08 May 2025
Learning Explainable Dense Reward Shapes via Bayesian Optimization
Learning Explainable Dense Reward Shapes via Bayesian Optimization
Ryan Koo
Ian Yang
Vipul Raheja
Mingyi Hong
Kwang-Sung Jun
Dongyeop Kang
31
0
0
22 Apr 2025
Post-Convergence Sim-to-Real Policy Transfer: A Principled Alternative to Cherry-Picking
Post-Convergence Sim-to-Real Policy Transfer: A Principled Alternative to Cherry-Picking
Dylan Khor
Bowen Weng
40
1
0
21 Apr 2025
Towards Fully Automated Decision-Making Systems for Greenhouse Control: Challenges and Opportunities
Towards Fully Automated Decision-Making Systems for Greenhouse Control: Challenges and Opportunities
Yongshuai Liu
Taeyeong Choi
Xin Liu
AI4CE
61
0
0
27 Mar 2025
KEA: Keeping Exploration Alive by Proactively Coordinating Exploration Strategies
KEA: Keeping Exploration Alive by Proactively Coordinating Exploration Strategies
Shih-Min Yang
Martin Magnusson
J. A. Stork
Todor Stoyanov
49
0
0
23 Mar 2025
Towards Better Alignment: Training Diffusion Models with Reinforcement Learning Against Sparse Rewards
Towards Better Alignment: Training Diffusion Models with Reinforcement Learning Against Sparse Rewards
Zijing Hu
Fengda Zhang
Long Chen
Kun Kuang
Jiahui Li
Kaifeng Gao
Jun Xiao
X. Wang
Wenwu Zhu
EGVM
56
0
0
14 Mar 2025
Curiosity-Driven Imagination: Discovering Plan Operators and Learning Associated Policies for Open-World Adaptation
Pierrick Lorang
Hong Lu
Matthias Scheutz
46
0
0
06 Mar 2025
Closing the Intent-to-Behavior Gap via Fulfillment Priority Logic
Closing the Intent-to-Behavior Gap via Fulfillment Priority Logic
B. Mabsout
Abdelrahman AbdelGawad
R. Mancuso
40
1
0
04 Mar 2025
Reinforcement learning Based Automated Design of Differential Evolution Algorithm for Black-box Optimization
Reinforcement learning Based Automated Design of Differential Evolution Algorithm for Black-box Optimization
Xu Yang
Rui Wang
Kaiwen Li
Ling Wang
56
0
0
22 Jan 2025
Blockchain-assisted Demonstration Cloning for Multi-Agent Deep Reinforcement Learning
Blockchain-assisted Demonstration Cloning for Multi-Agent Deep Reinforcement Learning
Ahmed Alagha
Jamal Bentahar
Hadi Otrok
Shakti Singh
R. Mizouni
53
3
0
19 Jan 2025
Latent Reward: LLM-Empowered Credit Assignment in Episodic Reinforcement Learning
Latent Reward: LLM-Empowered Credit Assignment in Episodic Reinforcement Learning
Yun Qu
Yuhang Jiang
Boyuan Wang
Yixiu Mao
Cheems Wang
Chang-Shu Liu
Xiangyang Ji
80
2
0
10 Jan 2025
Fairness in Reinforcement Learning with Bisimulation Metrics
Fairness in Reinforcement Learning with Bisimulation Metrics
S. Rezaei-Shoshtari
Hanna Yurchyk
Scott Fujimoto
Doina Precup
D. Meger
85
0
0
03 Jan 2025
Bootstrapped Reward Shaping
Jacob Adamczyk
Volodymyr Makarenko
Stas Tiomkin
R. Kulkarni
OffRL
28
0
0
03 Jan 2025
Comprehensive Overview of Reward Engineering and Shaping in Advancing Reinforcement Learning Applications
Comprehensive Overview of Reward Engineering and Shaping in Advancing Reinforcement Learning Applications
Sinan Ibrahim
Mostafa Mostafa
Ali Jnadi
Hadi Salloum
Pavel Osinenko
OffRL
52
12
0
31 Dec 2024
Efficient Model-Based Reinforcement Learning Through Optimistic Thompson Sampling
Efficient Model-Based Reinforcement Learning Through Optimistic Thompson Sampling
Jasmine Bayrooti
Carl Henrik Ek
Amanda Prorok
42
0
0
07 Oct 2024
ETGL-DDPG: A Deep Deterministic Policy Gradient Algorithm for Sparse Reward Continuous Control
ETGL-DDPG: A Deep Deterministic Policy Gradient Algorithm for Sparse Reward Continuous Control
Ehsan Futuhi
Shayan Karimi
Chao Gao
Martin Müller
40
1
0
07 Oct 2024
Model-Based Reward Shaping for Adversarial Inverse Reinforcement
  Learning in Stochastic Environments
Model-Based Reward Shaping for Adversarial Inverse Reinforcement Learning in Stochastic Environments
S. Zhan
Qingyuan Wu
Philip Wang
Yixuan Wang
Ruochen Jiao
Chao Huang
Qi Zhu
39
1
0
04 Oct 2024
ELO-Rated Sequence Rewards: Advancing Reinforcement Learning Models
ELO-Rated Sequence Rewards: Advancing Reinforcement Learning Models
Qi Ju
Falin Hei
Zhemei Fang
Yunfeng Luo
37
0
0
05 Sep 2024
Highly Efficient Self-Adaptive Reward Shaping for Reinforcement Learning
Highly Efficient Self-Adaptive Reward Shaping for Reinforcement Learning
Haozhe Ma
Zhengding Luo
Thanh Vinh Vo
Kuankuan Sima
Tze-Yun Leong
39
6
0
06 Aug 2024
Principal-Agent Reinforcement Learning
Principal-Agent Reinforcement Learning
Dima Ivanov
Paul Dutting
Inbal Talgam-Cohen
Tonghan Wang
David C. Parkes
42
3
0
25 Jul 2024
Automatic Environment Shaping is the Next Frontier in RL
Automatic Environment Shaping is the Next Frontier in RL
Younghyo Park
G. Margolis
Pulkit Agrawal
OffRL
40
3
0
23 Jul 2024
Aquatic Navigation: A Challenging Benchmark for Deep Reinforcement
  Learning
Aquatic Navigation: A Challenging Benchmark for Deep Reinforcement Learning
Davide Corsi
Davide Camponogara
Alessandro Farinelli
OffRL
46
2
0
30 May 2024
Bilevel reinforcement learning via the development of hyper-gradient without lower-level convexity
Bilevel reinforcement learning via the development of hyper-gradient without lower-level convexity
Yan Yang
Bin Gao
Ya-xiang Yuan
46
2
0
30 May 2024
Enhancing Q-Learning with Large Language Model Heuristics
Enhancing Q-Learning with Large Language Model Heuristics
Xiefeng Wu
LRM
32
0
0
06 May 2024
On the Sample Efficiency of Abstractions and Potential-Based Reward
  Shaping in Reinforcement Learning
On the Sample Efficiency of Abstractions and Potential-Based Reward Shaping in Reinforcement Learning
Giuseppe Canonaco
Leo Ardon
Alberto Pozanco
Daniel Borrajo
OffRL
31
1
0
11 Apr 2024
Extremum-Seeking Action Selection for Accelerating Policy Optimization
Extremum-Seeking Action Selection for Accelerating Policy Optimization
Ya-Chien Chang
Sicun Gao
40
0
0
02 Apr 2024
Decomposing Control Lyapunov Functions for Efficient Reinforcement
  Learning
Decomposing Control Lyapunov Functions for Efficient Reinforcement Learning
Antonio Lopez
David Fridovich-Keil
38
1
0
18 Mar 2024
Transformable Gaussian Reward Function for Socially-Aware Navigation
  with Deep Reinforcement Learning
Transformable Gaussian Reward Function for Socially-Aware Navigation with Deep Reinforcement Learning
Jinyeob Kim
Sumin Kang
Sungwoo Yang
Beomjoon Kim
Jargalbaatar Yura
Donghan Kim
169
1
0
22 Feb 2024
Auxiliary Reward Generation with Transition Distance Representation
  Learning
Auxiliary Reward Generation with Transition Distance Representation Learning
Siyuan Li
Shijie Han
Yingnan Zhao
B. Liang
Peng Liu
OffRL
38
0
0
12 Feb 2024
Principled Penalty-based Methods for Bilevel Reinforcement Learning and
  RLHF
Principled Penalty-based Methods for Bilevel Reinforcement Learning and RLHF
Han Shen
Zhuoran Yang
Tianyi Chen
OffRL
40
14
0
10 Feb 2024
Reinforcement Learning from Bagged Reward
Reinforcement Learning from Bagged Reward
Yuting Tang
Xin-Qiang Cai
Yao-Xiang Ding
Qiyu Wu
Guoqing Liu
Masashi Sugiyama
OffRL
36
0
0
06 Feb 2024
Principal-Agent Reward Shaping in MDPs
Principal-Agent Reward Shaping in MDPs
Omer Ben-Porat
Yishay Mansour
Michal Moshkovitz
Boaz Taitler
45
10
0
30 Dec 2023
Human-AI Collaboration in Real-World Complex Environment with
  Reinforcement Learning
Human-AI Collaboration in Real-World Complex Environment with Reinforcement Learning
Md Saiful Islam
Srijita Das
S. Gottipati
William Duguay
Clodéric Mars
Jalal Arabneydi
Antoine Fagette
Matthew J. Guzdial
Matthew E. Taylor
41
1
0
23 Dec 2023
Toward Computationally Efficient Inverse Reinforcement Learning via
  Reward Shaping
Toward Computationally Efficient Inverse Reinforcement Learning via Reward Shaping
Lauren H. Cooke
Harvey Klyne
Edwin Zhang
Cassidy Laidlaw
Milind Tambe
Finale Doshi-Velez
21
2
0
15 Dec 2023
FoMo Rewards: Can we cast foundation models as reward functions?
FoMo Rewards: Can we cast foundation models as reward functions?
Ekdeep Singh Lubana
Johann Brehmer
P. D. Haan
Taco S. Cohen
OffRL
LRM
48
2
0
06 Dec 2023
Reward Shaping for Improved Learning in Real-time Strategy Game Play
Reward Shaping for Improved Learning in Real-time Strategy Game Play
John Kliem
Prithviraj Dasgupta
OffRL
19
1
0
27 Nov 2023
Guaranteeing Control Requirements via Reward Shaping in Reinforcement
  Learning
Guaranteeing Control Requirements via Reward Shaping in Reinforcement Learning
F. D. Lellis
M. Coraggio
G. Russo
Mirco Musolesi
Mario di Bernardo
OffRL
24
4
0
16 Nov 2023
Behavior Alignment via Reward Function Optimization
Behavior Alignment via Reward Function Optimization
Dhawal Gupta
Yash Chandak
Scott M. Jordan
Philip S. Thomas
Bruno Castro da Silva
31
10
0
29 Oct 2023
Learning Extrinsic Dexterity with Parameterized Manipulation Primitives
Learning Extrinsic Dexterity with Parameterized Manipulation Primitives
Shih-Min Yang
Martin Magnusson
J. A. Stork
Todor Stoyanov
47
6
0
26 Oct 2023
Learning to Terminate in Object Navigation
Learning to Terminate in Object Navigation
Yuhang Song
Anh Nguyen
Chun-Yi Lee
38
3
0
28 Sep 2023
Curiosity as a Self-Supervised Method to Improve Exploration in De novo
  Drug Design
Curiosity as a Self-Supervised Method to Improve Exploration in De novo Drug Design
M. Chadi
H. Mousannif
Ahmed Aamouche
BDL
52
2
0
24 Sep 2023
State2Explanation: Concept-Based Explanations to Benefit Agent Learning
  and User Understanding
State2Explanation: Concept-Based Explanations to Benefit Agent Learning and User Understanding
Devleena Das
Sonia Chernova
Been Kim
LRM
LLMAG
47
22
0
21 Sep 2023
On Reducing Undesirable Behavior in Deep Reinforcement Learning Models
On Reducing Undesirable Behavior in Deep Reinforcement Learning Models
Ophir M. Carmel
Guy Katz
38
0
0
06 Sep 2023
Deep Reinforcement Learning from Hierarchical Preference Design
Deep Reinforcement Learning from Hierarchical Preference Design
Alexander Bukharin
Yixiao Li
Pengcheng He
Tuo Zhao
17
0
0
06 Sep 2023
Loss Dynamics of Temporal Difference Reinforcement Learning
Loss Dynamics of Temporal Difference Reinforcement Learning
Blake Bordelon
P. Masset
Henry Kuo
Cengiz Pehlevan
AI4CE
21
0
0
10 Jul 2023
Expanding Versatility of Agile Locomotion through Policy Transitions
  Using Latent State Representation
Expanding Versatility of Agile Locomotion through Policy Transitions Using Latent State Representation
Guilherme Christmann
Ying-Sheng Luo
Jonathan Hans Soeseno
Wei-Chao Chen
25
2
0
14 Jun 2023
Interpretable Reward Redistribution in Reinforcement Learning: A Causal
  Approach
Interpretable Reward Redistribution in Reinforcement Learning: A Causal Approach
Yudi Zhang
Yali Du
Erdun Gao
Ziyan Wang
Jun Wang
Meng Fang
Mykola Pechenizkiy
CML
26
17
0
28 May 2023
Synthetically Generating Human-like Data for Sequential Decision Making
  Tasks via Reward-Shaped Imitation Learning
Synthetically Generating Human-like Data for Sequential Decision Making Tasks via Reward-Shaped Imitation Learning
Bryan C. Brandt
P. Dasgupta
28
1
0
14 Apr 2023
Bandit-Based Policy Invariant Explicit Shaping for Incorporating
  External Advice in Reinforcement Learning
Bandit-Based Policy Invariant Explicit Shaping for Incorporating External Advice in Reinforcement Learning
Yash Satsangi
Paniz Behboudian
OffRL
19
0
0
14 Apr 2023
Embedding Contextual Information through Reward Shaping in Multi-Agent
  Learning: A Case Study from Google Football
Embedding Contextual Information through Reward Shaping in Multi-Agent Learning: A Case Study from Google Football
Chaoyi Gu
V. D. Silva
Corentin Artaud
Rafael Pina
21
1
0
25 Mar 2023
12
Next