Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2011.02669
Cited By
Learning to Utilize Shaping Rewards: A New Approach of Reward Shaping
5 November 2020
Yujing Hu
Weixun Wang
Hangtian Jia
Yixiang Wang
Yingfeng Chen
Jianye Hao
Feng Wu
Changjie Fan
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Learning to Utilize Shaping Rewards: A New Approach of Reward Shaping"
50 / 79 papers shown
Title
CCL: Collaborative Curriculum Learning for Sparse-Reward Multi-Agent Reinforcement Learning via Co-evolutionary Task Evolution
Yufei Lin
Chengwei Ye
Jun Wang
Kangsheng Wang
Linuo Xu
Shuyan Liu
Zeyu Zhang
40
1
0
08 May 2025
Learning Explainable Dense Reward Shapes via Bayesian Optimization
Ryan Koo
Ian Yang
Vipul Raheja
Mingyi Hong
Kwang-Sung Jun
Dongyeop Kang
31
0
0
22 Apr 2025
Post-Convergence Sim-to-Real Policy Transfer: A Principled Alternative to Cherry-Picking
Dylan Khor
Bowen Weng
40
1
0
21 Apr 2025
Towards Fully Automated Decision-Making Systems for Greenhouse Control: Challenges and Opportunities
Yongshuai Liu
Taeyeong Choi
Xin Liu
AI4CE
61
0
0
27 Mar 2025
KEA: Keeping Exploration Alive by Proactively Coordinating Exploration Strategies
Shih-Min Yang
Martin Magnusson
J. A. Stork
Todor Stoyanov
49
0
0
23 Mar 2025
Towards Better Alignment: Training Diffusion Models with Reinforcement Learning Against Sparse Rewards
Zijing Hu
Fengda Zhang
Long Chen
Kun Kuang
Jiahui Li
Kaifeng Gao
Jun Xiao
X. Wang
Wenwu Zhu
EGVM
56
0
0
14 Mar 2025
Curiosity-Driven Imagination: Discovering Plan Operators and Learning Associated Policies for Open-World Adaptation
Pierrick Lorang
Hong Lu
Matthias Scheutz
46
0
0
06 Mar 2025
Closing the Intent-to-Behavior Gap via Fulfillment Priority Logic
B. Mabsout
Abdelrahman AbdelGawad
R. Mancuso
43
1
0
04 Mar 2025
Reinforcement learning Based Automated Design of Differential Evolution Algorithm for Black-box Optimization
Xu Yang
Rui Wang
Kaiwen Li
Ling Wang
56
0
0
22 Jan 2025
Blockchain-assisted Demonstration Cloning for Multi-Agent Deep Reinforcement Learning
Ahmed Alagha
Jamal Bentahar
Hadi Otrok
Shakti Singh
R. Mizouni
53
3
0
19 Jan 2025
Latent Reward: LLM-Empowered Credit Assignment in Episodic Reinforcement Learning
Yun Qu
Yuhang Jiang
Boyuan Wang
Yixiu Mao
Cheems Wang
Chang-Shu Liu
Xiangyang Ji
82
3
0
10 Jan 2025
Fairness in Reinforcement Learning with Bisimulation Metrics
S. Rezaei-Shoshtari
Hanna Yurchyk
Scott Fujimoto
Doina Precup
D. Meger
85
0
0
03 Jan 2025
Bootstrapped Reward Shaping
Jacob Adamczyk
Volodymyr Makarenko
Stas Tiomkin
R. Kulkarni
OffRL
33
0
0
03 Jan 2025
Comprehensive Overview of Reward Engineering and Shaping in Advancing Reinforcement Learning Applications
Sinan Ibrahim
Mostafa Mostafa
Ali Jnadi
Hadi Salloum
Pavel Osinenko
OffRL
52
14
0
31 Dec 2024
Efficient Model-Based Reinforcement Learning Through Optimistic Thompson Sampling
Jasmine Bayrooti
Carl Henrik Ek
Amanda Prorok
42
0
0
07 Oct 2024
ETGL-DDPG: A Deep Deterministic Policy Gradient Algorithm for Sparse Reward Continuous Control
Ehsan Futuhi
Shayan Karimi
Chao Gao
Martin Müller
43
1
0
07 Oct 2024
Model-Based Reward Shaping for Adversarial Inverse Reinforcement Learning in Stochastic Environments
S. Zhan
Qingyuan Wu
Philip Wang
Yixuan Wang
Ruochen Jiao
Chao Huang
Qi Zhu
39
1
0
04 Oct 2024
ELO-Rated Sequence Rewards: Advancing Reinforcement Learning Models
Qi Ju
Falin Hei
Zhemei Fang
Yunfeng Luo
37
0
0
05 Sep 2024
Highly Efficient Self-Adaptive Reward Shaping for Reinforcement Learning
Haozhe Ma
Zhengding Luo
Thanh Vinh Vo
Kuankuan Sima
Tze-Yun Leong
44
6
0
06 Aug 2024
Principal-Agent Reinforcement Learning
Dima Ivanov
Paul Dutting
Inbal Talgam-Cohen
Tonghan Wang
David C. Parkes
42
3
0
25 Jul 2024
Automatic Environment Shaping is the Next Frontier in RL
Younghyo Park
G. Margolis
Pulkit Agrawal
OffRL
44
3
0
23 Jul 2024
Aquatic Navigation: A Challenging Benchmark for Deep Reinforcement Learning
Davide Corsi
Davide Camponogara
Alessandro Farinelli
OffRL
46
2
0
30 May 2024
Bilevel reinforcement learning via the development of hyper-gradient without lower-level convexity
Yan Yang
Bin Gao
Ya-xiang Yuan
46
2
0
30 May 2024
Enhancing Q-Learning with Large Language Model Heuristics
Xiefeng Wu
LRM
32
0
0
06 May 2024
On the Sample Efficiency of Abstractions and Potential-Based Reward Shaping in Reinforcement Learning
Giuseppe Canonaco
Leo Ardon
Alberto Pozanco
Daniel Borrajo
OffRL
31
1
0
11 Apr 2024
Extremum-Seeking Action Selection for Accelerating Policy Optimization
Ya-Chien Chang
Sicun Gao
40
0
0
02 Apr 2024
Decomposing Control Lyapunov Functions for Efficient Reinforcement Learning
Antonio Lopez
David Fridovich-Keil
38
1
0
18 Mar 2024
Transformable Gaussian Reward Function for Socially-Aware Navigation with Deep Reinforcement Learning
Jinyeob Kim
Sumin Kang
Sungwoo Yang
Beomjoon Kim
Jargalbaatar Yura
Donghan Kim
181
1
0
22 Feb 2024
Auxiliary Reward Generation with Transition Distance Representation Learning
Siyuan Li
Shijie Han
Yingnan Zhao
B. Liang
Peng Liu
OffRL
40
0
0
12 Feb 2024
Principled Penalty-based Methods for Bilevel Reinforcement Learning and RLHF
Han Shen
Zhuoran Yang
Tianyi Chen
OffRL
40
14
0
10 Feb 2024
Reinforcement Learning from Bagged Reward
Yuting Tang
Xin-Qiang Cai
Yao-Xiang Ding
Qiyu Wu
Guoqing Liu
Masashi Sugiyama
OffRL
36
0
0
06 Feb 2024
Principal-Agent Reward Shaping in MDPs
Omer Ben-Porat
Yishay Mansour
Michal Moshkovitz
Boaz Taitler
45
10
0
30 Dec 2023
Human-AI Collaboration in Real-World Complex Environment with Reinforcement Learning
Md Saiful Islam
Srijita Das
S. Gottipati
William Duguay
Clodéric Mars
Jalal Arabneydi
Antoine Fagette
Matthew J. Guzdial
Matthew E. Taylor
41
1
0
23 Dec 2023
Toward Computationally Efficient Inverse Reinforcement Learning via Reward Shaping
Lauren H. Cooke
Harvey Klyne
Edwin Zhang
Cassidy Laidlaw
Milind Tambe
Finale Doshi-Velez
21
2
0
15 Dec 2023
FoMo Rewards: Can we cast foundation models as reward functions?
Ekdeep Singh Lubana
Johann Brehmer
P. D. Haan
Taco S. Cohen
OffRL
LRM
48
2
0
06 Dec 2023
Reward Shaping for Improved Learning in Real-time Strategy Game Play
John Kliem
Prithviraj Dasgupta
OffRL
19
1
0
27 Nov 2023
Guaranteeing Control Requirements via Reward Shaping in Reinforcement Learning
F. D. Lellis
M. Coraggio
G. Russo
Mirco Musolesi
Mario di Bernardo
OffRL
27
4
0
16 Nov 2023
Behavior Alignment via Reward Function Optimization
Dhawal Gupta
Yash Chandak
Scott M. Jordan
Philip S. Thomas
Bruno Castro da Silva
31
10
0
29 Oct 2023
Learning Extrinsic Dexterity with Parameterized Manipulation Primitives
Shih-Min Yang
Martin Magnusson
J. A. Stork
Todor Stoyanov
47
6
0
26 Oct 2023
Learning to Terminate in Object Navigation
Yuhang Song
Anh Nguyen
Chun-Yi Lee
38
3
0
28 Sep 2023
Curiosity as a Self-Supervised Method to Improve Exploration in De novo Drug Design
M. Chadi
H. Mousannif
Ahmed Aamouche
BDL
52
2
0
24 Sep 2023
State2Explanation: Concept-Based Explanations to Benefit Agent Learning and User Understanding
Devleena Das
Sonia Chernova
Been Kim
LRM
LLMAG
47
22
0
21 Sep 2023
On Reducing Undesirable Behavior in Deep Reinforcement Learning Models
Ophir M. Carmel
Guy Katz
38
0
0
06 Sep 2023
Deep Reinforcement Learning from Hierarchical Preference Design
Alexander Bukharin
Yixiao Li
Pengcheng He
Tuo Zhao
19
0
0
06 Sep 2023
Loss Dynamics of Temporal Difference Reinforcement Learning
Blake Bordelon
P. Masset
Henry Kuo
Cengiz Pehlevan
AI4CE
21
0
0
10 Jul 2023
Expanding Versatility of Agile Locomotion through Policy Transitions Using Latent State Representation
Guilherme Christmann
Ying-Sheng Luo
Jonathan Hans Soeseno
Wei-Chao Chen
28
2
0
14 Jun 2023
Interpretable Reward Redistribution in Reinforcement Learning: A Causal Approach
Yudi Zhang
Yali Du
Erdun Gao
Ziyan Wang
Jun Wang
Meng Fang
Mykola Pechenizkiy
CML
26
17
0
28 May 2023
Synthetically Generating Human-like Data for Sequential Decision Making Tasks via Reward-Shaped Imitation Learning
Bryan C. Brandt
P. Dasgupta
28
1
0
14 Apr 2023
Bandit-Based Policy Invariant Explicit Shaping for Incorporating External Advice in Reinforcement Learning
Yash Satsangi
Paniz Behboudian
OffRL
19
0
0
14 Apr 2023
Embedding Contextual Information through Reward Shaping in Multi-Agent Learning: A Case Study from Google Football
Chaoyi Gu
V. D. Silva
Corentin Artaud
Rafael Pina
21
1
0
25 Mar 2023
1
2
Next