ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.09281
  4. Cited By
Dealing with Sparse Rewards in Reinforcement Learning

Dealing with Sparse Rewards in Reinforcement Learning

21 October 2019
J. Hare
ArXivPDFHTML

Papers citing "Dealing with Sparse Rewards in Reinforcement Learning"

26 / 26 papers shown
Title
Towards Better Alignment: Training Diffusion Models with Reinforcement Learning Against Sparse Rewards
Towards Better Alignment: Training Diffusion Models with Reinforcement Learning Against Sparse Rewards
Zijing Hu
Fengda Zhang
Long Chen
Kun Kuang
Jiahui Li
Kaifeng Gao
Jun Xiao
X. Wang
Wenwu Zhu
EGVM
56
0
0
14 Mar 2025
Efficient Diversity-based Experience Replay for Deep Reinforcement Learning
Efficient Diversity-based Experience Replay for Deep Reinforcement Learning
Kaiyan Zhao
Yiming Wang
Yuyang Chen
Yan Li
Leong Hou U
Xiaoguang Niu
36
1
0
27 Oct 2024
Automated Rewards via LLM-Generated Progress Functions
Automated Rewards via LLM-Generated Progress Functions
Vishnu Sarukkai
Brennan Shacklett
Zander Majercik
Kush S. Bhatia
Christopher Ré
Kayvon Fatahalian
28
1
0
11 Oct 2024
AdaMemento: Adaptive Memory-Assisted Policy Optimization for
  Reinforcement Learning
AdaMemento: Adaptive Memory-Assisted Policy Optimization for Reinforcement Learning
Renye Yan
Yaozhong Gan
You Wu
Junliang Xing
Ling Liangn
Yeshang Zhu
Yimao Cai
OffRL
26
1
0
06 Oct 2024
Exploratory Optimal Stopping: A Singular Control Formulation
Exploratory Optimal Stopping: A Singular Control Formulation
Jodi Dianetti
Giorgio Ferrari
Renyuan Xu
31
3
0
18 Aug 2024
Take a Step and Reconsider: Sequence Decoding for Self-Improved Neural
  Combinatorial Optimization
Take a Step and Reconsider: Sequence Decoding for Self-Improved Neural Combinatorial Optimization
Jonathan Pirnay
D. G. Grimm
BDL
53
3
0
24 Jul 2024
On shallow planning under partial observability
On shallow planning under partial observability
Randy Lefebvre
Audrey Durand
OffRL
33
0
0
22 Jul 2024
Feasibility Consistent Representation Learning for Safe Reinforcement
  Learning
Feasibility Consistent Representation Learning for Safe Reinforcement Learning
Zhepeng Cen
Yi-Fan Yao
Zuxin Liu
Ding Zhao
OffRL
40
3
0
20 May 2024
A Survey on Self-Evolution of Large Language Models
A Survey on Self-Evolution of Large Language Models
Zhengwei Tao
Ting-En Lin
Xiancai Chen
Hangyu Li
Yuchuan Wu
Yongbin Li
Zhi Jin
Fei Huang
Dacheng Tao
Jingren Zhou
LRM
LM&Ro
57
22
0
22 Apr 2024
Code as Reward: Empowering Reinforcement Learning with VLMs
Code as Reward: Empowering Reinforcement Learning with VLMs
David Venuto
Sami Nur Islam
Martin Klissarov
Doina Precup
Sherry Yang
Ankit Anand
VLM
28
9
0
07 Feb 2024
Learning Neural Traffic Rules
Learning Neural Traffic Rules
Xuan Zhang
Xifeng Gao
Kui Wu
Zherong Pan
27
0
0
03 Dec 2023
Behavior Alignment via Reward Function Optimization
Behavior Alignment via Reward Function Optimization
Dhawal Gupta
Yash Chandak
Scott M. Jordan
Philip S. Thomas
Bruno Castro da Silva
19
10
0
29 Oct 2023
Neural Combinatorial Optimization with Heavy Decoder: Toward Large Scale
  Generalization
Neural Combinatorial Optimization with Heavy Decoder: Toward Large Scale Generalization
Fu Luo
Xi Lin
Fei Liu
Qingfu Zhang
Zhenkun Wang
35
57
0
12 Oct 2023
Collaborative Route Planning of UAVs, Workers and Cars for Crowdsensing
  in Disaster Response
Collaborative Route Planning of UAVs, Workers and Cars for Crowdsensing in Disaster Response
Lei Han
Chunyu Tu
Zhiwen Yu
Zhiyong Yu
Weihua Shan
Liang Wang
Bin Guo
14
2
0
21 Aug 2023
Diversity is Strength: Mastering Football Full Game with Interactive
  Reinforcement Learning of Multiple AIs
Diversity is Strength: Mastering Football Full Game with Interactive Reinforcement Learning of Multiple AIs
Chenglu Sun
Shuo Shen
Sijia Xu
Weidong Zhang
22
1
0
28 Jun 2023
Deep Reinforcement Learning for mmWave Initial Beam Alignment
Deep Reinforcement Learning for mmWave Initial Beam Alignment
Daniel Tandler
Sebastian Dörner
Marc Gauger
S. Brink
14
2
0
17 Feb 2023
WILD-SCAV: Benchmarking FPS Gaming AI on Unity3D-based Environments
WILD-SCAV: Benchmarking FPS Gaming AI on Unity3D-based Environments
Xi Chen
Tianyuan Shi
Qing Zhao
Yuchen Sun
Yunfei Gao
Xiangjun Wang
25
2
0
14 Oct 2022
AutoCAT: Reinforcement Learning for Automated Exploration of
  Cache-Timing Attacks
AutoCAT: Reinforcement Learning for Automated Exploration of Cache-Timing Attacks
Mulong Luo
Wenjie Xiong
G. G. Lee
Yueying Li
Xiaomeng Yang
Amy Zhang
Yuandong Tian
Hsien-Hsin S. Lee
G. E. Suh
AAML
40
10
0
17 Aug 2022
HTRON:Efficient Outdoor Navigation with Sparse Rewards via Heavy Tailed
  Adaptive Reinforce Algorithm
HTRON:Efficient Outdoor Navigation with Sparse Rewards via Heavy Tailed Adaptive Reinforce Algorithm
K. Weerakoon
Souradip Chakraborty
N. Karapetyan
A. Sathyamoorthy
Amrit Singh Bedi
Tianyi Zhou
20
15
0
08 Jul 2022
Collaborative Training of Heterogeneous Reinforcement Learning Agents in
  Environments with Sparse Rewards: What and When to Share?
Collaborative Training of Heterogeneous Reinforcement Learning Agents in Environments with Sparse Rewards: What and When to Share?
Alain Andres
Esther Villar-Rodriguez
Javier Del Ser
19
9
0
24 Feb 2022
Discovering and Exploiting Sparse Rewards in a Learned Behavior Space
Discovering and Exploiting Sparse Rewards in a Learned Behavior Space
Giuseppe Paolo
Alexandre Coninx
Alban Laflaquière
Stéphane Doncieux
14
3
0
02 Nov 2021
A Reinforcement Learning Approach to Parameter Selection for Distributed
  Optimal Power Flow
A Reinforcement Learning Approach to Parameter Selection for Distributed Optimal Power Flow
Tai-Yin Chiu
Alyssa Kody
Youngdae Kim
Kibaek Kim
Daniel K. Molzahn
11
20
0
22 Oct 2021
ScheduleNet: Learn to solve multi-agent scheduling problems with
  reinforcement learning
ScheduleNet: Learn to solve multi-agent scheduling problems with reinforcement learning
Junyoung Park
Sanjar Bakhtiyar
Jinkyoo Park
13
38
0
06 Jun 2021
Goal-constrained Sparse Reinforcement Learning for End-to-End Driving
Goal-constrained Sparse Reinforcement Learning for End-to-End Driving
Pranav Agarwal
Pierre de Beaucorps
Raoul de Charette
26
3
0
16 Mar 2021
Locally Persistent Exploration in Continuous Control Tasks with Sparse
  Rewards
Locally Persistent Exploration in Continuous Control Tasks with Sparse Rewards
Susan Amin
Maziar Gomrokchi
Hossein Aboutalebi
Harsh Satija
Doina Precup
14
16
0
26 Dec 2020
HMRL: Hyper-Meta Learning for Sparse Reward Reinforcement Learning
  Problem
HMRL: Hyper-Meta Learning for Sparse Reward Reinforcement Learning Problem
Yun Hua
Xiangfeng Wang
Bo Jin
Wenhao Li
Junchi Yan
Xiaofeng He
H. Zha
OffRL
8
9
0
11 Feb 2020
1