Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1910.09281
Cited By
Dealing with Sparse Rewards in Reinforcement Learning
21 October 2019
J. Hare
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Dealing with Sparse Rewards in Reinforcement Learning"
26 / 26 papers shown
Title
Towards Better Alignment: Training Diffusion Models with Reinforcement Learning Against Sparse Rewards
Zijing Hu
Fengda Zhang
Long Chen
Kun Kuang
Jiahui Li
Kaifeng Gao
Jun Xiao
X. Wang
Wenwu Zhu
EGVM
56
0
0
14 Mar 2025
Efficient Diversity-based Experience Replay for Deep Reinforcement Learning
Kaiyan Zhao
Yiming Wang
Yuyang Chen
Yan Li
Leong Hou U
Xiaoguang Niu
36
1
0
27 Oct 2024
Automated Rewards via LLM-Generated Progress Functions
Vishnu Sarukkai
Brennan Shacklett
Zander Majercik
Kush S. Bhatia
Christopher Ré
Kayvon Fatahalian
28
1
0
11 Oct 2024
AdaMemento: Adaptive Memory-Assisted Policy Optimization for Reinforcement Learning
Renye Yan
Yaozhong Gan
You Wu
Junliang Xing
Ling Liangn
Yeshang Zhu
Yimao Cai
OffRL
26
1
0
06 Oct 2024
Exploratory Optimal Stopping: A Singular Control Formulation
Jodi Dianetti
Giorgio Ferrari
Renyuan Xu
31
3
0
18 Aug 2024
Take a Step and Reconsider: Sequence Decoding for Self-Improved Neural Combinatorial Optimization
Jonathan Pirnay
D. G. Grimm
BDL
53
3
0
24 Jul 2024
On shallow planning under partial observability
Randy Lefebvre
Audrey Durand
OffRL
33
0
0
22 Jul 2024
Feasibility Consistent Representation Learning for Safe Reinforcement Learning
Zhepeng Cen
Yi-Fan Yao
Zuxin Liu
Ding Zhao
OffRL
40
3
0
20 May 2024
A Survey on Self-Evolution of Large Language Models
Zhengwei Tao
Ting-En Lin
Xiancai Chen
Hangyu Li
Yuchuan Wu
Yongbin Li
Zhi Jin
Fei Huang
Dacheng Tao
Jingren Zhou
LRM
LM&Ro
57
22
0
22 Apr 2024
Code as Reward: Empowering Reinforcement Learning with VLMs
David Venuto
Sami Nur Islam
Martin Klissarov
Doina Precup
Sherry Yang
Ankit Anand
VLM
28
9
0
07 Feb 2024
Learning Neural Traffic Rules
Xuan Zhang
Xifeng Gao
Kui Wu
Zherong Pan
27
0
0
03 Dec 2023
Behavior Alignment via Reward Function Optimization
Dhawal Gupta
Yash Chandak
Scott M. Jordan
Philip S. Thomas
Bruno Castro da Silva
19
10
0
29 Oct 2023
Neural Combinatorial Optimization with Heavy Decoder: Toward Large Scale Generalization
Fu Luo
Xi Lin
Fei Liu
Qingfu Zhang
Zhenkun Wang
35
57
0
12 Oct 2023
Collaborative Route Planning of UAVs, Workers and Cars for Crowdsensing in Disaster Response
Lei Han
Chunyu Tu
Zhiwen Yu
Zhiyong Yu
Weihua Shan
Liang Wang
Bin Guo
14
2
0
21 Aug 2023
Diversity is Strength: Mastering Football Full Game with Interactive Reinforcement Learning of Multiple AIs
Chenglu Sun
Shuo Shen
Sijia Xu
Weidong Zhang
22
1
0
28 Jun 2023
Deep Reinforcement Learning for mmWave Initial Beam Alignment
Daniel Tandler
Sebastian Dörner
Marc Gauger
S. Brink
14
2
0
17 Feb 2023
WILD-SCAV: Benchmarking FPS Gaming AI on Unity3D-based Environments
Xi Chen
Tianyuan Shi
Qing Zhao
Yuchen Sun
Yunfei Gao
Xiangjun Wang
25
2
0
14 Oct 2022
AutoCAT: Reinforcement Learning for Automated Exploration of Cache-Timing Attacks
Mulong Luo
Wenjie Xiong
G. G. Lee
Yueying Li
Xiaomeng Yang
Amy Zhang
Yuandong Tian
Hsien-Hsin S. Lee
G. E. Suh
AAML
40
10
0
17 Aug 2022
HTRON:Efficient Outdoor Navigation with Sparse Rewards via Heavy Tailed Adaptive Reinforce Algorithm
K. Weerakoon
Souradip Chakraborty
N. Karapetyan
A. Sathyamoorthy
Amrit Singh Bedi
Tianyi Zhou
20
15
0
08 Jul 2022
Collaborative Training of Heterogeneous Reinforcement Learning Agents in Environments with Sparse Rewards: What and When to Share?
Alain Andres
Esther Villar-Rodriguez
Javier Del Ser
19
9
0
24 Feb 2022
Discovering and Exploiting Sparse Rewards in a Learned Behavior Space
Giuseppe Paolo
Alexandre Coninx
Alban Laflaquière
Stéphane Doncieux
14
3
0
02 Nov 2021
A Reinforcement Learning Approach to Parameter Selection for Distributed Optimal Power Flow
Tai-Yin Chiu
Alyssa Kody
Youngdae Kim
Kibaek Kim
Daniel K. Molzahn
11
20
0
22 Oct 2021
ScheduleNet: Learn to solve multi-agent scheduling problems with reinforcement learning
Junyoung Park
Sanjar Bakhtiyar
Jinkyoo Park
13
38
0
06 Jun 2021
Goal-constrained Sparse Reinforcement Learning for End-to-End Driving
Pranav Agarwal
Pierre de Beaucorps
Raoul de Charette
26
3
0
16 Mar 2021
Locally Persistent Exploration in Continuous Control Tasks with Sparse Rewards
Susan Amin
Maziar Gomrokchi
Hossein Aboutalebi
Harsh Satija
Doina Precup
14
16
0
26 Dec 2020
HMRL: Hyper-Meta Learning for Sparse Reward Reinforcement Learning Problem
Yun Hua
Xiangfeng Wang
Bo Jin
Wenhao Li
Junchi Yan
Xiaofeng He
H. Zha
OffRL
8
9
0
11 Feb 2020
1