ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2109.06668
  4. Cited By
Exploration in Deep Reinforcement Learning: From Single-Agent to
  Multiagent Domain

Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain

14 September 2021
Jianye Hao
Tianpei Yang
Hongyao Tang
Chenjia Bai
Jinyi Liu
Zhaopeng Meng
Peng Liu
Zhen Wang
    OffRL
ArXivPDFHTML

Papers citing "Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain"

39 / 39 papers shown
Title
Leveraging Partial SMILES Validation Scheme for Enhanced Drug Design in Reinforcement Learning Frameworks
Leveraging Partial SMILES Validation Scheme for Enhanced Drug Design in Reinforcement Learning Frameworks
Xinyu Wang
Jinbo Bi
Minghu Song
CLL
62
0
0
01 May 2025
Multi-Agent Reinforcement Learning for Resources Allocation Optimization: A Survey
Multi-Agent Reinforcement Learning for Resources Allocation Optimization: A Survey
Mohamad Abdul Hady
Siyi Hu
Mahardhika Pratama
Jimmy Cao
Ryszard Kowalczyk
24
0
0
29 Apr 2025
Inverse RL Scene Dynamics Learning for Nonlinear Predictive Control in Autonomous Vehicles
Inverse RL Scene Dynamics Learning for Nonlinear Predictive Control in Autonomous Vehicles
Sorin Grigorescu
Mihai V. Zaha
AI4CE
34
0
0
02 Apr 2025
A Survey of Reinforcement Learning-Based Motion Planning for Autonomous Driving: Lessons Learned from a Driving Task Perspective
A Survey of Reinforcement Learning-Based Motion Planning for Autonomous Driving: Lessons Learned from a Driving Task Perspective
Zhuoren Li
Guizhe Jin
Ran Yu
Z. Chen
Nan I. Li
...
Lu Xiong
Bo Leng
Jia Hu
I. Kolmanovsky
Dimitar Filev
51
0
0
31 Mar 2025
Adventurer: Exploration with BiGAN for Deep Reinforcement Learning
Adventurer: Exploration with BiGAN for Deep Reinforcement Learning
Yongshuai Liu
Xin Liu
GAN
97
2
0
24 Mar 2025
CPIG: Leveraging Consistency Policy with Intention Guidance for
  Multi-agent Exploration
CPIG: Leveraging Consistency Policy with Intention Guidance for Multi-agent Exploration
Y. Fu
Yuanheng Zhu
Haoran Li
Zijie Zhao
Jiajun Chai
Dongbin Zhao
37
0
0
06 Nov 2024
Multi-Agent Deep Q-Network with Layer-based Communication Channel for
  Autonomous Internal Logistics Vehicle Scheduling in Smart Manufacturing
Multi-Agent Deep Q-Network with Layer-based Communication Channel for Autonomous Internal Logistics Vehicle Scheduling in Smart Manufacturing
Mohammad Feizabadi
Arman Hosseini
Zakaria Yahouni
28
0
0
01 Nov 2024
TF-DDRL: A Transformer-enhanced Distributed DRL Technique for Scheduling
  IoT Applications in Edge and Cloud Computing Environments
TF-DDRL: A Transformer-enhanced Distributed DRL Technique for Scheduling IoT Applications in Edge and Cloud Computing Environments
Zhiyu Wang
M. Goudarzi
Rajkumar Buyya
OffRL
30
3
0
18 Oct 2024
A Safety Modulator Actor-Critic Method in Model-Free Safe Reinforcement
  Learning and Application in UAV Hovering
A Safety Modulator Actor-Critic Method in Model-Free Safe Reinforcement Learning and Application in UAV Hovering
Qihan Qi
Xinsong Yang
Gang Xia
Daniel W. C. Ho
Pengyang Tang
26
0
0
09 Oct 2024
GravMAD: Grounded Spatial Value Maps Guided Action Diffusion for Generalized 3D Manipulation
GravMAD: Grounded Spatial Value Maps Guided Action Diffusion for Generalized 3D Manipulation
Yangtao Chen
Zixuan Chen
Junhui Yin
Jing Huo
Pinzhuo Tian
Jieqi Shi
Yang Gao
LM&Ro
44
2
0
30 Sep 2024
Multi-agent Reinforcement Learning for Dynamic Dispatching in Material
  Handling Systems
Multi-agent Reinforcement Learning for Dynamic Dispatching in Material Handling Systems
Xian Yeow Lee
Haiyan Wang
Daisuke Katsumata
Takaharu Matsui
Chetan Gupta
29
1
0
27 Sep 2024
The Exploration-Exploitation Dilemma Revisited: An Entropy Perspective
The Exploration-Exploitation Dilemma Revisited: An Entropy Perspective
Renye Yan
Yaozhong Gan
You Wu
Ling Liang
Junliang Xing
Yimao Cai
Ru Huang
30
1
0
19 Aug 2024
VL-TGS: Trajectory Generation and Selection using Vision Language Models in Mapless Outdoor Environments
VL-TGS: Trajectory Generation and Selection using Vision Language Models in Mapless Outdoor Environments
Daeun Song
Jing Liang
Xuesu Xiao
Dinesh Manocha
48
4
0
05 Aug 2024
Discretizing Continuous Action Space with Unimodal Probability
  Distributions for On-Policy Reinforcement Learning
Discretizing Continuous Action Space with Unimodal Probability Distributions for On-Policy Reinforcement Learning
Yuanyang Zhu
Zhi Wang
Yuanheng Zhu
Chunlin Chen
Dongbin Zhao
16
0
0
01 Aug 2024
Constrained Ensemble Exploration for Unsupervised Skill Discovery
Constrained Ensemble Exploration for Unsupervised Skill Discovery
Chenjia Bai
Rushuai Yang
Qiaosheng Zhang
Kang Xu
Yi Chen
Ting Xiao
Xuelong Li
OffRL
40
3
0
25 May 2024
Provably Efficient Information-Directed Sampling Algorithms for
  Multi-Agent Reinforcement Learning
Provably Efficient Information-Directed Sampling Algorithms for Multi-Agent Reinforcement Learning
Qiaosheng Zhang
Chenjia Bai
Shuyue Hu
Zhen Wang
Xuelong Li
37
1
0
30 Apr 2024
FPGA Divide-and-Conquer Placement using Deep Reinforcement Learning
FPGA Divide-and-Conquer Placement using Deep Reinforcement Learning
Shang Wang
Deepak Ranganatha Sastry Mamillapalli
Tianpei Yang
Matthew E. Taylor
36
0
0
11 Apr 2024
Emergent Braitenberg-style Behaviours for Navigating the ViZDoom `My Way
  Home' Labyrinth
Emergent Braitenberg-style Behaviours for Navigating the ViZDoom `My Way Home' Labyrinth
Caleidgh Bayer
Robert J. Smith
M. Heywood
26
0
0
09 Apr 2024
Multi-Fidelity Reinforcement Learning for Time-Optimal Quadrotor
  Re-planning
Multi-Fidelity Reinforcement Learning for Time-Optimal Quadrotor Re-planning
Gilhyun Ryou
Geoffrey Wang
S. Karaman
48
3
0
13 Mar 2024
StepCoder: Improve Code Generation with Reinforcement Learning from
  Compiler Feedback
StepCoder: Improve Code Generation with Reinforcement Learning from Compiler Feedback
Shihan Dou
Yan Liu
Haoxiang Jia
Limao Xiong
Enyu Zhou
...
Tao Ji
Rui Zheng
Qi Zhang
Xuanjing Huang
Tao Gui
LLMAG
59
30
0
02 Feb 2024
Bridging Evolutionary Algorithms and Reinforcement Learning: A
  Comprehensive Survey on Hybrid Algorithms
Bridging Evolutionary Algorithms and Reinforcement Learning: A Comprehensive Survey on Hybrid Algorithms
Pengyi Li
Jianye Hao
Hongyao Tang
Xian Fu
Yan Zheng
Ke Tang
36
9
0
22 Jan 2024
OVD-Explorer: Optimism Should Not Be the Sole Pursuit of Exploration in
  Noisy Environments
OVD-Explorer: Optimism Should Not Be the Sole Pursuit of Exploration in Noisy Environments
Jinyi Liu
Zhi Wang
Yan Zheng
Jianye Hao
Chenjia Bai
Junjie Ye
Zhen Wang
Haiyin Piao
Yang Sun
27
6
0
19 Dec 2023
Multi-agent Reinforcement Learning: A Comprehensive Survey
Multi-agent Reinforcement Learning: A Comprehensive Survey
Dom Huh
Prasant Mohapatra
AI4CE
30
8
0
15 Dec 2023
Small batch deep reinforcement learning
Small batch deep reinforcement learning
J. Obando-Ceron
Marc G. Bellemare
Pablo Samuel Castro
VLM
32
14
0
05 Oct 2023
Prioritized Trajectory Replay: A Replay Memory for Data-driven Reinforcement Learning
Prioritized Trajectory Replay: A Replay Memory for Data-driven Reinforcement Learning
Jinyi Liu
Y. Ma
Jianye Hao
Yujing Hu
Yan Zheng
Tangjie Lv
Changjie Fan
OffRL
44
2
0
27 Jun 2023
Large Sequence Models for Sequential Decision-Making: A Survey
Large Sequence Models for Sequential Decision-Making: A Survey
Muning Wen
Runji Lin
Hanjing Wang
Yaodong Yang
Ying Wen
Luo Mai
J. Wang
Haifeng Zhang
Weinan Zhang
LM&Ro
LRM
37
35
0
24 Jun 2023
Improving Offline-to-Online Reinforcement Learning with Q-Ensembles
Improving Offline-to-Online Reinforcement Learning with Q-Ensembles
Kai-Wen Zhao
Yi-An Ma
Jianye Hao
Jinyi Liu
Yan Zheng
Zhaopeng Meng
OffRL
OnRL
18
12
0
12 Jun 2023
Semantically Aligned Task Decomposition in Multi-Agent Reinforcement
  Learning
Semantically Aligned Task Decomposition in Multi-Agent Reinforcement Learning
Wenhao Li
Dan Qiao
Baoxiang Wang
Xiangfeng Wang
Bo Jin
H. Zha
35
5
0
18 May 2023
Behavior Contrastive Learning for Unsupervised Skill Discovery
Behavior Contrastive Learning for Unsupervised Skill Discovery
Rushuai Yang
Chenjia Bai
Hongyi Guo
Siyuan Li
Bin Zhao
Zhen Wang
Peng Liu
Xuelong Li
SSL
27
16
0
08 May 2023
SVDE: Scalable Value-Decomposition Exploration for Cooperative
  Multi-Agent Reinforcement Learning
SVDE: Scalable Value-Decomposition Exploration for Cooperative Multi-Agent Reinforcement Learning
Shuhan Qi
Shuhao Zhang
Qiang-qiang Wang
Jia-jia Zhang
Jing Xiao
X. Wang
26
0
0
16 Mar 2023
Latent-Conditioned Policy Gradient for Multi-Objective Deep
  Reinforcement Learning
Latent-Conditioned Policy Gradient for Multi-Objective Deep Reinforcement Learning
T. Kanazawa
Chetan Gupta
24
0
0
15 Mar 2023
Progress and summary of reinforcement learning on energy management of
  MPS-EV
Progress and summary of reinforcement learning on energy management of MPS-EV
Jincheng Hu
Yang Lin
Liang Chu
Zhuoran Hou
Jihan Li
Jingjing Jiang
Yuanjian Zhang
18
11
0
08 Nov 2022
EUCLID: Towards Efficient Unsupervised Reinforcement Learning with
  Multi-choice Dynamics Model
EUCLID: Towards Efficient Unsupervised Reinforcement Learning with Multi-choice Dynamics Model
Yifu Yuan
Jianye Hao
Fei Ni
Yao Mu
Yan Zheng
Yujing Hu
Jinyi Liu
Yingfeng Chen
Changjie Fan
71
12
0
02 Oct 2022
A Policy Resonance Approach to Solve the Problem of Responsibility
  Diffusion in Multiagent Reinforcement Learning
A Policy Resonance Approach to Solve the Problem of Responsibility Diffusion in Multiagent Reinforcement Learning
Qing Fu
Tenghai Qiu
Jianqiang Yi
Zhiqiang Pu
Xiaolin Ai
Wanmai Yuan
21
0
0
16 Aug 2022
Image Augmentation Based Momentum Memory Intrinsic Reward for Sparse
  Reward Visual Scenes
Image Augmentation Based Momentum Memory Intrinsic Reward for Sparse Reward Visual Scenes
Zheng Fang
Biao Zhao
Guizhong Liu
16
2
0
19 May 2022
Towards Safe Reinforcement Learning with a Safety Editor Policy
Towards Safe Reinforcement Learning with a Safety Editor Policy
Haonan Yu
Wei-ping Xu
Haichao Zhang
OffRL
66
31
0
28 Jan 2022
Curious Explorer: a provable exploration strategy in Policy Learning
Curious Explorer: a provable exploration strategy in Policy Learning
M. Miani
Maurizio Parton
M. Romito
37
0
0
29 Jun 2021
MAVEN: Multi-Agent Variational Exploration
MAVEN: Multi-Agent Variational Exploration
Anuj Mahajan
Tabish Rashid
Mikayel Samvelyan
Shimon Whiteson
DRL
135
355
0
16 Oct 2019
Dropout as a Bayesian Approximation: Representing Model Uncertainty in
  Deep Learning
Dropout as a Bayesian Approximation: Representing Model Uncertainty in Deep Learning
Y. Gal
Zoubin Ghahramani
UQCV
BDL
285
9,136
0
06 Jun 2015
1