Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2302.01477
Cited By
A Reduction-based Framework for Sequential Decision Making with Delayed Feedback
3 February 2023
Yunchang Yang
Hangshi Zhong
Tianhao Wu
B. Liu
Liwei Wang
S. Du
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"A Reduction-based Framework for Sequential Decision Making with Delayed Feedback"
11 / 11 papers shown
Title
Rainbow Delay Compensation: A Multi-Agent Reinforcement Learning Framework for Mitigating Delayed Observation
Songchen Fu
Siang Chen
Shaojing Zhao
Letian Bai
Ta Li
Yonghong Yan
96
0
0
06 May 2025
Delayed Feedback in Generalised Linear Bandits Revisited
Benjamin Howson
Ciara Pike-Burke
Sarah Filippi
27
15
0
21 Jul 2022
Model-Based Reinforcement Learning for Offline Zero-Sum Markov Games
Yuling Yan
Gen Li
Yuxin Chen
Jianqing Fan
OffRL
58
11
0
08 Jun 2022
Near-Optimal Regret for Adversarial MDP with Delayed Bandit Feedback
Tiancheng Jin
Tal Lancewicki
Haipeng Luo
Yishay Mansour
Aviv A. Rosenberg
86
21
0
31 Jan 2022
When is Offline Two-Player Zero-Sum Markov Game Solvable?
Qiwen Cui
S. Du
OffRL
64
29
0
10 Jan 2022
Can Reinforcement Learning Find Stackelberg-Nash Equilibria in General-Sum Markov Games with Myopic Followers?
Han Zhong
Zhuoran Yang
Zhaoran Wang
Michael I. Jordan
90
30
0
27 Dec 2021
Stochastic Multi-Armed Bandits with Unrestricted Delay Distributions
Tal Lancewicki
Shahar Segal
Tomer Koren
Yishay Mansour
38
40
0
04 Jun 2021
Learning Adversarial Markov Decision Processes with Delayed Feedback
Tal Lancewicki
Aviv A. Rosenberg
Yishay Mansour
48
32
0
29 Dec 2020
Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms
Kai Zhang
Zhuoran Yang
Tamer Basar
123
1,197
0
24 Nov 2019
An Optimal Algorithm for Adversarial Bandits with Arbitrary Delays
Julian Zimmert
Yevgeny Seldin
30
51
0
14 Oct 2019
Batched Multi-armed Bandits Problem
Zijun Gao
Yanjun Han
Zhimei Ren
Zhengqing Zhou
94
140
0
03 Apr 2019
1