ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2302.01477
  4. Cited By
A Reduction-based Framework for Sequential Decision Making with Delayed
  Feedback

A Reduction-based Framework for Sequential Decision Making with Delayed Feedback

3 February 2023
Yunchang Yang
Hangshi Zhong
Tianhao Wu
B. Liu
Liwei Wang
S. Du
    OffRL
ArXivPDFHTML

Papers citing "A Reduction-based Framework for Sequential Decision Making with Delayed Feedback"

11 / 11 papers shown
Title
Rainbow Delay Compensation: A Multi-Agent Reinforcement Learning Framework for Mitigating Delayed Observation
Rainbow Delay Compensation: A Multi-Agent Reinforcement Learning Framework for Mitigating Delayed Observation
Songchen Fu
Siang Chen
Shaojing Zhao
Letian Bai
Ta Li
Yonghong Yan
96
0
0
06 May 2025
Delayed Feedback in Generalised Linear Bandits Revisited
Delayed Feedback in Generalised Linear Bandits Revisited
Benjamin Howson
Ciara Pike-Burke
Sarah Filippi
27
15
0
21 Jul 2022
Model-Based Reinforcement Learning for Offline Zero-Sum Markov Games
Model-Based Reinforcement Learning for Offline Zero-Sum Markov Games
Yuling Yan
Gen Li
Yuxin Chen
Jianqing Fan
OffRL
58
11
0
08 Jun 2022
Near-Optimal Regret for Adversarial MDP with Delayed Bandit Feedback
Near-Optimal Regret for Adversarial MDP with Delayed Bandit Feedback
Tiancheng Jin
Tal Lancewicki
Haipeng Luo
Yishay Mansour
Aviv A. Rosenberg
86
21
0
31 Jan 2022
When is Offline Two-Player Zero-Sum Markov Game Solvable?
When is Offline Two-Player Zero-Sum Markov Game Solvable?
Qiwen Cui
S. Du
OffRL
64
29
0
10 Jan 2022
Can Reinforcement Learning Find Stackelberg-Nash Equilibria in
  General-Sum Markov Games with Myopic Followers?
Can Reinforcement Learning Find Stackelberg-Nash Equilibria in General-Sum Markov Games with Myopic Followers?
Han Zhong
Zhuoran Yang
Zhaoran Wang
Michael I. Jordan
90
30
0
27 Dec 2021
Stochastic Multi-Armed Bandits with Unrestricted Delay Distributions
Stochastic Multi-Armed Bandits with Unrestricted Delay Distributions
Tal Lancewicki
Shahar Segal
Tomer Koren
Yishay Mansour
38
40
0
04 Jun 2021
Learning Adversarial Markov Decision Processes with Delayed Feedback
Learning Adversarial Markov Decision Processes with Delayed Feedback
Tal Lancewicki
Aviv A. Rosenberg
Yishay Mansour
48
32
0
29 Dec 2020
Multi-Agent Reinforcement Learning: A Selective Overview of Theories and
  Algorithms
Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms
Kai Zhang
Zhuoran Yang
Tamer Basar
123
1,197
0
24 Nov 2019
An Optimal Algorithm for Adversarial Bandits with Arbitrary Delays
An Optimal Algorithm for Adversarial Bandits with Arbitrary Delays
Julian Zimmert
Yevgeny Seldin
30
51
0
14 Oct 2019
Batched Multi-armed Bandits Problem
Batched Multi-armed Bandits Problem
Zijun Gao
Yanjun Han
Zhimei Ren
Zhengqing Zhou
94
140
0
03 Apr 2019
1