ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2110.03743
  4. Cited By
Reinforcement Learning in Reward-Mixing MDPs

Reinforcement Learning in Reward-Mixing MDPs

7 October 2021
Jeongyeol Kwon
Yonathan Efroni
C. Caramanis
Shie Mannor
ArXivPDFHTML

Papers citing "Reinforcement Learning in Reward-Mixing MDPs"

14 / 14 papers shown
Title
A Classification View on Meta Learning Bandits
A Classification View on Meta Learning Bandits
Mirco Mutti
Jeongyeol Kwon
Shie Mannor
Aviv Tamar
23
0
0
06 Apr 2025
Test-Time Regret Minimization in Meta Reinforcement Learning
Test-Time Regret Minimization in Meta Reinforcement Learning
Mirco Mutti
Aviv Tamar
23
4
0
04 Jun 2024
RL in Latent MDPs is Tractable: Online Guarantees via Off-Policy
  Evaluation
RL in Latent MDPs is Tractable: Online Guarantees via Off-Policy Evaluation
Jeongyeol Kwon
Shie Mannor
C. Caramanis
Yonathan Efroni
OffRL
37
2
0
03 Jun 2024
Tractable Optimality in Episodic Latent MABs
Tractable Optimality in Episodic Latent MABs
Jeongyeol Kwon
Yonathan Efroni
C. Caramanis
Shie Mannor
50
3
0
05 Oct 2022
Reward-Mixing MDPs with a Few Latent Contexts are Learnable
Reward-Mixing MDPs with a Few Latent Contexts are Learnable
Jeongyeol Kwon
Yonathan Efroni
C. Caramanis
Shie Mannor
29
5
0
05 Oct 2022
Learning in Observable POMDPs, without Computationally Intractable
  Oracles
Learning in Observable POMDPs, without Computationally Intractable Oracles
Noah Golowich
Ankur Moitra
Dhruv Rohatgi
24
26
0
07 Jun 2022
Reinforcement Learning with Brain-Inspired Modulation can Improve
  Adaptation to Environmental Changes
Reinforcement Learning with Brain-Inspired Modulation can Improve Adaptation to Environmental Changes
Eric Chalmers
Artur Luczak
25
3
0
19 May 2022
When Is Partially Observable Reinforcement Learning Not Scary?
When Is Partially Observable Reinforcement Learning Not Scary?
Qinghua Liu
Alan Chung
Csaba Szepesvári
Chi Jin
14
92
0
19 Apr 2022
Understanding Curriculum Learning in Policy Optimization for Online
  Combinatorial Optimization
Understanding Curriculum Learning in Policy Optimization for Online Combinatorial Optimization
Runlong Zhou
Zelin He
Yuandong Tian
Yi Wu
S. Du
OffRL
18
3
0
11 Feb 2022
Coordinated Attacks against Contextual Bandits: Fundamental Limits and
  Defense Mechanisms
Coordinated Attacks against Contextual Bandits: Fundamental Limits and Defense Mechanisms
Jeongyeol Kwon
Yonathan Efroni
C. Caramanis
Shie Mannor
AAML
45
6
0
30 Jan 2022
Planning in Observable POMDPs in Quasipolynomial Time
Planning in Observable POMDPs in Quasipolynomial Time
Noah Golowich
Ankur Moitra
Dhruv Rohatgi
21
27
0
12 Jan 2022
Provably Efficient Reinforcement Learning with Linear Function
  Approximation Under Adaptivity Constraints
Provably Efficient Reinforcement Learning with Linear Function Approximation Under Adaptivity Constraints
Chi Jin
Zhuoran Yang
Zhaoran Wang
OffRL
122
166
0
06 Jan 2021
Reward-Free Exploration for Reinforcement Learning
Reward-Free Exploration for Reinforcement Learning
Chi Jin
A. Krishnamurthy
Max Simchowitz
Tiancheng Yu
OffRL
109
194
0
07 Feb 2020
Learning mixtures of structured distributions over discrete domains
Learning mixtures of structured distributions over discrete domains
Siu On Chan
Ilias Diakonikolas
Rocco A. Servedio
Xiaorui Sun
67
83
0
02 Oct 2012
1