ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2207.06272
  4. Cited By
Hindsight Learning for MDPs with Exogenous Inputs

Hindsight Learning for MDPs with Exogenous Inputs

13 July 2022
Sean R. Sinclair
Felipe Vieira Frujeri
Ching-An Cheng
Luke Marshall
Hugo Barbalho
Jingling Li
Jennifer Neville
Ishai Menache
Adith Swaminathan
ArXivPDFHTML

Papers citing "Hindsight Learning for MDPs with Exogenous Inputs"

24 / 24 papers shown
Title
Improving Online Algorithms via ML Predictions
Improving Online Algorithms via ML Predictions
Ravi Kumar
Manish Purohit
Zoya Svitkina
67
318
0
25 Jul 2024
Online Reinforcement Learning in Non-Stationary Context-Driven Environments
Online Reinforcement Learning in Non-Stationary Context-Driven Environments
Pouya Hamadanian
Arash Nasr-Esfahany
Malte Schwarzkopf
Siddartha Sen
MohammadIman Alizadeh
CLL
OffRL
108
0
0
04 Feb 2023
Sample-Efficient Reinforcement Learning in the Presence of Exogenous
  Information
Sample-Efficient Reinforcement Learning in the Presence of Exogenous Information
Yonathan Efroni
Dylan J. Foster
Dipendra Kumar Misra
A. Krishnamurthy
John Langford
OffRL
69
25
0
09 Jun 2022
The Statistical Complexity of Interactive Decision Making
The Statistical Complexity of Interactive Decision Making
Dylan J. Foster
Sham Kakade
Jian Qian
Alexander Rakhlin
370
180
0
27 Dec 2021
Stateful Offline Contextual Policy Evaluation and Learning
Stateful Offline Contextual Policy Evaluation and Learning
Nathan Kallus
Angela Zhou
OffRL
39
7
0
19 Oct 2021
Heuristic-Guided Reinforcement Learning
Heuristic-Guided Reinforcement Learning
Ching-An Cheng
Andrey Kolobov
Adith Swaminathan
OffRL
74
62
0
05 Jun 2021
Bridging Offline Reinforcement Learning and Imitation Learning: A Tale
  of Pessimism
Bridging Offline Reinforcement Learning and Imitation Learning: A Tale of Pessimism
Paria Rashidinejad
Banghua Zhu
Cong Ma
Jiantao Jiao
Stuart J. Russell
OffRL
215
289
0
22 Mar 2021
Universal Trading for Order Execution with Oracle Policy Distillation
Universal Trading for Order Execution with Oracle Policy Distillation
Yuchen Fang
Kan Ren
Weiqing Liu
Dong Zhou
Weinan Zhang
Jiang Bian
Yong Yu
Tie-Yan Liu
OffRL
76
46
0
28 Jan 2021
Robust Asymmetric Learning in POMDPs
Robust Asymmetric Learning in POMDPs
Andrew Warrington
J. Lavington
Adam Scibior
Mark Schmidt
Frank Wood
35
31
0
31 Dec 2020
PowerNet: Multi-agent Deep Reinforcement Learning for Scalable Powergrid
  Control
PowerNet: Multi-agent Deep Reinforcement Learning for Scalable Powergrid Control
Dong Chen
Kaian Chen
Tianshu Chu
Rui Yao
F. Qiu
Kaixiang Lin
53
67
0
24 Nov 2020
OR-Gym: A Reinforcement Learning Library for Operations Research
  Problems
OR-Gym: A Reinforcement Learning Library for Operations Research Problems
Christian D. Hubbs
Hector D. Perez
Owais Sarwar
N. Sahinidis
I. Grossmann
J. Wassick
OffRL
AI4CE
39
74
0
14 Aug 2020
Queueing Network Controls via Deep Reinforcement Learning
Queueing Network Controls via Deep Reinforcement Learning
J. Dai
Mark O. Gluzman
OffRL
104
51
0
31 Jul 2020
Training Agents using Upside-Down Reinforcement Learning
Training Agents using Upside-Down Reinforcement Learning
R. Srivastava
Pranav Shyam
Filipe Wall Mutz
Wojciech Ja'skowski
Jürgen Schmidhuber
OffRL
68
126
0
05 Dec 2019
Online Allocation and Pricing: Constant Regret via Bellman Inequalities
Online Allocation and Pricing: Constant Regret via Bellman Inequalities
Alberto Vera
Siddhartha Banerjee
I. Gurvich
OffRL
43
50
0
14 Jun 2019
Reinforcement Learning for Integer Programming: Learning to Cut
Reinforcement Learning for Integer Programming: Learning to Cut
Yunhao Tang
Shipra Agrawal
Yuri Faenza
AI4CE
58
172
0
11 Jun 2019
Off-Policy Deep Reinforcement Learning without Exploration
Off-Policy Deep Reinforcement Learning without Exploration
Scott Fujimoto
David Meger
Doina Precup
OffRL
BDL
226
1,613
0
07 Dec 2018
Learning Scheduling Algorithms for Data Processing Clusters
Learning Scheduling Algorithms for Data Processing Clusters
Hongzi Mao
Malte Schwarzkopf
S. Venkatakrishnan
Zili Meng
Mohammad Alizadeh
OffRL
78
646
0
03 Oct 2018
Solving a New 3D Bin Packing Problem with Deep Reinforcement Learning
  Method
Solving a New 3D Bin Packing Problem with Deep Reinforcement Learning Method
Haoyuan Hu
Xiaodong Zhang
Xiaowei Yan
Longfei Wang
Yinghui Xu
57
129
0
20 Aug 2017
Thinking Fast and Slow with Deep Learning and Tree Search
Thinking Fast and Slow with Deep Learning and Tree Search
Thomas W. Anthony
Zheng Tian
David Barber
100
396
0
23 May 2017
Neural Combinatorial Optimization with Reinforcement Learning
Neural Combinatorial Optimization with Reinforcement Learning
Irwan Bello
Hieu H. Pham
Quoc V. Le
Mohammad Norouzi
Samy Bengio
158
1,490
0
29 Nov 2016
Deep Reinforcement Learning with Double Q-learning
Deep Reinforcement Learning with Double Q-learning
H. V. Hasselt
A. Guez
David Silver
OffRL
167
7,641
0
22 Sep 2015
Pointer Networks
Pointer Networks
Oriol Vinyals
Meire Fortunato
Navdeep Jaitly
118
3,055
0
09 Jun 2015
Reinforcement and Imitation Learning via Interactive No-Regret Learning
Reinforcement and Imitation Learning via Interactive No-Regret Learning
Stéphane Ross
J. Andrew Bagnell
OffRL
147
264
0
23 Jun 2014
Polynomial-Time Approximation Schemes for Knapsack and Related Counting
  Problems using Branching Programs
Polynomial-Time Approximation Schemes for Knapsack and Related Counting Problems using Branching Programs
Parikshit Gopalan
Adam R. Klivans
Raghu Meka
56
27
0
18 Aug 2010
1