Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2406.05064
Cited By
Pretraining Decision Transformers with Reward Prediction for In-Context Multi-task Structured Bandit Learning
7 June 2024
Subhojyoti Mukherjee
Josiah P. Hanna
Qiaomin Xie
Robert Nowak
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Pretraining Decision Transformers with Reward Prediction for In-Context Multi-task Structured Bandit Learning"
12 / 12 papers shown
Title
HVAC-DPT: A Decision Pretrained Transformer for HVAC Control
Anaïs Berkes
AI4CE
69
0
0
29 Nov 2024
Efficient Frameworks for Generalized Low-Rank Matrix Bandit Problems
Yue Kang
Cho-Jui Hsieh
T. C. Lee
37
17
0
14 Jan 2024
Self-supervised Pretraining for Decision Foundation Model: Formulation, Pipeline and Challenges
Xiaoqian Liu
Jianbin Jiao
Junge Zhang
OffRL
LRM
34
2
0
29 Dec 2023
Multi-task Representation Learning for Pure Exploration in Bilinear Bandits
Subhojyoti Mukherjee
Qiaomin Xie
Josiah P. Hanna
Robert D. Nowak
40
6
0
01 Nov 2023
Transformers are Provably Optimal In-context Estimators for Wireless Communications
Vishnu Teja Kunde
Vicram Rajagopalan
Chandra Shekhara Kaushik Valmeekam
Krishna R. Narayanan
S. Shakkottai
D. Kalathil
J. Chamberland
35
4
0
01 Nov 2023
Structured State Space Models for In-Context Reinforcement Learning
Chris Xiaoxuan Lu
Yannick Schroecker
Albert Gu
Emilio Parisotto
Jakob N. Foerster
Satinder Singh
Feryal M. P. Behbahani
AI4TS
94
81
0
07 Mar 2023
Tractable Optimality in Episodic Latent MABs
Jeongyeol Kwon
Yonathan Efroni
C. Caramanis
Shie Mannor
48
3
0
05 Oct 2022
Partially Observable Markov Decision Processes in Robotics: A Survey
M. Lauri
David Hsu
J. Pajarinen
53
95
0
21 Sep 2022
A Review of Safe Reinforcement Learning: Methods, Theory and Applications
Shangding Gu
Longyu Yang
Yali Du
Guang Chen
Florian Walter
Jun Wang
Alois C. Knoll
OffRL
AI4TS
115
237
0
20 May 2022
COMBO: Conservative Offline Model-Based Policy Optimization
Tianhe Yu
Aviral Kumar
Rafael Rafailov
Aravind Rajeswaran
Sergey Levine
Chelsea Finn
OffRL
219
413
0
16 Feb 2021
An Introduction to Deep Reinforcement Learning
Vincent François-Lavet
Peter Henderson
Riashat Islam
Marc G. Bellemare
Joelle Pineau
OffRL
AI4CE
80
1,231
0
30 Nov 2018
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Chelsea Finn
Pieter Abbeel
Sergey Levine
OOD
314
11,681
0
09 Mar 2017
1