ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2406.05064
  4. Cited By
Pretraining Decision Transformers with Reward Prediction for In-Context Multi-task Structured Bandit Learning

Pretraining Decision Transformers with Reward Prediction for In-Context Multi-task Structured Bandit Learning

7 June 2024
Subhojyoti Mukherjee
Josiah P. Hanna
Qiaomin Xie
Robert Nowak
ArXivPDFHTML

Papers citing "Pretraining Decision Transformers with Reward Prediction for In-Context Multi-task Structured Bandit Learning"

12 / 12 papers shown
Title
HVAC-DPT: A Decision Pretrained Transformer for HVAC Control
HVAC-DPT: A Decision Pretrained Transformer for HVAC Control
Anaïs Berkes
AI4CE
69
0
0
29 Nov 2024
Efficient Frameworks for Generalized Low-Rank Matrix Bandit Problems
Efficient Frameworks for Generalized Low-Rank Matrix Bandit Problems
Yue Kang
Cho-Jui Hsieh
T. C. Lee
37
17
0
14 Jan 2024
Self-supervised Pretraining for Decision Foundation Model: Formulation,
  Pipeline and Challenges
Self-supervised Pretraining for Decision Foundation Model: Formulation, Pipeline and Challenges
Xiaoqian Liu
Jianbin Jiao
Junge Zhang
OffRL
LRM
34
2
0
29 Dec 2023
Multi-task Representation Learning for Pure Exploration in Bilinear
  Bandits
Multi-task Representation Learning for Pure Exploration in Bilinear Bandits
Subhojyoti Mukherjee
Qiaomin Xie
Josiah P. Hanna
Robert D. Nowak
40
6
0
01 Nov 2023
Transformers are Provably Optimal In-context Estimators for Wireless Communications
Transformers are Provably Optimal In-context Estimators for Wireless Communications
Vishnu Teja Kunde
Vicram Rajagopalan
Chandra Shekhara Kaushik Valmeekam
Krishna R. Narayanan
S. Shakkottai
D. Kalathil
J. Chamberland
35
4
0
01 Nov 2023
Structured State Space Models for In-Context Reinforcement Learning
Structured State Space Models for In-Context Reinforcement Learning
Chris Xiaoxuan Lu
Yannick Schroecker
Albert Gu
Emilio Parisotto
Jakob N. Foerster
Satinder Singh
Feryal M. P. Behbahani
AI4TS
94
81
0
07 Mar 2023
Tractable Optimality in Episodic Latent MABs
Tractable Optimality in Episodic Latent MABs
Jeongyeol Kwon
Yonathan Efroni
C. Caramanis
Shie Mannor
48
3
0
05 Oct 2022
Partially Observable Markov Decision Processes in Robotics: A Survey
Partially Observable Markov Decision Processes in Robotics: A Survey
M. Lauri
David Hsu
J. Pajarinen
53
95
0
21 Sep 2022
A Review of Safe Reinforcement Learning: Methods, Theory and
  Applications
A Review of Safe Reinforcement Learning: Methods, Theory and Applications
Shangding Gu
Longyu Yang
Yali Du
Guang Chen
Florian Walter
Jun Wang
Alois C. Knoll
OffRL
AI4TS
115
237
0
20 May 2022
COMBO: Conservative Offline Model-Based Policy Optimization
COMBO: Conservative Offline Model-Based Policy Optimization
Tianhe Yu
Aviral Kumar
Rafael Rafailov
Aravind Rajeswaran
Sergey Levine
Chelsea Finn
OffRL
219
413
0
16 Feb 2021
An Introduction to Deep Reinforcement Learning
An Introduction to Deep Reinforcement Learning
Vincent François-Lavet
Peter Henderson
Riashat Islam
Marc G. Bellemare
Joelle Pineau
OffRL
AI4CE
80
1,231
0
30 Nov 2018
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Chelsea Finn
Pieter Abbeel
Sergey Levine
OOD
314
11,681
0
09 Mar 2017
1