ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2006.16507
  4. Cited By
Policy Gradient Optimization of Thompson Sampling Policies

Policy Gradient Optimization of Thompson Sampling Policies

30 June 2020
Seungki Min
C. Moallemi
Daniel Russo
ArXivPDFHTML

Papers citing "Policy Gradient Optimization of Thompson Sampling Policies"

2 / 2 papers shown
Title
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Chelsea Finn
Pieter Abbeel
Sergey Levine
OOD
806
11,866
0
09 Mar 2017
RL$^2$: Fast Reinforcement Learning via Slow Reinforcement Learning
RL2^22: Fast Reinforcement Learning via Slow Reinforcement Learning
Yan Duan
John Schulman
Xi Chen
Peter L. Bartlett
Ilya Sutskever
Pieter Abbeel
OffRL
76
1,015
0
09 Nov 2016
1