ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1612.05628
  4. Cited By
An Alternative Softmax Operator for Reinforcement Learning
v1v2v3v4v5 (latest)

An Alternative Softmax Operator for Reinforcement Learning

16 December 2016
Kavosh Asadi
Michael L. Littman
ArXiv (abs)PDFHTML

Papers citing "An Alternative Softmax Operator for Reinforcement Learning"

3 / 3 papers shown
Title
Bridging the Gap Between Value and Policy Based Reinforcement Learning
Bridging the Gap Between Value and Policy Based Reinforcement Learning
Ofir Nachum
Mohammad Norouzi
Kelvin Xu
Dale Schuurmans
174
476
0
28 Feb 2017
Algorithms for multi-armed bandit problems
Algorithms for multi-armed bandit problems
Volodymyr Kuleshov
Doina Precup
150
351
0
25 Feb 2014
Apprenticeship Learning using Inverse Reinforcement Learning and
  Gradient Methods
Apprenticeship Learning using Inverse Reinforcement Learning and Gradient Methods
Gergely Neu
Csaba Szepesvári
84
243
0
20 Jun 2012
1