ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2012.04687
  4. Cited By
Diluted Near-Optimal Expert Demonstrations for Guiding Dialogue
  Stochastic Policy Optimisation

Diluted Near-Optimal Expert Demonstrations for Guiding Dialogue Stochastic Policy Optimisation

25 November 2020
Thibault Cordier
Tanguy Urvoy
L. Rojas-Barahona
F. Lefèvre
ArXivPDFHTML

Papers citing "Diluted Near-Optimal Expert Demonstrations for Guiding Dialogue Stochastic Policy Optimisation"

2 / 2 papers shown
Title
What Does The User Want? Information Gain for Hierarchical Dialogue
  Policy Optimisation
What Does The User Want? Information Gain for Hierarchical Dialogue Policy Optimisation
Christian Geishauser
Songbo Hu
Hsien-Chin Lin
Nurul Lubis
Michael Heck
Shutong Feng
Carel van Niekerk
Milica Gavsić
OffRL
18
3
0
15 Sep 2021
Semi-Supervised Dialogue Policy Learning via Stochastic Reward
  Estimation
Semi-Supervised Dialogue Policy Learning via Stochastic Reward Estimation
Xinting Huang
Jianzhong Qi
Yu Sun
Rui Zhang
OffRL
69
18
0
09 May 2020
1