ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2202.13675
  4. Cited By
A Survey on Recent Advances and Challenges in Reinforcement Learning
  Methods for Task-Oriented Dialogue Policy Learning

A Survey on Recent Advances and Challenges in Reinforcement Learning Methods for Task-Oriented Dialogue Policy Learning

28 February 2022
Wai-Chung Kwan
Hongru Wang
Huimin Wang
Kam-Fai Wong
    OffRL
ArXivPDFHTML

Papers citing "A Survey on Recent Advances and Challenges in Reinforcement Learning Methods for Task-Oriented Dialogue Policy Learning"

6 / 6 papers shown
Title
Task Oriented Dialogue as a Catalyst for Self-Supervised Automatic
  Speech Recognition
Task Oriented Dialogue as a Catalyst for Self-Supervised Automatic Speech Recognition
David M. Chan
Shalini Ghosh
Hitesh Tulsiani
Ariya Rastrow
Björn Hoffmeister
28
1
0
04 Jan 2024
TOD-Flow: Modeling the Structure of Task-Oriented Dialogues
TOD-Flow: Modeling the Structure of Task-Oriented Dialogues
Sungryull Sohn
Yiwei Lyu
Anthony Z. Liu
Lajanugen Logeswaran
Dong-Ki Kim
Dongsub Shim
Honglak Lee
21
3
0
07 Dec 2023
Why Guided Dialog Policy Learning performs well? Understanding the role
  of adversarial learning and its alternative
Why Guided Dialog Policy Learning performs well? Understanding the role of adversarial learning and its alternative
Sho Shimoyama
Tetsuro Morimura
Kenshi Abe
Toda Takamichi
Yuta Tomomatsu
Masakazu Sugiyama
Asahi Hentona
Yuuki Azuma
Hirotaka Ninomiya
OffRL
21
0
0
13 Jul 2023
Semi-Supervised Dialogue Policy Learning via Stochastic Reward
  Estimation
Semi-Supervised Dialogue Policy Learning via Stochastic Reward Estimation
Xinting Huang
Jianzhong Qi
Yu Sun
Rui Zhang
OffRL
61
18
0
09 May 2020
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Chelsea Finn
Pieter Abbeel
Sergey Levine
OOD
317
11,681
0
09 Mar 2017
A Sequence-to-Sequence Model for User Simulation in Spoken Dialogue
  Systems
A Sequence-to-Sequence Model for User Simulation in Spoken Dialogue Systems
Layla El Asri
Jing He
Kaheer Suleman
57
117
0
30 Jun 2016
1