ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1707.00130
  4. Cited By
Sample-efficient Actor-Critic Reinforcement Learning with Supervised
  Data for Dialogue Management
v1v2 (latest)

Sample-efficient Actor-Critic Reinforcement Learning with Supervised Data for Dialogue Management

1 July 2017
Pei-hao Su
Paweł Budzianowski
Stefan Ultes
Milica Gasic
S. Young
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Sample-efficient Actor-Critic Reinforcement Learning with Supervised Data for Dialogue Management"

41 / 41 papers shown
Title
Successor Features for Efficient Multisubject Controlled Text Generation
Successor Features for Efficient Multisubject Controlled Text Generation
Mengyao Cao
Mehdi Fatemi
Jackie Chi Kit Cheung
Samira Shabanian
BDL
89
0
0
03 Nov 2023
End-to-End Evaluation of a Spoken Dialogue System for Learning Basic
  Mathematics
End-to-End Evaluation of a Spoken Dialogue System for Learning Basic Mathematics
Eda Okur
Saurav Sahay
Roddy Fuentes Alba
L. Nachman
73
6
0
07 Nov 2022
NLU for Game-based Learning in Real: Initial Evaluations
NLU for Game-based Learning in Real: Initial Evaluations
Eda Okur
Saurav Sahay
L. Nachman
LLMAG
37
2
0
27 May 2022
Taming Continuous Posteriors for Latent Variational Dialogue Policies
Taming Continuous Posteriors for Latent Variational Dialogue Policies
Marin Vlastelica
P. Ernst
Gyuri Szarvas
BDLOffRL
75
1
0
16 May 2022
Data Augmentation with Paraphrase Generation and Entity Extraction for
  Multimodal Dialogue System
Data Augmentation with Paraphrase Generation and Entity Extraction for Multimodal Dialogue System
Eda Okur
Saurav Sahay
L. Nachman
55
25
0
09 May 2022
GALAXY: A Generative Pre-trained Model for Task-Oriented Dialog with
  Semi-Supervised Learning and Explicit Policy Injection
GALAXY: A Generative Pre-trained Model for Task-Oriented Dialog with Semi-Supervised Learning and Explicit Policy Injection
Wanwei He
Yinpei Dai
Yinhe Zheng
Yuchuan Wu
Zhen Cao
...
Min Yang
Feiling Huang
Luo Si
Jian Sun
Yongbin Li
VLM
140
159
0
29 Nov 2021
Reinforcement Explanation Learning
Reinforcement Explanation Learning
Siddhant Agarwal
Owais Iqbal
Sree Aditya Buridi
Madda Manjusha
Abir Das
FAtt
40
0
0
26 Nov 2021
Transferable Dialogue Systems and User Simulators
Transferable Dialogue Systems and User Simulators
Bo-Hsiang Tseng
Yinpei Dai
Florian Kreyssig
Bill Byrne
106
54
0
25 Jul 2021
A Survey on Deep Reinforcement Learning for Audio-Based Applications
A Survey on Deep Reinforcement Learning for Audio-Based Applications
S. Latif
Heriberto Cuayáhuitl
Farrukh Pervez
Fahad Shamshad
Hafiz Shehbaz Ali
Min Zhang
OffRL
123
75
0
01 Jan 2021
Data-Efficient Methods for Dialogue Systems
Data-Efficient Methods for Dialogue Systems
Igor Shalyminov
75
0
0
05 Dec 2020
Diluted Near-Optimal Expert Demonstrations for Guiding Dialogue
  Stochastic Policy Optimisation
Diluted Near-Optimal Expert Demonstrations for Guiding Dialogue Stochastic Policy Optimisation
Thibault Cordier
Tanguy Urvoy
L. Rojas-Barahona
F. Lefèvre
128
5
0
25 Nov 2020
Human-centric Dialog Training via Offline Reinforcement Learning
Human-centric Dialog Training via Offline Reinforcement Learning
Natasha Jaques
J. Shen
Asma Ghandeharioun
Craig Ferguson
Àgata Lapedriza
Noah J. Jones
S. Gu
Rosalind W. Picard
OffRL
86
96
0
12 Oct 2020
Structured Hierarchical Dialogue Policy with Graph Neural Networks
Structured Hierarchical Dialogue Policy with Graph Neural Networks
Zhi Chen
Xiaoyuan Liu
Lu Chen
Kai Yu
BDLOffRL
38
3
0
22 Sep 2020
Distributed Structured Actor-Critic Reinforcement Learning for Universal
  Dialogue Management
Distributed Structured Actor-Critic Reinforcement Learning for Universal Dialogue Management
Zhi Chen
Lu Chen
Xiaoyuan Liu
Kai Yu
87
20
0
22 Sep 2020
Rethinking Supervised Learning and Reinforcement Learning in
  Task-Oriented Dialogue Systems
Rethinking Supervised Learning and Reinforcement Learning in Task-Oriented Dialogue Systems
Ziming Li
Julia Kiseleva
Maarten de Rijke
OffRL
59
22
0
21 Sep 2020
Document-editing Assistants and Model-based Reinforcement Learning as a
  Path to Conversational AI
Document-editing Assistants and Model-based Reinforcement Learning as a Path to Conversational AI
Katya Kudashkina
P. Pilarski
R. Sutton
KELM
84
6
0
27 Aug 2020
A Survey on Dialog Management: Recent Advances and Challenges
A Survey on Dialog Management: Recent Advances and Challenges
Yinpei Dai
Huihua Yu
Yixuan Jiang
Chengguang Tang
Yongbin Li
Jian Sun
OffRLVLM
81
20
0
05 May 2020
Multi-Domain Dialogue Acts and Response Co-Generation
Multi-Domain Dialogue Acts and Response Co-Generation
Kai Wang
Junfeng Tian
Rui Wang
Xiaojun Quan
Jianxing Yu
74
60
0
26 Apr 2020
Show Us the Way: Learning to Manage Dialog from Demonstrations
Show Us the Way: Learning to Manage Dialog from Demonstrations
Gabriel Gordon-Hall
P. Gorinski
Gerasimos Lampouras
Ignacio Iacobacci
OffRL
111
11
0
17 Apr 2020
Hierarchical Reinforcement Learning for Open-Domain Dialog
Hierarchical Reinforcement Learning for Open-Domain Dialog
Abdelrhman Saleh
Natasha Jaques
Asma Ghandeharioun
J. Shen
Rosalind W. Picard
OffRL
88
59
0
17 Sep 2019
How to Build User Simulators to Train RL-based Dialog Systems
How to Build User Simulators to Train RL-based Dialog Systems
Weiyan Shi
Kun Qian
Xuewei Wang
Zhou Yu
OffRL
72
65
0
03 Sep 2019
Modeling Multi-Action Policy for Task-Oriented Dialogues
Modeling Multi-Action Policy for Task-Oriented Dialogues
Lei Shu
Hu Xu
Bing-Quan Liu
Piero Molino
39
9
0
30 Aug 2019
Reinforcement Learning for Personalized Dialogue Management
Reinforcement Learning for Personalized Dialogue Management
Floris den Hengst
Mark Hoogendoorn
F. V. Harmelen
Joost Bosman
OffRL
49
20
0
01 Aug 2019
Collaborative Multi-Agent Dialogue Model Training Via Reinforcement
  Learning
Collaborative Multi-Agent Dialogue Model Training Via Reinforcement Learning
Alexandros Papangelis
Yi-Chia Wang
Piero Molino
Gokhan Tur
89
32
0
11 Jul 2019
Way Off-Policy Batch Deep Reinforcement Learning of Implicit Human
  Preferences in Dialog
Way Off-Policy Batch Deep Reinforcement Learning of Implicit Human Preferences in Dialog
Natasha Jaques
Asma Ghandeharioun
J. Shen
Craig Ferguson
Àgata Lapedriza
Noah J. Jones
S. Gu
Rosalind W. Picard
OffRL
159
344
0
30 Jun 2019
AgentGraph: Towards Universal Dialogue Management with Structured Deep
  Reinforcement Learning
AgentGraph: Towards Universal Dialogue Management with Structured Deep Reinforcement Learning
Lu Chen
Zhi Chen
Bowen Tan
Sishan Long
Milica Gasic
Kai Yu
79
35
0
27 May 2019
Rethinking Action Spaces for Reinforcement Learning in End-to-end Dialog
  Agents with Latent Variable Models
Rethinking Action Spaces for Reinforcement Learning in End-to-end Dialog Agents with Latent Variable Models
Tiancheng Zhao
Kaige Xie
M. Eskénazi
107
142
0
23 Feb 2019
Addressing Objects and Their Relations: The Conversational Entity
  Dialogue Model
Addressing Objects and Their Relations: The Conversational Entity Dialogue Model
Stefan Ultes
Paweł Budzianowski
I. Casanueva
L. Rojas-Barahona
Bo-Hsiang Tseng
Yen-Chen Wu
S. Young
Milica Gasic
50
17
0
05 Jan 2019
Sample-Efficient Policy Learning based on Completely Behavior Cloning
Sample-Efficient Policy Learning based on Completely Behavior Cloning
Qiming Zou
Ling Wang
K. Lu
Yu Li
OffRL
54
0
0
09 Nov 2018
Expert-augmented actor-critic for ViZDoom and Montezumas Revenge
Expert-augmented actor-critic for ViZDoom and Montezumas Revenge
Michal Garmulewicz
Henryk Michalewski
Piotr Milos
78
8
0
10 Sep 2018
Goal-oriented Dialogue Policy Learning from Failures
Goal-oriented Dialogue Policy Learning from Failures
Keting Lu
Shiqi Zhang
Xiaoping Chen
OffRL
46
29
0
20 Aug 2018
Adversarial Learning of Task-Oriented Neural Dialog Models
Adversarial Learning of Task-Oriented Neural Dialog Models
Bing-Quan Liu
Ian Lane
OffRL
70
39
0
30 May 2018
Dialogue Learning with Human Teaching and Feedback in End-to-End
  Trainable Task-Oriented Dialogue Systems
Dialogue Learning with Human Teaching and Feedback in End-to-End Trainable Task-Oriented Dialogue Systems
Bing-Quan Liu
Gokhan Tur
Dilek Z. Hakkani-Tür
Pararth Shah
Larry Heck
OffRL
64
160
0
18 Apr 2018
The Rapidly Changing Landscape of Conversational Agents
The Rapidly Changing Landscape of Conversational Agents
V. Mathur
Arpit Singh
LM&RoLLMAG
54
8
0
22 Mar 2018
Feudal Reinforcement Learning for Dialogue Management in Large Domains
Feudal Reinforcement Learning for Dialogue Management in Large Domains
I. Casanueva
Paweł Budzianowski
Pei-hao Su
Stefan Ultes
L. Rojas-Barahona
Bo-Hsiang Tseng
Milica Gasic
66
49
0
08 Mar 2018
Sample Efficient Deep Reinforcement Learning for Dialogue Systems with
  Large Action Spaces
Sample Efficient Deep Reinforcement Learning for Dialogue Systems with Large Action Spaces
Gellert Weisz
Paweł Budzianowski
Pei-hao Su
Milica Gasic
47
83
0
11 Feb 2018
Hierarchical and Interpretable Skill Acquisition in Multi-task
  Reinforcement Learning
Hierarchical and Interpretable Skill Acquisition in Multi-task Reinforcement Learning
Tianmin Shu
Caiming Xiong
R. Socher
OffRL
167
140
0
20 Dec 2017
End-to-End Offline Goal-Oriented Dialog Policy Learning via Policy
  Gradient
End-to-End Offline Goal-Oriented Dialog Policy Learning via Policy Gradient
Li Zhou
Kevin Small
Oleg Rokhlenko
Charles Elkan
OffRL
78
42
0
07 Dec 2017
A Benchmarking Environment for Reinforcement Learning Based Task
  Oriented Dialogue Management
A Benchmarking Environment for Reinforcement Learning Based Task Oriented Dialogue Management
I. Casanueva
Paweł Budzianowski
Pei-hao Su
N. Mrksic
Tsung-Hsien Wen
Stefan Ultes
L. Rojas-Barahona
S. Young
Milica Gasic
OffRL
159
54
0
29 Nov 2017
Iterative Policy Learning in End-to-End Trainable Task-Oriented Neural
  Dialog Models
Iterative Policy Learning in End-to-End Trainable Task-Oriented Neural Dialog Models
Bing-Quan Liu
Ian Lane
OffRL
82
98
0
18 Sep 2017
Sub-domain Modelling for Dialogue Management with Hierarchical
  Reinforcement Learning
Sub-domain Modelling for Dialogue Management with Hierarchical Reinforcement Learning
Paweł Budzianowski
Stefan Ultes
Pei-hao Su
N. Mrksic
Tsung-Hsien Wen
I. Casanueva
L. Rojas-Barahona
Milica Gasic
92
49
0
19 Jun 2017
1