Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1707.00130
Cited By
v1
v2 (latest)
Sample-efficient Actor-Critic Reinforcement Learning with Supervised Data for Dialogue Management
1 July 2017
Pei-hao Su
Paweł Budzianowski
Stefan Ultes
Milica Gasic
S. Young
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Sample-efficient Actor-Critic Reinforcement Learning with Supervised Data for Dialogue Management"
41 / 41 papers shown
Title
Successor Features for Efficient Multisubject Controlled Text Generation
Mengyao Cao
Mehdi Fatemi
Jackie Chi Kit Cheung
Samira Shabanian
BDL
89
0
0
03 Nov 2023
End-to-End Evaluation of a Spoken Dialogue System for Learning Basic Mathematics
Eda Okur
Saurav Sahay
Roddy Fuentes Alba
L. Nachman
73
6
0
07 Nov 2022
NLU for Game-based Learning in Real: Initial Evaluations
Eda Okur
Saurav Sahay
L. Nachman
LLMAG
37
2
0
27 May 2022
Taming Continuous Posteriors for Latent Variational Dialogue Policies
Marin Vlastelica
P. Ernst
Gyuri Szarvas
BDL
OffRL
75
1
0
16 May 2022
Data Augmentation with Paraphrase Generation and Entity Extraction for Multimodal Dialogue System
Eda Okur
Saurav Sahay
L. Nachman
55
25
0
09 May 2022
GALAXY: A Generative Pre-trained Model for Task-Oriented Dialog with Semi-Supervised Learning and Explicit Policy Injection
Wanwei He
Yinpei Dai
Yinhe Zheng
Yuchuan Wu
Zhen Cao
...
Min Yang
Feiling Huang
Luo Si
Jian Sun
Yongbin Li
VLM
140
159
0
29 Nov 2021
Reinforcement Explanation Learning
Siddhant Agarwal
Owais Iqbal
Sree Aditya Buridi
Madda Manjusha
Abir Das
FAtt
40
0
0
26 Nov 2021
Transferable Dialogue Systems and User Simulators
Bo-Hsiang Tseng
Yinpei Dai
Florian Kreyssig
Bill Byrne
106
54
0
25 Jul 2021
A Survey on Deep Reinforcement Learning for Audio-Based Applications
S. Latif
Heriberto Cuayáhuitl
Farrukh Pervez
Fahad Shamshad
Hafiz Shehbaz Ali
Min Zhang
OffRL
123
75
0
01 Jan 2021
Data-Efficient Methods for Dialogue Systems
Igor Shalyminov
75
0
0
05 Dec 2020
Diluted Near-Optimal Expert Demonstrations for Guiding Dialogue Stochastic Policy Optimisation
Thibault Cordier
Tanguy Urvoy
L. Rojas-Barahona
F. Lefèvre
128
5
0
25 Nov 2020
Human-centric Dialog Training via Offline Reinforcement Learning
Natasha Jaques
J. Shen
Asma Ghandeharioun
Craig Ferguson
Àgata Lapedriza
Noah J. Jones
S. Gu
Rosalind W. Picard
OffRL
86
96
0
12 Oct 2020
Structured Hierarchical Dialogue Policy with Graph Neural Networks
Zhi Chen
Xiaoyuan Liu
Lu Chen
Kai Yu
BDL
OffRL
38
3
0
22 Sep 2020
Distributed Structured Actor-Critic Reinforcement Learning for Universal Dialogue Management
Zhi Chen
Lu Chen
Xiaoyuan Liu
Kai Yu
87
20
0
22 Sep 2020
Rethinking Supervised Learning and Reinforcement Learning in Task-Oriented Dialogue Systems
Ziming Li
Julia Kiseleva
Maarten de Rijke
OffRL
59
22
0
21 Sep 2020
Document-editing Assistants and Model-based Reinforcement Learning as a Path to Conversational AI
Katya Kudashkina
P. Pilarski
R. Sutton
KELM
84
6
0
27 Aug 2020
A Survey on Dialog Management: Recent Advances and Challenges
Yinpei Dai
Huihua Yu
Yixuan Jiang
Chengguang Tang
Yongbin Li
Jian Sun
OffRL
VLM
81
20
0
05 May 2020
Multi-Domain Dialogue Acts and Response Co-Generation
Kai Wang
Junfeng Tian
Rui Wang
Xiaojun Quan
Jianxing Yu
74
60
0
26 Apr 2020
Show Us the Way: Learning to Manage Dialog from Demonstrations
Gabriel Gordon-Hall
P. Gorinski
Gerasimos Lampouras
Ignacio Iacobacci
OffRL
111
11
0
17 Apr 2020
Hierarchical Reinforcement Learning for Open-Domain Dialog
Abdelrhman Saleh
Natasha Jaques
Asma Ghandeharioun
J. Shen
Rosalind W. Picard
OffRL
88
59
0
17 Sep 2019
How to Build User Simulators to Train RL-based Dialog Systems
Weiyan Shi
Kun Qian
Xuewei Wang
Zhou Yu
OffRL
72
65
0
03 Sep 2019
Modeling Multi-Action Policy for Task-Oriented Dialogues
Lei Shu
Hu Xu
Bing-Quan Liu
Piero Molino
39
9
0
30 Aug 2019
Reinforcement Learning for Personalized Dialogue Management
Floris den Hengst
Mark Hoogendoorn
F. V. Harmelen
Joost Bosman
OffRL
49
20
0
01 Aug 2019
Collaborative Multi-Agent Dialogue Model Training Via Reinforcement Learning
Alexandros Papangelis
Yi-Chia Wang
Piero Molino
Gokhan Tur
89
32
0
11 Jul 2019
Way Off-Policy Batch Deep Reinforcement Learning of Implicit Human Preferences in Dialog
Natasha Jaques
Asma Ghandeharioun
J. Shen
Craig Ferguson
Àgata Lapedriza
Noah J. Jones
S. Gu
Rosalind W. Picard
OffRL
159
344
0
30 Jun 2019
AgentGraph: Towards Universal Dialogue Management with Structured Deep Reinforcement Learning
Lu Chen
Zhi Chen
Bowen Tan
Sishan Long
Milica Gasic
Kai Yu
79
35
0
27 May 2019
Rethinking Action Spaces for Reinforcement Learning in End-to-end Dialog Agents with Latent Variable Models
Tiancheng Zhao
Kaige Xie
M. Eskénazi
107
142
0
23 Feb 2019
Addressing Objects and Their Relations: The Conversational Entity Dialogue Model
Stefan Ultes
Paweł Budzianowski
I. Casanueva
L. Rojas-Barahona
Bo-Hsiang Tseng
Yen-Chen Wu
S. Young
Milica Gasic
50
17
0
05 Jan 2019
Sample-Efficient Policy Learning based on Completely Behavior Cloning
Qiming Zou
Ling Wang
K. Lu
Yu Li
OffRL
54
0
0
09 Nov 2018
Expert-augmented actor-critic for ViZDoom and Montezumas Revenge
Michal Garmulewicz
Henryk Michalewski
Piotr Milos
78
8
0
10 Sep 2018
Goal-oriented Dialogue Policy Learning from Failures
Keting Lu
Shiqi Zhang
Xiaoping Chen
OffRL
46
29
0
20 Aug 2018
Adversarial Learning of Task-Oriented Neural Dialog Models
Bing-Quan Liu
Ian Lane
OffRL
70
39
0
30 May 2018
Dialogue Learning with Human Teaching and Feedback in End-to-End Trainable Task-Oriented Dialogue Systems
Bing-Quan Liu
Gokhan Tur
Dilek Z. Hakkani-Tür
Pararth Shah
Larry Heck
OffRL
64
160
0
18 Apr 2018
The Rapidly Changing Landscape of Conversational Agents
V. Mathur
Arpit Singh
LM&Ro
LLMAG
54
8
0
22 Mar 2018
Feudal Reinforcement Learning for Dialogue Management in Large Domains
I. Casanueva
Paweł Budzianowski
Pei-hao Su
Stefan Ultes
L. Rojas-Barahona
Bo-Hsiang Tseng
Milica Gasic
66
49
0
08 Mar 2018
Sample Efficient Deep Reinforcement Learning for Dialogue Systems with Large Action Spaces
Gellert Weisz
Paweł Budzianowski
Pei-hao Su
Milica Gasic
47
83
0
11 Feb 2018
Hierarchical and Interpretable Skill Acquisition in Multi-task Reinforcement Learning
Tianmin Shu
Caiming Xiong
R. Socher
OffRL
167
140
0
20 Dec 2017
End-to-End Offline Goal-Oriented Dialog Policy Learning via Policy Gradient
Li Zhou
Kevin Small
Oleg Rokhlenko
Charles Elkan
OffRL
78
42
0
07 Dec 2017
A Benchmarking Environment for Reinforcement Learning Based Task Oriented Dialogue Management
I. Casanueva
Paweł Budzianowski
Pei-hao Su
N. Mrksic
Tsung-Hsien Wen
Stefan Ultes
L. Rojas-Barahona
S. Young
Milica Gasic
OffRL
159
54
0
29 Nov 2017
Iterative Policy Learning in End-to-End Trainable Task-Oriented Neural Dialog Models
Bing-Quan Liu
Ian Lane
OffRL
82
98
0
18 Sep 2017
Sub-domain Modelling for Dialogue Management with Hierarchical Reinforcement Learning
Paweł Budzianowski
Stefan Ultes
Pei-hao Su
N. Mrksic
Tsung-Hsien Wen
I. Casanueva
L. Rojas-Barahona
Milica Gasic
92
49
0
19 Jun 2017
1