Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2009.09781
Cited By
Rethinking Supervised Learning and Reinforcement Learning in Task-Oriented Dialogue Systems
21 September 2020
Ziming Li
Julia Kiseleva
Maarten de Rijke
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Rethinking Supervised Learning and Reinforcement Learning in Task-Oriented Dialogue Systems"
22 / 22 papers shown
Title
Guided Dialog Policy Learning without Adversarial Learning in the Loop
Ziming Li
Sungjin Lee
Baolin Peng
Jinchao Li
Julia Kiseleva
Maarten de Rijke
Shahin Shayandeh
Jianfeng Gao
111
14
0
07 Apr 2020
Guided Dialog Policy Learning: Reward Estimation for Multi-Domain Task-Oriented Dialog
Ryuichi Takanobu
Hanlin Zhu
Minlie Huang
73
91
0
28 Aug 2019
ConvLab: Multi-Domain End-to-End Dialog System Platform
Sungjin Lee
Qi Zhu
Ryuichi Takanobu
Xiang Li
Yaoqin Zhang
...
Jinchao Li
Baolin Peng
Xiujun Li
Minlie Huang
Jianfeng Gao
VLM
74
110
0
18 Apr 2019
MultiWOZ -- A Large-Scale Multi-Domain Wizard-of-Oz Dataset for Task-Oriented Dialogue Modelling
Paweł Budzianowski
Tsung-Hsien Wen
Bo-Hsiang Tseng
I. Casanueva
Stefan Ultes
Osman Ramadan
Milica Gasic
147
1,311
0
29 Sep 2018
Discriminative Deep Dyna-Q: Robust Planning for Dialogue Policy Learning
Shang-Yu Su
Xiujun Li
Jianfeng Gao
Jingjing Liu
Yun-Nung Chen
OffRL
53
67
0
28 Aug 2018
Adversarial Learning of Task-Oriented Neural Dialog Models
Bing-Quan Liu
Ian Lane
OffRL
39
39
0
30 May 2018
BBQ-Networks: Efficient Exploration in Deep Reinforcement Learning for Task-Oriented Dialogue Systems
Zachary Chase Lipton
Xiujun Li
Jianfeng Gao
Lihong Li
Faisal Ahmed
Li Deng
55
171
0
15 Nov 2017
A Survey on Dialogue Systems: Recent Advances and New Frontiers
Hongshen Chen
Xiaorui Liu
Dawei Yin
Jiliang Tang
VLM
LLMAG
63
700
0
06 Nov 2017
Adversarial Advantage Actor-Critic Model for Task-Completion Dialogue Policy Learning
Baolin Peng
Xiujun Li
Jianfeng Gao
Jingjing Liu
Yun-Nung Chen
Kam-Fai Wong
51
65
0
31 Oct 2017
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
444
18,931
0
20 Jul 2017
Sample-efficient Actor-Critic Reinforcement Learning with Supervised Data for Dialogue Management
Pei-hao Su
Paweł Budzianowski
Stefan Ultes
Milica Gasic
S. Young
OffRL
108
129
0
01 Jul 2017
Composite Task-Completion Dialogue Policy Learning via Hierarchical Deep Reinforcement Learning
Baolin Peng
Xiujun Li
Lihong Li
Jianfeng Gao
Asli Celikyilmaz
Sungjin Lee
Kam-Fai Wong
BDL
58
190
0
10 Apr 2017
End-to-End Task-Completion Neural Dialogue Systems
Xiujun Li
Yun-Nung Chen
Lihong Li
Jianfeng Gao
Asli Celikyilmaz
64
368
0
03 Mar 2017
Hybrid Code Networks: practical and efficient end-to-end dialog control with supervised and reinforcement learning
Jason D. Williams
Kavosh Asadi
Geoffrey Zweig
OffRL
62
335
0
10 Feb 2017
Wasserstein GAN
Martín Arjovsky
Soumith Chintala
Léon Bottou
GAN
163
4,825
0
26 Jan 2017
Sample Efficient Actor-Critic with Experience Replay
Ziyun Wang
V. Bapst
N. Heess
Volodymyr Mnih
Rémi Munos
Koray Kavukcuoglu
Nando de Freitas
97
761
0
03 Nov 2016
Categorical Reparameterization with Gumbel-Softmax
Eric Jang
S. Gu
Ben Poole
BDL
277
5,360
0
03 Nov 2016
Towards End-to-End Reinforcement Learning of Dialogue Agents for Information Access
Bhuwan Dhingra
Lihong Li
Xiujun Li
Jianfeng Gao
Yun-Nung Chen
Faisal Ahmed
Li Deng
62
303
0
03 Sep 2016
Generative Adversarial Imitation Learning
Jonathan Ho
Stefano Ermon
GAN
131
3,098
0
10 Jun 2016
On-line Active Reward Learning for Policy Optimisation in Spoken Dialogue Systems
Pei-hao Su
Milica Gasic
N. Mrksic
L. Rojas-Barahona
Stefan Ultes
David Vandyke
Tsung-Hsien Wen
S. Young
OffRL
61
170
0
24 May 2016
Towards AI-Complete Question Answering: A Set of Prerequisite Toy Tasks
Jason Weston
Antoine Bordes
S. Chopra
Alexander M. Rush
Bart van Merriënboer
Armand Joulin
Tomas Mikolov
LRM
ELM
133
1,180
0
19 Feb 2015
Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation
Kyunghyun Cho
B. V. Merrienboer
Çağlar Gülçehre
Dzmitry Bahdanau
Fethi Bougares
Holger Schwenk
Yoshua Bengio
AIMat
825
23,310
0
03 Jun 2014
1