v1v2 (latest)

BBQ-Networks: Efficient Exploration in Deep Reinforcement Learning for Task-Oriented Dialogue Systems

15 November 2017

Li Deng

Papers citing "BBQ-Networks: Efficient Exploration in Deep Reinforcement Learning for Task-Oriented Dialogue Systems"

22 / 22 papers shown

Title
Recent Advances and Challenges in Task-oriented Dialog System Zheng Zhang Ryuichi Takanobu Qi Zhu Minlie Huang Xiaoyan Zhu LLMAG 118 177 0 17 Mar 2020
A User Simulator for Task-Completion Dialogues Xiujun Li Zachary Chase Lipton Bhuwan Dhingra Lihong Li Jianfeng Gao Yun-Nung Chen OffRL 69 167 0 17 Dec 2016
Overcoming catastrophic forgetting in neural networks J. Kirkpatrick Razvan Pascanu Neil C. Rabinowitz J. Veness Guillaume Desjardins ... A. Grabska-Barwinska Demis Hassabis Claudia Clopath D. Kumaran R. Hadsell CLL 374 7,561 0 02 Dec 2016
Policy Networks with Two-Stage Training for Dialogue Systems Mehdi Fatemi Layla El Asri Hannes Schulz Jing He Kaheer Suleman OffRL 68 108 0 10 Jun 2016
Unifying Count-Based Exploration and Intrinsic Motivation Marc G. Bellemare S. Srinivasan Georg Ostrovski Tom Schaul D. Saxton Rémi Munos 179 1,483 0 06 Jun 2016
End-to-end LSTM-based dialog control optimized with supervised and reinforcement learning Jason D. Williams Geoffrey Zweig OffRL 50 155 0 03 Jun 2016
VIME: Variational Information Maximizing Exploration Rein Houthooft Xi Chen Yan Duan John Schulman F. Turck Pieter Abbeel 69 78 0 31 May 2016
A Network-based End-to-End Trainable Task-oriented Dialogue System Tsung-Hsien Wen David Vandyke N. Mrksic Milica Gasic L. Rojas-Barahona Pei-hao Su Stefan Ultes S. Young 77 1,109 0 15 Apr 2016
Deep Exploration via Bootstrapped DQN Ian Osband Charles Blundell Alexander Pritzel Benjamin Van Roy 121 1,313 0 15 Feb 2016
SimpleDS: A Simple Deep Reinforcement Learning Dialogue System Heriberto Cuayáhuitl OffRL 42 85 0 18 Jan 2016
Dueling Network Architectures for Deep Reinforcement Learning Ziyun Wang Tom Schaul Matteo Hessel H. V. Hasselt Marc Lanctot Nando de Freitas OffRL 91 3,768 0 20 Nov 2015
Prioritized Experience Replay Tom Schaul John Quan Ioannis Antonoglou David Silver OffRL 223 3,797 0 18 Nov 2015
Generative Concatenative Nets Jointly Learn to Write and Classify Reviews Zachary Chase Lipton Sharad Vikram Julian McAuley BDL 103 33 0 11 Nov 2015
Deep Reinforcement Learning with Double Q-learning H. V. Hasselt A. Guez David Silver OffRL 170 7,662 0 22 Sep 2015
Incentivizing Exploration In Reinforcement Learning With Deep Predictive Models Bradly C. Stadie Sergey Levine Pieter Abbeel 92 505 0 03 Jul 2015
Weight Uncertainty in Neural Networks Charles Blundell Julien Cornebise Koray Kavukcuoglu Daan Wierstra UQCV BDL 192 1,892 0 20 May 2015
Adam: A Method for Stochastic Optimization Diederik P. Kingma Jimmy Ba ODL 2.0K 150,312 0 22 Dec 2014
Auto-Encoding Variational Bayes Diederik P. Kingma Max Welling BDL 455 16,923 0 20 Dec 2013
Playing Atari with Deep Reinforcement Learning Volodymyr Mnih Koray Kavukcuoglu David Silver Alex Graves Ioannis Antonoglou Daan Wierstra Martin Riedmiller 129 12,265 0 19 Dec 2013
(More) Efficient Reinforcement Learning via Posterior Sampling Ian Osband Daniel Russo Benjamin Van Roy 129 535 0 04 Jun 2013
A Bayesian Sampling Approach to Exploration in Reinforcement Learning J. Asmuth Lihong Li Michael L. Littman A. Nouri David Wingate BDL 95 189 0 09 May 2012
An Application of Reinforcement Learning to Dialogue Strategy Selection in a Spoken Dialogue System for Email M. Walker 92 244 0 01 Jun 2011