Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1711.05715
Cited By
v1
v2 (latest)
BBQ-Networks: Efficient Exploration in Deep Reinforcement Learning for Task-Oriented Dialogue Systems
15 November 2017
Zachary Chase Lipton
Xiujun Li
Jianfeng Gao
Lihong Li
Faisal Ahmed
Li Deng
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"BBQ-Networks: Efficient Exploration in Deep Reinforcement Learning for Task-Oriented Dialogue Systems"
22 / 22 papers shown
Title
Recent Advances and Challenges in Task-oriented Dialog System
Zheng Zhang
Ryuichi Takanobu
Qi Zhu
Minlie Huang
Xiaoyan Zhu
LLMAG
118
177
0
17 Mar 2020
A User Simulator for Task-Completion Dialogues
Xiujun Li
Zachary Chase Lipton
Bhuwan Dhingra
Lihong Li
Jianfeng Gao
Yun-Nung Chen
OffRL
69
167
0
17 Dec 2016
Overcoming catastrophic forgetting in neural networks
J. Kirkpatrick
Razvan Pascanu
Neil C. Rabinowitz
J. Veness
Guillaume Desjardins
...
A. Grabska-Barwinska
Demis Hassabis
Claudia Clopath
D. Kumaran
R. Hadsell
CLL
374
7,561
0
02 Dec 2016
Policy Networks with Two-Stage Training for Dialogue Systems
Mehdi Fatemi
Layla El Asri
Hannes Schulz
Jing He
Kaheer Suleman
OffRL
68
108
0
10 Jun 2016
Unifying Count-Based Exploration and Intrinsic Motivation
Marc G. Bellemare
S. Srinivasan
Georg Ostrovski
Tom Schaul
D. Saxton
Rémi Munos
179
1,483
0
06 Jun 2016
End-to-end LSTM-based dialog control optimized with supervised and reinforcement learning
Jason D. Williams
Geoffrey Zweig
OffRL
50
155
0
03 Jun 2016
VIME: Variational Information Maximizing Exploration
Rein Houthooft
Xi Chen
Yan Duan
John Schulman
F. Turck
Pieter Abbeel
69
78
0
31 May 2016
A Network-based End-to-End Trainable Task-oriented Dialogue System
Tsung-Hsien Wen
David Vandyke
N. Mrksic
Milica Gasic
L. Rojas-Barahona
Pei-hao Su
Stefan Ultes
S. Young
77
1,109
0
15 Apr 2016
Deep Exploration via Bootstrapped DQN
Ian Osband
Charles Blundell
Alexander Pritzel
Benjamin Van Roy
121
1,313
0
15 Feb 2016
SimpleDS: A Simple Deep Reinforcement Learning Dialogue System
Heriberto Cuayáhuitl
OffRL
42
85
0
18 Jan 2016
Dueling Network Architectures for Deep Reinforcement Learning
Ziyun Wang
Tom Schaul
Matteo Hessel
H. V. Hasselt
Marc Lanctot
Nando de Freitas
OffRL
91
3,768
0
20 Nov 2015
Prioritized Experience Replay
Tom Schaul
John Quan
Ioannis Antonoglou
David Silver
OffRL
223
3,797
0
18 Nov 2015
Generative Concatenative Nets Jointly Learn to Write and Classify Reviews
Zachary Chase Lipton
Sharad Vikram
Julian McAuley
BDL
103
33
0
11 Nov 2015
Deep Reinforcement Learning with Double Q-learning
H. V. Hasselt
A. Guez
David Silver
OffRL
170
7,662
0
22 Sep 2015
Incentivizing Exploration In Reinforcement Learning With Deep Predictive Models
Bradly C. Stadie
Sergey Levine
Pieter Abbeel
92
505
0
03 Jul 2015
Weight Uncertainty in Neural Networks
Charles Blundell
Julien Cornebise
Koray Kavukcuoglu
Daan Wierstra
UQCV
BDL
192
1,892
0
20 May 2015
Adam: A Method for Stochastic Optimization
Diederik P. Kingma
Jimmy Ba
ODL
2.0K
150,312
0
22 Dec 2014
Auto-Encoding Variational Bayes
Diederik P. Kingma
Max Welling
BDL
455
16,923
0
20 Dec 2013
Playing Atari with Deep Reinforcement Learning
Volodymyr Mnih
Koray Kavukcuoglu
David Silver
Alex Graves
Ioannis Antonoglou
Daan Wierstra
Martin Riedmiller
129
12,265
0
19 Dec 2013
(More) Efficient Reinforcement Learning via Posterior Sampling
Ian Osband
Daniel Russo
Benjamin Van Roy
129
535
0
04 Jun 2013
A Bayesian Sampling Approach to Exploration in Reinforcement Learning
J. Asmuth
Lihong Li
Michael L. Littman
A. Nouri
David Wingate
BDL
95
189
0
09 May 2012
An Application of Reinforcement Learning to Dialogue Strategy Selection in a Spoken Dialogue System for Email
M. Walker
92
244
0
01 Jun 2011
1