ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1711.05715
  4. Cited By
v1v2 (latest)

BBQ-Networks: Efficient Exploration in Deep Reinforcement Learning for Task-Oriented Dialogue Systems

15 November 2017
Zachary Chase Lipton
Xiujun Li
Jianfeng Gao
Lihong Li
Faisal Ahmed
Li Deng
ArXiv (abs)PDFHTML

Papers citing "BBQ-Networks: Efficient Exploration in Deep Reinforcement Learning for Task-Oriented Dialogue Systems"

22 / 22 papers shown
Title
Recent Advances and Challenges in Task-oriented Dialog System
Recent Advances and Challenges in Task-oriented Dialog System
Zheng Zhang
Ryuichi Takanobu
Qi Zhu
Minlie Huang
Xiaoyan Zhu
LLMAG
118
177
0
17 Mar 2020
A User Simulator for Task-Completion Dialogues
A User Simulator for Task-Completion Dialogues
Xiujun Li
Zachary Chase Lipton
Bhuwan Dhingra
Lihong Li
Jianfeng Gao
Yun-Nung Chen
OffRL
69
167
0
17 Dec 2016
Overcoming catastrophic forgetting in neural networks
Overcoming catastrophic forgetting in neural networks
J. Kirkpatrick
Razvan Pascanu
Neil C. Rabinowitz
J. Veness
Guillaume Desjardins
...
A. Grabska-Barwinska
Demis Hassabis
Claudia Clopath
D. Kumaran
R. Hadsell
CLL
374
7,561
0
02 Dec 2016
Policy Networks with Two-Stage Training for Dialogue Systems
Policy Networks with Two-Stage Training for Dialogue Systems
Mehdi Fatemi
Layla El Asri
Hannes Schulz
Jing He
Kaheer Suleman
OffRL
68
108
0
10 Jun 2016
Unifying Count-Based Exploration and Intrinsic Motivation
Unifying Count-Based Exploration and Intrinsic Motivation
Marc G. Bellemare
S. Srinivasan
Georg Ostrovski
Tom Schaul
D. Saxton
Rémi Munos
179
1,483
0
06 Jun 2016
End-to-end LSTM-based dialog control optimized with supervised and
  reinforcement learning
End-to-end LSTM-based dialog control optimized with supervised and reinforcement learning
Jason D. Williams
Geoffrey Zweig
OffRL
50
155
0
03 Jun 2016
VIME: Variational Information Maximizing Exploration
VIME: Variational Information Maximizing Exploration
Rein Houthooft
Xi Chen
Yan Duan
John Schulman
F. Turck
Pieter Abbeel
69
78
0
31 May 2016
A Network-based End-to-End Trainable Task-oriented Dialogue System
A Network-based End-to-End Trainable Task-oriented Dialogue System
Tsung-Hsien Wen
David Vandyke
N. Mrksic
Milica Gasic
L. Rojas-Barahona
Pei-hao Su
Stefan Ultes
S. Young
77
1,109
0
15 Apr 2016
Deep Exploration via Bootstrapped DQN
Deep Exploration via Bootstrapped DQN
Ian Osband
Charles Blundell
Alexander Pritzel
Benjamin Van Roy
121
1,313
0
15 Feb 2016
SimpleDS: A Simple Deep Reinforcement Learning Dialogue System
SimpleDS: A Simple Deep Reinforcement Learning Dialogue System
Heriberto Cuayáhuitl
OffRL
42
85
0
18 Jan 2016
Dueling Network Architectures for Deep Reinforcement Learning
Dueling Network Architectures for Deep Reinforcement Learning
Ziyun Wang
Tom Schaul
Matteo Hessel
H. V. Hasselt
Marc Lanctot
Nando de Freitas
OffRL
91
3,768
0
20 Nov 2015
Prioritized Experience Replay
Prioritized Experience Replay
Tom Schaul
John Quan
Ioannis Antonoglou
David Silver
OffRL
223
3,797
0
18 Nov 2015
Generative Concatenative Nets Jointly Learn to Write and Classify
  Reviews
Generative Concatenative Nets Jointly Learn to Write and Classify Reviews
Zachary Chase Lipton
Sharad Vikram
Julian McAuley
BDL
103
33
0
11 Nov 2015
Deep Reinforcement Learning with Double Q-learning
Deep Reinforcement Learning with Double Q-learning
H. V. Hasselt
A. Guez
David Silver
OffRL
170
7,662
0
22 Sep 2015
Incentivizing Exploration In Reinforcement Learning With Deep Predictive
  Models
Incentivizing Exploration In Reinforcement Learning With Deep Predictive Models
Bradly C. Stadie
Sergey Levine
Pieter Abbeel
92
505
0
03 Jul 2015
Weight Uncertainty in Neural Networks
Weight Uncertainty in Neural Networks
Charles Blundell
Julien Cornebise
Koray Kavukcuoglu
Daan Wierstra
UQCVBDL
192
1,892
0
20 May 2015
Adam: A Method for Stochastic Optimization
Adam: A Method for Stochastic Optimization
Diederik P. Kingma
Jimmy Ba
ODL
2.0K
150,312
0
22 Dec 2014
Auto-Encoding Variational Bayes
Auto-Encoding Variational Bayes
Diederik P. Kingma
Max Welling
BDL
455
16,923
0
20 Dec 2013
Playing Atari with Deep Reinforcement Learning
Playing Atari with Deep Reinforcement Learning
Volodymyr Mnih
Koray Kavukcuoglu
David Silver
Alex Graves
Ioannis Antonoglou
Daan Wierstra
Martin Riedmiller
129
12,265
0
19 Dec 2013
(More) Efficient Reinforcement Learning via Posterior Sampling
(More) Efficient Reinforcement Learning via Posterior Sampling
Ian Osband
Daniel Russo
Benjamin Van Roy
129
535
0
04 Jun 2013
A Bayesian Sampling Approach to Exploration in Reinforcement Learning
A Bayesian Sampling Approach to Exploration in Reinforcement Learning
J. Asmuth
Lihong Li
Michael L. Littman
A. Nouri
David Wingate
BDL
95
189
0
09 May 2012
An Application of Reinforcement Learning to Dialogue Strategy Selection
  in a Spoken Dialogue System for Email
An Application of Reinforcement Learning to Dialogue Strategy Selection in a Spoken Dialogue System for Email
M. Walker
92
244
0
01 Jun 2011
1