ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1808.09442
  4. Cited By
Discriminative Deep Dyna-Q: Robust Planning for Dialogue Policy Learning

Discriminative Deep Dyna-Q: Robust Planning for Dialogue Policy Learning

28 August 2018
Shang-Yu Su
Xiujun Li
Jianfeng Gao
Jingjing Liu
Yun-Nung Chen
    OffRL
ArXivPDFHTML

Papers citing "Discriminative Deep Dyna-Q: Robust Planning for Dialogue Policy Learning"

22 / 22 papers shown
Title
Recent Advances and Challenges in Task-oriented Dialog System
Recent Advances and Challenges in Task-oriented Dialog System
Zheng Zhang
Ryuichi Takanobu
Qi Zhu
Minlie Huang
Xiaoyan Zhu
LLMAG
100
176
0
17 Mar 2020
Subgoal Discovery for Hierarchical Dialogue Policy Learning
Subgoal Discovery for Hierarchical Dialogue Policy Learning
Da Tang
Xiujun Li
Jianfeng Gao
Chong-Jun Wang
Lihong Li
Tony Jebara
39
50
0
20 Apr 2018
Deep Dyna-Q: Integrating Planning for Task-Completion Dialogue Policy
  Learning
Deep Dyna-Q: Integrating Planning for Task-Completion Dialogue Policy Learning
Baolin Peng
Xiujun Li
Jianfeng Gao
Jingjing Liu
Kam-Fai Wong
Shang-Yu Su
OffRL
57
156
0
18 Jan 2018
BBQ-Networks: Efficient Exploration in Deep Reinforcement Learning for Task-Oriented Dialogue Systems
Zachary Chase Lipton
Xiujun Li
Jianfeng Gao
Lihong Li
Faisal Ahmed
Li Deng
55
171
0
15 Nov 2017
Adversarial Advantage Actor-Critic Model for Task-Completion Dialogue
  Policy Learning
Adversarial Advantage Actor-Critic Model for Task-Completion Dialogue Policy Learning
Baolin Peng
Xiujun Li
Jianfeng Gao
Jingjing Liu
Yun-Nung Chen
Kam-Fai Wong
51
65
0
31 Oct 2017
Iterative Policy Learning in End-to-End Trainable Task-Oriented Neural
  Dialog Models
Iterative Policy Learning in End-to-End Trainable Task-Oriented Neural Dialog Models
Bing-Quan Liu
Ian Lane
OffRL
38
98
0
18 Sep 2017
Imagination-Augmented Agents for Deep Reinforcement Learning
Imagination-Augmented Agents for Deep Reinforcement Learning
T. Weber
S. Racanière
David P. Reichert
Lars Buesing
A. Guez
...
Razvan Pascanu
Peter W. Battaglia
Demis Hassabis
David Silver
Daan Wierstra
LM&Ro
72
552
0
19 Jul 2017
Sub-domain Modelling for Dialogue Management with Hierarchical
  Reinforcement Learning
Sub-domain Modelling for Dialogue Management with Hierarchical Reinforcement Learning
Paweł Budzianowski
Stefan Ultes
Pei-hao Su
N. Mrksic
Tsung-Hsien Wen
I. Casanueva
L. Rojas-Barahona
Milica Gasic
52
49
0
19 Jun 2017
Composite Task-Completion Dialogue Policy Learning via Hierarchical Deep
  Reinforcement Learning
Composite Task-Completion Dialogue Policy Learning via Hierarchical Deep Reinforcement Learning
Baolin Peng
Xiujun Li
Lihong Li
Jianfeng Gao
Asli Celikyilmaz
Sungjin Lee
Kam-Fai Wong
BDL
55
190
0
10 Apr 2017
End-to-End Task-Completion Neural Dialogue Systems
End-to-End Task-Completion Neural Dialogue Systems
Xiujun Li
Yun-Nung Chen
Lihong Li
Jianfeng Gao
Asli Celikyilmaz
58
367
0
03 Mar 2017
Hybrid Code Networks: practical and efficient end-to-end dialog control
  with supervised and reinforcement learning
Hybrid Code Networks: practical and efficient end-to-end dialog control with supervised and reinforcement learning
Jason D. Williams
Kavosh Asadi
Geoffrey Zweig
OffRL
59
335
0
10 Feb 2017
The Predictron: End-To-End Learning and Planning
The Predictron: End-To-End Learning and Planning
David Silver
H. V. Hasselt
Matteo Hessel
Tom Schaul
A. Guez
...
Gabriel Dulac-Arnold
David P. Reichert
Neil C. Rabinowitz
André Barreto
T. Degris
52
289
0
28 Dec 2016
A User Simulator for Task-Completion Dialogues
A User Simulator for Task-Completion Dialogues
Xiujun Li
Zachary Chase Lipton
Bhuwan Dhingra
Lihong Li
Jianfeng Gao
Yun-Nung Chen
OffRL
45
164
0
17 Dec 2016
Towards End-to-End Reinforcement Learning of Dialogue Agents for
  Information Access
Towards End-to-End Reinforcement Learning of Dialogue Agents for Information Access
Bhuwan Dhingra
Lihong Li
Xiujun Li
Jianfeng Gao
Yun-Nung Chen
Faisal Ahmed
Li Deng
59
303
0
03 Sep 2016
Neural Belief Tracker: Data-Driven Dialogue State Tracking
Neural Belief Tracker: Data-Driven Dialogue State Tracking
N. Mrksic
Diarmuid Ó Séaghdha
Tsung-Hsien Wen
Blaise Thomson
S. Young
79
482
0
12 Jun 2016
Continuously Learning Neural Dialogue Management
Continuously Learning Neural Dialogue Management
Pei-hao Su
Milica Gasic
N. Mrksic
L. Rojas-Barahona
Stefan Ultes
David Vandyke
Tsung-Hsien Wen
S. Young
63
122
0
08 Jun 2016
Towards End-to-End Learning for Dialog State Tracking and Management
  using Deep Reinforcement Learning
Towards End-to-End Learning for Dialog State Tracking and Management using Deep Reinforcement Learning
Tiancheng Zhao
M. Eskénazi
53
264
0
08 Jun 2016
Learning End-to-End Goal-Oriented Dialog
Learning End-to-End Goal-Oriented Dialog
Antoine Bordes
Y-Lan Boureau
Jason Weston
74
781
0
24 May 2016
A Network-based End-to-End Trainable Task-oriented Dialogue System
A Network-based End-to-End Trainable Task-oriented Dialogue System
Tsung-Hsien Wen
David Vandyke
N. Mrksic
Milica Gasic
L. Rojas-Barahona
Pei-hao Su
Stefan Ultes
S. Young
62
1,104
0
15 Apr 2016
Continuous Deep Q-Learning with Model-based Acceleration
Continuous Deep Q-Learning with Model-based Acceleration
S. Gu
Timothy Lillicrap
Ilya Sutskever
Sergey Levine
69
1,010
0
02 Mar 2016
Value Iteration Networks
Value Iteration Networks
Aviv Tamar
Yi Wu
G. Thomas
Sergey Levine
Pieter Abbeel
71
650
0
09 Feb 2016
Semantically Conditioned LSTM-based Natural Language Generation for
  Spoken Dialogue Systems
Semantically Conditioned LSTM-based Natural Language Generation for Spoken Dialogue Systems
Tsung-Hsien Wen
Milica Gasic
N. Mrksic
Pei-hao Su
David Vandyke
S. Young
95
948
0
07 Aug 2015
1