ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1804.06512
  4. Cited By
Dialogue Learning with Human Teaching and Feedback in End-to-End
  Trainable Task-Oriented Dialogue Systems

Dialogue Learning with Human Teaching and Feedback in End-to-End Trainable Task-Oriented Dialogue Systems

18 April 2018
Bing-Quan Liu
Gokhan Tur
Dilek Z. Hakkani-Tür
Pararth Shah
Larry Heck
    OffRL
ArXivPDFHTML

Papers citing "Dialogue Learning with Human Teaching and Feedback in End-to-End Trainable Task-Oriented Dialogue Systems"

23 / 23 papers shown
Title
MIDAS: Multi-level Intent, Domain, And Slot Knowledge Distillation for Multi-turn NLU
MIDAS: Multi-level Intent, Domain, And Slot Knowledge Distillation for Multi-turn NLU
Yan Li
So-Eon Kim
Seong-Bae Park
S. Han
56
1
0
15 Aug 2024
End-to-End Optimization of Task-Oriented Dialogue Model with Deep
  Reinforcement Learning
End-to-End Optimization of Task-Oriented Dialogue Model with Deep Reinforcement Learning
Bing-Quan Liu
Gokhan Tur
Dilek Z. Hakkani-Tür
Pararth Shah
Larry Heck
OffRL
44
58
0
29 Nov 2017
Iterative Policy Learning in End-to-End Trainable Task-Oriented Neural
  Dialog Models
Iterative Policy Learning in End-to-End Trainable Task-Oriented Neural Dialog Models
Bing-Quan Liu
Ian Lane
OffRL
38
98
0
18 Sep 2017
An End-to-End Trainable Neural Network Model with Belief Tracking for
  Task-Oriented Dialog
An End-to-End Trainable Neural Network Model with Belief Tracking for Task-Oriented Dialog
Bing-Quan Liu
Ian Lane
30
97
0
20 Aug 2017
Sample-efficient Actor-Critic Reinforcement Learning with Supervised
  Data for Dialogue Management
Sample-efficient Actor-Critic Reinforcement Learning with Supervised Data for Dialogue Management
Pei-hao Su
Paweł Budzianowski
Stefan Ultes
Milica Gasic
S. Young
OffRL
103
129
0
01 Jul 2017
Composite Task-Completion Dialogue Policy Learning via Hierarchical Deep
  Reinforcement Learning
Composite Task-Completion Dialogue Policy Learning via Hierarchical Deep Reinforcement Learning
Baolin Peng
Xiujun Li
Lihong Li
Jianfeng Gao
Asli Celikyilmaz
Sungjin Lee
Kam-Fai Wong
BDL
55
190
0
10 Apr 2017
End-to-End Task-Completion Neural Dialogue Systems
End-to-End Task-Completion Neural Dialogue Systems
Xiujun Li
Yun-Nung Chen
Lihong Li
Jianfeng Gao
Asli Celikyilmaz
44
367
0
03 Mar 2017
Hybrid Code Networks: practical and efficient end-to-end dialog control
  with supervised and reinforcement learning
Hybrid Code Networks: practical and efficient end-to-end dialog control with supervised and reinforcement learning
Jason D. Williams
Kavosh Asadi
Geoffrey Zweig
OffRL
49
335
0
10 Feb 2017
A Copy-Augmented Sequence-to-Sequence Architecture Gives Good
  Performance on Task-Oriented Dialogue
A Copy-Augmented Sequence-to-Sequence Architecture Gives Good Performance on Task-Oriented Dialogue
Mihail Eric
Christopher D. Manning
BDL
62
155
0
15 Jan 2017
Gated End-to-End Memory Networks
Gated End-to-End Memory Networks
Julien Perez
Fei Liu
LRM
25
103
0
13 Oct 2016
Joint Online Spoken Language Understanding and Language Modeling with
  Recurrent Neural Networks
Joint Online Spoken Language Understanding and Language Modeling with Recurrent Neural Networks
Bing-Quan Liu
Ian Lane
25
105
0
06 Sep 2016
Towards End-to-End Reinforcement Learning of Dialogue Agents for
  Information Access
Towards End-to-End Reinforcement Learning of Dialogue Agents for Information Access
Bhuwan Dhingra
Lihong Li
Xiujun Li
Jianfeng Gao
Yun-Nung Chen
Faisal Ahmed
Li Deng
46
303
0
03 Sep 2016
Query-Reduction Networks for Question Answering
Query-Reduction Networks for Question Answering
Minjoon Seo
Sewon Min
Ali Farhadi
Hannaneh Hajishirzi
LRM
ReLM
38
18
0
14 Jun 2016
Neural Belief Tracker: Data-Driven Dialogue State Tracking
Neural Belief Tracker: Data-Driven Dialogue State Tracking
N. Mrksic
Diarmuid Ó Séaghdha
Tsung-Hsien Wen
Blaise Thomson
S. Young
79
482
0
12 Jun 2016
Towards End-to-End Learning for Dialog State Tracking and Management
  using Deep Reinforcement Learning
Towards End-to-End Learning for Dialog State Tracking and Management using Deep Reinforcement Learning
Tiancheng Zhao
M. Eskénazi
53
264
0
08 Jun 2016
End-to-end LSTM-based dialog control optimized with supervised and
  reinforcement learning
End-to-end LSTM-based dialog control optimized with supervised and reinforcement learning
Jason D. Williams
Geoffrey Zweig
OffRL
25
154
0
03 Jun 2016
Learning End-to-End Goal-Oriented Dialog
Learning End-to-End Goal-Oriented Dialog
Antoine Bordes
Y-Lan Boureau
Jason Weston
64
779
0
24 May 2016
On-line Active Reward Learning for Policy Optimisation in Spoken
  Dialogue Systems
On-line Active Reward Learning for Policy Optimisation in Spoken Dialogue Systems
Pei-hao Su
Milica Gasic
N. Mrksic
L. Rojas-Barahona
Stefan Ultes
David Vandyke
Tsung-Hsien Wen
S. Young
OffRL
51
170
0
24 May 2016
A Network-based End-to-End Trainable Task-oriented Dialogue System
A Network-based End-to-End Trainable Task-oriented Dialogue System
Tsung-Hsien Wen
David Vandyke
N. Mrksic
Milica Gasic
L. Rojas-Barahona
Pei-hao Su
Stefan Ultes
S. Young
50
1,104
0
15 Apr 2016
A Persona-Based Neural Conversation Model
A Persona-Based Neural Conversation Model
Jiwei Li
Michel Galley
Chris Brockett
Georgios P. Spithourakis
Jianfeng Gao
W. Dolan
76
1,036
0
19 Mar 2016
Building End-To-End Dialogue Systems Using Generative Hierarchical
  Neural Network Models
Building End-To-End Dialogue Systems Using Generative Hierarchical Neural Network Models
Iulian Serban
Alessandro Sordoni
Yoshua Bengio
Aaron Courville
Joelle Pineau
AILaw
103
1,752
0
17 Jul 2015
Adam: A Method for Stochastic Optimization
Adam: A Method for Stochastic Optimization
Diederik P. Kingma
Jimmy Ba
ODL
678
149,474
0
22 Dec 2014
A Reduction of Imitation Learning and Structured Prediction to No-Regret
  Online Learning
A Reduction of Imitation Learning and Structured Prediction to No-Regret Online Learning
Stéphane Ross
Geoffrey J. Gordon
J. Andrew Bagnell
OffRL
145
3,196
0
02 Nov 2010
1