ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1902.08858
  4. Cited By
Rethinking Action Spaces for Reinforcement Learning in End-to-end Dialog
  Agents with Latent Variable Models

Rethinking Action Spaces for Reinforcement Learning in End-to-end Dialog Agents with Latent Variable Models

23 February 2019
Tiancheng Zhao
Kaige Xie
M. Eskénazi
ArXivPDFHTML

Papers citing "Rethinking Action Spaces for Reinforcement Learning in End-to-end Dialog Agents with Latent Variable Models"

43 / 43 papers shown
Title
Let's Negotiate! A Survey of Negotiation Dialogue Systems
Let's Negotiate! A Survey of Negotiation Dialogue Systems
Haolan Zhan
Yufei Wang
Tao Feng
Yuncheng Hua
Suraj Sharma
Zhuang Li
Lizhen Qu
Zhaleh Semnani Azad
Ingrid Zukerman
Gholamreza Haffari
LLMAG
89
29
0
02 Feb 2024
Rescue Conversations from Dead-ends: Efficient Exploration for
  Task-oriented Dialogue Policy Optimization
Rescue Conversations from Dead-ends: Efficient Exploration for Task-oriented Dialogue Policy Optimization
Yangyang Zhao
Zhenyu Wang
Mehdi Dastani
Shihan Wang
24
0
0
05 May 2023
Deep RL with Hierarchical Action Exploration for Dialogue Generation
Deep RL with Hierarchical Action Exploration for Dialogue Generation
Itsugun Cho
Ryota Takahashi
Yusaku Yanase
Hiroaki Saito
28
2
0
22 Mar 2023
KRLS: Improving End-to-End Response Generation in Task Oriented Dialog
  with Reinforced Keywords Learning
KRLS: Improving End-to-End Response Generation in Task Oriented Dialog with Reinforced Keywords Learning
Xiao Yu
Qingyang Wu
Kun Qian
Zhou Yu
OffRL
21
11
0
30 Nov 2022
Jointly Reinforced User Simulator and Task-oriented Dialog System with
  Simplified Generative Architecture
Jointly Reinforced User Simulator and Task-oriented Dialog System with Simplified Generative Architecture
Abhishek Sethi
Zhijian Ou
Yi Huang
Junlan Feng
RALM
21
1
0
13 Oct 2022
Dialogue Evaluation with Offline Reinforcement Learning
Dialogue Evaluation with Offline Reinforcement Learning
Nurul Lubis
Christian Geishauser
Hsien-Chin Lin
Carel van Niekerk
Michael Heck
Shutong Feng
Milica Gavsić
OffRL
27
4
0
02 Sep 2022
Post-processing Networks: Method for Optimizing Pipeline Task-oriented
  Dialogue Systems using Reinforcement Learning
Post-processing Networks: Method for Optimizing Pipeline Task-oriented Dialogue Systems using Reinforcement Learning
Atsumoto Ohashi
Ryuichiro Higashinaka
OffRL
24
7
0
25 Jul 2022
A Mixture-of-Expert Approach to RL-based Dialogue Management
A Mixture-of-Expert Approach to RL-based Dialogue Management
Yinlam Chow
Azamat Tulepbergenov
Ofir Nachum
Moonkyung Ryu
Mohammad Ghavamzadeh
Craig Boutilier
MoE
25
14
0
31 May 2022
RSTGen: Imbuing Fine-Grained Interpretable Control into Long-FormText
  Generators
RSTGen: Imbuing Fine-Grained Interpretable Control into Long-FormText Generators
Rilwan A. Adewoyin
Ritabrata Dutta
Yulan He
27
2
0
25 May 2022
CORAL: Contextual Response Retrievability Loss Function for Training
  Dialog Generation Models
CORAL: Contextual Response Retrievability Loss Function for Training Dialog Generation Models
Bishal Santra
Ravi Ghadia
Manish Gupta
Pawan Goyal
OffRL
23
0
0
21 May 2022
BORT: Back and Denoising Reconstruction for End-to-End Task-Oriented
  Dialog
BORT: Back and Denoising Reconstruction for End-to-End Task-Oriented Dialog
Haipeng Sun
Junwei Bao
Youzheng Wu
Xiaodong He
21
30
0
05 May 2022
Structure Extraction in Task-Oriented Dialogues with Slot Clustering
Structure Extraction in Task-Oriented Dialogues with Slot Clustering
Liang Qiu
Chien-Sheng Wu
Wenhao Liu
Caiming Xiong
27
8
0
28 Feb 2022
A Survey on Recent Advances and Challenges in Reinforcement Learning
  Methods for Task-Oriented Dialogue Policy Learning
A Survey on Recent Advances and Challenges in Reinforcement Learning Methods for Task-Oriented Dialogue Policy Learning
Wai-Chung Kwan
Hongru Wang
Huimin Wang
Kam-Fai Wong
OffRL
38
43
0
28 Feb 2022
OpenViDial 2.0: A Larger-Scale, Open-Domain Dialogue Generation Dataset
  with Visual Contexts
OpenViDial 2.0: A Larger-Scale, Open-Domain Dialogue Generation Dataset with Visual Contexts
Shuhe Wang
Yuxian Meng
Xiaoya Li
Xiaofei Sun
Rongbin Ouyang
Jiwei Li
MLLM
VLM
32
21
0
27 Sep 2021
Reinforced Natural Language Interfaces via Entropy Decomposition
Reinforced Natural Language Interfaces via Entropy Decomposition
Xiaoran Wu
Yipeng Kang
LLMAG
30
0
0
23 Sep 2021
EmoWOZ: A Large-Scale Corpus and Labelling Scheme for Emotion
  Recognition in Task-Oriented Dialogue Systems
EmoWOZ: A Large-Scale Corpus and Labelling Scheme for Emotion Recognition in Task-Oriented Dialogue Systems
Shutong Feng
Nurul Lubis
Christian Geishauser
Hsien-Chin Lin
Michael Heck
Carel van Niekerk
Milica Gavsić
53
21
0
10 Sep 2021
Variational Latent-State GPT for Semi-Supervised Task-Oriented Dialog
  Systems
Variational Latent-State GPT for Semi-Supervised Task-Oriented Dialog Systems
Hong Liu
Yucheng Cai
Zhenru Lin
Zhijian Ou
Yi Huang
Junlan Feng
DRL
VLM
BDL
36
18
0
09 Sep 2021
Task-Oriented Dialogue System as Natural Language Generation
Task-Oriented Dialogue System as Natural Language Generation
Weizhi Wang
Zhirui Zhang
Junliang Guo
Yinpei Dai
Boxing Chen
Weihua Luo
36
32
0
31 Aug 2021
Transferable Dialogue Systems and User Simulators
Transferable Dialogue Systems and User Simulators
Bo-Hsiang Tseng
Yinpei Dai
Florian Kreyssig
Bill Byrne
19
53
0
25 Jul 2021
Survey on reinforcement learning for language processing
Survey on reinforcement learning for language processing
Víctor Uc Cetina
Nicolás Navarro-Guerrero
A. Martín-González
C. Weber
S. Wermter
OffRL
31
101
0
12 Apr 2021
When is it permissible for artificial intelligence to lie? A trust-based
  approach
When is it permissible for artificial intelligence to lie? A trust-based approach
Tae Wan Kim
Tong Lu
Lu
Kyusong Lee
Zhaoqi Cheng
Yanhan Tang
J. N. Hooker
24
4
0
09 Mar 2021
A Survey on Deep Reinforcement Learning for Audio-Based Applications
A Survey on Deep Reinforcement Learning for Audio-Based Applications
S. Latif
Heriberto Cuayáhuitl
Farrukh Pervez
Fahad Shamshad
Hafiz Shehbaz Ali
Min Zhang
OffRL
60
73
0
01 Jan 2021
OpenViDial: A Large-Scale, Open-Domain Dialogue Dataset with Visual
  Contexts
OpenViDial: A Large-Scale, Open-Domain Dialogue Dataset with Visual Contexts
Yuxian Meng
Shuhe Wang
Qinghong Han
Xiaofei Sun
Fei Wu
Rui Yan
Jiwei Li
32
28
0
30 Dec 2020
UBAR: Towards Fully End-to-End Task-Oriented Dialog Systems with GPT-2
UBAR: Towards Fully End-to-End Task-Oriented Dialog Systems with GPT-2
Yunyi Yang
Yunhao Li
Xiaojun Quan
35
189
0
07 Dec 2020
DLGNet-Task: An End-to-end Neural Network Framework for Modeling
  Multi-turn Multi-domain Task-Oriented Dialogue
DLGNet-Task: An End-to-end Neural Network Framework for Modeling Multi-turn Multi-domain Task-Oriented Dialogue
O. Olabiyi
P. Bhattarai
C. Bayan Bruss
Zachary Kulis
21
2
0
04 Oct 2020
A Probabilistic End-To-End Task-Oriented Dialog Model with Latent Belief
  States towards Semi-Supervised Learning
A Probabilistic End-To-End Task-Oriented Dialog Model with Latent Belief States towards Semi-Supervised Learning
Yichi Zhang
Zhijian Ou
Huixin Wang
Junlan Feng
RALM
29
67
0
17 Sep 2020
Dialog without Dialog Data: Learning Visual Dialog Agents from VQA Data
Dialog without Dialog Data: Learning Visual Dialog Agents from VQA Data
Michael Cogswell
Jiasen Lu
Rishabh Jain
Stefan Lee
Devi Parikh
Dhruv Batra
VLM
EgoV
39
15
0
24 Jul 2020
Span-ConveRT: Few-shot Span Extraction for Dialog with Pretrained
  Conversational Representations
Span-ConveRT: Few-shot Span Extraction for Dialog with Pretrained Conversational Representations
Sam Coope
Tyler Farghly
D. Gerz
Ivan Vulić
Matthew Henderson
27
62
0
18 May 2020
SOLOIST: Building Task Bots at Scale with Transfer Learning and Machine
  Teaching
SOLOIST: Building Task Bots at Scale with Transfer Learning and Machine Teaching
Baolin Peng
Chunyuan Li
Jinchao Li
Shahin Shayandeh
Lars Liden
Jianfeng Gao
33
125
0
11 May 2020
Semi-Supervised Dialogue Policy Learning via Stochastic Reward
  Estimation
Semi-Supervised Dialogue Policy Learning via Stochastic Reward Estimation
Xinting Huang
Jianzhong Qi
Yu Sun
Rui Zhang
OffRL
69
18
0
09 May 2020
A Survey on Dialog Management: Recent Advances and Challenges
A Survey on Dialog Management: Recent Advances and Challenges
Yinpei Dai
Huihua Yu
Yixuan Jiang
Chengguang Tang
Yongbin Li
Jian Sun
OffRL
VLM
32
20
0
05 May 2020
A Simple Language Model for Task-Oriented Dialogue
A Simple Language Model for Task-Oriented Dialogue
Ehsan Hosseini-Asl
Bryan McCann
Chien-Sheng Wu
Semih Yavuz
R. Socher
31
526
0
02 May 2020
Multi-Domain Dialogue Acts and Response Co-Generation
Multi-Domain Dialogue Acts and Response Co-Generation
Kai Wang
Junfeng Tian
Rui Wang
Xiaojun Quan
Jianxing Yu
8
58
0
26 Apr 2020
Multi-Agent Task-Oriented Dialog Policy Learning with Role-Aware Reward
  Decomposition
Multi-Agent Task-Oriented Dialog Policy Learning with Role-Aware Reward Decomposition
Ryuichi Takanobu
Runze Liang
Minlie Huang
LLMAG
19
54
0
08 Apr 2020
ConvLab-2: An Open-Source Toolkit for Building, Evaluating, and
  Diagnosing Dialogue Systems
ConvLab-2: An Open-Source Toolkit for Building, Evaluating, and Diagnosing Dialogue Systems
Qi Zhu
Zheng Zhang
Yan Fang
Xiang Li
Ryuichi Takanobu
Jinchao Li
Baolin Peng
Jianfeng Gao
Xiaoyan Zhu
Minlie Huang
21
105
0
12 Feb 2020
MALA: Cross-Domain Dialogue Generation with Action Learning
MALA: Cross-Domain Dialogue Generation with Action Learning
Xinting Huang
Jianzhong Qi
Yu Sun
Rui Zhang
LRM
28
18
0
18 Dec 2019
Task-Oriented Dialog Systems that Consider Multiple Appropriate
  Responses under the Same Context
Task-Oriented Dialog Systems that Consider Multiple Appropriate Responses under the Same Context
Yichi Zhang
Zhijian Ou
Zhou Yu
27
182
0
24 Nov 2019
ConveRT: Efficient and Accurate Conversational Representations from
  Transformers
ConveRT: Efficient and Accurate Conversational Representations from Transformers
Matthew Henderson
I. Casanueva
Nikola Mrkvsić
Pei-hao Su
Tsung-Hsien
Ivan Vulić
23
196
0
09 Nov 2019
Alternating Recurrent Dialog Model with Large-scale Pre-trained Language
  Models
Alternating Recurrent Dialog Model with Large-scale Pre-trained Language Models
Qingyang Wu
Yichi Zhang
Yu Li
Zhou Yu
VLM
22
63
0
09 Oct 2019
Hello, It's GPT-2 -- How Can I Help You? Towards the Use of Pretrained
  Language Models for Task-Oriented Dialogue Systems
Hello, It's GPT-2 -- How Can I Help You? Towards the Use of Pretrained Language Models for Task-Oriented Dialogue Systems
Paweł Budzianowski
Ivan Vulić
34
308
0
12 Jul 2019
Target-Guided Open-Domain Conversation
Target-Guided Open-Domain Conversation
Jianheng Tang
Tiancheng Zhao
Chenyan Xiong
Xiaodan Liang
Eric Xing
Zhiting Hu
22
133
0
28 May 2019
Deep Reinforcement Learning for Dialogue Generation
Deep Reinforcement Learning for Dialogue Generation
Jiwei Li
Will Monroe
Alan Ritter
Michel Galley
Jianfeng Gao
Dan Jurafsky
220
1,328
0
05 Jun 2016
Effective Approaches to Attention-based Neural Machine Translation
Effective Approaches to Attention-based Neural Machine Translation
Thang Luong
Hieu H. Pham
Christopher D. Manning
220
7,930
0
17 Aug 2015
1