ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2004.03809
  4. Cited By
Multi-Agent Task-Oriented Dialog Policy Learning with Role-Aware Reward
  Decomposition

Multi-Agent Task-Oriented Dialog Policy Learning with Role-Aware Reward Decomposition

8 April 2020
Ryuichi Takanobu
Runze Liang
Minlie Huang
    LLMAG
ArXivPDFHTML

Papers citing "Multi-Agent Task-Oriented Dialog Policy Learning with Role-Aware Reward Decomposition"

27 / 27 papers shown
Title
Recent Advances and Challenges in Task-oriented Dialog System
Recent Advances and Challenges in Task-oriented Dialog System
Zheng Zhang
Ryuichi Takanobu
Qi Zhu
Minlie Huang
Xiaoyan Zhu
LLMAG
100
176
0
17 Mar 2020
Countering Language Drift via Visual Grounding
Countering Language Drift via Visual Grounding
Jason D. Lee
Kyunghyun Cho
Douwe Kiela
80
66
0
10 Sep 2019
How to Build User Simulators to Train RL-based Dialog Systems
How to Build User Simulators to Train RL-based Dialog Systems
Weiyan Shi
Kun Qian
Xuewei Wang
Zhou Yu
OffRL
37
63
0
03 Sep 2019
Guided Dialog Policy Learning: Reward Estimation for Multi-Domain
  Task-Oriented Dialog
Guided Dialog Policy Learning: Reward Estimation for Multi-Domain Task-Oriented Dialog
Ryuichi Takanobu
Hanlin Zhu
Minlie Huang
68
90
0
28 Aug 2019
Collaborative Multi-Agent Dialogue Model Training Via Reinforcement
  Learning
Collaborative Multi-Agent Dialogue Model Training Via Reinforcement Learning
Alexandros Papangelis
Yi-Chia Wang
Piero Molino
Gokhan Tur
31
32
0
11 Jul 2019
Budgeted Policy Learning for Task-Oriented Dialogue Systems
Budgeted Policy Learning for Task-Oriented Dialogue Systems
Zhirui Zhang
Xiujun Li
Jianfeng Gao
Enhong Chen
OffRL
44
36
0
02 Jun 2019
ConvLab: Multi-Domain End-to-End Dialog System Platform
ConvLab: Multi-Domain End-to-End Dialog System Platform
Sungjin Lee
Qi Zhu
Ryuichi Takanobu
Xiang Li
Yaoqin Zhang
...
Jinchao Li
Baolin Peng
Xiujun Li
Minlie Huang
Jianfeng Gao
VLM
70
110
0
18 Apr 2019
Rethinking Action Spaces for Reinforcement Learning in End-to-end Dialog
  Agents with Latent Variable Models
Rethinking Action Spaces for Reinforcement Learning in End-to-end Dialog Agents with Latent Variable Models
Tiancheng Zhao
Kaige Xie
M. Eskénazi
49
142
0
23 Feb 2019
User Modeling for Task Oriented Dialogues
User Modeling for Task Oriented Dialogues
Izzeddin Gur
Dilek Z. Hakkani-Tür
Gokhan Tur
Pararth Shah
100
58
0
11 Nov 2018
MultiWOZ -- A Large-Scale Multi-Domain Wizard-of-Oz Dataset for
  Task-Oriented Dialogue Modelling
MultiWOZ -- A Large-Scale Multi-Domain Wizard-of-Oz Dataset for Task-Oriented Dialogue Modelling
Paweł Budzianowski
Tsung-Hsien Wen
Bo-Hsiang Tseng
I. Casanueva
Stefan Ultes
Osman Ramadan
Milica Gasic
130
1,306
0
29 Sep 2018
Learning to Collaborate: Multi-Scenario Ranking via Multi-Agent
  Reinforcement Learning
Learning to Collaborate: Multi-Scenario Ranking via Multi-Agent Reinforcement Learning
Jun Feng
Heng Li
Minlie Huang
Shichen Liu
Wenwu Ou
Zhirong Wang
Xiaoyan Zhu
36
70
0
17 Sep 2018
Decoupling Strategy and Generation in Negotiation Dialogues
Decoupling Strategy and Generation in Negotiation Dialogues
He He
Derek Chen
Anusha Balakrishnan
Percy Liang
43
178
0
29 Aug 2018
Discriminative Deep Dyna-Q: Robust Planning for Dialogue Policy Learning
Discriminative Deep Dyna-Q: Robust Planning for Dialogue Policy Learning
Shang-Yu Su
Xiujun Li
Jianfeng Gao
Jingjing Liu
Yun-Nung Chen
OffRL
43
67
0
28 Aug 2018
Neural User Simulation for Corpus-based Policy Optimisation for Spoken
  Dialogue Systems
Neural User Simulation for Corpus-based Policy Optimisation for Spoken Dialogue Systems
Florian Kreyssig
I. Casanueva
Paweł Budzianowski
Milica Gasic
56
74
0
17 May 2018
QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent
  Reinforcement Learning
QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning
Tabish Rashid
Mikayel Samvelyan
Christian Schroeder de Witt
Gregory Farquhar
Jakob N. Foerster
Shimon Whiteson
118
1,662
0
30 Mar 2018
Iterative Policy Learning in End-to-End Trainable Task-Oriented Neural
  Dialog Models
Iterative Policy Learning in End-to-End Trainable Task-Oriented Neural Dialog Models
Bing-Quan Liu
Ian Lane
OffRL
38
98
0
18 Sep 2017
Natural Language Does Not Emerge Ñaturally' in Multi-Agent Dialog
Natural Language Does Not Emerge Ñaturally' in Multi-Agent Dialog
Satwik Kottur
José M. F. Moura
Stefan Lee
Dhruv Batra
LLMAG
53
218
0
26 Jun 2017
Deal or No Deal? End-to-End Learning for Negotiation Dialogues
Deal or No Deal? End-to-End Learning for Negotiation Dialogues
M. Lewis
Denis Yarats
Yann N. Dauphin
Devi Parikh
Dhruv Batra
LLMAG
55
412
0
16 Jun 2017
Hybrid Reward Architecture for Reinforcement Learning
Hybrid Reward Architecture for Reinforcement Learning
H. V. Seijen
Mehdi Fatemi
Joshua Romoff
Romain Laroche
Tavian Barnes
Jeffrey Tsang
26
252
0
13 Jun 2017
Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments
Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments
Ryan J. Lowe
Yi Wu
Aviv Tamar
J. Harb
Pieter Abbeel
Igor Mordatch
116
4,441
0
07 Jun 2017
Counterfactual Multi-Agent Policy Gradients
Counterfactual Multi-Agent Policy Gradients
Jakob N. Foerster
Gregory Farquhar
Triantafyllos Afouras
Nantas Nardelli
Shimon Whiteson
54
2,062
0
24 May 2017
Composite Task-Completion Dialogue Policy Learning via Hierarchical Deep
  Reinforcement Learning
Composite Task-Completion Dialogue Policy Learning via Hierarchical Deep Reinforcement Learning
Baolin Peng
Xiujun Li
Lihong Li
Jianfeng Gao
Asli Celikyilmaz
Sungjin Lee
Kam-Fai Wong
BDL
55
190
0
10 Apr 2017
Learning Cooperative Visual Dialog Agents with Deep Reinforcement
  Learning
Learning Cooperative Visual Dialog Agents with Deep Reinforcement Learning
Abhishek Das
Satwik Kottur
J. M. F. Moura
Stefan Lee
Dhruv Batra
OffRL
101
425
0
20 Mar 2017
A Sequence-to-Sequence Model for User Simulation in Spoken Dialogue
  Systems
A Sequence-to-Sequence Model for User Simulation in Spoken Dialogue Systems
Layla El Asri
Jing He
Kaheer Suleman
92
118
0
30 Jun 2016
Policy Networks with Two-Stage Training for Dialogue Systems
Policy Networks with Two-Stage Training for Dialogue Systems
Mehdi Fatemi
Layla El Asri
Hannes Schulz
Jing He
Kaheer Suleman
OffRL
30
108
0
10 Jun 2016
On-line Active Reward Learning for Policy Optimisation in Spoken
  Dialogue Systems
On-line Active Reward Learning for Policy Optimisation in Spoken Dialogue Systems
Pei-hao Su
Milica Gasic
N. Mrksic
L. Rojas-Barahona
Stefan Ultes
David Vandyke
Tsung-Hsien Wen
S. Young
OffRL
54
170
0
24 May 2016
The Complexity of Decentralized Control of Markov Decision Processes
The Complexity of Decentralized Control of Markov Decision Processes
D. Bernstein
S. Zilberstein
N. Immerman
54
1,588
0
16 Jan 2013
1