Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2202.13675
Cited By
A Survey on Recent Advances and Challenges in Reinforcement Learning Methods for Task-Oriented Dialogue Policy Learning
28 February 2022
Wai-Chung Kwan
Hongru Wang
Huimin Wang
Kam-Fai Wong
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"A Survey on Recent Advances and Challenges in Reinforcement Learning Methods for Task-Oriented Dialogue Policy Learning"
25 / 25 papers shown
Title
Planning with Diffusion Models for Target-Oriented Dialogue Systems
Hanwen Du
B. Peng
Xia Ning
25
0
0
23 Apr 2025
Simulating Before Planning: Constructing Intrinsic User World Model for User-Tailored Dialogue Policy Planning
Tao He
Lizi Liao
Ming Liu
Bing Qin
32
0
0
18 Apr 2025
Do We Really Need Curated Malicious Data for Safety Alignment in Multi-modal Large Language Models?
Yanbo Wang
Jiyang Guan
Jian Liang
Ran He
51
0
0
14 Apr 2025
Policy Learning with a Natural Language Action Space: A Causal Approach
Bohan Zhang
Yixin Wang
Paramveer S. Dhillon
CML
46
0
0
24 Feb 2025
Infusing Emotions into Task-oriented Dialogue Systems: Understanding, Management, and Generation
Shutong Feng
Hsien-chin Lin
Christian Geishauser
Nurul Lubis
Carel van Niekerk
Michael Heck
Benjamin Ruppik
Renato Vukovic
Milica Gašić
31
3
0
05 Aug 2024
Rewarding What Matters: Step-by-Step Reinforcement Learning for Task-Oriented Dialogue
Huifang Du
Shuqin Li
Minghao Wu
Xuejing Feng
Yuan-Fang Li
Haofen Wang
OffRL
86
1
0
20 Jun 2024
A Dense Reward View on Aligning Text-to-Image Diffusion with Preference
Shentao Yang
Tianqi Chen
Mingyuan Zhou
EGVM
34
22
0
13 Feb 2024
Multi-User Chat Assistant (MUCA): a Framework Using LLMs to Facilitate Group Conversations
Manqing Mao
Paishun Ting
Yijian Xiang
Mingyang Xu
Julia Chen
Jianzhe Lin
LLMAG
33
6
0
10 Jan 2024
Task Oriented Dialogue as a Catalyst for Self-Supervised Automatic Speech Recognition
David M. Chan
Shalini Ghosh
Hitesh Tulsiani
Ariya Rastrow
Björn Hoffmeister
28
1
0
04 Jan 2024
Learning Top-k Subtask Planning Tree based on Discriminative Representation Pre-training for Decision Making
Jingqing Ruan
Kaishen Wang
Qingyang Zhang
Dengpeng Xing
Bo Xu
33
0
0
18 Dec 2023
TOD-Flow: Modeling the Structure of Task-Oriented Dialogues
Sungryull Sohn
Yiwei Lyu
Anthony Z. Liu
Lajanugen Logeswaran
Dong-Ki Kim
Dongsub Shim
Honglak Lee
28
3
0
07 Dec 2023
LMRL Gym: Benchmarks for Multi-Turn Reinforcement Learning with Language Models
Marwa Abdulhai
Isadora White
Charles Burton Snell
Charles Sun
Joey Hong
Yuexiang Zhai
Kelvin Xu
Sergey Levine
LLMAG
OffRL
LRM
31
31
0
30 Nov 2023
A Survey of the Evolution of Language Model-Based Dialogue Systems
Hongru Wang
Lingzhi Wang
Yiming Du
Liang Chen
Jing Zhou
Yufei Wang
Kam-Fai Wong
LRM
61
20
0
28 Nov 2023
End-to-end Task-oriented Dialogue: A Survey of Tasks, Methods, and Future Directions
Libo Qin
Wenbo Pan
Qiguang Chen
Lizi Liao
Zhou Yu
Yue Zhang
Wanxiang Che
Min Li
26
11
0
15 Nov 2023
Plug-and-Play Policy Planner for Large Language Model Powered Dialogue Agents
Yang Deng
Wenxuan Zhang
Wai Lam
See-Kiong Ng
Tat-Seng Chua
LM&Ro
LLMAG
16
34
0
01 Nov 2023
Improving Factual Consistency for Knowledge-Grounded Dialogue Systems via Knowledge Enhancement and Alignment
Boyang Xue
Weichao Wang
Hongru Wang
Fei Mi
Rui Wang
Yasheng Wang
Lifeng Shang
Xin Jiang
Qun Liu
Kam-Fai Wong
KELM
HILM
216
15
0
12 Oct 2023
Dialog Action-Aware Transformer for Dialog Policy Learning
Huimin Wang
Wai-Chung Kwan
Kam-Fai Wong
OffRL
16
1
0
05 Sep 2023
JoTR: A Joint Transformer and Reinforcement Learning Framework for Dialog Policy Learning
Wai-Chung Kwan
Huimin Wang
Hongru Wang
Zezhong Wang
Xian Wu
Yefeng Zheng
Kam-Fai Wong
OffRL
17
0
0
01 Sep 2023
Why Guided Dialog Policy Learning performs well? Understanding the role of adversarial learning and its alternative
Sho Shimoyama
Tetsuro Morimura
Kenshi Abe
Toda Takamichi
Yuta Tomomatsu
Masakazu Sugiyama
Asahi Hentona
Yuuki Azuma
Hirotaka Ninomiya
OffRL
26
0
0
13 Jul 2023
Rescue Conversations from Dead-ends: Efficient Exploration for Task-oriented Dialogue Policy Optimization
Yangyang Zhao
Zhenyu Wang
Mehdi Dastani
Shihan Wang
19
0
0
05 May 2023
Fantastic Rewards and How to Tame Them: A Case Study on Reward Learning for Task-oriented Dialogue Systems
Yihao Feng
Shentao Yang
Shujian Zhang
Jianguo Zhang
Caiming Xiong
Mi Zhou
Haiquan Wang
OffRL
28
24
0
20 Feb 2023
KddRES: A Multi-level Knowledge-driven Dialogue Dataset for Restaurant Towards Customized Dialogue System
Hongru Wang
Min Li
Zimo Zhou
Gabriel Pui Cheong Fung
Kam-Fai Wong
21
5
0
17 Nov 2020
Semi-Supervised Dialogue Policy Learning via Stochastic Reward Estimation
Xinting Huang
Jianzhong Qi
Yu Sun
Rui Zhang
OffRL
69
18
0
09 May 2020
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Chelsea Finn
Pieter Abbeel
Sergey Levine
OOD
341
11,684
0
09 Mar 2017
A Sequence-to-Sequence Model for User Simulation in Spoken Dialogue Systems
Layla El Asri
Jing He
Kaheer Suleman
57
117
0
30 Jun 2016
1