Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1609.00777
Cited By
v1
v2
v3 (latest)
Towards End-to-End Reinforcement Learning of Dialogue Agents for Information Access
3 September 2016
Bhuwan Dhingra
Lihong Li
Xiujun Li
Jianfeng Gao
Yun-Nung Chen
Faisal Ahmed
Li Deng
Re-assign community
ArXiv (abs)
PDF
HTML
Github (185★)
Papers citing
"Towards End-to-End Reinforcement Learning of Dialogue Agents for Information Access"
50 / 149 papers shown
Title
Process-Supervised Reinforcement Learning for Code Generation
Yufan Ye
Ting Zhang
Wenbin Jiang
Hua Huang
OffRL
LRM
SyDa
112
1
0
03 Feb 2025
Leveraging Knowledge Graph Embedding for Effective Conversational Recommendation
Yunwen Xia
Hui Fang
Zhiyuan Zhao
Xuelong Li
42
0
0
02 Aug 2024
ChatShop: Interactive Information Seeking with Language Agents
Sanxing Chen
Sam Wiseman
Bhuwan Dhingra
KELM
99
11
0
15 Apr 2024
Beyond Sparse Rewards: Enhancing Reinforcement Learning with Language Model Critique in Text Generation
Meng Cao
Lei Shu
Lei Yu
Yun Zhu
Nevan Wichers
Yinxiao Liu
Lei Meng
OffRL
ALM
49
7
0
14 Jan 2024
Conversational Question Answering with Reformulations over Knowledge Graph
Lihui Liu
Blaine Hill
Boxin Du
Fei Wang
Hanghang Tong
64
5
0
27 Dec 2023
End-to-end Task-oriented Dialogue: A Survey of Tasks, Methods, and Future Directions
Libo Qin
Wenbo Pan
Qiguang Chen
Lizi Liao
Zhou Yu
Yue Zhang
Wanxiang Che
Min Li
93
13
0
15 Nov 2023
Successor Features for Efficient Multisubject Controlled Text Generation
Mengyao Cao
Mehdi Fatemi
Jackie Chi Kit Cheung
Samira Shabanian
BDL
82
0
0
03 Nov 2023
The Past, Present and Better Future of Feedback Learning in Large Language Models for Subjective Human Preferences and Values
Hannah Rose Kirk
Andrew M. Bean
Bertie Vidgen
Paul Röttger
Scott A. Hale
ALM
113
50
0
11 Oct 2023
Probing the Multi-turn Planning Capabilities of LLMs via 20 Question Games
Yizhe Zhang
Jiarui Lu
Navdeep Jaitly
LRM
ELM
71
13
0
02 Oct 2023
LLMRec: Benchmarking Large Language Models on Recommendation Task
Junling Liu
Chao-Hong Liu
Peilin Zhou
Qichen Ye
Dading Chong
...
Yueqi Xie
Dongyuan Li
Shoujin Wang
Chenyu You
Philip S.Yu
ALM
LRM
71
34
0
23 Aug 2023
End-to-End Evaluation of a Spoken Dialogue System for Learning Basic Mathematics
Eda Okur
Saurav Sahay
Roddy Fuentes Alba
L. Nachman
70
6
0
07 Nov 2022
Understanding the Limits of Poisoning Attacks in Episodic Reinforcement Learning
A. Rangi
Haifeng Xu
Long Tran-Thanh
M. Franceschetti
AAML
OffRL
70
24
0
29 Aug 2022
REKnow: Enhanced Knowledge for Joint Entity and Relation Extraction
Sheng Zhang
Patrick Ng
Zhiguo Wang
Bing Xiang
63
5
0
10 Jun 2022
NLU for Game-based Learning in Real: Initial Evaluations
Eda Okur
Saurav Sahay
L. Nachman
LLMAG
31
2
0
27 May 2022
Data Augmentation with Paraphrase Generation and Entity Extraction for Multimodal Dialogue System
Eda Okur
Saurav Sahay
L. Nachman
55
25
0
09 May 2022
Locally Aggregated Feature Attribution on Natural Language Model Understanding
Shenmin Zhang
Jin Wang
Haitao Jiang
Rui Song
FAtt
69
3
0
22 Apr 2022
Recent Progress in Conversational AI
Zijun Xue
Ruirui Li
Mingda Li
VLM
LLMAG
63
2
0
08 Apr 2022
Conversational Agents: Theory and Applications
M. Wahde
M. Virgolin
LLMAG
68
26
0
07 Feb 2022
Findings from Experiments of On-line Joint Reinforcement Learning of Semantic Parser and Dialogue Manager with real Users
Matthieu Riou
Bassam Jabaian
Stéphane Huet
F. Lefèvre
OffRL
50
0
0
25 Oct 2021
Knowledge Graph-enhanced Sampling for Conversational Recommender System
Mengyuan Zhao
Xiaowen Huang
Lixi Zhu
Jitao Sang
Jian Yu
45
3
0
13 Oct 2021
Feudal Reinforcement Learning by Reading Manuals
Kai Wang
Zhonghao Wang
Mo Yu
Humphrey Shi
OffRL
77
0
0
13 Oct 2021
Reinforced Natural Language Interfaces via Entropy Decomposition
Xiaoran Wu
Yipeng Kang
LLMAG
70
0
0
23 Sep 2021
A Neural Conversation Generation Model via Equivalent Shared Memory Investigation
Changzhen Ji
Yating Zhang
Xiaozhong Liu
Adam Jatowt
Changlong Sun
Conghui Zhu
Tiejun Zhao
53
1
0
20 Aug 2021
WeaSuL: Weakly Supervised Dialogue Policy Learning: Reward Estimation for Multi-turn Dialogue
Anant Khandelwal
OffRL
60
6
0
01 Aug 2021
Transferable Dialogue Systems and User Simulators
Bo-Hsiang Tseng
Yinpei Dai
Florian Kreyssig
Bill Byrne
96
54
0
25 Jul 2021
Training like Playing: A Reinforcement Learning And Knowledge Graph-based framework for building Automatic Consultation System in Medical Field
Yining Huang
Meilian Chen
Keke Tang
55
3
0
14 Jun 2021
High-Quality Diversification for Task-Oriented Dialogue Systems
Zhiwen Tang
Hrishikesh Kulkarni
Grace Hui Yang
43
9
0
02 Jun 2021
Conversational Question Answering: A Survey
Munazza Zaib
Wei Emma Zhang
Quan Z. Sheng
A. Mahmood
Yang Zhang
89
91
0
02 Jun 2021
Modeling Text-visual Mutual Dependency for Multi-modal Dialog Generation
Shuhe Wang
Yuxian Meng
Xiaofei Sun
Leilei Gan
Rongbin Ouyang
Rui Yan
Tianwei Zhang
Jiwei Li
66
15
0
30 May 2021
Recent Advances in Deep Learning Based Dialogue Systems: A Systematic Survey
Jinjie Ni
Tom Young
Vlad Pandelea
Fuzhao Xue
Min Zhang
225
279
0
10 May 2021
Grounding Dialogue Systems via Knowledge Graph Aware Decoding with Pre-trained Transformers
Debanjan Chaudhuri
Md. Rony
Jens Lehmann
66
12
0
30 Mar 2021
Reward Poisoning in Reinforcement Learning: Attacks Against Unknown Learners in Unknown Environments
Amin Rakhsha
Xuezhou Zhang
Xiaojin Zhu
Adish Singla
AAML
OffRL
86
37
0
16 Feb 2021
Converse, Focus and Guess -- Towards Multi-Document Driven Dialogue
Han Liu
Caixia Yuan
Xiaojie Wang
Yushu Yang
Huixing Jiang
Zhongyuan Wang
106
1
0
04 Feb 2021
Automatic Curriculum Learning With Over-repetition Penalty for Dialogue Policy Learning
Yangyang Zhao
Zhenyu Wang
Zhenhua Huang
OffRL
150
19
0
28 Dec 2020
Graph-Evolving Meta-Learning for Low-Resource Medical Dialogue Generation
Shuai Lin
Pan Zhou
Xiaodan Liang
Jianheng Tang
Ruihui Zhao
Ziliang Chen
Liang Lin
MedIm
81
55
0
22 Dec 2020
VisualHints: A Visual-Lingual Environment for Multimodal Reinforcement Learning
Thomas Carta
Subhajit Chaudhury
Kartik Talamadupula
Michiaki Tatsubori
24
3
0
26 Oct 2020
Improving Dialog Systems for Negotiation with Personality Modeling
Runzhe Yang
Jingxiao Chen
Karthik Narasimhan
106
51
0
20 Oct 2020
MEEP: An Open-Source Platform for Human-Human Dialog Collection and End-to-End Agent Training
Arkady Arkhangorodsky
Amittai Axelrod
Christopher Chu
Scot Fang
Yiqi Huang
Ajay Nagesh
Xing Shi
Boliang Zhang
Kevin Knight
LLMAG
VLM
AuLLM
16
2
0
09 Oct 2020
Towards Topic-Guided Conversational Recommender System
Kun Zhou
Yuanhang Zhou
Wayne Xin Zhao
Xiaoke Wang
Ji-Rong Wen
67
204
0
08 Oct 2020
Bootstrapped Q-learning with Context Relevant Observation Pruning to Generalize in Text-based Games
Subhajit Chaudhury
Daiki Kimura
Kartik Talamadupula
Michiaki Tatsubori
Asim Munawar
Ryuki Tachibana
OffRL
53
10
0
24 Sep 2020
SUMBT+LaRL: Effective Multi-domain End-to-end Neural Task-oriented Dialog System
Hwaran Lee
Seokhwan Jo
Hyungjun Kim
Sangkeun Jung
Tae-Yoon Kim
OffRL
91
6
0
22 Sep 2020
Rethinking Supervised Learning and Reinforcement Learning in Task-Oriented Dialogue Systems
Ziming Li
Julia Kiseleva
Maarten de Rijke
OffRL
59
22
0
21 Sep 2020
Robust Conversational AI with Grounded Text Generation
Jianfeng Gao
Baolin Peng
Chunyuan Li
Jinchao Li
Shahin Shayandeh
Lars Liden
H. Shum
71
21
0
07 Sep 2020
Document-editing Assistants and Model-based Reinforcement Learning as a Path to Conversational AI
Katya Kudashkina
P. Pilarski
R. Sutton
KELM
82
6
0
27 Aug 2020
An Overview of Natural Language State Representation for Reinforcement Learning
Brielen Madureira
David Schlangen
OffRL
78
9
0
19 Jul 2020
Optimizing Interactive Systems via Data-Driven Objectives
Ziming Li
Julia Kiseleva
A. Grotov
Maarten de Rijke
Harrie Oosterhuis
OffRL
38
3
0
19 Jun 2020
Report from the NSF Future Directions Workshop, Toward User-Oriented Agents: Research Directions and Challenges
M. Eskénazi
Tiancheng Zhao
LLMAG
AI4TS
AI4CE
90
9
0
10 Jun 2020
Offline and Online Satisfaction Prediction in Open-Domain Conversational Systems
J. Choi
Ali Ahmadvand
Eugene Agichtein
OffRL
57
28
0
02 Jun 2020
Variational Reward Estimator Bottleneck: Learning Robust Reward Estimator for Multi-Domain Task-Oriented Dialog
Jeiyoon Park
Chanhee Lee
Kuekyeng Kim
Heuiseok Lim
OffRL
40
0
0
31 May 2020
Is Your Goal-Oriented Dialog Model Performing Really Well? Empirical Analysis of System-wise Evaluation
Ryuichi Takanobu
Qi Zhu
Jinchao Li
Baolin Peng
Jianfeng Gao
Minlie Huang
51
43
0
15 May 2020
1
2
3
Next