Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2204.08426
Cited By
CHAI: A CHatbot AI for Task-Oriented Dialogue with Offline Reinforcement Learning
18 April 2022
Siddharth Verma
Justin Fu
Mengjiao Yang
Sergey Levine
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"CHAI: A CHatbot AI for Task-Oriented Dialogue with Offline Reinforcement Learning"
22 / 22 papers shown
Title
Prompt Optimization with Logged Bandit Data
Haruka Kiyohara
Daniel Yiming Cao
Yuta Saito
Thorsten Joachims
229
0
0
03 Apr 2025
Steering Your Generalists: Improving Robotic Foundation Models via Value Guidance
Mitsuhiko Nakamoto
Oier Mees
Aviral Kumar
Sergey Levine
OffRL
152
19
0
17 Oct 2024
Reinforcement learning
Florentin Wörgötter
86
2,529
0
16 May 2024
Conservative Q-Learning for Offline Reinforcement Learning
Aviral Kumar
Aurick Zhou
George Tucker
Sergey Levine
OffRL
OnRL
148
1,836
0
08 Jun 2020
Towards a Human-like Open-Domain Chatbot
Daniel De Freitas
Minh-Thang Luong
David R. So
Jamie Hall
Noah Fiedel
...
Zi Yang
Apoorv Kulshreshtha
Gaurav Nemade
Yifeng Lu
Quoc V. Le
120
939
0
27 Jan 2020
Behavior Regularized Offline Reinforcement Learning
Yifan Wu
George Tucker
Ofir Nachum
OffRL
100
690
0
26 Nov 2019
Hierarchical Reinforcement Learning for Open-Domain Dialog
Abdelrhman Saleh
Natasha Jaques
Asma Ghandeharioun
J. Shen
Rosalind W. Picard
OffRL
86
59
0
17 Sep 2019
Way Off-Policy Batch Deep Reinforcement Learning of Implicit Human Preferences in Dialog
Natasha Jaques
Asma Ghandeharioun
J. Shen
Craig Ferguson
Àgata Lapedriza
Noah J. Jones
S. Gu
Rosalind W. Picard
OffRL
133
343
0
30 Jun 2019
Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction
Aviral Kumar
Justin Fu
George Tucker
Sergey Levine
OffRL
OnRL
142
1,067
0
03 Jun 2019
Rethinking Action Spaces for Reinforcement Learning in End-to-end Dialog Agents with Latent Variable Models
Tiancheng Zhao
Kaige Xie
M. Eskénazi
88
142
0
23 Feb 2019
Off-Policy Deep Reinforcement Learning without Exploration
Scott Fujimoto
David Meger
Doina Precup
OffRL
BDL
262
1,625
0
07 Dec 2018
Neural Approaches to Conversational AI
Jianfeng Gao
Michel Galley
Lihong Li
154
676
0
21 Sep 2018
Decoupling Strategy and Generation in Negotiation Dialogues
He He
Derek Chen
Anusha Balakrishnan
Percy Liang
69
184
0
29 Aug 2018
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
319
8,432
0
04 Jan 2018
Deal or No Deal? End-to-End Learning for Negotiation Dialogues
M. Lewis
Denis Yarats
Yann N. Dauphin
Devi Parikh
Dhruv Batra
LLMAG
107
415
0
16 Jun 2017
Sequence-to-Sequence Generation for Spoken Dialogue via Deep Syntax Trees and Strings
Ondrej Dusek
Filip Jurcícek
70
187
0
17 Jun 2016
Continuously Learning Neural Dialogue Management
Pei-hao Su
Milica Gasic
N. Mrksic
L. Rojas-Barahona
Stefan Ultes
David Vandyke
Tsung-Hsien Wen
S. Young
91
122
0
08 Jun 2016
Deep Reinforcement Learning for Dialogue Generation
Jiwei Li
Will Monroe
Alan Ritter
Michel Galley
Jianfeng Gao
Dan Jurafsky
288
1,341
0
05 Jun 2016
Multiresolution Recurrent Neural Networks: An Application to Dialogue Response Generation
Iulian Serban
Tim Klinger
Gerald Tesauro
Kartik Talamadupula
Bowen Zhou
Yoshua Bengio
Aaron Courville
80
190
0
02 Jun 2016
A Diversity-Promoting Objective Function for Neural Conversation Models
Jiwei Li
Michel Galley
Chris Brockett
Jianfeng Gao
W. Dolan
149
2,406
0
11 Oct 2015
Continuous control with deep reinforcement learning
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
336
13,295
0
09 Sep 2015
Learning from Real Users: Rating Dialogue Success with Neural Networks for Reinforcement Learning in Spoken Dialogue Systems
Pei-hao Su
David Vandyke
Milica Gasic
Dongho Kim
N. Mrksic
Tsung-Hsien Wen
S. Young
73
70
0
13 Aug 2015
1