Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1702.03334
Cited By
Batch Policy Gradient Methods for Improving Neural Conversation Models
10 February 2017
Kirthevasan Kandasamy
Yoram Bachrach
Ryota Tomioka
Daniel Tarlow
David Carter
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Batch Policy Gradient Methods for Improving Neural Conversation Models"
5 / 5 papers shown
Title
Dynamic Prompt Learning via Policy Gradient for Semi-structured Mathematical Reasoning
Pan Lu
Liang Qiu
Kai-Wei Chang
Ying Nian Wu
Song-Chun Zhu
Tanmay Rajpurohit
Peter Clark
Ashwin Kalyan
ReLM
LRM
55
269
0
29 Sep 2022
Deep Learning Based Chatbot Models
Richard Csaky
29
46
0
23 Aug 2019
Machine Comprehension by Text-to-Text Neural Question Generation
Xingdi Yuan
Tong Wang
Çağlar Gülçehre
Alessandro Sordoni
Philip Bachman
Sandeep Subramanian
Saizheng Zhang
Adam Trischler
OOD
53
187
0
04 May 2017
Deep Reinforcement Learning: An Overview
Yuxi Li
OffRL
VLM
104
1,503
0
25 Jan 2017
Deep Reinforcement Learning for Dialogue Generation
Jiwei Li
Will Monroe
Alan Ritter
Michel Galley
Jianfeng Gao
Dan Jurafsky
217
1,327
0
05 Jun 2016
1