ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1702.03334
  4. Cited By
Batch Policy Gradient Methods for Improving Neural Conversation Models

Batch Policy Gradient Methods for Improving Neural Conversation Models

10 February 2017
Kirthevasan Kandasamy
Yoram Bachrach
Ryota Tomioka
Daniel Tarlow
David Carter
    OffRL
ArXivPDFHTML

Papers citing "Batch Policy Gradient Methods for Improving Neural Conversation Models"

5 / 5 papers shown
Title
Dynamic Prompt Learning via Policy Gradient for Semi-structured
  Mathematical Reasoning
Dynamic Prompt Learning via Policy Gradient for Semi-structured Mathematical Reasoning
Pan Lu
Liang Qiu
Kai-Wei Chang
Ying Nian Wu
Song-Chun Zhu
Tanmay Rajpurohit
Peter Clark
Ashwin Kalyan
ReLM
LRM
55
269
0
29 Sep 2022
Deep Learning Based Chatbot Models
Deep Learning Based Chatbot Models
Richard Csaky
29
46
0
23 Aug 2019
Machine Comprehension by Text-to-Text Neural Question Generation
Machine Comprehension by Text-to-Text Neural Question Generation
Xingdi Yuan
Tong Wang
Çağlar Gülçehre
Alessandro Sordoni
Philip Bachman
Sandeep Subramanian
Saizheng Zhang
Adam Trischler
OOD
53
187
0
04 May 2017
Deep Reinforcement Learning: An Overview
Deep Reinforcement Learning: An Overview
Yuxi Li
OffRL
VLM
104
1,503
0
25 Jan 2017
Deep Reinforcement Learning for Dialogue Generation
Deep Reinforcement Learning for Dialogue Generation
Jiwei Li
Will Monroe
Alan Ritter
Michel Galley
Jianfeng Gao
Dan Jurafsky
220
1,327
0
05 Jun 2016
1