ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2209.07873
  4. Cited By
Adaptive Natural Language Generation for Task-oriented Dialogue via
  Reinforcement Learning

Adaptive Natural Language Generation for Task-oriented Dialogue via Reinforcement Learning

16 September 2022
Atsumoto Ohashi
Ryuichiro Higashinaka
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Adaptive Natural Language Generation for Task-oriented Dialogue via Reinforcement Learning"

15 / 15 papers shown
Title
Robustness Testing of Language Understanding in Task-Oriented Dialog
Robustness Testing of Language Understanding in Task-Oriented Dialog
Jiexi Liu
Ryuichi Takanobu
Jiaxin Wen
Dazhen Wan
Hongguang Li
Weiran Nie
Cheng Li
Wei Peng
Minlie Huang
ELM
98
48
0
30 Dec 2020
Few-shot Natural Language Generation for Task-Oriented Dialog
Few-shot Natural Language Generation for Task-Oriented Dialog
Baolin Peng
Chenguang Zhu
Chunyuan Li
Xiujun Li
Jinchao Li
Michael Zeng
Jianfeng Gao
80
201
0
27 Feb 2020
ConvLab-2: An Open-Source Toolkit for Building, Evaluating, and
  Diagnosing Dialogue Systems
ConvLab-2: An Open-Source Toolkit for Building, Evaluating, and Diagnosing Dialogue Systems
Qi Zhu
Zheng Zhang
Yan Fang
Xiang Li
Ryuichi Takanobu
Jinchao Li
Baolin Peng
Jianfeng Gao
Xiaoyan Zhu
Minlie Huang
93
106
0
12 Feb 2020
Fine-Tuning Language Models from Human Preferences
Fine-Tuning Language Models from Human Preferences
Daniel M. Ziegler
Nisan Stiennon
Jeff Wu
Tom B. Brown
Alec Radford
Dario Amodei
Paul Christiano
G. Irving
ALM
474
1,766
0
18 Sep 2019
Way Off-Policy Batch Deep Reinforcement Learning of Implicit Human
  Preferences in Dialog
Way Off-Policy Batch Deep Reinforcement Learning of Implicit Human Preferences in Dialog
Natasha Jaques
Asma Ghandeharioun
J. Shen
Craig Ferguson
Àgata Lapedriza
Noah J. Jones
S. Gu
Rosalind W. Picard
OffRL
130
343
0
30 Jun 2019
A Survey of Reinforcement Learning Informed by Natural Language
A Survey of Reinforcement Learning Informed by Natural Language
Jelena Luketina
Nantas Nardelli
Gregory Farquhar
Jakob N. Foerster
Jacob Andreas
Edward Grefenstette
Shimon Whiteson
Tim Rocktaschel
LM&RoKELMOffRLLRM
84
282
0
10 Jun 2019
Rethinking Action Spaces for Reinforcement Learning in End-to-end Dialog
  Agents with Latent Variable Models
Rethinking Action Spaces for Reinforcement Learning in End-to-end Dialog Agents with Latent Variable Models
Tiancheng Zhao
Kaige Xie
M. Eskénazi
75
142
0
23 Feb 2019
Learning from Dialogue after Deployment: Feed Yourself, Chatbot!
Learning from Dialogue after Deployment: Feed Yourself, Chatbot!
Braden Hancock
Antoine Bordes
Pierre-Emmanuel Mazaré
Jason Weston
124
194
0
16 Jan 2019
MultiWOZ -- A Large-Scale Multi-Domain Wizard-of-Oz Dataset for
  Task-Oriented Dialogue Modelling
MultiWOZ -- A Large-Scale Multi-Domain Wizard-of-Oz Dataset for Task-Oriented Dialogue Modelling
Paweł Budzianowski
Tsung-Hsien Wen
Bo-Hsiang Tseng
I. Casanueva
Stefan Ultes
Osman Ramadan
Milica Gasic
184
1,324
0
29 Sep 2018
Controllable Neural Story Plot Generation via Reward Shaping
Controllable Neural Story Plot Generation via Reward Shaping
Pradyumna Tambwekar
Murtaza Dhuliawala
Lara J. Martin
Animesh Mehta
Brent Harrison
Mark O. Riedl
77
88
0
27 Sep 2018
BanditSum: Extractive Summarization as a Contextual Bandit
BanditSum: Extractive Summarization as a Contextual Bandit
Yue Dong
Songlin Yang
Eric Crawford
H. V. Hoof
Jackie C.K. Cheung
60
181
0
25 Sep 2018
Neural Approaches to Conversational AI
Neural Approaches to Conversational AI
Jianfeng Gao
Michel Galley
Lihong Li
103
676
0
21 Sep 2018
Google's Neural Machine Translation System: Bridging the Gap between
  Human and Machine Translation
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation
Yonghui Wu
M. Schuster
Zhiwen Chen
Quoc V. Le
Mohammad Norouzi
...
Alex Rudnick
Oriol Vinyals
G. Corrado
Macduff Hughes
J. Dean
AIMat
911
6,796
0
26 Sep 2016
Sequence Level Training with Recurrent Neural Networks
Sequence Level Training with Recurrent Neural Networks
MarcÁurelio Ranzato
S. Chopra
Michael Auli
Wojciech Zaremba
102
1,620
0
20 Nov 2015
High-Dimensional Continuous Control Using Generalized Advantage
  Estimation
High-Dimensional Continuous Control Using Generalized Advantage Estimation
John Schulman
Philipp Moritz
Sergey Levine
Michael I. Jordan
Pieter Abbeel
OffRL
125
3,438
0
08 Jun 2015
1