ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2104.05565
  4. Cited By
Survey on reinforcement learning for language processing
v1v2v3 (latest)

Survey on reinforcement learning for language processing

12 April 2021
Víctor Uc Cetina
Nicolás Navarro-Guerrero
A. Martín-González
C. Weber
S. Wermter
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Survey on reinforcement learning for language processing"

33 / 83 papers shown
Title
End-to-End Task-Completion Neural Dialogue Systems
End-to-End Task-Completion Neural Dialogue Systems
Xiujun Li
Yun-Nung Chen
Lihong Li
Jianfeng Gao
Asli Celikyilmaz
96
370
0
03 Mar 2017
Learning Conversational Systems that Interleave Task and Non-Task
  Content
Learning Conversational Systems that Interleave Task and Non-Task Content
Zhou Yu
A. Black
Alexander I. Rudnicky
68
51
0
01 Mar 2017
Maximum-Likelihood Augmented Discrete Generative Adversarial Networks
Maximum-Likelihood Augmented Discrete Generative Adversarial Networks
Tong Che
Yanran Li
Ruixiang Zhang
R. Devon Hjelm
Wenjie Li
Yangqiu Song
Yoshua Bengio
GAN
79
235
0
26 Feb 2017
Tackling Error Propagation through Reinforcement Learning: A Case of
  Greedy Dependency Parsing
Tackling Error Propagation through Reinforcement Learning: A Case of Greedy Dependency Parsing
Minh Le
Antske Fokkens
48
18
0
22 Feb 2017
A User Simulator for Task-Completion Dialogues
A User Simulator for Task-Completion Dialogues
Xiujun Li
Zachary Chase Lipton
Bhuwan Dhingra
Lihong Li
Jianfeng Gao
Yun-Nung Chen
OffRL
87
167
0
17 Dec 2016
Dual Learning for Machine Translation
Dual Learning for Machine Translation
Yingce Xia
Di He
Tao Qin
Liwei Wang
Nenghai Yu
Tie-Yan Liu
Wei-Ying Ma
AI4CE
91
850
0
01 Nov 2016
Learning to Translate in Real-time with Neural Machine Translation
Learning to Translate in Real-time with Neural Machine Translation
Jiatao Gu
Graham Neubig
Kyunghyun Cho
Victor O.K. Li
87
219
0
03 Oct 2016
Google's Neural Machine Translation System: Bridging the Gap between
  Human and Machine Translation
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation
Yonghui Wu
M. Schuster
Zhiwen Chen
Quoc V. Le
Mohammad Norouzi
...
Alex Rudnick
Oriol Vinyals
G. Corrado
Macduff Hughes
J. Dean
AIMat
918
6,799
0
26 Sep 2016
SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient
SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient
Lantao Yu
Weinan Zhang
Jun Wang
Yong Yu
GAN
72
2,406
0
18 Sep 2016
Enriching Word Vectors with Subword Information
Enriching Word Vectors with Subword Information
Piotr Bojanowski
Edouard Grave
Armand Joulin
Tomas Mikolov
NAISSLVLM
234
9,986
0
15 Jul 2016
Deep Reinforcement Learning for Dialogue Generation
Deep Reinforcement Learning for Dialogue Generation
Jiwei Li
Will Monroe
Alan Ritter
Michel Galley
Jianfeng Gao
Dan Jurafsky
288
1,339
0
05 Jun 2016
Improving Information Extraction by Acquiring External Evidence with
  Reinforcement Learning
Improving Information Extraction by Acquiring External Evidence with Reinforcement Learning
Karthik Narasimhan
Adam Yala
Regina Barzilay
OffRL
92
152
0
25 Mar 2016
Value Iteration Networks
Value Iteration Networks
Aviv Tamar
Yi Wu
G. Thomas
Sergey Levine
Pieter Abbeel
82
656
0
09 Feb 2016
Asynchronous Methods for Deep Reinforcement Learning
Asynchronous Methods for Deep Reinforcement Learning
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
210
8,881
0
04 Feb 2016
Bandit Structured Prediction for Learning from Partial Feedback in
  Statistical Machine Translation
Bandit Structured Prediction for Learning from Partial Feedback in Statistical Machine Translation
Artem Sokolov
Stefan Riezler
Tanguy Urvoy
68
22
0
18 Jan 2016
A Survey of Available Corpora for Building Data-Driven Dialogue Systems
A Survey of Available Corpora for Building Data-Driven Dialogue Systems
Iulian Serban
Ryan J. Lowe
Peter Henderson
Laurent Charlin
Joelle Pineau
63
342
0
17 Dec 2015
Deep Reinforcement Learning with a Natural Language Action Space
Deep Reinforcement Learning with a Natural Language Action Space
Ji He
Jianshu Chen
Xiaodong He
Jianfeng Gao
Lihong Li
Li Deng
Mari Ostendorf
108
246
0
14 Nov 2015
Generating Text with Deep Reinforcement Learning
Generating Text with Deep Reinforcement Learning
Hongyu Guo
AIMat
44
50
0
30 Oct 2015
Continuous control with deep reinforcement learning
Continuous control with deep reinforcement learning
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
330
13,289
0
09 Sep 2015
Language Understanding for Text-based Games Using Deep Reinforcement
  Learning
Language Understanding for Text-based Games Using Deep Reinforcement Learning
Karthik Narasimhan
Tejas D. Kulkarni
Regina Barzilay
OffRL
104
361
0
30 Jun 2015
Skip-Thought Vectors
Skip-Thought Vectors
Ryan Kiros
Yukun Zhu
Ruslan Salakhutdinov
R. Zemel
Antonio Torralba
R. Urtasun
Sanja Fidler
SSL
226
2,412
0
22 Jun 2015
Scheduled Sampling for Sequence Prediction with Recurrent Neural
  Networks
Scheduled Sampling for Sequence Prediction with Recurrent Neural Networks
Samy Bengio
Oriol Vinyals
Navdeep Jaitly
Noam M. Shazeer
156
2,039
0
09 Jun 2015
Trust Region Policy Optimization
Trust Region Policy Optimization
John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
283
6,801
0
19 Feb 2015
Sequence to Sequence Learning with Neural Networks
Sequence to Sequence Learning with Neural Networks
Ilya Sutskever
Oriol Vinyals
Quoc V. Le
AIMat
450
20,606
0
10 Sep 2014
Neural Machine Translation by Jointly Learning to Align and Translate
Neural Machine Translation by Jointly Learning to Align and Translate
Dzmitry Bahdanau
Kyunghyun Cho
Yoshua Bengio
AIMat
580
27,338
0
01 Sep 2014
Learning Phrase Representations using RNN Encoder-Decoder for
  Statistical Machine Translation
Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation
Kyunghyun Cho
B. V. Merrienboer
Çağlar Gülçehre
Dzmitry Bahdanau
Fethi Bougares
Holger Schwenk
Yoshua Bengio
AIMat
1.1K
23,396
0
03 Jun 2014
Distributed Representations of Sentences and Documents
Distributed Representations of Sentences and Documents
Quoc V. Le
Tomas Mikolov
FaML
265
9,250
0
16 May 2014
Learning to Win by Reading Manuals in a Monte-Carlo Framework
Learning to Win by Reading Manuals in a Monte-Carlo Framework
S. Branavan
David Silver
Regina Barzilay
111
191
0
18 Jan 2014
Efficient Estimation of Word Representations in Vector Space
Efficient Estimation of Word Representations in Vector Space
Tomas Mikolov
Kai Chen
G. Corrado
J. Dean
3DV
693
31,553
0
16 Jan 2013
Optimizing Dialogue Management with Reinforcement Learning: Experiments
  with the NJFun System
Optimizing Dialogue Management with Reinforcement Learning: Experiments with the NJFun System
Michael Kearns
Diane Litman
Satinder Singh
M. Walker
OffRL
96
426
0
03 Jun 2011
An Application of Reinforcement Learning to Dialogue Strategy Selection
  in a Spoken Dialogue System for Email
An Application of Reinforcement Learning to Dialogue Strategy Selection in a Spoken Dialogue System for Email
M. Walker
94
244
0
01 Jun 2011
A Contextual-Bandit Approach to Personalized News Article Recommendation
A Contextual-Bandit Approach to Personalized News Article Recommendation
Lihong Li
Wei Chu
John Langford
Robert Schapire
473
2,957
0
28 Feb 2010
Search-based Structured Prediction
Search-based Structured Prediction
Hal Daumé
John Langford
Daniel Marcu
GNN
142
586
0
04 Jul 2009
Previous
12