Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1607.07086
Cited By
An Actor-Critic Algorithm for Sequence Prediction
24 July 2016
Dzmitry Bahdanau
Philemon Brakel
Kelvin Xu
Anirudh Goyal
Ryan J. Lowe
Joelle Pineau
Aaron Courville
Yoshua Bengio
Re-assign community
ArXiv
PDF
HTML
Papers citing
"An Actor-Critic Algorithm for Sequence Prediction"
12 / 362 papers shown
Title
Learning to Decode for Future Success
Jiwei Li
Will Monroe
Dan Jurafsky
26
58
0
23 Jan 2017
Adversarial Learning for Neural Dialogue Generation
Jiwei Li
Will Monroe
Tianlin Shi
Sébastien Jean
Alan Ritter
Dan Jurafsky
21
897
0
23 Jan 2017
Comprehension-guided referring expressions
Ruotian Luo
Gregory Shakhnarovich
ObjD
29
171
0
12 Jan 2017
Self-critical Sequence Training for Image Captioning
Steven J. Rennie
E. Marcheret
Youssef Mroueh
Jerret Ross
Vaibhava Goel
11
1,877
0
02 Dec 2016
Improved Image Captioning via Policy Gradient optimization of SPIDEr
Siqi Liu
Zhenhai Zhu
Ning Ye
S. Guadarrama
Kevin Patrick Murphy
31
440
0
01 Dec 2016
A Connection between Generative Adversarial Networks, Inverse Reinforcement Learning, and Energy-Based Models
Chelsea Finn
Paul Christiano
Pieter Abbeel
Sergey Levine
OffRL
AI4CE
GAN
19
350
0
11 Nov 2016
Sequence Tutor: Conservative Fine-Tuning of Sequence Generation Models with KL-control
Natasha Jaques
S. Gu
Dzmitry Bahdanau
José Miguel Hernández-Lobato
Richard Turner
Douglas Eck
30
169
0
09 Nov 2016
Professor Forcing: A New Algorithm for Training Recurrent Networks
Alex Lamb
Anirudh Goyal
Ying Zhang
Saizheng Zhang
Aaron Courville
Yoshua Bengio
GAN
60
588
0
27 Oct 2016
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation
Yonghui Wu
M. Schuster
Z. Chen
Quoc V. Le
Mohammad Norouzi
...
Alex Rudnick
Oriol Vinyals
G. Corrado
Macduff Hughes
J. Dean
AIMat
716
6,746
0
26 Sep 2016
SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient
Lantao Yu
Weinan Zhang
Jun Wang
Yong Yu
GAN
20
2,385
0
18 Sep 2016
Reward Augmented Maximum Likelihood for Neural Structured Prediction
Mohammad Norouzi
Samy Bengio
Z. Chen
Navdeep Jaitly
M. Schuster
Yonghui Wu
Dale Schuurmans
24
252
0
01 Sep 2016
Sequence-to-Sequence Learning as Beam-Search Optimization
Sam Wiseman
Alexander M. Rush
44
589
0
09 Jun 2016
Previous
1
2
3
4
5
6
7
8