Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1607.07086
Cited By
An Actor-Critic Algorithm for Sequence Prediction
24 July 2016
Dzmitry Bahdanau
Philemon Brakel
Kelvin Xu
Anirudh Goyal
Ryan J. Lowe
Joelle Pineau
Aaron Courville
Yoshua Bengio
Re-assign community
ArXiv
PDF
HTML
Papers citing
"An Actor-Critic Algorithm for Sequence Prediction"
50 / 362 papers shown
Title
Neural Particle Smoothing for Sampling from Conditional Sequence Models
Chu-cheng Lin
Jason Eisner
BDL
19
12
0
28 Apr 2018
Interactive Language Acquisition with One-shot Visual Concept Learning through a Conversational Game
Haichao Zhang
Haonan Yu
Wenyuan Xu
LLMAG
45
8
0
26 Apr 2018
No Metrics Are Perfect: Adversarial Reward Learning for Visual Storytelling
Xin Eric Wang
Wenhu Chen
Yuan-fang Wang
William Yang Wang
11
157
0
24 Apr 2018
A Stable and Effective Learning Strategy for Trainable Greedy Decoding
Yun Chen
V. Li
Kyunghyun Cho
Samuel R. Bowman
28
28
0
21 Apr 2018
Can Neural Machine Translation be Improved with User Feedback?
Julia Kreutzer
Shahram Khadivi
E. Matusov
Stefan Riezler
14
93
0
16 Apr 2018
Learning How to Self-Learn: Enhancing Self-Training Using Neural Reinforcement Learning
Chenhua Chen
Yue Zhang
SSL
22
11
0
16 Apr 2018
Actor-Critic based Training Framework for Abstractive Summarization
Piji Li
Lidong Bing
Wai Lam
OffRL
12
49
0
28 Mar 2018
A Survey on Neural Network-Based Summarization Methods
Yue Dong
AILaw
AI4TS
27
34
0
19 Mar 2018
Unpaired Image Captioning by Language Pivoting
Jiuxiang Gu
Chenyu You
Jianfei Cai
G. Wang
24
82
0
14 Mar 2018
Fully Decentralized Multi-Agent Reinforcement Learning with Networked Agents
Kaipeng Zhang
Zhuoran Yang
Han Liu
Tong Zhang
Tamer Basar
40
582
0
23 Feb 2018
Efficient Collaborative Multi-Agent Deep Reinforcement Learning for Large-Scale Fleet Management
Kaixiang Lin
Renyu Zhao
Zhe Xu
Jiayu Zhou
10
8
0
18 Feb 2018
DP-GAN: Diversity-Promoting Generative Adversarial Network for Generating Informative and Diversified Text
Jingjing Xu
Xuancheng Ren
Junyang Lin
Xu Sun
25
144
0
05 Feb 2018
MaskGAN: Better Text Generation via Filling in the______
W. Fedus
Ian Goodfellow
Andrew M. Dai
24
468
0
23 Jan 2018
Expected Policy Gradients for Reinforcement Learning
K. Ciosek
Shimon Whiteson
50
51
0
10 Jan 2018
Improving End-to-End Speech Recognition with Policy Learning
Yingbo Zhou
Caiming Xiong
R. Socher
19
40
0
19 Dec 2017
End-to-End Offline Goal-Oriented Dialog Policy Learning via Policy Gradient
Li Zhou
Kevin Small
Oleg Rokhlenko
Charles Elkan
OffRL
14
41
0
07 Dec 2017
Video Captioning via Hierarchical Reinforcement Learning
Xin Eric Wang
Wenhu Chen
Jiawei Wu
Yuan-fang Wang
William Yang Wang
24
228
0
29 Nov 2017
Plan, Attend, Generate: Planning for Sequence-to-Sequence Models
Francis Dutil
Çağlar Gülçehre
Adam Trischler
Yoshua Bengio
18
12
0
28 Nov 2017
Neural Text Generation: A Practical Guide
Ziang Xie
6
46
0
27 Nov 2017
Modeling Past and Future for Neural Machine Translation
Zaixiang Zheng
Hao Zhou
Shujian Huang
Lili Mou
Xinyu Dai
Jiajun Chen
Zhaopeng Tu
29
48
0
27 Nov 2017
Classical Structured Prediction Losses for Sequence to Sequence Learning
Sergey Edunov
Myle Ott
Michael Auli
David Grangier
MarcÁurelio Ranzato
AIMat
56
185
0
14 Nov 2017
ACtuAL: Actor-Critic Under Adversarial Learning
Anirudh Goyal
Nan Rosemary Ke
Alex Lamb
R. Devon Hjelm
C. Pal
Joelle Pineau
Yoshua Bengio
GAN
20
9
0
13 Nov 2017
Paraphrase Generation with Deep Reinforcement Learning
Zichao Li
Xin Jiang
Lifeng Shang
Hang Li
OffRL
16
213
0
01 Nov 2017
DCN+: Mixed Objective and Deep Residual Coattention for Question Answering
Caiming Xiong
Victor Zhong
R. Socher
29
109
0
31 Oct 2017
Stack-Captioning: Coarse-to-Fine Learning for Image Captioning
Jiuxiang Gu
Jianfei Cai
G. Wang
Tsuhan Chen
27
178
0
11 Sep 2017
A Brief Survey of Deep Reinforcement Learning
Kai Arulkumaran
M. Deisenroth
Miles Brundage
Anil Anthony Bharath
OffRL
35
2,775
0
19 Aug 2017
Deconvolutional Paragraph Representation Learning
Yizhe Zhang
Dinghan Shen
Guoyin Wang
Zhe Gan
Ricardo Henao
Lawrence Carin
SSL
AI4TS
17
76
0
16 Aug 2017
A Continuous Relaxation of Beam Search for End-to-end Training of Neural Sequence Models
Kartik Goyal
Graham Neubig
Chris Dyer
Taylor Berg-Kirkpatrick
3DV
39
40
0
01 Aug 2017
A Shared Task on Bandit Learning for Machine Translation
Artem Sokolov
Julia Kreutzer
Kellen Sunderland
Pavel Danchenko
Witold Szymaniak
Hagen Fürstenau
Stefan Riezler
27
16
0
27 Jul 2017
Reinforcement Learning for Bandit Neural Machine Translation with Simulated Human Feedback
Khanh Nguyen
Hal Daumé
Jordan L. Boyd-Graber
30
135
0
24 Jul 2017
Neural Sequence Model Training via
α
α
α
-divergence Minimization
Sotetsu Koyamada
Yuta Kikuchi
Atsunori Kanemura
S. Maeda
S. Ishii
65
0
0
30 Jun 2017
Actor-Critic Sequence Training for Image Captioning
Li Zhang
Flood Sung
Feng Liu
Tao Xiang
S. Gong
Yongxin Yang
Timothy M. Hospedales
16
111
0
29 Jun 2017
Generative Bridging Network in Neural Sequence Prediction
Wenhu Chen
Guanlin Li
Shuo Ren
Shujie Liu
Zhirui Zhang
Mu Li
M. Zhou
20
10
0
28 Jun 2017
Neural Machine Translation with Gumbel-Greedy Decoding
Jiatao Gu
Daniel Jiwoong Im
V. Li
17
35
0
22 Jun 2017
Towards Neural Phrase-based Machine Translation
Po-Sen Huang
Chong-Jun Wang
Sitao Huang
Dengyong Zhou
Li Deng
19
3
0
17 Jun 2017
SEARNN: Training RNNs with Global-Local Losses
Rémi Leblond
Jean-Baptiste Alayrac
A. Osokin
Simon Lacoste-Julien
19
52
0
14 Jun 2017
Plan, Attend, Generate: Character-level Neural Machine Translation with Planning in the Decoder
Çağlar Gülçehre
Francis Dutil
Adam Trischler
Yoshua Bengio
11
7
0
13 Jun 2017
Reinforcement Learning for Learning Rate Control
Chang Xu
Tao Qin
G. Wang
Tie-Yan Liu
16
34
0
31 May 2017
Listen, Interact and Talk: Learning to Speak via Interaction
Haichao Zhang
Haonan Yu
Wenyuan Xu
23
13
0
28 May 2017
Ask the Right Questions: Active Question Reformulation with Reinforcement Learning
Christian Buck
Jannis Bulian
Massimiliano Ciaramita
Wojciech Gajewski
Andrea Gesmundo
N. Houlsby
Wei Wang
17
165
0
22 May 2017
Softmax Q-Distribution Estimation for Structured Prediction: A Theoretical Interpretation for RAML
Xuezhe Ma
Pengcheng Yin
J. Liu
Graham Neubig
Eduard H. Hovy
20
19
0
19 May 2017
Machine Comprehension by Text-to-Text Neural Question Generation
Xingdi Yuan
Tong Wang
Çağlar Gülçehre
Alessandro Sordoni
Philip Bachman
Sandeep Subramanian
Saizheng Zhang
Adam Trischler
OOD
47
187
0
04 May 2017
Show, Adapt and Tell: Adversarial Training of Cross-domain Image Captioner
Tseng-Hung Chen
Yuan-Hong Liao
Ching-Yao Chuang
W. Hsu
Jianlong Fu
Min Sun
23
141
0
02 May 2017
Differentiable Scheduled Sampling for Credit Assignment
Kartik Goyal
Chris Dyer
Taylor Berg-Kirkpatrick
24
40
0
23 Apr 2017
Adversarial Neural Machine Translation
Lijun Wu
Yingce Xia
Li Zhao
Fei Tian
Tao Qin
Jianhuang Lai
Tie-Yan Liu
GAN
AAML
19
133
0
20 Apr 2017
End-to-end optimization of goal-driven and visually grounded dialogue systems
Florian Strub
H. D. Vries
Jérémie Mary
Bilal Piot
Aaron Courville
Olivier Pietquin
OffRL
22
138
0
15 Mar 2017
Deeply AggreVaTeD: Differentiable Imitation Learning for Sequential Prediction
Wen Sun
Arun Venkatraman
Geoffrey J. Gordon
Byron Boots
J. Andrew Bagnell
24
232
0
03 Mar 2017
Batch Policy Gradient Methods for Improving Neural Conversation Models
Kirthevasan Kandasamy
Yoram Bachrach
Ryota Tomioka
Daniel Tarlow
David Carter
OffRL
16
37
0
10 Feb 2017
Trainable Greedy Decoding for Neural Machine Translation
Jiatao Gu
Kyunghyun Cho
V. Li
21
73
0
08 Feb 2017
Deep Reinforcement Learning: An Overview
Yuxi Li
OffRL
VLM
104
1,503
0
25 Jan 2017
Previous
1
2
3
4
5
6
7
8
Next