ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1607.07086
  4. Cited By
An Actor-Critic Algorithm for Sequence Prediction

An Actor-Critic Algorithm for Sequence Prediction

24 July 2016
Dzmitry Bahdanau
Philemon Brakel
Kelvin Xu
Anirudh Goyal
Ryan J. Lowe
Joelle Pineau
Aaron Courville
Yoshua Bengio
ArXivPDFHTML

Papers citing "An Actor-Critic Algorithm for Sequence Prediction"

50 / 362 papers shown
Title
Neural Particle Smoothing for Sampling from Conditional Sequence Models
Neural Particle Smoothing for Sampling from Conditional Sequence Models
Chu-cheng Lin
Jason Eisner
BDL
19
12
0
28 Apr 2018
Interactive Language Acquisition with One-shot Visual Concept Learning
  through a Conversational Game
Interactive Language Acquisition with One-shot Visual Concept Learning through a Conversational Game
Haichao Zhang
Haonan Yu
Wenyuan Xu
LLMAG
45
8
0
26 Apr 2018
No Metrics Are Perfect: Adversarial Reward Learning for Visual
  Storytelling
No Metrics Are Perfect: Adversarial Reward Learning for Visual Storytelling
Xin Eric Wang
Wenhu Chen
Yuan-fang Wang
William Yang Wang
11
157
0
24 Apr 2018
A Stable and Effective Learning Strategy for Trainable Greedy Decoding
A Stable and Effective Learning Strategy for Trainable Greedy Decoding
Yun Chen
V. Li
Kyunghyun Cho
Samuel R. Bowman
28
28
0
21 Apr 2018
Can Neural Machine Translation be Improved with User Feedback?
Can Neural Machine Translation be Improved with User Feedback?
Julia Kreutzer
Shahram Khadivi
E. Matusov
Stefan Riezler
14
93
0
16 Apr 2018
Learning How to Self-Learn: Enhancing Self-Training Using Neural
  Reinforcement Learning
Learning How to Self-Learn: Enhancing Self-Training Using Neural Reinforcement Learning
Chenhua Chen
Yue Zhang
SSL
22
11
0
16 Apr 2018
Actor-Critic based Training Framework for Abstractive Summarization
Actor-Critic based Training Framework for Abstractive Summarization
Piji Li
Lidong Bing
Wai Lam
OffRL
12
49
0
28 Mar 2018
A Survey on Neural Network-Based Summarization Methods
A Survey on Neural Network-Based Summarization Methods
Yue Dong
AILaw
AI4TS
27
34
0
19 Mar 2018
Unpaired Image Captioning by Language Pivoting
Unpaired Image Captioning by Language Pivoting
Jiuxiang Gu
Chenyu You
Jianfei Cai
G. Wang
24
82
0
14 Mar 2018
Fully Decentralized Multi-Agent Reinforcement Learning with Networked
  Agents
Fully Decentralized Multi-Agent Reinforcement Learning with Networked Agents
Kaipeng Zhang
Zhuoran Yang
Han Liu
Tong Zhang
Tamer Basar
40
582
0
23 Feb 2018
Efficient Collaborative Multi-Agent Deep Reinforcement Learning for
  Large-Scale Fleet Management
Efficient Collaborative Multi-Agent Deep Reinforcement Learning for Large-Scale Fleet Management
Kaixiang Lin
Renyu Zhao
Zhe Xu
Jiayu Zhou
10
8
0
18 Feb 2018
DP-GAN: Diversity-Promoting Generative Adversarial Network for
  Generating Informative and Diversified Text
DP-GAN: Diversity-Promoting Generative Adversarial Network for Generating Informative and Diversified Text
Jingjing Xu
Xuancheng Ren
Junyang Lin
Xu Sun
25
144
0
05 Feb 2018
MaskGAN: Better Text Generation via Filling in the______
MaskGAN: Better Text Generation via Filling in the______
W. Fedus
Ian Goodfellow
Andrew M. Dai
24
468
0
23 Jan 2018
Expected Policy Gradients for Reinforcement Learning
Expected Policy Gradients for Reinforcement Learning
K. Ciosek
Shimon Whiteson
50
51
0
10 Jan 2018
Improving End-to-End Speech Recognition with Policy Learning
Improving End-to-End Speech Recognition with Policy Learning
Yingbo Zhou
Caiming Xiong
R. Socher
19
40
0
19 Dec 2017
End-to-End Offline Goal-Oriented Dialog Policy Learning via Policy
  Gradient
End-to-End Offline Goal-Oriented Dialog Policy Learning via Policy Gradient
Li Zhou
Kevin Small
Oleg Rokhlenko
Charles Elkan
OffRL
14
41
0
07 Dec 2017
Video Captioning via Hierarchical Reinforcement Learning
Video Captioning via Hierarchical Reinforcement Learning
Xin Eric Wang
Wenhu Chen
Jiawei Wu
Yuan-fang Wang
William Yang Wang
24
228
0
29 Nov 2017
Plan, Attend, Generate: Planning for Sequence-to-Sequence Models
Plan, Attend, Generate: Planning for Sequence-to-Sequence Models
Francis Dutil
Çağlar Gülçehre
Adam Trischler
Yoshua Bengio
18
12
0
28 Nov 2017
Neural Text Generation: A Practical Guide
Neural Text Generation: A Practical Guide
Ziang Xie
6
46
0
27 Nov 2017
Modeling Past and Future for Neural Machine Translation
Modeling Past and Future for Neural Machine Translation
Zaixiang Zheng
Hao Zhou
Shujian Huang
Lili Mou
Xinyu Dai
Jiajun Chen
Zhaopeng Tu
29
48
0
27 Nov 2017
Classical Structured Prediction Losses for Sequence to Sequence Learning
Classical Structured Prediction Losses for Sequence to Sequence Learning
Sergey Edunov
Myle Ott
Michael Auli
David Grangier
MarcÁurelio Ranzato
AIMat
56
185
0
14 Nov 2017
ACtuAL: Actor-Critic Under Adversarial Learning
ACtuAL: Actor-Critic Under Adversarial Learning
Anirudh Goyal
Nan Rosemary Ke
Alex Lamb
R. Devon Hjelm
C. Pal
Joelle Pineau
Yoshua Bengio
GAN
20
9
0
13 Nov 2017
Paraphrase Generation with Deep Reinforcement Learning
Paraphrase Generation with Deep Reinforcement Learning
Zichao Li
Xin Jiang
Lifeng Shang
Hang Li
OffRL
16
213
0
01 Nov 2017
DCN+: Mixed Objective and Deep Residual Coattention for Question
  Answering
DCN+: Mixed Objective and Deep Residual Coattention for Question Answering
Caiming Xiong
Victor Zhong
R. Socher
29
109
0
31 Oct 2017
Stack-Captioning: Coarse-to-Fine Learning for Image Captioning
Stack-Captioning: Coarse-to-Fine Learning for Image Captioning
Jiuxiang Gu
Jianfei Cai
G. Wang
Tsuhan Chen
27
178
0
11 Sep 2017
A Brief Survey of Deep Reinforcement Learning
A Brief Survey of Deep Reinforcement Learning
Kai Arulkumaran
M. Deisenroth
Miles Brundage
Anil Anthony Bharath
OffRL
35
2,775
0
19 Aug 2017
Deconvolutional Paragraph Representation Learning
Deconvolutional Paragraph Representation Learning
Yizhe Zhang
Dinghan Shen
Guoyin Wang
Zhe Gan
Ricardo Henao
Lawrence Carin
SSL
AI4TS
17
76
0
16 Aug 2017
A Continuous Relaxation of Beam Search for End-to-end Training of Neural
  Sequence Models
A Continuous Relaxation of Beam Search for End-to-end Training of Neural Sequence Models
Kartik Goyal
Graham Neubig
Chris Dyer
Taylor Berg-Kirkpatrick
3DV
39
40
0
01 Aug 2017
A Shared Task on Bandit Learning for Machine Translation
A Shared Task on Bandit Learning for Machine Translation
Artem Sokolov
Julia Kreutzer
Kellen Sunderland
Pavel Danchenko
Witold Szymaniak
Hagen Fürstenau
Stefan Riezler
27
16
0
27 Jul 2017
Reinforcement Learning for Bandit Neural Machine Translation with
  Simulated Human Feedback
Reinforcement Learning for Bandit Neural Machine Translation with Simulated Human Feedback
Khanh Nguyen
Hal Daumé
Jordan L. Boyd-Graber
30
135
0
24 Jul 2017
Neural Sequence Model Training via $α$-divergence Minimization
Neural Sequence Model Training via ααα-divergence Minimization
Sotetsu Koyamada
Yuta Kikuchi
Atsunori Kanemura
S. Maeda
S. Ishii
65
0
0
30 Jun 2017
Actor-Critic Sequence Training for Image Captioning
Actor-Critic Sequence Training for Image Captioning
Li Zhang
Flood Sung
Feng Liu
Tao Xiang
S. Gong
Yongxin Yang
Timothy M. Hospedales
16
111
0
29 Jun 2017
Generative Bridging Network in Neural Sequence Prediction
Generative Bridging Network in Neural Sequence Prediction
Wenhu Chen
Guanlin Li
Shuo Ren
Shujie Liu
Zhirui Zhang
Mu Li
M. Zhou
20
10
0
28 Jun 2017
Neural Machine Translation with Gumbel-Greedy Decoding
Neural Machine Translation with Gumbel-Greedy Decoding
Jiatao Gu
Daniel Jiwoong Im
V. Li
17
35
0
22 Jun 2017
Towards Neural Phrase-based Machine Translation
Towards Neural Phrase-based Machine Translation
Po-Sen Huang
Chong-Jun Wang
Sitao Huang
Dengyong Zhou
Li Deng
19
3
0
17 Jun 2017
SEARNN: Training RNNs with Global-Local Losses
SEARNN: Training RNNs with Global-Local Losses
Rémi Leblond
Jean-Baptiste Alayrac
A. Osokin
Simon Lacoste-Julien
19
52
0
14 Jun 2017
Plan, Attend, Generate: Character-level Neural Machine Translation with
  Planning in the Decoder
Plan, Attend, Generate: Character-level Neural Machine Translation with Planning in the Decoder
Çağlar Gülçehre
Francis Dutil
Adam Trischler
Yoshua Bengio
11
7
0
13 Jun 2017
Reinforcement Learning for Learning Rate Control
Reinforcement Learning for Learning Rate Control
Chang Xu
Tao Qin
G. Wang
Tie-Yan Liu
16
34
0
31 May 2017
Listen, Interact and Talk: Learning to Speak via Interaction
Listen, Interact and Talk: Learning to Speak via Interaction
Haichao Zhang
Haonan Yu
Wenyuan Xu
23
13
0
28 May 2017
Ask the Right Questions: Active Question Reformulation with
  Reinforcement Learning
Ask the Right Questions: Active Question Reformulation with Reinforcement Learning
Christian Buck
Jannis Bulian
Massimiliano Ciaramita
Wojciech Gajewski
Andrea Gesmundo
N. Houlsby
Wei Wang
17
165
0
22 May 2017
Softmax Q-Distribution Estimation for Structured Prediction: A
  Theoretical Interpretation for RAML
Softmax Q-Distribution Estimation for Structured Prediction: A Theoretical Interpretation for RAML
Xuezhe Ma
Pengcheng Yin
J. Liu
Graham Neubig
Eduard H. Hovy
20
19
0
19 May 2017
Machine Comprehension by Text-to-Text Neural Question Generation
Machine Comprehension by Text-to-Text Neural Question Generation
Xingdi Yuan
Tong Wang
Çağlar Gülçehre
Alessandro Sordoni
Philip Bachman
Sandeep Subramanian
Saizheng Zhang
Adam Trischler
OOD
47
187
0
04 May 2017
Show, Adapt and Tell: Adversarial Training of Cross-domain Image
  Captioner
Show, Adapt and Tell: Adversarial Training of Cross-domain Image Captioner
Tseng-Hung Chen
Yuan-Hong Liao
Ching-Yao Chuang
W. Hsu
Jianlong Fu
Min Sun
23
141
0
02 May 2017
Differentiable Scheduled Sampling for Credit Assignment
Differentiable Scheduled Sampling for Credit Assignment
Kartik Goyal
Chris Dyer
Taylor Berg-Kirkpatrick
24
40
0
23 Apr 2017
Adversarial Neural Machine Translation
Adversarial Neural Machine Translation
Lijun Wu
Yingce Xia
Li Zhao
Fei Tian
Tao Qin
Jianhuang Lai
Tie-Yan Liu
GAN
AAML
19
133
0
20 Apr 2017
End-to-end optimization of goal-driven and visually grounded dialogue
  systems
End-to-end optimization of goal-driven and visually grounded dialogue systems
Florian Strub
H. D. Vries
Jérémie Mary
Bilal Piot
Aaron Courville
Olivier Pietquin
OffRL
22
138
0
15 Mar 2017
Deeply AggreVaTeD: Differentiable Imitation Learning for Sequential
  Prediction
Deeply AggreVaTeD: Differentiable Imitation Learning for Sequential Prediction
Wen Sun
Arun Venkatraman
Geoffrey J. Gordon
Byron Boots
J. Andrew Bagnell
24
232
0
03 Mar 2017
Batch Policy Gradient Methods for Improving Neural Conversation Models
Batch Policy Gradient Methods for Improving Neural Conversation Models
Kirthevasan Kandasamy
Yoram Bachrach
Ryota Tomioka
Daniel Tarlow
David Carter
OffRL
16
37
0
10 Feb 2017
Trainable Greedy Decoding for Neural Machine Translation
Trainable Greedy Decoding for Neural Machine Translation
Jiatao Gu
Kyunghyun Cho
V. Li
21
73
0
08 Feb 2017
Deep Reinforcement Learning: An Overview
Deep Reinforcement Learning: An Overview
Yuxi Li
OffRL
VLM
104
1,503
0
25 Jan 2017
Previous
12345678
Next