ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1607.07086
  4. Cited By
An Actor-Critic Algorithm for Sequence Prediction

An Actor-Critic Algorithm for Sequence Prediction

24 July 2016
Dzmitry Bahdanau
Philemon Brakel
Kelvin Xu
Anirudh Goyal
Ryan J. Lowe
Joelle Pineau
Aaron Courville
Yoshua Bengio
ArXivPDFHTML

Papers citing "An Actor-Critic Algorithm for Sequence Prediction"

50 / 362 papers shown
Title
Learning to Teach with Dynamic Loss Functions
Learning to Teach with Dynamic Loss Functions
Lijun Wu
Fei Tian
Yingce Xia
Yang Fan
Tao Qin
Jianhuang Lai
Tie-Yan Liu
17
111
0
29 Oct 2018
Learning to Discriminate Noises for Incorporating External Information
  in Neural Machine Translation
Learning to Discriminate Noises for Incorporating External Information in Neural Machine Translation
Zaixiang Zheng
Shujian Huang
Zewei Sun
Rongxiang Weng
Xinyu Dai
Jiajun Chen
17
8
0
24 Oct 2018
Deep Reinforcement Learning
Deep Reinforcement Learning
Yuxi Li
VLM
OffRL
28
144
0
15 Oct 2018
Learning to Encode Text as Human-Readable Summaries using Generative
  Adversarial Networks
Learning to Encode Text as Human-Readable Summaries using Generative Adversarial Networks
Yau-Shian Wang
Hung-yi Lee
21
64
0
05 Oct 2018
Seq2Slate: Re-ranking and Slate Optimization with RNNs
Seq2Slate: Re-ranking and Slate Optimization with RNNs
Irwan Bello
Sayali Kulkarni
Sagar Jain
Craig Boutilier
Ed H. Chi
Elad Eban
Xiyang Luo
Alan Mackey
Ofer Meshi
30
91
0
04 Oct 2018
Learning to Segment Inputs for NMT Favors Character-Level Processing
Learning to Segment Inputs for NMT Favors Character-Level Processing
Julia Kreutzer
Artem Sokolov
11
31
0
02 Oct 2018
Optimal Completion Distillation for Sequence Learning
Optimal Completion Distillation for Sequence Learning
S. Sabour
William Chan
Mohammad Norouzi
19
45
0
02 Oct 2018
Efficient Sequence Labeling with Actor-Critic Training
Efficient Sequence Labeling with Actor-Critic Training
Saeed Najafi
Colin Cherry
Grzegorz Kondrak
16
7
0
30 Sep 2018
Learning to Coordinate Multiple Reinforcement Learning Agents for
  Diverse Query Reformulation
Learning to Coordinate Multiple Reinforcement Learning Agents for Diverse Query Reformulation
Rodrigo Nogueira
Jannis Bulian
Massimiliano Ciaramita
21
11
0
27 Sep 2018
BanditSum: Extractive Summarization as a Contextual Bandit
BanditSum: Extractive Summarization as a Contextual Bandit
Yue Dong
Songlin Yang
Eric Crawford
H. V. Hoof
Jackie C.K. Cheung
26
181
0
25 Sep 2018
Finite Sample Analysis of the GTD Policy Evaluation Algorithms in Markov
  Setting
Finite Sample Analysis of the GTD Policy Evaluation Algorithms in Markov Setting
Yue Wang
Wei-neng Chen
Yuting Liu
Zhi-Ming Ma
Tie-Yan Liu
OffRL
4
39
0
21 Sep 2018
Target Transfer Q-Learning and Its Convergence Analysis
Target Transfer Q-Learning and Its Convergence Analysis
Yue Wang
Qi Meng
Wei Cheng
Yuting Liu
Zhiming Ma
Tie-Yan Liu
14
30
0
21 Sep 2018
FRAGE: Frequency-Agnostic Word Representation
FRAGE: Frequency-Agnostic Word Representation
Chengyue Gong
Di He
Xu Tan
Tao Qin
Liwei Wang
Tie-Yan Liu
OOD
28
144
0
18 Sep 2018
Generating Informative and Diverse Conversational Responses via
  Adversarial Information Maximization
Generating Informative and Diverse Conversational Responses via Adversarial Information Maximization
Yizhe Zhang
Michel Galley
Jianfeng Gao
Zhe Gan
Xiujun Li
Chris Brockett
W. Dolan
32
294
0
16 Sep 2018
Curriculum-Based Neighborhood Sampling For Sequence Prediction
Curriculum-Based Neighborhood Sampling For Sequence Prediction
James OÑeill
Danushka Bollegala
24
1
0
16 Sep 2018
Closed-Book Training to Improve Summarization Encoder Memory
Closed-Book Training to Improve Summarization Encoder Memory
Yichen Jiang
Joey Tianyi Zhou
RALM
29
28
0
12 Sep 2018
Towards one-shot learning for rare-word translation with external
  experts
Towards one-shot learning for rare-word translation with external experts
Ngoc-Quan Pham
Jan Niehues
A. Waibel
AAML
8
24
0
10 Sep 2018
Greedy Search with Probabilistic N-gram Matching for Neural Machine
  Translation
Greedy Search with Probabilistic N-gram Matching for Neural Machine Translation
Chenze Shao
Yang Feng
Xilin Chen
19
28
0
10 Sep 2018
A Deep Reinforced Sequence-to-Set Model for Multi-Label Text
  Classification
A Deep Reinforced Sequence-to-Set Model for Multi-Label Text Classification
Pengcheng Yang
Shuming Ma
Wenjie Qu
Junyang Lin
Qi Su
Xu Sun
VLM
19
7
0
10 Sep 2018
Adversarial Reprogramming of Text Classification Neural Networks
Adversarial Reprogramming of Text Classification Neural Networks
Paarth Neekhara
Shehzeen Samarah Hussain
Shlomo Dubnov
F. Koushanfar
AAML
SILM
21
9
0
06 Sep 2018
Texar: A Modularized, Versatile, and Extensible Toolkit for Text
  Generation
Texar: A Modularized, Versatile, and Extensible Toolkit for Text Generation
Zhiting Hu
Haoran Shi
Bowen Tan
Wentao Wang
Zichao Yang
...
Zhengzhong Liu
Xiaodan Liang
Wangrong Zhu
Devendra Singh Sachan
Eric P. Xing
VLM
17
56
0
04 Sep 2018
Imitation Learning for Neural Morphological String Transduction
Imitation Learning for Neural Morphological String Transduction
Peter Makarov
Simon Clematide
AI4CE
25
33
0
31 Aug 2018
A Study of Reinforcement Learning for Neural Machine Translation
A Study of Reinforcement Learning for Neural Machine Translation
Lijun Wu
Fei Tian
Tao Qin
Jianhuang Lai
Tie-Yan Liu
OffRL
27
182
0
27 Aug 2018
Reinforcement Learning for Relation Classification from Noisy Data
Reinforcement Learning for Relation Classification from Noisy Data
Jun Feng
Minlie Huang
Li Zhao
Yang Yang
Xiaoyan Zhu
NoLa
14
340
0
24 Aug 2018
Approximate Distribution Matching for Sequence-to-Sequence Learning
Approximate Distribution Matching for Sequence-to-Sequence Learning
Wenhu Chen
Guanlin Li
Shujie Liu
Zhirui Zhang
Mu Li
M. Zhou
OOD
BDL
19
0
0
24 Aug 2018
Proximal Policy Optimization and its Dynamic Version for Sequence
  Generation
Proximal Policy Optimization and its Dynamic Version for Sequence Generation
Yi-Lin Tuan
Jinzhi Zhang
Yujia Li
Hung-yi Lee
13
10
0
24 Aug 2018
XL-NBT: A Cross-lingual Neural Belief Tracking Framework
XL-NBT: A Cross-lingual Neural Belief Tracking Framework
Wenhu Chen
Jianshu Chen
Yu-Chuan Su
Xin Eric Wang
Dong Yu
Xifeng Yan
William Yang Wang
19
33
0
19 Aug 2018
Improving Conditional Sequence Generative Adversarial Networks by
  Stepwise Evaluation
Improving Conditional Sequence Generative Adversarial Networks by Stepwise Evaluation
Yi-Lin Tuan
Hung-yi Lee
GAN
22
55
0
16 Aug 2018
Regularizing Neural Machine Translation by Target-bidirectional
  Agreement
Regularizing Neural Machine Translation by Target-bidirectional Agreement
Zhirui Zhang
Shuo Ren
Shujie Liu
Mu Li
M. Zhou
Tong Xu
37
116
0
13 Aug 2018
Pervasive Attention: 2D Convolutional Neural Networks for
  Sequence-to-Sequence Prediction
Pervasive Attention: 2D Convolutional Neural Networks for Sequence-to-Sequence Prediction
Maha Elbayad
Laurent Besacier
Jakob Verbeek
HAI
8
82
0
11 Aug 2018
Auto-Encoding Variational Neural Machine Translation
Auto-Encoding Variational Neural Machine Translation
Bryan Eikema
Wilker Aziz
DRL
BDL
14
39
0
27 Jul 2018
Latent Alignment and Variational Attention
Latent Alignment and Variational Attention
Yuntian Deng
Yoon Kim
Justin T. Chiu
Demi Guo
Alexander M. Rush
BDL
18
110
0
10 Jul 2018
A New Approach for Resource Scheduling with Deep Reinforcement Learning
A New Approach for Resource Scheduling with Deep Reinforcement Learning
Yufei Ye
Xiaoqin Ren
Jin Wang
Lingxiao Xu
Wenxia Guo
Wenqiang Huang
Wenhong Tian
OffRL
11
16
0
21 Jun 2018
On Accurate Evaluation of GANs for Language Generation
On Accurate Evaluation of GANs for Language Generation
Stanislau Semeniuta
Aliaksei Severyn
Sylvain Gelly
EGVM
39
81
0
13 Jun 2018
Double Path Networks for Sequence to Sequence Learning
Double Path Networks for Sequence to Sequence Learning
Kaitao Song
Xu Tan
Di He
Jianfeng Lu
Tao Qin
Tie-Yan Liu
24
14
0
13 Jun 2018
Sparse Stochastic Zeroth-Order Optimization with an Application to
  Bandit Structured Prediction
Sparse Stochastic Zeroth-Order Optimization with an Application to Bandit Structured Prediction
Artem Sokolov
Julian Hitschler
Mayumi Ohta
Stefan Riezler
13
7
0
12 Jun 2018
Explaining and Generalizing Back-Translation through Wake-Sleep
Explaining and Generalizing Back-Translation through Wake-Sleep
Ryan Cotterell
Julia Kreutzer
113
39
0
12 Jun 2018
Towards Binary-Valued Gates for Robust LSTM Training
Towards Binary-Valued Gates for Robust LSTM Training
Zhuohan Li
Di He
Fei Tian
Wei-neng Chen
Tao Qin
Liwei Wang
Tie-Yan Liu
MQ
10
47
0
08 Jun 2018
Dense Information Flow for Neural Machine Translation
Dense Information Flow for Neural Machine Translation
Yanyao Shen
Xu Tan
Di He
Tao Qin
Tie-Yan Liu
AI4CE
20
34
0
03 Jun 2018
Efficient Entropy for Policy Gradient with Multidimensional Action Space
Efficient Entropy for Policy Gradient with Multidimensional Action Space
Yiming Zhang
Q. Vuong
Kenny Song
Xiao-Yue Gong
Keith Ross
27
17
0
02 Jun 2018
Truncated Horizon Policy Search: Combining Reinforcement Learning &
  Imitation Learning
Truncated Horizon Policy Search: Combining Reinforcement Learning & Imitation Learning
Wen Sun
J. Andrew Bagnell
Byron Boots
17
93
0
29 May 2018
Fast Abstractive Summarization with Reinforce-Selected Sentence
  Rewriting
Fast Abstractive Summarization with Reinforce-Selected Sentence Rewriting
Yen-Chun Chen
Joey Tianyi Zhou
BDL
25
582
0
28 May 2018
Reliability and Learnability of Human Bandit Feedback for
  Sequence-to-Sequence Reinforcement Learning
Reliability and Learnability of Human Bandit Feedback for Sequence-to-Sequence Reinforcement Learning
Julia Kreutzer
Joshua Uyheng
Stefan Riezler
28
85
0
27 May 2018
Zero-Shot Dual Machine Translation
Zero-Shot Dual Machine Translation
L. Sestorain
Massimiliano Ciaramita
Christian Buck
Thomas Hofmann
26
23
0
25 May 2018
Deep Reinforcement Learning For Sequence to Sequence Models
Deep Reinforcement Learning For Sequence to Sequence Models
Yaser Keneshloo
Tian Shi
Naren Ramakrishnan
Chandan K. Reddy
AIMat
3DV
OffRL
30
208
0
24 May 2018
Hybrid Retrieval-Generation Reinforced Agent for Medical Image Report
  Generation
Hybrid Retrieval-Generation Reinforced Agent for Medical Image Report Generation
Yuan Li
Xiaodan Liang
Zhiting Hu
Eric P. Xing
MedIm
13
327
0
21 May 2018
Leveraging Grammar and Reinforcement Learning for Neural Program
  Synthesis
Leveraging Grammar and Reinforcement Learning for Neural Program Synthesis
Rudy Bunel
Matthew J. Hausknecht
Jacob Devlin
Rishabh Singh
Pushmeet Kohli
NAI
16
216
0
11 May 2018
Multimodal Machine Translation with Reinforcement Learning
Multimodal Machine Translation with Reinforcement Learning
Xin-Yao Qian
Ziyi Zhong
Jieli Zhou
13
14
0
07 May 2018
A Reinforcement Learning Approach to Interactive-Predictive Neural
  Machine Translation
A Reinforcement Learning Approach to Interactive-Predictive Neural Machine Translation
Tsz Kin Lam
Julia Kreutzer
Stefan Riezler
19
31
0
03 May 2018
From Credit Assignment to Entropy Regularization: Two New Algorithms for
  Neural Sequence Prediction
From Credit Assignment to Entropy Regularization: Two New Algorithms for Neural Sequence Prediction
Zihang Dai
Qizhe Xie
Eduard H. Hovy
29
6
0
29 Apr 2018
Previous
12345678
Next