Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1607.07086
Cited By
An Actor-Critic Algorithm for Sequence Prediction
24 July 2016
Dzmitry Bahdanau
Philemon Brakel
Kelvin Xu
Anirudh Goyal
Ryan J. Lowe
Joelle Pineau
Aaron Courville
Yoshua Bengio
Re-assign community
ArXiv
PDF
HTML
Papers citing
"An Actor-Critic Algorithm for Sequence Prediction"
50 / 362 papers shown
Title
Generative Question Refinement with Deep Reinforcement Learning in Retrieval-based QA System
Ye Liu
Chenwei Zhang
Xiaohui Yan
Yi-Ju Chang
Philip S. Yu
24
18
0
13 Aug 2019
A review on Deep Reinforcement Learning for Fluid Mechanics
Paul Garnier
J. Viquerat
Jean Rabault
A. Larcher
A. Kuhnle
E. Hachem
AI4CE
24
253
0
12 Aug 2019
Joey NMT: A Minimalist NMT Toolkit for Novices
Julia Kreutzer
Jasmijn Bastings
Stefan Riezler
MoE
22
115
0
29 Jul 2019
Deep Reinforcement Learning for Personalized Search Story Recommendation
Jason Zhang
Zhang
Junming Yin
Dongwon Lee
Linhong Zhu
33
2
0
26 Jul 2019
Interactive-Predictive Neural Machine Translation through Reinforcement and Imitation
Tsz Kin Lam
Shigehiko Schamoni
Stefan Riezler
AI4CE
22
18
0
04 Jul 2019
Retrieving Sequential Information for Non-Autoregressive Neural Machine Translation
Chenze Shao
Yang Feng
Jinchao Zhang
Fandong Meng
Xilin Chen
Jie Zhou
24
42
0
22 Jun 2019
A Study of State Aliasing in Structured Prediction with RNNs
Layla El Asri
Adam Trischler
20
1
0
21 Jun 2019
Scheduled Sampling for Transformers
Tsvetomila Mihaylova
André F. T. Martins
17
64
0
18 Jun 2019
Calibration, Entropy Rates, and Memory in Language Models
M. Braverman
Xinyi Chen
Sham Kakade
Karthik Narasimhan
Cyril Zhang
Yi Zhang
11
38
0
11 Jun 2019
Towards Amortized Ranking-Critical Training for Collaborative Filtering
Sam Lobel
Chunyuan Li
Jianfeng Gao
Lawrence Carin
9
14
0
10 Jun 2019
Improving Neural Language Modeling via Adversarial Training
Dilin Wang
Chengyue Gong
Qiang Liu
AAML
35
115
0
10 Jun 2019
This Email Could Save Your Life: Introducing the Task of Email Subject Line Generation
Rui Zhang
Joel R. Tetreault
20
73
0
08 Jun 2019
Figure Captioning with Reasoning and Sequence-Level Training
Charles C. Chen
Ruiyi Zhang
Eunyee Koh
Sungchul Kim
Scott D. Cohen
Tong Yu
Ryan Rossi
Razvan Bunescu
AIMat
23
38
0
07 Jun 2019
Generative Adversarial Networks in Computer Vision: A Survey and Taxonomy
Zhengwei Wang
Qi She
T. Ward
MedIm
EGVM
29
90
0
04 Jun 2019
Transcribing Content from Structural Images with Spotlight Mechanism
Yu Yin
Zhenya Huang
Enhong Chen
Qi Liu
Fuzheng Zhang
Xing Xie
Guoping Hu
11
22
0
27 May 2019
Simple and Effective Curriculum Pointer-Generator Networks for Reading Comprehension over Long Narratives
Yi Tay
Shuohang Wang
Anh Tuan Luu
Jie Fu
Minh C. Phan
Xingdi Yuan
J. Rao
S. Hui
Aston Zhang
31
107
0
26 May 2019
Action Assembly: Sparse Imitation Learning for Text Based Games with Combinatorial Action Spaces
Chen Tessler
Tom Zahavy
Deborah Cohen
D. Mankowitz
Shie Mannor
30
17
0
23 May 2019
Exploiting Cognitive Structure for Adaptive Learning
Qi Liu
Shiwei Tong
Chuanren Liu
Hongke Zhao
Enhong Chen
Haiping Ma
Shijin Wang
12
118
0
23 May 2019
Synchronous Bidirectional Neural Machine Translation
Long Zhou
Jiajun Zhang
Chengqing Zong
14
106
0
13 May 2019
Context-Dependent Semantic Parsing over Temporally Structured Data
Charles C. Chen
Razvan Bunescu
10
3
0
01 May 2019
Dynamic Past and Future for Neural Machine Translation
Zaixiang Zheng
Shujian Huang
Zhaopeng Tu
Xinyu Dai
Jiajun Chen
35
30
0
21 Apr 2019
Actor-Critic Instance Segmentation
Nikita Araslanov
Constantin Rothkopf
Stefan Roth
EgoV
ISeg
25
17
0
10 Apr 2019
Clinically Accurate Chest X-Ray Report Generation
Guanxiong Liu
T. Hsu
Matthew B. A. McDermott
Willie Boag
W. Weng
Peter Szolovits
Marzyeh Ghassemi
MedIm
25
271
0
04 Apr 2019
Differentiable Sampling with Flexible Reference Word Order for Neural Machine Translation
Weijia Xu
Xing Niu
Marine Carpuat
16
10
0
04 Apr 2019
Stochastic Beams and Where to Find Them: The Gumbel-Top-k Trick for Sampling Sequences Without Replacement
W. Kool
H. V. Hoof
Max Welling
60
214
0
14 Mar 2019
CoaCor: Code Annotation for Code Retrieval with Reinforcement Learning
Ziyu Yao
Jayavardhan Reddy Peddamail
Huan Sun
14
100
0
13 Mar 2019
Catalyst.RL: A Distributed Framework for Reproducible RL Research
Sergey Kolesnikov
Oleksii Hrinchuk
OffRL
17
8
0
28 Feb 2019
Synchronous Bidirectional Inference for Neural Sequence Generation
Jiajun Zhang
Long Zhou
Yang Zhao
Chengqing Zong
24
36
0
24 Feb 2019
Non-Monotonic Sequential Text Generation
Sean Welleck
Kianté Brantley
Hal Daumé
Kyunghyun Cho
44
129
0
05 Feb 2019
Improving Sequence-to-Sequence Learning via Optimal Transport
Liqun Chen
Yizhe Zhang
Ruiyi Zhang
Chenyang Tao
Zhe Gan
Haichao Zhang
Bai Li
Dinghan Shen
Changyou Chen
Lawrence Carin
OT
11
92
0
18 Jan 2019
Learning to Selectively Transfer: Reinforced Transfer Learning for Deep Text Matching
Chen Qu
Feng Ji
Minghui Qiu
Liu Yang
Zhiyu Min
Haiqing Chen
Jun Huang
W. Bruce Croft
11
39
0
30 Dec 2018
Non-Autoregressive Neural Machine Translation with Enhanced Decoder Input
Junliang Guo
Xu Tan
Di He
Tao Qin
Linli Xu
Tie-Yan Liu
16
125
0
23 Dec 2018
Search-Guided, Lightly-supervised Training of Structured Prediction Energy Networks
Pedram Rooshenas
Dongxu Zhang
Gopal Sharma
Andrew McCallum
19
10
0
22 Dec 2018
Sentence-wise Smooth Regularization for Sequence to Sequence Learning
Chengyue Gong
Xu Tan
Di He
Tao Qin
AI4TS
32
8
0
12 Dec 2018
Counterfactual Critic Multi-Agent Training for Scene Graph Generation
Long Chen
Hanwang Zhang
Jun Xiao
Xiangnan He
Shiliang Pu
Shih-Fu Chang
19
159
0
06 Dec 2018
Neural Abstractive Text Summarization with Sequence-to-Sequence Models
Tian Shi
Yaser Keneshloo
Naren Ramakrishnan
Chandan K. Reddy
33
225
0
05 Dec 2018
Bach2Bach: Generating Music Using A Deep Reinforcement Learning Approach
Nikhil Kotecha
6
11
0
03 Dec 2018
An Introduction to Deep Reinforcement Learning
Vincent François-Lavet
Peter Henderson
Riashat Islam
Marc G. Bellemare
Joelle Pineau
OffRL
AI4CE
80
1,234
0
30 Nov 2018
Connecting the Dots Between MLE and RL for Sequence Prediction
Bowen Tan
Zhiting Hu
Zichao Yang
Ruslan Salakhutdinov
Eric P. Xing
22
24
0
24 Nov 2018
Neural Machine Translation with Adequacy-Oriented Learning
X. Kong
Zhaopeng Tu
Shuming Shi
Eduard H. Hovy
Tong Zhang
OffRL
25
26
0
21 Nov 2018
Representation Learning of Pedestrian Trajectories Using Actor-Critic Sequence-to-Sequence Autoencoder
Ka-Ho Chow
Anish Hiranandani
Yifeng Zhang
Shueng-Han Gary Chan
37
4
0
20 Nov 2018
Seq2Seq Mimic Games: A Signaling Perspective
Juan Leni
J. Levine
J. Quigley
LLMAG
22
1
0
15 Nov 2018
Image Captioning Based on a Hierarchical Attention Mechanism and Policy Gradient Optimization
Shiyang Yan
Yuan Xie
F. Wu
Jeremy S. Smith
Wenjin Lu
Bailing Zhang
14
5
0
13 Nov 2018
Modeling Local Dependence in Natural Language with Multi-channel Recurrent Neural Networks
Chang Xu
Weiran Huang
Hongwei Wang
G. Wang
Tie-Yan Liu
11
13
0
13 Nov 2018
Promising Accurate Prefix Boosting for sequence-to-sequence ASR
M. Baskar
Lukás Burget
Yi-Chen Chen
Hung-yi Lee
Takaaki Hori
Lin-Shan Lee
11
16
0
07 Nov 2018
Neural Phrase-to-Phrase Machine Translation
Jiangtao Feng
Lingpeng Kong
Po-Sen Huang
Chong-Jun Wang
Da Huang
Jiayuan Mao
Kan Qiao
Dengyong Zhou
AIMat
16
14
0
06 Nov 2018
Value-based Search in Execution Space for Mapping Instructions to Programs
Dor Muhlgay
Jonathan Herzig
Jonathan Berant
22
6
0
02 Nov 2018
Analysing Dropout and Compounding Errors in Neural Language Models
James OÑeill
Danushka Bollegala
20
1
0
02 Nov 2018
Sequence Generation with Guider Network
Ruiyi Zhang
Changyou Chen
Zhe Gan
Wenlin Wang
Liqun Chen
Dinghan Shen
Guoyin Wang
Lawrence Carin
3DV
14
4
0
02 Nov 2018
Towards Coherent and Cohesive Long-form Text Generation
W. Cho
Pengchuan Zhang
Yizhe Zhang
Xiujun Li
Michel Galley
Chris Brockett
Mengdi Wang
Jianfeng Gao
24
0
0
01 Nov 2018
Previous
1
2
3
4
5
6
7
8
Next