ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1607.07086
  4. Cited By
An Actor-Critic Algorithm for Sequence Prediction

An Actor-Critic Algorithm for Sequence Prediction

24 July 2016
Dzmitry Bahdanau
Philemon Brakel
Kelvin Xu
Anirudh Goyal
Ryan J. Lowe
Joelle Pineau
Aaron Courville
Yoshua Bengio
ArXivPDFHTML

Papers citing "An Actor-Critic Algorithm for Sequence Prediction"

50 / 362 papers shown
Title
Generative Question Refinement with Deep Reinforcement Learning in
  Retrieval-based QA System
Generative Question Refinement with Deep Reinforcement Learning in Retrieval-based QA System
Ye Liu
Chenwei Zhang
Xiaohui Yan
Yi-Ju Chang
Philip S. Yu
24
18
0
13 Aug 2019
A review on Deep Reinforcement Learning for Fluid Mechanics
A review on Deep Reinforcement Learning for Fluid Mechanics
Paul Garnier
J. Viquerat
Jean Rabault
A. Larcher
A. Kuhnle
E. Hachem
AI4CE
24
253
0
12 Aug 2019
Joey NMT: A Minimalist NMT Toolkit for Novices
Joey NMT: A Minimalist NMT Toolkit for Novices
Julia Kreutzer
Jasmijn Bastings
Stefan Riezler
MoE
22
115
0
29 Jul 2019
Deep Reinforcement Learning for Personalized Search Story Recommendation
Deep Reinforcement Learning for Personalized Search Story Recommendation
Jason Zhang
Zhang
Junming Yin
Dongwon Lee
Linhong Zhu
33
2
0
26 Jul 2019
Interactive-Predictive Neural Machine Translation through Reinforcement
  and Imitation
Interactive-Predictive Neural Machine Translation through Reinforcement and Imitation
Tsz Kin Lam
Shigehiko Schamoni
Stefan Riezler
AI4CE
22
18
0
04 Jul 2019
Retrieving Sequential Information for Non-Autoregressive Neural Machine
  Translation
Retrieving Sequential Information for Non-Autoregressive Neural Machine Translation
Chenze Shao
Yang Feng
Jinchao Zhang
Fandong Meng
Xilin Chen
Jie Zhou
24
42
0
22 Jun 2019
A Study of State Aliasing in Structured Prediction with RNNs
A Study of State Aliasing in Structured Prediction with RNNs
Layla El Asri
Adam Trischler
20
1
0
21 Jun 2019
Scheduled Sampling for Transformers
Scheduled Sampling for Transformers
Tsvetomila Mihaylova
André F. T. Martins
17
64
0
18 Jun 2019
Calibration, Entropy Rates, and Memory in Language Models
Calibration, Entropy Rates, and Memory in Language Models
M. Braverman
Xinyi Chen
Sham Kakade
Karthik Narasimhan
Cyril Zhang
Yi Zhang
11
38
0
11 Jun 2019
Towards Amortized Ranking-Critical Training for Collaborative Filtering
Towards Amortized Ranking-Critical Training for Collaborative Filtering
Sam Lobel
Chunyuan Li
Jianfeng Gao
Lawrence Carin
9
14
0
10 Jun 2019
Improving Neural Language Modeling via Adversarial Training
Improving Neural Language Modeling via Adversarial Training
Dilin Wang
Chengyue Gong
Qiang Liu
AAML
35
115
0
10 Jun 2019
This Email Could Save Your Life: Introducing the Task of Email Subject
  Line Generation
This Email Could Save Your Life: Introducing the Task of Email Subject Line Generation
Rui Zhang
Joel R. Tetreault
20
73
0
08 Jun 2019
Figure Captioning with Reasoning and Sequence-Level Training
Figure Captioning with Reasoning and Sequence-Level Training
Charles C. Chen
Ruiyi Zhang
Eunyee Koh
Sungchul Kim
Scott D. Cohen
Tong Yu
Ryan Rossi
Razvan Bunescu
AIMat
23
38
0
07 Jun 2019
Generative Adversarial Networks in Computer Vision: A Survey and
  Taxonomy
Generative Adversarial Networks in Computer Vision: A Survey and Taxonomy
Zhengwei Wang
Qi She
T. Ward
MedIm
EGVM
29
90
0
04 Jun 2019
Transcribing Content from Structural Images with Spotlight Mechanism
Transcribing Content from Structural Images with Spotlight Mechanism
Yu Yin
Zhenya Huang
Enhong Chen
Qi Liu
Fuzheng Zhang
Xing Xie
Guoping Hu
11
22
0
27 May 2019
Simple and Effective Curriculum Pointer-Generator Networks for Reading
  Comprehension over Long Narratives
Simple and Effective Curriculum Pointer-Generator Networks for Reading Comprehension over Long Narratives
Yi Tay
Shuohang Wang
Anh Tuan Luu
Jie Fu
Minh C. Phan
Xingdi Yuan
J. Rao
S. Hui
Aston Zhang
31
107
0
26 May 2019
Action Assembly: Sparse Imitation Learning for Text Based Games with
  Combinatorial Action Spaces
Action Assembly: Sparse Imitation Learning for Text Based Games with Combinatorial Action Spaces
Chen Tessler
Tom Zahavy
Deborah Cohen
D. Mankowitz
Shie Mannor
30
17
0
23 May 2019
Exploiting Cognitive Structure for Adaptive Learning
Exploiting Cognitive Structure for Adaptive Learning
Qi Liu
Shiwei Tong
Chuanren Liu
Hongke Zhao
Enhong Chen
Haiping Ma
Shijin Wang
12
118
0
23 May 2019
Synchronous Bidirectional Neural Machine Translation
Synchronous Bidirectional Neural Machine Translation
Long Zhou
Jiajun Zhang
Chengqing Zong
14
106
0
13 May 2019
Context-Dependent Semantic Parsing over Temporally Structured Data
Context-Dependent Semantic Parsing over Temporally Structured Data
Charles C. Chen
Razvan Bunescu
10
3
0
01 May 2019
Dynamic Past and Future for Neural Machine Translation
Dynamic Past and Future for Neural Machine Translation
Zaixiang Zheng
Shujian Huang
Zhaopeng Tu
Xinyu Dai
Jiajun Chen
35
30
0
21 Apr 2019
Actor-Critic Instance Segmentation
Actor-Critic Instance Segmentation
Nikita Araslanov
Constantin Rothkopf
Stefan Roth
EgoV
ISeg
25
17
0
10 Apr 2019
Clinically Accurate Chest X-Ray Report Generation
Clinically Accurate Chest X-Ray Report Generation
Guanxiong Liu
T. Hsu
Matthew B. A. McDermott
Willie Boag
W. Weng
Peter Szolovits
Marzyeh Ghassemi
MedIm
25
271
0
04 Apr 2019
Differentiable Sampling with Flexible Reference Word Order for Neural
  Machine Translation
Differentiable Sampling with Flexible Reference Word Order for Neural Machine Translation
Weijia Xu
Xing Niu
Marine Carpuat
16
10
0
04 Apr 2019
Stochastic Beams and Where to Find Them: The Gumbel-Top-k Trick for
  Sampling Sequences Without Replacement
Stochastic Beams and Where to Find Them: The Gumbel-Top-k Trick for Sampling Sequences Without Replacement
W. Kool
H. V. Hoof
Max Welling
60
214
0
14 Mar 2019
CoaCor: Code Annotation for Code Retrieval with Reinforcement Learning
CoaCor: Code Annotation for Code Retrieval with Reinforcement Learning
Ziyu Yao
Jayavardhan Reddy Peddamail
Huan Sun
14
100
0
13 Mar 2019
Catalyst.RL: A Distributed Framework for Reproducible RL Research
Catalyst.RL: A Distributed Framework for Reproducible RL Research
Sergey Kolesnikov
Oleksii Hrinchuk
OffRL
17
8
0
28 Feb 2019
Synchronous Bidirectional Inference for Neural Sequence Generation
Synchronous Bidirectional Inference for Neural Sequence Generation
Jiajun Zhang
Long Zhou
Yang Zhao
Chengqing Zong
24
36
0
24 Feb 2019
Non-Monotonic Sequential Text Generation
Non-Monotonic Sequential Text Generation
Sean Welleck
Kianté Brantley
Hal Daumé
Kyunghyun Cho
44
129
0
05 Feb 2019
Improving Sequence-to-Sequence Learning via Optimal Transport
Improving Sequence-to-Sequence Learning via Optimal Transport
Liqun Chen
Yizhe Zhang
Ruiyi Zhang
Chenyang Tao
Zhe Gan
Haichao Zhang
Bai Li
Dinghan Shen
Changyou Chen
Lawrence Carin
OT
11
92
0
18 Jan 2019
Learning to Selectively Transfer: Reinforced Transfer Learning for Deep
  Text Matching
Learning to Selectively Transfer: Reinforced Transfer Learning for Deep Text Matching
Chen Qu
Feng Ji
Minghui Qiu
Liu Yang
Zhiyu Min
Haiqing Chen
Jun Huang
W. Bruce Croft
11
39
0
30 Dec 2018
Non-Autoregressive Neural Machine Translation with Enhanced Decoder
  Input
Non-Autoregressive Neural Machine Translation with Enhanced Decoder Input
Junliang Guo
Xu Tan
Di He
Tao Qin
Linli Xu
Tie-Yan Liu
16
125
0
23 Dec 2018
Search-Guided, Lightly-supervised Training of Structured Prediction
  Energy Networks
Search-Guided, Lightly-supervised Training of Structured Prediction Energy Networks
Pedram Rooshenas
Dongxu Zhang
Gopal Sharma
Andrew McCallum
19
10
0
22 Dec 2018
Sentence-wise Smooth Regularization for Sequence to Sequence Learning
Sentence-wise Smooth Regularization for Sequence to Sequence Learning
Chengyue Gong
Xu Tan
Di He
Tao Qin
AI4TS
32
8
0
12 Dec 2018
Counterfactual Critic Multi-Agent Training for Scene Graph Generation
Counterfactual Critic Multi-Agent Training for Scene Graph Generation
Long Chen
Hanwang Zhang
Jun Xiao
Xiangnan He
Shiliang Pu
Shih-Fu Chang
19
159
0
06 Dec 2018
Neural Abstractive Text Summarization with Sequence-to-Sequence Models
Neural Abstractive Text Summarization with Sequence-to-Sequence Models
Tian Shi
Yaser Keneshloo
Naren Ramakrishnan
Chandan K. Reddy
33
225
0
05 Dec 2018
Bach2Bach: Generating Music Using A Deep Reinforcement Learning Approach
Bach2Bach: Generating Music Using A Deep Reinforcement Learning Approach
Nikhil Kotecha
6
11
0
03 Dec 2018
An Introduction to Deep Reinforcement Learning
An Introduction to Deep Reinforcement Learning
Vincent François-Lavet
Peter Henderson
Riashat Islam
Marc G. Bellemare
Joelle Pineau
OffRL
AI4CE
80
1,234
0
30 Nov 2018
Connecting the Dots Between MLE and RL for Sequence Prediction
Connecting the Dots Between MLE and RL for Sequence Prediction
Bowen Tan
Zhiting Hu
Zichao Yang
Ruslan Salakhutdinov
Eric P. Xing
22
24
0
24 Nov 2018
Neural Machine Translation with Adequacy-Oriented Learning
Neural Machine Translation with Adequacy-Oriented Learning
X. Kong
Zhaopeng Tu
Shuming Shi
Eduard H. Hovy
Tong Zhang
OffRL
25
26
0
21 Nov 2018
Representation Learning of Pedestrian Trajectories Using Actor-Critic
  Sequence-to-Sequence Autoencoder
Representation Learning of Pedestrian Trajectories Using Actor-Critic Sequence-to-Sequence Autoencoder
Ka-Ho Chow
Anish Hiranandani
Yifeng Zhang
Shueng-Han Gary Chan
37
4
0
20 Nov 2018
Seq2Seq Mimic Games: A Signaling Perspective
Seq2Seq Mimic Games: A Signaling Perspective
Juan Leni
J. Levine
J. Quigley
LLMAG
22
1
0
15 Nov 2018
Image Captioning Based on a Hierarchical Attention Mechanism and Policy
  Gradient Optimization
Image Captioning Based on a Hierarchical Attention Mechanism and Policy Gradient Optimization
Shiyang Yan
Yuan Xie
F. Wu
Jeremy S. Smith
Wenjin Lu
Bailing Zhang
14
5
0
13 Nov 2018
Modeling Local Dependence in Natural Language with Multi-channel
  Recurrent Neural Networks
Modeling Local Dependence in Natural Language with Multi-channel Recurrent Neural Networks
Chang Xu
Weiran Huang
Hongwei Wang
G. Wang
Tie-Yan Liu
11
13
0
13 Nov 2018
Promising Accurate Prefix Boosting for sequence-to-sequence ASR
Promising Accurate Prefix Boosting for sequence-to-sequence ASR
M. Baskar
Lukás Burget
Yi-Chen Chen
Hung-yi Lee
Takaaki Hori
Lin-Shan Lee
11
16
0
07 Nov 2018
Neural Phrase-to-Phrase Machine Translation
Neural Phrase-to-Phrase Machine Translation
Jiangtao Feng
Lingpeng Kong
Po-Sen Huang
Chong-Jun Wang
Da Huang
Jiayuan Mao
Kan Qiao
Dengyong Zhou
AIMat
16
14
0
06 Nov 2018
Value-based Search in Execution Space for Mapping Instructions to
  Programs
Value-based Search in Execution Space for Mapping Instructions to Programs
Dor Muhlgay
Jonathan Herzig
Jonathan Berant
22
6
0
02 Nov 2018
Analysing Dropout and Compounding Errors in Neural Language Models
Analysing Dropout and Compounding Errors in Neural Language Models
James OÑeill
Danushka Bollegala
20
1
0
02 Nov 2018
Sequence Generation with Guider Network
Sequence Generation with Guider Network
Ruiyi Zhang
Changyou Chen
Zhe Gan
Wenlin Wang
Liqun Chen
Dinghan Shen
Guoyin Wang
Lawrence Carin
3DV
14
4
0
02 Nov 2018
Towards Coherent and Cohesive Long-form Text Generation
Towards Coherent and Cohesive Long-form Text Generation
W. Cho
Pengchuan Zhang
Yizhe Zhang
Xiujun Li
Michel Galley
Chris Brockett
Mengdi Wang
Jianfeng Gao
24
0
0
01 Nov 2018
Previous
12345678
Next