Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1607.07086
Cited By
An Actor-Critic Algorithm for Sequence Prediction
24 July 2016
Dzmitry Bahdanau
Philemon Brakel
Kelvin Xu
Anirudh Goyal
Ryan J. Lowe
Joelle Pineau
Aaron Courville
Yoshua Bengio
Re-assign community
ArXiv
PDF
HTML
Papers citing
"An Actor-Critic Algorithm for Sequence Prediction"
50 / 362 papers shown
Title
Recall@k Surrogate Loss with Large Batches and Similarity Mixup
Yash J. Patel
Giorgos Tolias
Jirí Matas
VLM
39
54
0
25 Aug 2021
Learning to Control DC Motor for Micromobility in Real Time with Reinforcement Learning
Bibek Poudel
Thomas Watson
Weizi Li
21
12
0
31 Jul 2021
Mixed Cross Entropy Loss for Neural Machine Translation
Haoran Li
Wei Lu
21
16
0
30 Jun 2021
Deep Multiagent Reinforcement Learning: Challenges and Directions
Annie Wong
Thomas Bäck
Anna V. Kononova
Aske Plaat
AI4CE
34
88
0
29 Jun 2021
A Reinforcement Learning Approach for Sequential Spatial Transformer Networks
Fatemeh Azimi
Federico Raue
Jörn Hees
Andreas Dengel
12
3
0
27 Jun 2021
Revisiting the Weaknesses of Reinforcement Learning for Neural Machine Translation
Samuel Kiegeland
Julia Kreutzer
AAML
37
46
0
16 Jun 2021
Sequence-Level Training for Non-Autoregressive Neural Machine Translation
Chenze Shao
Yang Feng
Jinchao Zhang
Fandong Meng
Jie Zhou
28
28
0
15 Jun 2021
Efficient (Soft) Q-Learning for Text Generation with Limited Good Data
Han Guo
Bowen Tan
Zhengzhong Liu
Eric P. Xing
Zhiting Hu
OffRL
33
33
0
14 Jun 2021
Guiding Teacher Forcing with Seer Forcing for Neural Machine Translation
Yang Feng
Shuhao Gu
Dengji Guo
Zhengxin Yang
Chenze Shao
11
13
0
12 Jun 2021
Energy-Based Models for Code Generation under Compilability Constraints
Tomasz Korbak
Hady ElSahar
Marc Dymetman
Germán Kruszewski
27
13
0
09 Jun 2021
Diversity driven Query Rewriting in Search Advertising
Akash Kumar Mohankumar
Nikit Begwani
Amit Singh
12
24
0
07 Jun 2021
Search from History and Reason for Future: Two-stage Reasoning on Temporal Knowledge Graphs
Zixuan Li
Xiaolong Jin
Saiping Guan
Wei Li
Jiafeng Guo
Yuanzhuo Wang
Xueqi Cheng
36
105
0
01 Jun 2021
PoBRL: Optimizing Multi-Document Summarization by Blending Reinforcement Learning Policies
Andy Su
Difei Su
John M.Mulvey
H. Poor
16
3
0
18 May 2021
Machine Translation Decoding beyond Beam Search
Rémi Leblond
Jean-Baptiste Alayrac
Laurent Sifre
Miruna Pislar
Jean-Baptiste Lespiau
Ioannis Antonoglou
Karen Simonyan
Oriol Vinyals
29
69
0
12 Apr 2021
Creativity and Machine Learning: A Survey
Giorgio Franceschelli
Mirco Musolesi
VLM
AI4CE
34
40
0
06 Apr 2021
Attention Forcing for Machine Translation
Qingyun Dou
Yiting Lu
Potsawee Manakul
Xixin Wu
Mark J. F. Gales
31
7
0
02 Apr 2021
AutoLoss-Zero: Searching Loss Functions from Scratch for Generic Tasks
Hao Li
Tianwen Fu
Jifeng Dai
Hongsheng Li
Gao Huang
Xizhou Zhu
32
29
0
25 Mar 2021
Alleviate Exposure Bias in Sequence Prediction \\ with Recurrent Neural Networks
Liping Yuan
Jiangtao Feng
Xiaoqing Zheng
Xuanjing Huang
27
1
0
22 Mar 2021
Set-to-Sequence Methods in Machine Learning: a Review
Mateusz Jurewicz
Leon Derczynski
BDL
27
9
0
17 Mar 2021
Simpson's Bias in NLP Training
Fei Yuan
Longtu Zhang
Bojun Huang
Yaobo Liang
AI4CE
8
3
0
13 Mar 2021
Exploring Supervised and Unsupervised Rewards in Machine Translation
Julia Ive
Zixu Wang
M. Fomicheva
Lucia Specia
11
2
0
22 Feb 2021
Finite-Sample Analysis of Off-Policy Natural Actor-Critic Algorithm
S. Khodadadian
Zaiwei Chen
S. T. Maguluri
CML
OffRL
71
26
0
18 Feb 2021
Finite Sample Analysis of Two-Time-Scale Natural Actor-Critic Algorithm
S. Khodadadian
Thinh T. Doan
Justin Romberg
S. T. Maguluri
35
42
0
26 Jan 2021
Fast Sequence Generation with Multi-Agent Reinforcement Learning
Longteng Guo
Jing Liu
Xinxin Zhu
Hanqing Lu
LRM
53
6
0
24 Jan 2021
Enriching Non-Autoregressive Transformer with Syntactic and SemanticStructures for Neural Machine Translation
Ye Liu
Yao Wan
Jianguo Zhang
Wenting Zhao
Philip S. Yu
25
23
0
22 Jan 2021
Adversarial Machine Learning in Text Analysis and Generation
I. Alsmadi
AAML
24
5
0
14 Jan 2021
SDA: Improving Text Generation with Self Data Augmentation
Ping Yu
Ruiyi Zhang
Yang Zhao
Yizhe Zhang
Chunyuan Li
Changyou Chen
33
2
0
02 Jan 2021
A Distributional Approach to Controlled Text Generation
Muhammad Khalifa
Hady ElSahar
Marc Dymetman
23
117
0
21 Dec 2020
Contrastive Learning with Adversarial Perturbations for Conditional Text Generation
Seanie Lee
Dong Bok Lee
Sung Ju Hwang
27
106
0
14 Dec 2020
Reinforced Multi-Teacher Selection for Knowledge Distillation
Fei Yuan
Linjun Shou
J. Pei
Wutao Lin
Ming Gong
Yan Fu
Daxin Jiang
15
121
0
11 Dec 2020
A Hybrid Approach for Improved Low Resource Neural Machine Translation using Monolingual Data
Idris Abdulmumin
B. Galadanci
Abubakar Isa
Habeebah Adamu Kakudi
Ismaila Idris Sinan
16
6
0
14 Nov 2020
Multi-Agent Decentralized Belief Propagation on Graphs
Yitao Chen
Deepanshu Vasal
6
1
0
06 Nov 2020
Offline Reinforcement Learning from Human Feedback in Real-World Sequence-to-Sequence Tasks
Julia Kreutzer
Stefan Riezler
Carolin (Haas) Lawrence
RALM
OffRL
6
15
0
04 Nov 2020
Loss Bounds for Approximate Influence-Based Abstraction
E. Congeduti
A. Mey
F. Oliehoek
13
9
0
03 Nov 2020
Turn-level Dialog Evaluation with Dialog-level Weak Signals for Bot-Human Hybrid Customer Service Systems
Ruofeng Wen
11
0
0
25 Oct 2020
Improving Text Generation with Student-Forcing Optimal Transport
Guoyin Wang
Chunyuan Li
Jianqiao Li
Hao Fu
Yuh-Chen Lin
...
Ruiyi Zhang
Wenlin Wang
Dinghan Shen
Qian Yang
Lawrence Carin
OT
30
17
0
12 Oct 2020
Adversarial Grammatical Error Correction
Vipul Raheja
Dimitrios Alikaniotis
8
10
0
06 Oct 2020
Goal-directed Generation of Discrete Structures with Conditional Generative Models
Amina Mollaysa
Brooks Paige
Alexandros Kalousis
37
8
0
05 Oct 2020
A Simple but Tough-to-Beat Data Augmentation Approach for Natural Language Understanding and Generation
Dinghan Shen
Ming Zheng
Yelong Shen
Yanru Qu
Weizhu Chen
AAML
29
130
0
29 Sep 2020
Energy-Based Reranking: Improving Neural Machine Translation Using Energy-Based Models
Sumanta Bhattacharyya
Pedram Rooshenas
Subhajit Naskar
Simeng Sun
Mohit Iyyer
Andrew McCallum
37
57
0
20 Sep 2020
Transfer Learning in Deep Reinforcement Learning: A Survey
Zhuangdi Zhu
Kaixiang Lin
Anil K. Jain
Jiayu Zhou
OffRL
LRM
30
562
0
16 Sep 2020
Autoregressive Knowledge Distillation through Imitation Learning
Alexander Lin
Jeremy Wohlwend
Howard Chen
Tao Lei
42
37
0
15 Sep 2020
Learning to summarize from human feedback
Nisan Stiennon
Long Ouyang
Jeff Wu
Daniel M. Ziegler
Ryan J. Lowe
Chelsea Voss
Alec Radford
Dario Amodei
Paul Christiano
ALM
31
1,984
0
02 Sep 2020
Optimizing AD Pruning of Sponsored Search with Reinforcement Learning
Yijiang Lian
Zhijie Chen
Xin Pei
Shuang Li
Yifei Wang
...
Liang Yuan
Hanju Guan
Ke-feng Zhang
Zhigang Li
Xiaochun Liu
18
3
0
05 Aug 2020
Learning Optimal Tree Models Under Beam Search
Jingwei Zhuo
Xinhang Li
Wei Dai
Ziru Xu
Han Li
Jian Xu
Kun Gai
19
57
0
27 Jun 2020
Off-Policy Self-Critical Training for Transformer in Visual Paragraph Generation
Shiyang Yan
Yang Hua
N. Robertson
OffRL
11
0
0
21 Jun 2020
Implicit Kernel Attention
Kyungwoo Song
Yohan Jung
Dongjun Kim
Il-Chul Moon
8
16
0
11 Jun 2020
MLE-guided parameter search for task loss minimization in neural sequence modeling
Sean Welleck
Kyunghyun Cho
18
10
0
04 Jun 2020
Using Context in Neural Machine Translation Training Objectives
Danielle Saunders
Felix Stahlberg
Bill Byrne
11
20
0
04 May 2020
A Finite Time Analysis of Two Time-Scale Actor Critic Methods
Yue Wu
Weitong Zhang
Pan Xu
Quanquan Gu
90
146
0
04 May 2020
Previous
1
2
3
4
5
6
7
8
Next