Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1608.01281
Cited By
Learning Online Alignments with Continuous Rewards Policy Gradient
3 August 2016
Yuping Luo
Chung-Cheng Chiu
Navdeep Jaitly
Ilya Sutskever
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Learning Online Alignments with Continuous Rewards Policy Gradient"
25 / 25 papers shown
Title
Anticipation-Free Training for Simultaneous Machine Translation
Chih-Chiang Chang
Shun-Po Chuang
Hung-yi Lee
66
7
0
30 Jan 2022
Alignment Knowledge Distillation for Online Streaming Attention-based Speech Recognition
Hirofumi Inaguma
Tatsuya Kawahara
125
14
0
28 Feb 2021
Concentrated Document Topic Model
Hao Lei
Ying Chen
29
1
0
06 Feb 2021
A Survey on Deep Reinforcement Learning for Audio-Based Applications
S. Latif
Heriberto Cuayáhuitl
Farrukh Pervez
Fahad Shamshad
Hafiz Shehbaz Ali
Min Zhang
OffRL
123
75
0
01 Jan 2021
SimulMT to SimulST: Adapting Simultaneous Text Translation to End-to-End Simultaneous Speech Translation
Xutai Ma
J. Pino
Philipp Koehn
72
97
0
03 Nov 2020
Online Versus Offline NMT Quality: An In-depth Analysis on English-German and German-English
Maha Elbayad
M. Ustaszewski
Emmanuelle Esperancca-Rodier
Francis Brunet Manquat
Jakob Verbeek
Laurent Besacier
OffRL
79
10
0
01 Jun 2020
Efficient Wait-k Models for Simultaneous Machine Translation
Maha Elbayad
Laurent Besacier
Jakob Verbeek
VLM
78
80
0
18 May 2020
Imputer: Sequence Modelling via Imputation and Dynamic Programming
William Chan
Chitwan Saharia
Geoffrey E. Hinton
Mohammad Norouzi
Navdeep Jaitly
BDL
AI4TS
97
116
0
20 Feb 2020
Monotonic Multihead Attention
Xutai Ma
J. Pino
James Cross
Liezl Puzon
Jiatao Gu
95
140
0
26 Sep 2019
Incremental Learning with Maximum Entropy Regularization: Rethinking Forgetting and Intransigence
Dahyun Kim
Jihwan Bae
Yeonsik Jo
Jonghyun Choi
OOD
CLL
75
20
0
03 Feb 2019
Maximum-Entropy Fine-Grained Classification
Abhimanyu Dubey
O. Gupta
Ramesh Raskar
Nikhil Naik
93
157
0
16 Sep 2018
Deep Lip Reading: a comparison of models and an online application
Triantafyllos Afouras
Joon Son Chung
Andrew Zisserman
75
119
0
15 Jun 2018
On Modular Training of Neural Acoustics-to-Word Model for LVCSR
Zhehuai Chen
Qi Liu
Hao Li
Kai Yu
81
29
0
03 Mar 2018
Interpretable Counting for Visual Question Answering
Alexander R. Trott
Caiming Xiong
R. Socher
106
71
0
23 Dec 2017
Improving End-to-End Speech Recognition with Policy Learning
Yingbo Zhou
Caiming Xiong
R. Socher
79
40
0
19 Dec 2017
Monotonic Chunkwise Attention
Chung-Cheng Chiu
Colin Raffel
98
256
0
14 Dec 2017
Plan, Attend, Generate: Planning for Sequence-to-Sequence Models
Francis Dutil
Çağlar Gülçehre
Adam Trischler
Yoshua Bengio
56
12
0
28 Nov 2017
An online sequence-to-sequence model for noisy speech recognition
Chung-Cheng Chiu
Dieterich Lawson
Yuping Luo
George Tucker
Kevin Swersky
Ilya Sutskever
Navdeep Jaitly
57
7
0
16 Jun 2017
Learning Hard Alignments with Variational Inference
Dieterich Lawson
Chung-Cheng Chiu
George Tucker
Colin Raffel
Kevin Swersky
Navdeep Jaitly
DRL
62
29
0
16 May 2017
Online and Linear-Time Attention by Enforcing Monotonic Alignments
Colin Raffel
Minh-Thang Luong
Peter J. Liu
Ron J. Weiss
Douglas Eck
114
261
0
03 Apr 2017
Training a Subsampling Mechanism in Expectation
Colin Raffel
Dieterich Lawson
65
4
0
22 Feb 2017
Deep Reinforcement Learning: An Overview
Yuxi Li
OffRL
VLM
346
1,550
0
25 Jan 2017
Regularizing Neural Networks by Penalizing Confident Output Distributions
Gabriel Pereyra
George Tucker
J. Chorowski
Lukasz Kaiser
Geoffrey E. Hinton
NoLa
229
1,142
0
23 Jan 2017
Towards better decoding and language model integration in sequence to sequence models
J. Chorowski
Navdeep Jaitly
106
370
0
08 Dec 2016
Learning to Translate in Real-time with Neural Machine Translation
Jiatao Gu
Graham Neubig
Kyunghyun Cho
Victor O.K. Li
100
219
0
03 Oct 2016
1