ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1608.01281
  4. Cited By
Learning Online Alignments with Continuous Rewards Policy Gradient

Learning Online Alignments with Continuous Rewards Policy Gradient

3 August 2016
Yuping Luo
Chung-Cheng Chiu
Navdeep Jaitly
Ilya Sutskever
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Learning Online Alignments with Continuous Rewards Policy Gradient"

25 / 25 papers shown
Title
Anticipation-Free Training for Simultaneous Machine Translation
Anticipation-Free Training for Simultaneous Machine Translation
Chih-Chiang Chang
Shun-Po Chuang
Hung-yi Lee
66
7
0
30 Jan 2022
Alignment Knowledge Distillation for Online Streaming Attention-based
  Speech Recognition
Alignment Knowledge Distillation for Online Streaming Attention-based Speech Recognition
Hirofumi Inaguma
Tatsuya Kawahara
125
14
0
28 Feb 2021
Concentrated Document Topic Model
Concentrated Document Topic Model
Hao Lei
Ying Chen
29
1
0
06 Feb 2021
A Survey on Deep Reinforcement Learning for Audio-Based Applications
A Survey on Deep Reinforcement Learning for Audio-Based Applications
S. Latif
Heriberto Cuayáhuitl
Farrukh Pervez
Fahad Shamshad
Hafiz Shehbaz Ali
Min Zhang
OffRL
123
75
0
01 Jan 2021
SimulMT to SimulST: Adapting Simultaneous Text Translation to End-to-End
  Simultaneous Speech Translation
SimulMT to SimulST: Adapting Simultaneous Text Translation to End-to-End Simultaneous Speech Translation
Xutai Ma
J. Pino
Philipp Koehn
72
97
0
03 Nov 2020
Online Versus Offline NMT Quality: An In-depth Analysis on
  English-German and German-English
Online Versus Offline NMT Quality: An In-depth Analysis on English-German and German-English
Maha Elbayad
M. Ustaszewski
Emmanuelle Esperancca-Rodier
Francis Brunet Manquat
Jakob Verbeek
Laurent Besacier
OffRL
79
10
0
01 Jun 2020
Efficient Wait-k Models for Simultaneous Machine Translation
Efficient Wait-k Models for Simultaneous Machine Translation
Maha Elbayad
Laurent Besacier
Jakob Verbeek
VLM
78
80
0
18 May 2020
Imputer: Sequence Modelling via Imputation and Dynamic Programming
Imputer: Sequence Modelling via Imputation and Dynamic Programming
William Chan
Chitwan Saharia
Geoffrey E. Hinton
Mohammad Norouzi
Navdeep Jaitly
BDLAI4TS
97
116
0
20 Feb 2020
Monotonic Multihead Attention
Monotonic Multihead Attention
Xutai Ma
J. Pino
James Cross
Liezl Puzon
Jiatao Gu
95
140
0
26 Sep 2019
Incremental Learning with Maximum Entropy Regularization: Rethinking
  Forgetting and Intransigence
Incremental Learning with Maximum Entropy Regularization: Rethinking Forgetting and Intransigence
Dahyun Kim
Jihwan Bae
Yeonsik Jo
Jonghyun Choi
OODCLL
75
20
0
03 Feb 2019
Maximum-Entropy Fine-Grained Classification
Maximum-Entropy Fine-Grained Classification
Abhimanyu Dubey
O. Gupta
Ramesh Raskar
Nikhil Naik
93
157
0
16 Sep 2018
Deep Lip Reading: a comparison of models and an online application
Deep Lip Reading: a comparison of models and an online application
Triantafyllos Afouras
Joon Son Chung
Andrew Zisserman
75
119
0
15 Jun 2018
On Modular Training of Neural Acoustics-to-Word Model for LVCSR
On Modular Training of Neural Acoustics-to-Word Model for LVCSR
Zhehuai Chen
Qi Liu
Hao Li
Kai Yu
81
29
0
03 Mar 2018
Interpretable Counting for Visual Question Answering
Interpretable Counting for Visual Question Answering
Alexander R. Trott
Caiming Xiong
R. Socher
106
71
0
23 Dec 2017
Improving End-to-End Speech Recognition with Policy Learning
Improving End-to-End Speech Recognition with Policy Learning
Yingbo Zhou
Caiming Xiong
R. Socher
79
40
0
19 Dec 2017
Monotonic Chunkwise Attention
Monotonic Chunkwise Attention
Chung-Cheng Chiu
Colin Raffel
98
256
0
14 Dec 2017
Plan, Attend, Generate: Planning for Sequence-to-Sequence Models
Plan, Attend, Generate: Planning for Sequence-to-Sequence Models
Francis Dutil
Çağlar Gülçehre
Adam Trischler
Yoshua Bengio
56
12
0
28 Nov 2017
An online sequence-to-sequence model for noisy speech recognition
An online sequence-to-sequence model for noisy speech recognition
Chung-Cheng Chiu
Dieterich Lawson
Yuping Luo
George Tucker
Kevin Swersky
Ilya Sutskever
Navdeep Jaitly
57
7
0
16 Jun 2017
Learning Hard Alignments with Variational Inference
Learning Hard Alignments with Variational Inference
Dieterich Lawson
Chung-Cheng Chiu
George Tucker
Colin Raffel
Kevin Swersky
Navdeep Jaitly
DRL
62
29
0
16 May 2017
Online and Linear-Time Attention by Enforcing Monotonic Alignments
Online and Linear-Time Attention by Enforcing Monotonic Alignments
Colin Raffel
Minh-Thang Luong
Peter J. Liu
Ron J. Weiss
Douglas Eck
114
261
0
03 Apr 2017
Training a Subsampling Mechanism in Expectation
Training a Subsampling Mechanism in Expectation
Colin Raffel
Dieterich Lawson
65
4
0
22 Feb 2017
Deep Reinforcement Learning: An Overview
Deep Reinforcement Learning: An Overview
Yuxi Li
OffRLVLM
346
1,550
0
25 Jan 2017
Regularizing Neural Networks by Penalizing Confident Output
  Distributions
Regularizing Neural Networks by Penalizing Confident Output Distributions
Gabriel Pereyra
George Tucker
J. Chorowski
Lukasz Kaiser
Geoffrey E. Hinton
NoLa
229
1,142
0
23 Jan 2017
Towards better decoding and language model integration in sequence to
  sequence models
Towards better decoding and language model integration in sequence to sequence models
J. Chorowski
Navdeep Jaitly
106
370
0
08 Dec 2016
Learning to Translate in Real-time with Neural Machine Translation
Learning to Translate in Real-time with Neural Machine Translation
Jiatao Gu
Graham Neubig
Kyunghyun Cho
Victor O.K. Li
100
219
0
03 Oct 2016
1