ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1608.01281
  4. Cited By
Learning Online Alignments with Continuous Rewards Policy Gradient

Learning Online Alignments with Continuous Rewards Policy Gradient

3 August 2016
Yuping Luo
Chung-Cheng Chiu
Navdeep Jaitly
Ilya Sutskever
    OffRL
ArXivPDFHTML

Papers citing "Learning Online Alignments with Continuous Rewards Policy Gradient"

10 / 10 papers shown
Title
A Survey on Deep Reinforcement Learning for Audio-Based Applications
A Survey on Deep Reinforcement Learning for Audio-Based Applications
S. Latif
Heriberto Cuayáhuitl
Farrukh Pervez
Fahad Shamshad
Hafiz Shehbaz Ali
Min Zhang
OffRL
54
73
0
01 Jan 2021
SimulMT to SimulST: Adapting Simultaneous Text Translation to End-to-End
  Simultaneous Speech Translation
SimulMT to SimulST: Adapting Simultaneous Text Translation to End-to-End Simultaneous Speech Translation
Xutai Ma
J. Pino
Philipp Koehn
17
94
0
03 Nov 2020
Efficient Wait-k Models for Simultaneous Machine Translation
Efficient Wait-k Models for Simultaneous Machine Translation
Maha Elbayad
Laurent Besacier
Jakob Verbeek
VLM
24
77
0
18 May 2020
Imputer: Sequence Modelling via Imputation and Dynamic Programming
Imputer: Sequence Modelling via Imputation and Dynamic Programming
William Chan
Chitwan Saharia
Geoffrey E. Hinton
Mohammad Norouzi
Navdeep Jaitly
BDL
AI4TS
21
114
0
20 Feb 2020
Monotonic Multihead Attention
Monotonic Multihead Attention
Xutai Ma
J. Pino
James Cross
Liezl Puzon
Jiatao Gu
25
137
0
26 Sep 2019
Maximum-Entropy Fine-Grained Classification
Maximum-Entropy Fine-Grained Classification
Abhimanyu Dubey
O. Gupta
Ramesh Raskar
Nikhil Naik
28
156
0
16 Sep 2018
An online sequence-to-sequence model for noisy speech recognition
An online sequence-to-sequence model for noisy speech recognition
Chung-Cheng Chiu
Dieterich Lawson
Yuping Luo
George Tucker
Kevin Swersky
Ilya Sutskever
Navdeep Jaitly
19
7
0
16 Jun 2017
Online and Linear-Time Attention by Enforcing Monotonic Alignments
Online and Linear-Time Attention by Enforcing Monotonic Alignments
Colin Raffel
Minh-Thang Luong
Peter J. Liu
Ron J. Weiss
Douglas Eck
32
255
0
03 Apr 2017
Deep Reinforcement Learning: An Overview
Deep Reinforcement Learning: An Overview
Yuxi Li
OffRL
VLM
104
1,503
0
25 Jan 2017
Towards better decoding and language model integration in sequence to
  sequence models
Towards better decoding and language model integration in sequence to sequence models
J. Chorowski
Navdeep Jaitly
17
368
0
08 Dec 2016
1