ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1410.8206
115
788
v1v2v3v4 (latest)

Addressing the Rare Word Problem in Neural Machine Translation

30 October 2014
Thang Luong
Ilya Sutskever
Quoc V. Le
Oriol Vinyals
Wojciech Zaremba
    AIMatAAML
ArXiv (abs)PDFHTML
Abstract

Neural Machine Translation (NMT) has recently attracted a lot of attention due to the very high performance achieved by deep neural networks in other domains. An inherent weakness in existing NMT systems is their inability to correctly translate rare words: end-to-end NMTs tend to have relatively small vocabularies with a single "unknown-word" symbol representing every possible out-of-vocabulary (OOV) word. In this paper, we propose and implement a simple technique to address this problem. We train an NMT system on data that is augmented by the output of a word alignment algorithm, allowing the NMT system to output, for each OOV word in the target sentence, its corresponding word in the source sentence. This information is later utilized in a post-processing step that translates every OOV word using a dictionary. Our experiments on the WMT'14 English to French translation task show that this simple method provides a substantial improvement over an equivalent NMT system that does not use this technique. The performance of our system achieves a BLEU score of 37.5, which improves upon the previous best end-to-end NMT by 2.7 points. Our NMT system is the first to surpass the existing state-of-the-art performance on a WMT'14 contest task.

View on arXiv
Comments on this paper