ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1808.10802
32
64

The MeMAD Submission to the WMT18 Multimodal Translation Task

31 August 2018
Stig-Arne Gronroos
B. Huet
M. Kurimo
Jorma T. Laaksonen
B. Mérialdo
Phu-Cuong Pham
Mats Sjöberg
U. Sulubacak
Jörg Tiedemann
Raphael Troncy
Raúl Vázquez
ArXivPDFHTML
Abstract

This paper describes the MeMAD project entry to the WMT Multimodal Machine Translation Shared Task. We propose adapting the Transformer neural machine translation (NMT) architecture to a multi-modal setting. In this paper, we also describe the preliminary experiments with text-only translation systems leading us up to this choice. We have the top scoring system for both English-to-German and English-to-French, according to the automatic metrics for flickr18. Our experiments show that the effect of the visual features in our system is small. Our largest gains come from the quality of the underlying text-only NMT system. We find that appropriate use of additional data is effective.

View on arXiv
Comments on this paper