ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2505.21172
38
0

TAT-R1: Terminology-Aware Translation with Reinforcement Learning and Word Alignment

27 May 2025
Zheng Li
Mao Zheng
Mingyang Song
Wenjie Yang
ArXiv (abs)PDFHTML
Main:8 Pages
4 Figures
Bibliography:3 Pages
3 Tables
Abstract

Recently, deep reasoning large language models(LLMs) like DeepSeek-R1 have made significant progress in tasks such as mathematics and coding. Inspired by this, several studies have employed reinforcement learning(RL) to enhance models' deep reasoning capabilities and improve machine translation(MT) quality. However, the terminology translation, an essential task in MT, remains unexplored in deep reasoning LLMs. In this paper, we propose \textbf{TAT-R1}, a terminology-aware translation model trained with reinforcement learning and word alignment. Specifically, we first extract the keyword translation pairs using a word alignment model. Then we carefully design three types of rule-based alignment rewards with the extracted alignment relationships. With those alignment rewards, the RL-trained translation model can learn to focus on the accurate translation of key information, including terminology in the source text. Experimental results show the effectiveness of TAT-R1. Our model significantly improves terminology translation accuracy compared to the baseline models while maintaining comparable performance on general translation tasks. In addition, we conduct detailed ablation studies of the DeepSeek-R1-like training paradigm for machine translation and reveal several key findings.

View on arXiv
@article{li2025_2505.21172,
  title={ TAT-R1: Terminology-Aware Translation with Reinforcement Learning and Word Alignment },
  author={ Zheng Li and Mao Zheng and Mingyang Song and Wenjie Yang },
  journal={arXiv preprint arXiv:2505.21172},
  year={ 2025 }
}
Comments on this paper