DeepSeek-R1 vs. o3-mini: How Well can Reasoning LLMs Evaluate MT and Summarization?

v1v2v3 (latest)

DeepSeek-R1 vs. o3-mini: How Well can Reasoning LLMs Evaluate MT and Summarization?

10 April 2025

Daniil Larionov

Sotaro Takeshita

Christoph Leiter

Christian Greisinger

ArXiv (abs)PDF HTML

Papers citing "DeepSeek-R1 vs. o3-mini: How Well can Reasoning LLMs Evaluate MT and Summarization?"

1 / 1 papers shown

Title
ExTrans: Multilingual Deep Reasoning Translation via Exemplar-Enhanced Reinforcement Learning Jiaan Wang Fandong Meng Jie Zhou LRM 42 0 0 19 May 2025

We use cookies and other tracking technologies to improve your browsing experience on our website, to show you personalized content and targeted ads, to analyze our website traffic, and to understand where our visitors are coming from. See our policy.