Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2503.03308
Cited By
The Box is in the Pen: Evaluating Commonsense Reasoning in Neural Machine Translation
5 March 2025
Jie He
Tao Wang
Deyi Xiong
Qun Liu
ELM
LRM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"The Box is in the Pen: Evaluating Commonsense Reasoning in Neural Machine Translation"
19 / 19 papers shown
Title
When Debate Fails: Bias Reinforcement in Large Language Models
Jihwan Oh
Minchan Jeong
Jongwoo Ko
Se-Young Yun
LLMAG
AI4CE
49
0
0
21 Mar 2025
New Trends for Modern Machine Translation with Large Reasoning Models
Sinuo Liu
Chenyang Lyu
Mingyang Wu
Longyue Wang
Weihua Luo
Kaifu Zhang
Zifu Shang
LRM
65
2
0
13 Mar 2025
Beyond Decoder-only: Large Language Models Can be Good Encoders for Machine Translation
Yingfeng Luo
Tong Zheng
Yongyu Mu
Yangqiu Song
Qinghong Zhang
...
Ziqiang Xu
Peinan Feng
Xiaoqian Liu
Tong Xiao
Jingbo Zhu
AI4CE
170
0
0
09 Mar 2025
LLM-based Translation Inference with Iterative Bilingual Understanding
Andong Chen
Kehai Chen
Yang Xiang
Xuefeng Bai
Muyun Yang
Yang Feng
T. Zhao
Min Zhang
LRM
84
5
0
31 Dec 2024
The Mystery of Compositional Generalization in Graph-based Generative Commonsense Reasoning
Xiyan Fu
Anette Frank
LRM
33
0
0
08 Oct 2024
Meta-RTL: Reinforcement-Based Meta-Transfer Learning for Low-Resource Commonsense Reasoning
Yu Fu
Jie He
Yifan Yang
Qun Liu
Deyi Xiong
OffRL
LRM
45
0
0
27 Sep 2024
The Fellowship of the LLMs: Multi-Agent Workflows for Synthetic Preference Optimization Dataset Generation
Samee Arif
Sualeha Farid
Abdul Hameed Azeemi
Awais Athar
Agha Ali Raza
LLMAG
24
7
0
16 Aug 2024
DUAL-REFLECT: Enhancing Large Language Models for Reflective Translation through Dual Learning Feedback Mechanisms
Andong Chen
Lianzhang Lou
Kehai Chen
Xuefeng Bai
Yang Xiang
Muyun Yang
Tiejun Zhao
Min Zhang
VLM
47
12
0
11 Jun 2024
A Picture Is Worth a Graph: Blueprint Debate on Graph for Multimodal Reasoning
Changmeng Zheng
Dayong Liang
Wengyu Zhang
Xiao Wei
Tat-Seng Chua
Qing Li
40
1
0
22 Mar 2024
OpenEval: Benchmarking Chinese LLMs across Capability, Alignment and Safety
Chuang Liu
Linhao Yu
Jiaxuan Li
Renren Jin
Yufei Huang
...
Tao Liu
Jinwang Song
Hongying Zan
Sun Li
Deyi Xiong
ELM
37
7
0
18 Mar 2024
Rethinking Human-like Translation Strategy: Integrating Drift-Diffusion Model with Large Language Models for Machine Translation
Hongbin Na
Zimu Wang
M. Maimaiti
Tong Chen
Wei Wang
Tao Shen
Ling Chen
LRM
25
5
0
16 Feb 2024
Self-Contrast: Better Reflection Through Inconsistent Solving Perspectives
Wenqi Zhang
Yongliang Shen
Linjuan Wu
Qiuying Peng
Jun Wang
Y. Zhuang
Weiming Lu
LRM
LLMAG
40
48
0
04 Jan 2024
CRoW: Benchmarking Commonsense Reasoning in Real-World Tasks
Mete Ismayilzada
Debjit Paul
Syrielle Montariol
Mor Geva
Antoine Bosselut
LRM
25
5
0
23 Oct 2023
Encouraging Divergent Thinking in Large Language Models through Multi-Agent Debate
Tian Liang
Zhiwei He
Wenxiang Jiao
Xing Wang
Rui Wang
Yujiu Yang
Zhaopeng Tu
Shuming Shi
LLMAG
LRM
37
401
0
30 May 2023
Exploring Human-Like Translation Strategy with Large Language Models
Zhiwei He
Tian Liang
Wenxiang Jiao
Zhuosheng Zhang
Yujiu Yang
Rui Wang
Zhaopeng Tu
Shuming Shi
Xing Wang
26
39
0
06 May 2023
RuCoLA: Russian Corpus of Linguistic Acceptability
Vladislav Mikhailov
T. Shamardina
Max Ryabinin
A. Pestova
I. Smurov
Ekaterina Artemova
30
28
0
23 Oct 2022
On the Limits of Minimal Pairs in Contrastive Evaluation
Jannis Vamvas
Rico Sennrich
49
16
0
15 Sep 2021
Incorporating External Knowledge into Machine Reading for Generative Question Answering
Bin Bi
Chen Henry Wu
Ming Yan
Wei Wang
Jiangnan Xia
Chenliang Li
RALM
176
44
0
06 Sep 2019
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation
Yonghui Wu
M. Schuster
Z. Chen
Quoc V. Le
Mohammad Norouzi
...
Alex Rudnick
Oriol Vinyals
G. Corrado
Macduff Hughes
J. Dean
AIMat
716
6,746
0
26 Sep 2016
1