v1v2v3 (latest)

T $^\star$ : Progressive Block Scaling for Masked Diffusion Language Models Through Trajectory Aware Reinforcement Learning

30 March 2026

Hanchen Xia

Baoyou Chen

Yutang Ge

Guojiang Zhao

Siyu Zhu

LRM

AI4CE

ArXiv (abs)PDF HTML Github

Main:4 Pages

4 Figures

Bibliography:2 Pages

2 Tables

Appendix:2 Pages

Abstract

We present T $^\star$ , a simple TraceRL-based training curriculum for progressive block-size scaling in masked diffusion language models (MDMs). Starting from an AR-initialized small-block MDM, T $^\star$ transitions smoothly to larger blocks, enabling higher-parallelism decoding with minimal performance degradation on math reasoning benchmarks. Moreover, further analysis suggests that T $^\star$ may actually converge to an alternative decoding schedule that achieves comparable performance.

View on arXiv

Comments on this paper

T⋆^\star⋆: Progressive Block Scaling for Masked Diffusion Language Models Through Trajectory Aware Reinforcement Learning

T $^\star$ : Progressive Block Scaling for Masked Diffusion Language Models Through Trajectory Aware Reinforcement Learning