Johnny: Structuring Representation Space to Enhance Machine Abstract Reasoning Ability

13 May 2025

Main:13 Pages

18 Figures

Bibliography:2 Pages

6 Tables

Abstract

This paper thoroughly investigates the challenges of enhancing AI's abstract reasoning capabilities, with a particular focus on Raven's Progressive Matrices (RPM) tasks involving complex human-like concepts. Firstly, it dissects the empirical reality that traditional end-to-end RPM-solving models heavily rely on option pool configurations, highlighting that this dependency constrains the model's reasoning capabilities. To address this limitation, the paper proposes the Johnny architecture - a novel representation space-based framework for RPM-solving. Through the synergistic operation of its Representation Extraction Module and Reasoning Module, Johnny significantly enhances reasoning performance by supplementing primitive negative option configurations with a learned representation space. Furthermore, to strengthen the model's capacity for capturing positional relationships among local features, the paper introduces the Spin-Transformer network architecture, accompanied by a lightweight Straw Spin-Transformer variant that reduces computational overhead through parameter sharing and attention mechanism optimization. Experimental evaluations demonstrate that both Johnny and Spin-Transformer achieve superior performance on RPM tasks, offering innovative methodologies for advancing AI's abstract reasoning capabilities.

View on arXiv

@article{song2025_2506.01970,
  title={ Johnny: Structuring Representation Space to Enhance Machine Abstract Reasoning Ability },
  author={ Ruizhuo Song and Beiming Yuan },
  journal={arXiv preprint arXiv:2506.01970},
  year={ 2025 }
}

Comments on this paper