Leveraging Neural Graph Compilers in Machine Learning Research for Edge-Cloud Systems

28 April 2025

Alireza Furutanpey

Carmen Walser

Philipp Raith

Pantelis A. Frangoudis

Schahram Dustdar

GNN

NAI

ArXiv PDF HTML

Abstract

This work presents a comprehensive evaluation of neural network graph compilers across heterogeneous hardware platforms, addressing the critical gap between theoretical optimization techniques and practical deployment scenarios. We demonstrate how vendor-specific optimizations can invalidate relative performance comparisons between architectural archetypes, with performance advantages sometimes completely reversing after compilation. Our systematic analysis reveals that graph compilers exhibit performance patterns highly dependent on both neural architecture and batch sizes. Through fine-grained block-level experimentation, we establish that vendor-specific compilers can leverage repeated patterns in simple architectures, yielding disproportionate throughput gains as model depth increases. We introduce novel metrics to quantify a compiler's ability to mitigate performance friction as batch size increases. Our methodology bridges the gap between academic research and practical deployment by incorporating compiler effects throughout the research process, providing actionable insights for practitioners navigating complex optimization landscapes across heterogeneous hardware environments.

View on arXiv

@article{furutanpey2025_2504.20198,
  title={ Leveraging Neural Graph Compilers in Machine Learning Research for Edge-Cloud Systems },
  author={ Alireza Furutanpey and Carmen Walser and Philipp Raith and Pantelis A. Frangoudis and Schahram Dustdar },
  journal={arXiv preprint arXiv:2504.20198},
  year={ 2025 }
}

Comments on this paper