47
0

VirnyFlow: A Design Space for Responsible Model Development

Main:8 Pages
10 Figures
Bibliography:2 Pages
9 Tables
Appendix:6 Pages
Abstract

Developing machine learning (ML) models requires a deep understanding of real-world problems, which are inherently multi-objective. In this paper, we present VirnyFlow, the first design space for responsible model development, designed to assist data scientists in building ML pipelines that are tailored to the specific context of their problem. Unlike conventional AutoML frameworks, VirnyFlow enables users to define customized optimization criteria, perform comprehensive experimentation across pipeline stages, and iteratively refine models in alignment with real-world constraints. Our system integrates evaluation protocol definition, multi-objective Bayesian optimization, cost-aware multi-armed bandits, query optimization, and distributed parallelism into a unified architecture. We show that VirnyFlow significantly outperforms state-of-the-art AutoML systems in both optimization quality and scalability across five real-world benchmarks, offering a flexible, efficient, and responsible alternative to black-box automation in ML development.

View on arXiv
@article{herasymuk2025_2506.01584,
  title={ VirnyFlow: A Design Space for Responsible Model Development },
  author={ Denys Herasymuk and Nazar Protsiv and Julia Stoyanovich },
  journal={arXiv preprint arXiv:2506.01584},
  year={ 2025 }
}
Comments on this paper