Domain-Specialized Object Detection via Model-Level Mixtures of Experts

20 April 2026

Svetlana Pavlitska

Malte Stüven

Beyza Keskin

J. Marius Zöllner

MoE

ObjD

OCL

ArXiv (abs)PDF HTML Github (7★)

Main:7 Pages

6 Figures

Bibliography:1 Pages

6 Tables

Abstract

Mixture-of-Experts (MoE) models provide a structured approach to combining specialized neural networks and offer greater interpretability than conventional ensembles. While MoEs have been successfully applied to image classification and semantic segmentation, their use in object detection remains limited due to challenges in merging dense and structured predictions. In this work, we investigate model-level mixtures of object detectors and analyze their suitability for improving performance and interpretability in object detection. We propose an MoE architecture that combines YOLO-based detectors trained on semantically disjoint data subsets, with a learned gating network that dynamically weights expert contributions. We study different strategies for fusing detection outputs and for training the gating mechanism, including balancing losses to prevent expert collapse. Experiments on the BDD100K dataset demonstrate that the proposed MoE consistently outperforms standard ensemble approaches and provides insights into expert specialization across domains, highlighting model-level MoEs as a viable alternative to traditional ensembling for object detection. Our code is available atthis https URL.

View on arXiv

Comments on this paper