Convolutional Neural Networks (CNNs) have gained high popularity as a tool for computer vision tasks and for that reason are used in various applications. There are many different concepts, like single shot detectors, that have been published for detecting objects in images or video streams. However, CNNs suffer from disadvantages regarding the deployment on embedded platforms such as re-configurable hardware like Field Programmable Gate Arrays (FPGAs). Due to the high computational intensity, memory requirements and arithmetic conditions, a variety of strategies for running CNNs on FPGAs have been developed. The following methods showcase our best practice approaches for a TinyYOLOv3 detector network on a XILINX Artix-7 FPGA using techniques like fusion of batch normalization, filter pruning and post training network quantization.
View on arXiv@article{wunderlich2025_2505.13575, title={ An Overview of Arithmetic Adaptations for Inference of Convolutional Neural Networks on Re-configurable Hardware }, author={ Ilkay Wunderlich and Benjamin Koch and Sven Schönfeld }, journal={arXiv preprint arXiv:2505.13575}, year={ 2025 } }