FINN: A Framework for Fast, Scalable Binarized Neural Network Inference

1 December 2016

Papers citing "FINN: A Framework for Fast, Scalable Binarized Neural Network Inference"

50 / 222 papers shown

Title
Compact and Efficient Neural Networks for Image Recognition Based on Learned 2D Separable Transform Maxim Vashkevich Egor Krivalcevich 19 0 0 10 May 2025
Dynamic Tsetlin Machine Accelerators for On-Chip Training at the Edge using FPGAs Gang Mao Tousif Rahman Sidharth Maheshwari Bob Pattison Zhuang Shao R. Shafik Alex Yakovlev 29 0 0 28 Apr 2025
NeuraLUT-Assemble: Hardware-aware Assembling of Sub-Neural Networks for Efficient LUT Inference Marta Andronic George A. Constantinides 46 0 0 01 Apr 2025
Real-Time Multi-Object Tracking using YOLOv8 and SORT on a SoC FPGA Michal Danilowicz T. Kryjak VOT 58 0 0 17 Mar 2025
nanoML for Human Activity Recognition Alan T. L. Bacellar Mugdha P. Jadhao Shashank Nag P. Lima F. M. G. França L. John BDL 29 0 0 13 Feb 2025
TreeLUT: An Efficient Alternative to Deep Neural Networks for Inference Acceleration Using Gradient Boosted Decision Trees Alireza Khataei Kia Bazargan 28 1 0 02 Jan 2025
LUTMUL: Exceed Conventional FPGA Roofline Limit by LUT-based Efficient Multiplication for Neural Network Inference Yanyue Xie Zhengang Li Dana Diaconu Suranga Handagala M. Leeser Xue Lin 69 0 0 01 Nov 2024
CGRA4ML: A Framework to Implement Modern Neural Networks for Scientific Edge Computing G. Abarajithan Zhenghua Ma Zepeng Li Shrideep Koparkar Ravidu Munasinghe Francesco Restuccia Ryan Kastner 22 1 0 28 Aug 2024
H2PIPE: High throughput CNN Inference on FPGAs with High-Bandwidth Memory Mario Doumet Marius Stan Mathew Hall Vaughn Betz 21 1 0 17 Aug 2024
PowerYOLO: Mixed Precision Model for Hardware Efficient Object Detection with Event Data Dominika Przewlocka-Rus T. Kryjak M. Gorgon 29 0 0 11 Jul 2024
Learning Interpretable Differentiable Logic Networks Chang Yue N. Jha NAI AI4CE 29 0 0 04 Jul 2024
Fast, Scalable, Energy-Efficient Non-element-wise Matrix Multiplication on FPGA Xuqi Zhu Huaizhi Zhang JunKyu Lee Jiacheng Zhu Chandrajit Pal S. Saha Klaus D. McDonald-Maier X. Zhai 21 0 0 02 Jul 2024
PolyLUT-Add: FPGA-based LUT Inference with Wide Inputs Binglei Lou Richard Rademacher David Boland Philip H. W. Leong 41 4 0 07 Jun 2024
Fusing Depthwise and Pointwise Convolutions for Efficient Inference on GPUs Fareed Qararyah M. Azhar Mohammad Ali Maleki Pedro Trancoso 29 1 0 30 Apr 2024
SMOF: Streaming Modern CNNs on FPGAs with Smart Off-Chip Eviction Petros Toupas Zhewen Yu C. Bouganis Dimitrios Tzovaras 25 0 0 27 Mar 2024
Architectural Implications of Neural Network Inference for High Data-Rate, Low-Latency Scientific Applications Olivia Weng Alexander Redding Nhan Tran Javier Mauricio Duarte Ryan Kastner 32 4 0 13 Mar 2024
NASH: Neural Architecture Search for Hardware-Optimized Machine Learning Models Mengfei Ji Yuchun Chang Baolin Zhang Zaid Al-Ars 19 0 0 04 Mar 2024
MATADOR: Automated System-on-Chip Tsetlin Machine Design Generation for Edge Applications Tousif Rahman Gang Mao Sidharth Maheshwari R. Shafik Alexandre Yakovlev 14 2 0 03 Mar 2024
NeuraLUT: Hiding Neural Network Density in Boolean Synthesizable Functions Marta Andronic George A. Constantinides 26 5 0 29 Feb 2024
Quantised Neural Network Accelerators for Low-Power IDS in Automotive Networks Shashwat Khandelwal Anneliese Walsh Shanker Shreejith 21 2 0 19 Jan 2024
Exploring Highly Quantised Neural Networks for Intrusion Detection in Automotive CAN Shashwat Khandelwal Shanker Shreejith 18 0 0 19 Jan 2024
A Lightweight FPGA-based IDS-ECU Architecture for Automotive CAN Shashwat Khandelwal Shanker Shreejith 11 13 0 19 Jan 2024
A Lightweight Multi-Attack CAN Intrusion Detection System on Hybrid FPGAs Shashwat Khandelwal Shanker Shreejith 15 11 0 19 Jan 2024
Exploration of Activation Fault Reliability in Quantized Systolic Array-Based DNN Accelerators Mahdi Taheri N. Cherezova M. S. Ansari M. Jenihhin A. Mahani Masoud Daneshtalab J. Raik 26 12 0 17 Jan 2024
Understanding the Potential of FPGA-Based Spatial Acceleration for Large Language Model Inference Hongzheng Chen Jiahao Zhang Yixiao Du Shaojie Xiang Zichao Yue Niansong Zhang Yaohui Cai Zhiru Zhang 55 34 0 23 Dec 2023
An Encoding Framework for Binarized Images using HyperDimensional Computing Laura Smets W. V. Leekwijck Ing Jyh Tsang Steven Latré 16 2 0 01 Dec 2023
When Side-Channel Attacks Break the Black-Box Property of Embedded Artificial Intelligence Benoît Coqueret Mathieu Carbone Olivier Sentieys Gabriel Zaid 58 2 0 23 Nov 2023
Shedding the Bits: Pushing the Boundaries of Quantization with Minifloats on FPGAs Shivam Aggarwal Hans Jakob Damsgaard Alessandro Pappalardo Giuseppe Franco Thomas B. Preußer Michaela Blott Tulika Mitra MQ 24 5 0 21 Nov 2023
Quantization-aware Neural Architectural Search for Intrusion Detection R. Acharya Laurens Le Jeune N. Mentens F. Ganji Domenic Forte 8 0 0 07 Nov 2023
Cost-Driven Hardware-Software Co-Optimization of Machine Learning Pipelines Ravit Sharma W. Romaszkan Feiqian Zhu Puneet Gupta Ankur Mehta 27 0 0 11 Oct 2023
Resilience of Deep Learning applications: a systematic literature review of analysis and hardening techniques C. Bolchini Qiyuan Chen Xianhao Chen AAML 15 0 0 27 Sep 2023
PolyLUT: Learning Piecewise Polynomials for Ultra-Low Latency FPGA LUT-based Inference Marta Andronic George A. Constantinides 30 17 0 05 Sep 2023
MST-compression: Compressing and Accelerating Binary Neural Networks with Minimum Spanning Tree Quang Hieu Vo Linh-Tam Tran Sung-Ho Bae Lokwon Kim Choong Seon Hong MQ 40 1 0 26 Aug 2023
A2Q: Accumulator-Aware Quantization with Guaranteed Overflow Avoidance Ian Colbert Alessandro Pappalardo Jakoba Petri-Koenig MQ 24 9 0 25 Aug 2023
FPGA Resource-aware Structured Pruning for Real-Time Neural Networks Benjamin Ramhorst Vladimir Loncar George A. Constantinides 33 4 0 09 Aug 2023
Mercury: An Automated Remote Side-channel Attack to Nvidia Deep Learning Accelerator Xi-ai Yan Xiaoxuan Lou Guowen Xu Han Qiu Shangwei Guo Chip Hong Chang Tianwei Zhang AAML 19 7 0 02 Aug 2023
Mitigating Memory Wall Effects in CNN Engines with On-the-Fly Weights Generation Stylianos I. Venieris Javier Fernandez-Marques Nicholas D. Lane MQ 27 3 0 25 Jul 2023
A Survey of Spiking Neural Network Accelerator on FPGA Murat Isik 18 16 0 08 Jul 2023
Binary domain generalization for sparsifying binary neural networks Riccardo Schiavone Francesco Galati Maria A. Zuluaga MQ 19 0 0 23 Jun 2023
MetaML: Automating Customizable Cross-Stage Design-Flow for Deep Learning Acceleration Zhiqiang Que Shuo Liu Markus Rognlien Ce Guo Jose G. F. Coutinho Wayne Luk 18 4 0 14 Jun 2023
A Systematic Literature Review on Hardware Reliability Assessment Methods for Deep Neural Networks Mohammad Hasan Ahmadilivani Mahdi Taheri J. Raik Masoud Daneshtalab M. Jenihhin 35 25 0 09 May 2023
DeepFire2: A Convolutional Spiking Neural Network Accelerator on FPGAs M. Aung Daniel Gerlinghoff Chuping Qu Liwei Yang Tian Huang Rick Siow Mong Goh Tao Luo Weng-Fai Wong 16 9 0 09 May 2023
Dynamically Reconfigurable Variable-precision Sparse-Dense Matrix Acceleration in Tensorflow Lite J. Núñez-Yáñez A. Otero E. D. L. Torre 20 3 0 17 Apr 2023
End-to-end codesign of Hessian-aware quantized neural networks for FPGAs and ASICs Javier Campos Zhen Dong Javier Mauricio Duarte A. Gholami Michael W. Mahoney Jovan Mitrevski Nhan Tran MQ 32 3 0 13 Apr 2023
A Hybrid Approach combining ANN-based and Conventional Demapping in Communication for Efficient FPGA-Implementation Jonas Ney Bilal Hammoud Norbert Wehn 18 2 0 11 Apr 2023
HARFLOW3D: A Latency-Oriented 3D-CNN Accelerator Toolflow for HAR on FPGA Devices Petros Toupas Alexander Montgomerie-Corcoran C. Bouganis Dimitrios Tzovaras 25 8 0 30 Mar 2023
DeepAxe: A Framework for Exploration of Approximation and Reliability Trade-offs in DNN Accelerators Mahdi Taheri M. Riazati Mohammad Hasan Ahmadilivani M. Jenihhin Masoud Daneshtalab J. Raik Mikael Sjödin B. Lisper 52 20 0 14 Mar 2023
Fixed-point quantization aware training for on-device keyword-spotting Sashank Macha Om Oza Alex Escott Francesco Calivá Robert M. Armitano S. Cheekatmalla S. Parthasarathi Yuzong Liu MQ 18 4 0 04 Mar 2023
Fixflow: A Framework to Evaluate Fixed-point Arithmetic in Light-Weight CNN Inference Farhad Taheri Siavash Bayat Sarmadi H. Mosanaei-Boorani Reza Taheri MQ 23 1 0 19 Feb 2023
Moby: Empowering 2D Models for Efficient Point Cloud Analytics on the Edge Jingzong Li Yik Hong Cai Libin Liu Yushun Mao Chun Jason Xue Hongchang Xu 17 3 0 18 Feb 2023