ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1612.07119
  4. Cited By
FINN: A Framework for Fast, Scalable Binarized Neural Network Inference

FINN: A Framework for Fast, Scalable Binarized Neural Network Inference

1 December 2016
Yaman Umuroglu
Nicholas J. Fraser
Giulio Gambardella
Michaela Blott
Philip H. W. Leong
Magnus Jahre
K. Vissers
    MQ
ArXivPDFHTML

Papers citing "FINN: A Framework for Fast, Scalable Binarized Neural Network Inference"

50 / 222 papers shown
Title
Compact and Efficient Neural Networks for Image Recognition Based on Learned 2D Separable Transform
Compact and Efficient Neural Networks for Image Recognition Based on Learned 2D Separable Transform
Maxim Vashkevich
Egor Krivalcevich
19
0
0
10 May 2025
Dynamic Tsetlin Machine Accelerators for On-Chip Training at the Edge using FPGAs
Dynamic Tsetlin Machine Accelerators for On-Chip Training at the Edge using FPGAs
Gang Mao
Tousif Rahman
Sidharth Maheshwari
Bob Pattison
Zhuang Shao
R. Shafik
Alex Yakovlev
29
0
0
28 Apr 2025
NeuraLUT-Assemble: Hardware-aware Assembling of Sub-Neural Networks for Efficient LUT Inference
NeuraLUT-Assemble: Hardware-aware Assembling of Sub-Neural Networks for Efficient LUT Inference
Marta Andronic
George A. Constantinides
46
0
0
01 Apr 2025
Real-Time Multi-Object Tracking using YOLOv8 and SORT on a SoC FPGA
Real-Time Multi-Object Tracking using YOLOv8 and SORT on a SoC FPGA
Michal Danilowicz
T. Kryjak
VOT
58
0
0
17 Mar 2025
nanoML for Human Activity Recognition
nanoML for Human Activity Recognition
Alan T. L. Bacellar
Mugdha P. Jadhao
Shashank Nag
P. Lima
F. M. G. França
L. John
BDL
29
0
0
13 Feb 2025
TreeLUT: An Efficient Alternative to Deep Neural Networks for Inference Acceleration Using Gradient Boosted Decision Trees
TreeLUT: An Efficient Alternative to Deep Neural Networks for Inference Acceleration Using Gradient Boosted Decision Trees
Alireza Khataei
Kia Bazargan
28
1
0
02 Jan 2025
LUTMUL: Exceed Conventional FPGA Roofline Limit by LUT-based Efficient
  Multiplication for Neural Network Inference
LUTMUL: Exceed Conventional FPGA Roofline Limit by LUT-based Efficient Multiplication for Neural Network Inference
Yanyue Xie
Zhengang Li
Dana Diaconu
Suranga Handagala
M. Leeser
Xue Lin
69
0
0
01 Nov 2024
CGRA4ML: A Framework to Implement Modern Neural Networks for Scientific
  Edge Computing
CGRA4ML: A Framework to Implement Modern Neural Networks for Scientific Edge Computing
G. Abarajithan
Zhenghua Ma
Zepeng Li
Shrideep Koparkar
Ravidu Munasinghe
Francesco Restuccia
Ryan Kastner
22
1
0
28 Aug 2024
H2PIPE: High throughput CNN Inference on FPGAs with High-Bandwidth
  Memory
H2PIPE: High throughput CNN Inference on FPGAs with High-Bandwidth Memory
Mario Doumet
Marius Stan
Mathew Hall
Vaughn Betz
21
1
0
17 Aug 2024
PowerYOLO: Mixed Precision Model for Hardware Efficient Object Detection
  with Event Data
PowerYOLO: Mixed Precision Model for Hardware Efficient Object Detection with Event Data
Dominika Przewlocka-Rus
T. Kryjak
M. Gorgon
29
0
0
11 Jul 2024
Learning Interpretable Differentiable Logic Networks
Learning Interpretable Differentiable Logic Networks
Chang Yue
N. Jha
NAI
AI4CE
29
0
0
04 Jul 2024
Fast, Scalable, Energy-Efficient Non-element-wise Matrix Multiplication
  on FPGA
Fast, Scalable, Energy-Efficient Non-element-wise Matrix Multiplication on FPGA
Xuqi Zhu
Huaizhi Zhang
JunKyu Lee
Jiacheng Zhu
Chandrajit Pal
S. Saha
Klaus D. McDonald-Maier
X. Zhai
21
0
0
02 Jul 2024
PolyLUT-Add: FPGA-based LUT Inference with Wide Inputs
PolyLUT-Add: FPGA-based LUT Inference with Wide Inputs
Binglei Lou
Richard Rademacher
David Boland
Philip H. W. Leong
41
4
0
07 Jun 2024
Fusing Depthwise and Pointwise Convolutions for Efficient Inference on
  GPUs
Fusing Depthwise and Pointwise Convolutions for Efficient Inference on GPUs
Fareed Qararyah
M. Azhar
Mohammad Ali Maleki
Pedro Trancoso
29
1
0
30 Apr 2024
SMOF: Streaming Modern CNNs on FPGAs with Smart Off-Chip Eviction
SMOF: Streaming Modern CNNs on FPGAs with Smart Off-Chip Eviction
Petros Toupas
Zhewen Yu
C. Bouganis
Dimitrios Tzovaras
25
0
0
27 Mar 2024
Architectural Implications of Neural Network Inference for High
  Data-Rate, Low-Latency Scientific Applications
Architectural Implications of Neural Network Inference for High Data-Rate, Low-Latency Scientific Applications
Olivia Weng
Alexander Redding
Nhan Tran
Javier Mauricio Duarte
Ryan Kastner
32
4
0
13 Mar 2024
NASH: Neural Architecture Search for Hardware-Optimized Machine Learning
  Models
NASH: Neural Architecture Search for Hardware-Optimized Machine Learning Models
Mengfei Ji
Yuchun Chang
Baolin Zhang
Zaid Al-Ars
19
0
0
04 Mar 2024
MATADOR: Automated System-on-Chip Tsetlin Machine Design Generation for
  Edge Applications
MATADOR: Automated System-on-Chip Tsetlin Machine Design Generation for Edge Applications
Tousif Rahman
Gang Mao
Sidharth Maheshwari
R. Shafik
Alexandre Yakovlev
14
2
0
03 Mar 2024
NeuraLUT: Hiding Neural Network Density in Boolean Synthesizable
  Functions
NeuraLUT: Hiding Neural Network Density in Boolean Synthesizable Functions
Marta Andronic
George A. Constantinides
26
5
0
29 Feb 2024
Quantised Neural Network Accelerators for Low-Power IDS in Automotive
  Networks
Quantised Neural Network Accelerators for Low-Power IDS in Automotive Networks
Shashwat Khandelwal
Anneliese Walsh
Shanker Shreejith
21
2
0
19 Jan 2024
Exploring Highly Quantised Neural Networks for Intrusion Detection in
  Automotive CAN
Exploring Highly Quantised Neural Networks for Intrusion Detection in Automotive CAN
Shashwat Khandelwal
Shanker Shreejith
18
0
0
19 Jan 2024
A Lightweight FPGA-based IDS-ECU Architecture for Automotive CAN
A Lightweight FPGA-based IDS-ECU Architecture for Automotive CAN
Shashwat Khandelwal
Shanker Shreejith
11
13
0
19 Jan 2024
A Lightweight Multi-Attack CAN Intrusion Detection System on Hybrid
  FPGAs
A Lightweight Multi-Attack CAN Intrusion Detection System on Hybrid FPGAs
Shashwat Khandelwal
Shanker Shreejith
15
11
0
19 Jan 2024
Exploration of Activation Fault Reliability in Quantized Systolic
  Array-Based DNN Accelerators
Exploration of Activation Fault Reliability in Quantized Systolic Array-Based DNN Accelerators
Mahdi Taheri
N. Cherezova
M. S. Ansari
M. Jenihhin
A. Mahani
Masoud Daneshtalab
J. Raik
26
12
0
17 Jan 2024
Understanding the Potential of FPGA-Based Spatial Acceleration for Large
  Language Model Inference
Understanding the Potential of FPGA-Based Spatial Acceleration for Large Language Model Inference
Hongzheng Chen
Jiahao Zhang
Yixiao Du
Shaojie Xiang
Zichao Yue
Niansong Zhang
Yaohui Cai
Zhiru Zhang
55
34
0
23 Dec 2023
An Encoding Framework for Binarized Images using HyperDimensional
  Computing
An Encoding Framework for Binarized Images using HyperDimensional Computing
Laura Smets
W. V. Leekwijck
Ing Jyh Tsang
Steven Latré
16
2
0
01 Dec 2023
When Side-Channel Attacks Break the Black-Box Property of Embedded
  Artificial Intelligence
When Side-Channel Attacks Break the Black-Box Property of Embedded Artificial Intelligence
Benoît Coqueret
Mathieu Carbone
Olivier Sentieys
Gabriel Zaid
58
2
0
23 Nov 2023
Shedding the Bits: Pushing the Boundaries of Quantization with
  Minifloats on FPGAs
Shedding the Bits: Pushing the Boundaries of Quantization with Minifloats on FPGAs
Shivam Aggarwal
Hans Jakob Damsgaard
Alessandro Pappalardo
Giuseppe Franco
Thomas B. Preußer
Michaela Blott
Tulika Mitra
MQ
24
5
0
21 Nov 2023
Quantization-aware Neural Architectural Search for Intrusion Detection
Quantization-aware Neural Architectural Search for Intrusion Detection
R. Acharya
Laurens Le Jeune
N. Mentens
F. Ganji
Domenic Forte
8
0
0
07 Nov 2023
Cost-Driven Hardware-Software Co-Optimization of Machine Learning
  Pipelines
Cost-Driven Hardware-Software Co-Optimization of Machine Learning Pipelines
Ravit Sharma
W. Romaszkan
Feiqian Zhu
Puneet Gupta
Ankur Mehta
27
0
0
11 Oct 2023
Resilience of Deep Learning applications: a systematic literature review
  of analysis and hardening techniques
Resilience of Deep Learning applications: a systematic literature review of analysis and hardening techniques
C. Bolchini
Qiyuan Chen
Xianhao Chen
AAML
15
0
0
27 Sep 2023
PolyLUT: Learning Piecewise Polynomials for Ultra-Low Latency FPGA
  LUT-based Inference
PolyLUT: Learning Piecewise Polynomials for Ultra-Low Latency FPGA LUT-based Inference
Marta Andronic
George A. Constantinides
30
17
0
05 Sep 2023
MST-compression: Compressing and Accelerating Binary Neural Networks
  with Minimum Spanning Tree
MST-compression: Compressing and Accelerating Binary Neural Networks with Minimum Spanning Tree
Quang Hieu Vo
Linh-Tam Tran
Sung-Ho Bae
Lokwon Kim
Choong Seon Hong
MQ
40
1
0
26 Aug 2023
A2Q: Accumulator-Aware Quantization with Guaranteed Overflow Avoidance
A2Q: Accumulator-Aware Quantization with Guaranteed Overflow Avoidance
Ian Colbert
Alessandro Pappalardo
Jakoba Petri-Koenig
MQ
24
9
0
25 Aug 2023
FPGA Resource-aware Structured Pruning for Real-Time Neural Networks
FPGA Resource-aware Structured Pruning for Real-Time Neural Networks
Benjamin Ramhorst
Vladimir Loncar
George A. Constantinides
33
4
0
09 Aug 2023
Mercury: An Automated Remote Side-channel Attack to Nvidia Deep Learning
  Accelerator
Mercury: An Automated Remote Side-channel Attack to Nvidia Deep Learning Accelerator
Xi-ai Yan
Xiaoxuan Lou
Guowen Xu
Han Qiu
Shangwei Guo
Chip Hong Chang
Tianwei Zhang
AAML
19
7
0
02 Aug 2023
Mitigating Memory Wall Effects in CNN Engines with On-the-Fly Weights
  Generation
Mitigating Memory Wall Effects in CNN Engines with On-the-Fly Weights Generation
Stylianos I. Venieris
Javier Fernandez-Marques
Nicholas D. Lane
MQ
27
3
0
25 Jul 2023
A Survey of Spiking Neural Network Accelerator on FPGA
A Survey of Spiking Neural Network Accelerator on FPGA
Murat Isik
18
16
0
08 Jul 2023
Binary domain generalization for sparsifying binary neural networks
Binary domain generalization for sparsifying binary neural networks
Riccardo Schiavone
Francesco Galati
Maria A. Zuluaga
MQ
19
0
0
23 Jun 2023
MetaML: Automating Customizable Cross-Stage Design-Flow for Deep
  Learning Acceleration
MetaML: Automating Customizable Cross-Stage Design-Flow for Deep Learning Acceleration
Zhiqiang Que
Shuo Liu
Markus Rognlien
Ce Guo
Jose G. F. Coutinho
Wayne Luk
18
4
0
14 Jun 2023
A Systematic Literature Review on Hardware Reliability Assessment
  Methods for Deep Neural Networks
A Systematic Literature Review on Hardware Reliability Assessment Methods for Deep Neural Networks
Mohammad Hasan Ahmadilivani
Mahdi Taheri
J. Raik
Masoud Daneshtalab
M. Jenihhin
35
25
0
09 May 2023
DeepFire2: A Convolutional Spiking Neural Network Accelerator on FPGAs
DeepFire2: A Convolutional Spiking Neural Network Accelerator on FPGAs
M. Aung
Daniel Gerlinghoff
Chuping Qu
Liwei Yang
Tian Huang
Rick Siow Mong Goh
Tao Luo
Weng-Fai Wong
16
9
0
09 May 2023
Dynamically Reconfigurable Variable-precision Sparse-Dense Matrix
  Acceleration in Tensorflow Lite
Dynamically Reconfigurable Variable-precision Sparse-Dense Matrix Acceleration in Tensorflow Lite
J. Núñez-Yáñez
A. Otero
E. D. L. Torre
20
3
0
17 Apr 2023
End-to-end codesign of Hessian-aware quantized neural networks for FPGAs
  and ASICs
End-to-end codesign of Hessian-aware quantized neural networks for FPGAs and ASICs
Javier Campos
Zhen Dong
Javier Mauricio Duarte
A. Gholami
Michael W. Mahoney
Jovan Mitrevski
Nhan Tran
MQ
32
3
0
13 Apr 2023
A Hybrid Approach combining ANN-based and Conventional Demapping in
  Communication for Efficient FPGA-Implementation
A Hybrid Approach combining ANN-based and Conventional Demapping in Communication for Efficient FPGA-Implementation
Jonas Ney
Bilal Hammoud
Norbert Wehn
18
2
0
11 Apr 2023
HARFLOW3D: A Latency-Oriented 3D-CNN Accelerator Toolflow for HAR on
  FPGA Devices
HARFLOW3D: A Latency-Oriented 3D-CNN Accelerator Toolflow for HAR on FPGA Devices
Petros Toupas
Alexander Montgomerie-Corcoran
C. Bouganis
Dimitrios Tzovaras
25
8
0
30 Mar 2023
DeepAxe: A Framework for Exploration of Approximation and Reliability
  Trade-offs in DNN Accelerators
DeepAxe: A Framework for Exploration of Approximation and Reliability Trade-offs in DNN Accelerators
Mahdi Taheri
M. Riazati
Mohammad Hasan Ahmadilivani
M. Jenihhin
Masoud Daneshtalab
J. Raik
Mikael Sjödin
B. Lisper
52
20
0
14 Mar 2023
Fixed-point quantization aware training for on-device keyword-spotting
Fixed-point quantization aware training for on-device keyword-spotting
Sashank Macha
Om Oza
Alex Escott
Francesco Calivá
Robert M. Armitano
S. Cheekatmalla
S. Parthasarathi
Yuzong Liu
MQ
18
4
0
04 Mar 2023
Fixflow: A Framework to Evaluate Fixed-point Arithmetic in Light-Weight
  CNN Inference
Fixflow: A Framework to Evaluate Fixed-point Arithmetic in Light-Weight CNN Inference
Farhad Taheri
Siavash Bayat Sarmadi
H. Mosanaei-Boorani
Reza Taheri
MQ
23
1
0
19 Feb 2023
Moby: Empowering 2D Models for Efficient Point Cloud Analytics on the
  Edge
Moby: Empowering 2D Models for Efficient Point Cloud Analytics on the Edge
Jingzong Li
Yik Hong Cai
Libin Liu
Yushun Mao
Chun Jason Xue
Hongchang Xu
17
3
0
18 Feb 2023
12345
Next