Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1609.00222
Cited By
Ternary Neural Networks for Resource-Efficient AI Applications
1 September 2016
Hande Alemdar
V. Leroy
Adrien Prost-Boucle
F. Pétrot
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Ternary Neural Networks for Resource-Efficient AI Applications"
19 / 19 papers shown
Title
Quality Scalable Quantization Methodology for Deep Learning on Edge
S. Khaliq
Rehan Hafiz
MQ
46
1
0
15 Jul 2024
TerDiT: Ternary Diffusion Models with Transformers
Xudong Lu
Aojun Zhou
Ziyi Lin
Qi Liu
Yuhui Xu
Renrui Zhang
Yafei Wen
Shuai Ren
Peng Gao
Junchi Yan
MQ
55
2
0
23 May 2024
Free Bits: Latency Optimization of Mixed-Precision Quantized Neural Networks on the Edge
Georg Rutishauser
Francesco Conti
Luca Benini
MQ
31
5
0
06 Jul 2023
AutoQNN: An End-to-End Framework for Automatically Quantizing Neural Networks
Cheng Gong
Ye Lu
Surong Dai
Deng Qian
Chenkun Du
Tao Li
MQ
29
0
0
07 Apr 2023
Expressive power of binary and ternary neural networks
A. Beknazaryan
MQ
24
0
0
27 Jun 2022
Why Quantization Improves Generalization: NTK of Binary Weight Neural Networks
Kaiqi Zhang
Ming Yin
Yu-Xiang Wang
MQ
24
4
0
13 Jun 2022
FlexBlock: A Flexible DNN Training Accelerator with Multi-Mode Block Floating Point Support
Seock-Hwan Noh
Jahyun Koo
Seunghyun Lee
Jongse Park
Jaeha Kung
AI4CE
32
17
0
13 Mar 2022
Signing the Supermask: Keep, Hide, Invert
Nils Koster
O. Grothe
Achim Rettinger
31
10
0
31 Jan 2022
3U-EdgeAI: Ultra-Low Memory Training, Ultra-Low BitwidthQuantization, and Ultra-Low Latency Acceleration
Yao Chen
Cole Hawkins
Kaiqi Zhang
Zheng-Wei Zhang
Cong Hao
26
8
0
11 May 2021
Ternary Hashing
Chang Liu
Lixin Fan
Kam Woh Ng
Yilun Jin
Ce Ju
Tianyu Zhang
Chee Seng Chan
Qiang Yang
18
3
0
16 Mar 2021
Pruning and Quantization for Deep Neural Network Acceleration: A Survey
Tailin Liang
C. Glossner
Lei Wang
Shaobo Shi
Xiaotong Zhang
MQ
150
675
0
24 Jan 2021
Fast Implementation of 4-bit Convolutional Neural Networks for Mobile Devices
A. Trusov
E. Limonova
Dmitry Slugin
D. Nikolaev
V. Arlazarov
MQ
25
17
0
14 Sep 2020
Density Encoding Enables Resource-Efficient Randomly Connected Neural Networks
Denis Kleyko
Mansour Kheffache
E. P. Frady
U. Wiklund
Evgeny Osipov
24
45
0
19 Sep 2019
TiM-DNN: Ternary in-Memory accelerator for Deep Neural Networks
Shubham Jain
S. Gupta
A. Raghunathan
MQ
30
37
0
15 Sep 2019
Combinatorial Attacks on Binarized Neural Networks
Elias Boutros Khalil
Amrita Gupta
B. Dilkina
AAML
49
40
0
08 Oct 2018
Inference of Quantized Neural Networks on Heterogeneous All-Programmable Devices
Thomas B. Preußer
Giulio Gambardella
Nicholas J. Fraser
Michaela Blott
MQ
32
41
0
21 Jun 2018
Toolflows for Mapping Convolutional Neural Networks on FPGAs: A Survey and Future Directions
Stylianos I. Venieris
Alexandros Kouris
C. Bouganis
19
184
0
15 Mar 2018
FINN: A Framework for Fast, Scalable Binarized Neural Network Inference
Yaman Umuroglu
Nicholas J. Fraser
Giulio Gambardella
Michaela Blott
Philip H. W. Leong
Magnus Jahre
K. Vissers
MQ
53
977
0
01 Dec 2016
Bit-pragmatic Deep Neural Network Computing
Jorge Albericio
Patrick Judd
A. Delmas
Sayeh Sharify
Andreas Moshovos
MQ
32
238
0
20 Oct 2016
1