Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1605.04711
Cited By
Ternary Weight Networks
16 May 2016
Fengfu Li
Bin Liu
Xiaoxing Wang
Bo-Wen Zhang
Junchi Yan
MQ
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Ternary Weight Networks"
50 / 207 papers shown
Title
Optimizing LLMs for Resource-Constrained Environments: A Survey of Model Compression Techniques
Sanjay Surendranath Girija
Shashank Kapoor
Lakshit Arora
Dipen Pradhan
Aman Raj
Ankit Shetgaonkar
54
0
0
05 May 2025
BackSlash: Rate Constrained Optimized Training of Large Language Models
Jun Wu
Jiangtao Wen
Yuxing Han
34
0
0
23 Apr 2025
Online Difficulty Filtering for Reasoning Oriented Reinforcement Learning
Sanghwan Bae
Jiwoo Hong
Min Young Lee
Hanbyul Kim
Jeongyeon Nam
Donghyun Kwak
OffRL
LRM
53
3
0
04 Apr 2025
Cauchy-Schwarz Regularizers
Sueda Taner
Ziyi Wang
Christoph Studer
36
0
0
03 Mar 2025
Forget the Data and Fine-Tuning! Just Fold the Network to Compress
Dong Wang
Haris Šikić
Lothar Thiele
O. Saukh
59
0
0
17 Feb 2025
Continual Quantization-Aware Pre-Training: When to transition from 16-bit to 1.58-bit pre-training for BitNet language models?
Jacob Nielsen
Peter Schneider-Kamp
Lukas Galke
MQ
61
1
0
17 Feb 2025
BILLNET: A Binarized Conv3D-LSTM Network with Logic-gated residual architecture for hardware-efficient video inference
Van Thien Nguyen
William Guicquero
Gilles Sicard
3DV
MQ
74
2
0
24 Jan 2025
MOGNET: A Mux-residual quantized Network leveraging Online-Generated weights
Van Thien Nguyen
William Guicquero
Gilles Sicard
MQ
75
1
0
17 Jan 2025
Histogram-Equalized Quantization for logic-gated Residual Neural Networks
Van Thien Nguyen
William Guicquero
Gilles Sicard
MQ
41
1
0
10 Jan 2025
Behavior Backdoor for Deep Learning Models
J. T. Wang
Pengfei Zhang
R. Tao
Jian Yang
Hao Liu
X. Liu
Y. X. Wei
Yao Zhao
AAML
75
0
0
02 Dec 2024
Efficient Ternary Weight Embedding Model: Bridging Scalability and Performance
Jiayi Chen
Chen Wu
S. Zhang
Nan Li
L. Zhang
Qi Zhang
69
0
0
23 Nov 2024
Gradient-Free Neural Network Training on the Edge
Dotan Di Castro
O. Joglekar
Shir Kozlovsky
Vladimir Tchuiev
Michal Moshkovitz
MQ
14
0
0
13 Oct 2024
Constraint Guided Model Quantization of Neural Networks
Quinten Van Baelen
P. Karsmakers
MQ
26
0
0
30 Sep 2024
CycleBNN: Cyclic Precision Training in Binary Neural Networks
Federico Fontana
Romeo Lanzino
Anxhelo Diko
G. Foresti
Luigi Cinque
MQ
34
0
0
28 Sep 2024
Twin Network Augmentation: A Novel Training Strategy for Improved Spiking Neural Networks and Efficient Weight Quantization
Lucas Deckers
Benjamin Vandersmissen
Ing Jyh Tsang
W. V. Leekwijck
Steven Latré
MQ
25
2
0
24 Sep 2024
Self-Masking Networks for Unsupervised Adaptation
Alfonso Taboada Warmerdam
Mathilde Caron
Yuki M. Asano
43
1
0
11 Sep 2024
Pessimistic Iterative Planning for Robust POMDPs
Maris F. L. Galesloot
Marnix Suilen
T. D. Simão
Steven Carr
M. Spaan
Ufuk Topcu
Nils Jansen
36
2
0
16 Aug 2024
ABQ-LLM: Arbitrary-Bit Quantized Inference Acceleration for Large Language Models
Chao Zeng
Songwei Liu
Yusheng Xie
Hong Liu
Xiaojian Wang
Miao Wei
Shu Yang
Fangmin Chen
Xing Mei
MQ
37
6
0
16 Aug 2024
Quality Scalable Quantization Methodology for Deep Learning on Edge
S. Khaliq
Rehan Hafiz
MQ
38
1
0
15 Jul 2024
ISQuant: apply squant to the real deployment
Dezan Zhao
MQ
19
0
0
05 Jul 2024
TernaryLLM: Ternarized Large Language Model
Tianqi Chen
Zhe Li
Weixiang Xu
Zeyu Zhu
Dong Li
Lu Tian
E. Barsoum
Peisong Wang
Jian Cheng
31
7
0
11 Jun 2024
Towards Lightweight Speaker Verification via Adaptive Neural Network Quantization
Bei Liu
Haoyu Wang
Yanmin Qian
MQ
31
1
0
08 Jun 2024
BitsFusion: 1.99 bits Weight Quantization of Diffusion Model
Yang Sui
Yanyu Li
Anil Kag
Yerlan Idelbayev
Junli Cao
Ju Hu
Dhritiman Sagar
Bo Yuan
Sergey Tulyakov
Jian Ren
MQ
39
18
0
06 Jun 2024
MagR: Weight Magnitude Reduction for Enhancing Post-Training Quantization
Aozhong Zhang
Naigang Wang
Yanxia Deng
Xin Li
Zi Yang
Penghang Yin
MQ
37
4
0
02 Jun 2024
TerDiT: Ternary Diffusion Models with Transformers
Xudong Lu
Aojun Zhou
Ziyi Lin
Qi Liu
Yuhui Xu
Renrui Zhang
Yafei Wen
Shuai Ren
Peng Gao
Junchi Yan
MQ
45
2
0
23 May 2024
Two Heads are Better Than One: Neural Networks Quantization with 2D Hilbert Curve-based Output Representation
Mykhail M. Uss
Ruslan Yermolenko
Olena Kolodiazhna
Oleksii Shashko
Ivan Safonov
Volodymyr Savin
Yoonjae Yeo
Seowon Ji
Jaeyun Jeong
MQ
27
0
0
22 May 2024
From Algorithm to Hardware: A Survey on Efficient and Safe Deployment of Deep Neural Networks
Xue Geng
Zhe Wang
Chunyun Chen
Qing Xu
Kaixin Xu
...
Zhenghua Chen
M. Aly
Jie Lin
Min-man Wu
Xiaoli Li
33
1
0
09 May 2024
Layer Ensemble Averaging for Improving Memristor-Based Artificial Neural Network Performance
Osama Yousuf
Brian D. Hoskins
Karthick Ramu
Mitchell Fream
W. A. Borders
...
M. Daniels
A. Dienstfrey
Jabez J. McClelland
Martin Lueker-Boden
Gina Adam
26
1
0
24 Apr 2024
AdaQAT: Adaptive Bit-Width Quantization-Aware Training
Cédric Gernigon
Silviu-Ioan Filip
Olivier Sentieys
Clément Coggiola
Mickael Bruno
23
2
0
22 Apr 2024
AutoDFP: Automatic Data-Free Pruning via Channel Similarity Reconstruction
Siqi Li
Jun Chen
Jingyang Xiang
Chengrui Zhu
Yong-Jin Liu
31
0
0
13 Mar 2024
KIWI: A Dataset of Knowledge-Intensive Writing Instructions for Answering Research Questions
Fangyuan Xu
Kyle Lo
Luca Soldaini
Bailey Kuehl
Eunsol Choi
David Wadden
32
6
0
06 Mar 2024
Better Schedules for Low Precision Training of Deep Neural Networks
Cameron R. Wolfe
Anastasios Kyrillidis
45
1
0
04 Mar 2024
Ef-QuantFace: Streamlined Face Recognition with Small Data and Low-Bit Precision
William Gazali
Jocelyn Michelle Kho
Joshua Santoso
Williem
CVBM
MQ
42
0
0
28 Feb 2024
One-Step Forward and Backtrack: Overcoming Zig-Zagging in Loss-Aware Quantization Training
Lianbo Ma
Yuee Zhou
Jianlun Ma
Guo-Ding Yu
Qing Li
MQ
17
1
0
30 Jan 2024
Memory-Efficient Fine-Tuning for Quantized Diffusion Model
Hyogon Ryu
Seohyun Lim
Hyunjung Shim
DiffM
MQ
27
5
0
09 Jan 2024
A foundation for exact binarized morphological neural networks
T. Aouad
Hugues Talbot
14
1
0
08 Jan 2024
EAGLES: Efficient Accelerated 3D Gaussians with Lightweight EncodingS
Sharath Girish
Kamal Gupta
Abhinav Shrivastava
3DGS
26
79
0
07 Dec 2023
PLUM: Improving Inference Efficiency By Leveraging Repetition-Sparsity Trade-Off
Sachit Kuhar
Yash Jain
Alexey Tumanov
MQ
54
0
0
04 Dec 2023
Improving the Robustness of Quantized Deep Neural Networks to White-Box Attacks using Stochastic Quantization and Information-Theoretic Ensemble Training
Saurabh Farkya
Aswin Raghavan
Avi Ziskind
14
0
0
30 Nov 2023
Low-Precision Floating-Point for Efficient On-Board Deep Neural Network Processing
Cédric Gernigon
Silviu-Ioan Filip
Olivier Sentieys
Clément Coggiola
Mickael Bruno
MQ
19
7
0
18 Nov 2023
RepQ: Generalizing Quantization-Aware Training for Re-Parametrized Architectures
Anastasiia Prutianova
Alexey Zaytsev
Chung-Kuei Lee
Fengyu Sun
Ivan Koryakovskiy
MQ
13
0
0
09 Nov 2023
AutoFHE: Automated Adaption of CNNs for Efficient Evaluation over FHE
Wei Ao
Vishnu Naresh Boddeti
AAML
25
18
0
12 Oct 2023
Going Beyond Neural Network Feature Similarity: The Network Feature Complexity and Its Interpretation Using Category Theory
Yiting Chen
Zhanpeng Zhou
Junchi Yan
16
9
0
10 Oct 2023
FedAIoT: A Federated Learning Benchmark for Artificial Intelligence of Things
Samiul Alam
Tuo Zhang
Tiantian Feng
Hui Shen
Zhichao Cao
...
JeongGil Ko
Kiran Somasundaram
Shrikanth S. Narayanan
Salman Avestimehr
Mi Zhang
25
11
0
29 Sep 2023
Stochastic Configuration Machines for Industrial Artificial Intelligence
Dianhui Wang
Matthew J. Felicetti
AI4CE
6
9
0
25 Aug 2023
An Estimator for the Sensitivity to Perturbations of Deep Neural Networks
Naman Maheshwari
Nicholas Malaya
Scott A. Moe
J. Kulkarni
S. Gurumurthi
AAML
9
0
0
24 Jul 2023
Learning Discrete Weights and Activations Using the Local Reparameterization Trick
G. Berger
Aviv Navon
Ethan Fetaya
MQ
22
0
0
04 Jul 2023
Data-Free Quantization via Mixed-Precision Compensation without Fine-Tuning
Jun Chen
Shipeng Bai
Tianxin Huang
Mengmeng Wang
Guanzhong Tian
Y. Liu
MQ
34
18
0
02 Jul 2023
Designing strong baselines for ternary neural network quantization through support and mass equalization
Edouard Yvinec
Arnaud Dapogny
Kévin Bailly
MQ
25
0
0
30 Jun 2023
Binary and Ternary Natural Language Generation
Zechun Liu
Barlas Oğuz
Aasish Pappu
Yangyang Shi
Raghuraman Krishnamoorthi
MQ
33
6
0
02 Jun 2023
1
2
3
4
5
Next