Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2411.06360
Cited By
An Efficient Matrix Multiplication Algorithm for Accelerating Inference in Binary and Ternary Neural Networks
10 November 2024
Mohsen Dehghankar
Mahdi Erfanian
Abolfazl Asudeh
Re-assign community
ArXiv
PDF
HTML
Papers citing
"An Efficient Matrix Multiplication Algorithm for Accelerating Inference in Binary and Ternary Neural Networks"
2 / 2 papers shown
Title
Fast Matrix Multiplications for Lookup Table-Quantized LLMs
Han Guo
William Brandon
Radostin Cholakov
Jonathan Ragan-Kelley
Eric P. Xing
Yoon Kim
MQ
119
15
0
20 Jan 2025
Logits of API-Protected LLMs Leak Proprietary Information
Matthew Finlayson
Xiang Ren
Swabha Swayamdipta
PILM
60
23
0
14 Mar 2024
1