Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2403.20137
Cited By
Accurate Block Quantization in LLMs with Outliers
29 March 2024
Nikita Trukhanov
I. Soloveychik
MQ
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Accurate Block Quantization in LLMs with Outliers"
5 / 5 papers shown
Title
Mixtral of Experts
Albert Q. Jiang
Alexandre Sablayrolles
Antoine Roux
A. Mensch
Blanche Savary
...
Théophile Gervet
Thibaut Lavril
Thomas Wang
Timothée Lacroix
William El Sayed
MoE
LLMAG
67
1,049
0
08 Jan 2024
FP8-LM: Training FP8 Large Language Models
Houwen Peng
Kan Wu
Yixuan Wei
Guoshuai Zhao
Yuxiang Yang
...
Zheng Zhang
Shuguang Liu
Joe Chau
Han Hu
Peng Cheng
MQ
64
42
0
27 Oct 2023
RoFormer: Enhanced Transformer with Rotary Position Embedding
Jianlin Su
Yu Lu
Shengfeng Pan
Ahmed Murtadha
Bo Wen
Yunfeng Liu
92
2,307
0
20 Apr 2021
Bottleneck Transformers for Visual Recognition
A. Srinivas
Nayeon Lee
Niki Parmar
Jonathon Shlens
Pieter Abbeel
Ashish Vaswani
SLR
313
986
0
27 Jan 2021
GOBO: Quantizing Attention-Based NLP Models for Low Latency and Energy Efficient Inference
Ali Hadi Zadeh
Isak Edo
Omar Mohamed Awad
Andreas Moshovos
MQ
44
186
0
08 May 2020
1