Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2405.10299
Cited By
HW-GPT-Bench: Hardware-Aware Architecture Benchmark for Language Models
16 May 2024
R. Sukthanker
Arber Zela
B. Staffler
Aaron Klein
Lennart Purucker
Jorg K. H. Franke
Frank Hutter
ELM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"HW-GPT-Bench: Hardware-Aware Architecture Benchmark for Language Models"
11 / 11 papers shown
Title
NEAR: A Training-Free Pre-Estimator of Machine Learning Model Performance
Raphael T. Husistein
Markus Reiher
Marco Eckhoff
142
1
0
20 Feb 2025
Structural Pruning of Pre-trained Language Models via Neural Architecture Search
Aaron Klein
Jacek Golebiowski
Xingchen Ma
Valerio Perrone
Cédric Archambeau
27
2
0
03 May 2024
Unsupervised Graph Neural Architecture Search with Disentangled Self-supervision
Zeyang Zhang
Xin Eric Wang
Ziwei Zhang
Guangyao Shen
Shiqi Shen
Wenwu Zhu
42
12
0
08 Mar 2024
Multi-objective Differentiable Neural Architecture Search
R. Sukthanker
Arber Zela
B. Staffler
Samuel Dooley
Josif Grabocka
Frank Hutter
40
1
0
28 Feb 2024
Shortened LLaMA: Depth Pruning for Large Language Models with Comparison of Retraining Methods
Bo-Kyeong Kim
Geonmin Kim
Tae-Ho Kim
Thibault Castells
Shinkook Choi
Junho Shin
Hyoung-Kyu Song
62
30
0
05 Feb 2024
Towards Efficient Post-training Quantization of Pre-trained Language Models
Haoli Bai
Lu Hou
Lifeng Shang
Xin Jiang
Irwin King
M. Lyu
MQ
73
47
0
30 Sep 2021
Pruning and Quantization for Deep Neural Network Acceleration: A Survey
Tailin Liang
C. Glossner
Lei Wang
Shaobo Shi
Xiaotong Zhang
MQ
130
673
0
24 Jan 2021
AutoGluon-Tabular: Robust and Accurate AutoML for Structured Data
Nick Erickson
Jonas W. Mueller
Alexander Shirkov
Hang Zhang
Pedro Larroy
Mu Li
Alex Smola
LMTD
92
607
0
13 Mar 2020
NAS-Bench-1Shot1: Benchmarking and Dissecting One-shot Neural Architecture Search
Arber Zela
Julien N. Siems
Frank Hutter
88
147
0
28 Jan 2020
Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism
M. Shoeybi
M. Patwary
Raul Puri
P. LeGresley
Jared Casper
Bryan Catanzaro
MoE
245
1,817
0
17 Sep 2019
Neural Architecture Search with Reinforcement Learning
Barret Zoph
Quoc V. Le
269
5,326
0
05 Nov 2016
1