Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2402.04359
Cited By
Adaptive Inference: Theoretical Limits and Unexplored Opportunities
6 February 2024
S. Hor
Ying Qian
Mert Pilanci
Amin Arbabian
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Adaptive Inference: Theoretical Limits and Unexplored Opportunities"
10 / 10 papers shown
Title
Adaptive Gating in Mixture-of-Experts based Language Models
Jiamin Li
Qiang Su
Yitao Yang
Yimin Jiang
Cong Wang
Hong-Yu Xu
MoE
69
6
0
11 Oct 2023
From Words to Watts: Benchmarking the Energy Costs of Large Language Model Inference
S. Samsi
Dan Zhao
Joseph McDonald
Baolin Li
Adam Michaleas
Michael Jones
William Bergeron
J. Kepner
Devesh Tiwari
V. Gadepally
65
150
0
04 Oct 2023
Finding the SWEET Spot: Analysis and Improvement of Adaptive Inference in Low Resource Settings
Daniel Rotem
Michael Hassid
Jonathan Mamou
Roy Schwartz
52
5
0
04 Jun 2023
A Survey on Model Compression and Acceleration for Pretrained Language Models
Canwen Xu
Julian McAuley
84
60
0
15 Feb 2022
Adaptive Inference through Early-Exit Networks: Design, Challenges and Directions
Stefanos Laskaridis
Alexandros Kouris
Nicholas D. Lane
TPM
105
118
0
09 Jun 2021
Dynamic Neural Networks: A Survey
Yizeng Han
Gao Huang
Shiji Song
Le Yang
Honghui Wang
Yulin Wang
3DH
AI4TS
AI4CE
115
652
0
09 Feb 2021
Carbontracker: Tracking and Predicting the Carbon Footprint of Training Deep Learning Models
Lasse F. Wolff Anthony
Benjamin Kanding
Raghavendra Selvan
HAI
62
313
0
06 Jul 2020
Edge AI: On-Demand Accelerating Deep Neural Network Inference via Edge Computing
En Li
Liekang Zeng
Zhi Zhou
Xu Chen
54
630
0
04 Oct 2019
EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks
Mingxing Tan
Quoc V. Le
3DV
MedIm
153
18,179
0
28 May 2019
ImageNet Large Scale Visual Recognition Challenge
Olga Russakovsky
Jia Deng
Hao Su
J. Krause
S. Satheesh
...
A. Karpathy
A. Khosla
Michael S. Bernstein
Alexander C. Berg
Li Fei-Fei
VLM
ObjD
1.7K
39,595
0
01 Sep 2014
1