Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2111.14255
Cited By
Automated Runtime-Aware Scheduling for Multi-Tenant DNN Inference on GPU
28 November 2021
Fuxun Yu
Shawn Bray
Di Wang
Longfei Shangguan
Xulong Tang
Chenchen Liu
Xiang Chen
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Automated Runtime-Aware Scheduling for Multi-Tenant DNN Inference on GPU"
2 / 2 papers shown
Title
Equality Saturation for Tensor Graph Superoptimization
Yichen Yang
Mangpo Phitchaya Phothilimtha
Y. Wang
Max Willsey
Sudip Roy
Jacques Pienaar
38
80
0
05 Jan 2021
NetAdapt: Platform-Aware Neural Network Adaptation for Mobile Applications
Tien-Ju Yang
Andrew G. Howard
Bo Chen
Xiao Zhang
Alec Go
Mark Sandler
Vivienne Sze
Hartwig Adam
90
515
0
09 Apr 2018
1