ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2111.14255
  4. Cited By
Automated Runtime-Aware Scheduling for Multi-Tenant DNN Inference on GPU

Automated Runtime-Aware Scheduling for Multi-Tenant DNN Inference on GPU

28 November 2021
Fuxun Yu
Shawn Bray
Di Wang
Longfei Shangguan
Xulong Tang
Chenchen Liu
Xiang Chen
ArXivPDFHTML

Papers citing "Automated Runtime-Aware Scheduling for Multi-Tenant DNN Inference on GPU"

2 / 2 papers shown
Title
Equality Saturation for Tensor Graph Superoptimization
Equality Saturation for Tensor Graph Superoptimization
Yichen Yang
Mangpo Phitchaya Phothilimtha
Y. Wang
Max Willsey
Sudip Roy
Jacques Pienaar
38
80
0
05 Jan 2021
NetAdapt: Platform-Aware Neural Network Adaptation for Mobile
  Applications
NetAdapt: Platform-Aware Neural Network Adaptation for Mobile Applications
Tien-Ju Yang
Andrew G. Howard
Bo Chen
Xiao Zhang
Alec Go
Mark Sandler
Vivienne Sze
Hartwig Adam
90
515
0
09 Apr 2018
1