ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2106.11890
10
16

Latency-Aware Neural Architecture Search with Multi-Objective Bayesian Optimization

22 June 2021
David Eriksson
P. Chuang
Sam Daulton
Peng Xia
Akshat Shrivastava
Arun Babu
Shicong Zhao
Ahmed Aly
Ganesh Venkatesh
Maximilian Balandat
    BDL
ArXivPDFHTML
Abstract

When tuning the architecture and hyperparameters of large machine learning models for on-device deployment, it is desirable to understand the optimal trade-offs between on-device latency and model accuracy. In this work, we leverage recent methodological advances in Bayesian optimization over high-dimensional search spaces and multi-objective Bayesian optimization to efficiently explore these trade-offs for a production-scale on-device natural language understanding model at Facebook.

View on arXiv
Comments on this paper