ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1810.06115
  4. Cited By
PRETZEL: Opening the Black Box of Machine Learning Prediction Serving
  Systems

PRETZEL: Opening the Black Box of Machine Learning Prediction Serving Systems

14 October 2018
Yunseong Lee
Alberto Scolari
Byung-Gon Chun
M. Santambrogio
Markus Weimer
Matteo Interlandi
    VLM
ArXivPDFHTML

Papers citing "PRETZEL: Opening the Black Box of Machine Learning Prediction Serving Systems"

11 / 11 papers shown
Title
FlexLLM: A System for Co-Serving Large Language Model Inference and Parameter-Efficient Finetuning
FlexLLM: A System for Co-Serving Large Language Model Inference and Parameter-Efficient Finetuning
Xupeng Miao
Gabriele Oliaro
Xinhao Cheng
Vineeth Kada
Ruohan Gao
...
April Yang
Yingcheng Wang
Mengdi Wu
Colin Unger
Zhihao Jia
MoE
114
10
0
29 Feb 2024
Rafiki: Machine Learning as an Analytics Service System
Rafiki: Machine Learning as an Analytics Service System
Wei Wang
Sheng Wang
Jinyang Gao
Meihui Zhang
Gang Chen
Teck Khim Ng
Beng Chin Ooi
67
112
0
17 Apr 2018
Tensor Comprehensions: Framework-Agnostic High-Performance Machine
  Learning Abstractions
Tensor Comprehensions: Framework-Agnostic High-Performance Machine Learning Abstractions
Nicolas Vasilache
O. Zinenko
Theodoros Theodoridis
Priya Goyal
Zach DeVito
William S. Moses
Sven Verdoolaege
Andrew Adams
Albert Cohen
62
432
0
13 Feb 2018
TensorFlow-Serving: Flexible, High-Performance ML Serving
TensorFlow-Serving: Flexible, High-Performance ML Serving
Christopher Olston
Noah Fiedel
Kiril Gorovoy
Jeremiah Harmsen
Li Lao
Fangwei Li
Vinu Rajashekhar
Sukriti Ramesh
Jordan Soyke
48
305
0
17 Dec 2017
DyNet: The Dynamic Neural Network Toolkit
DyNet: The Dynamic Neural Network Toolkit
Graham Neubig
Chris Dyer
Yoav Goldberg
Austin Matthews
Bridger Waleed Ammar
...
Yusuke Oda
Matthew Richardson
Naomi Saphra
Swabha Swayamdipta
Pengcheng Yin
71
386
0
15 Jan 2017
Clipper: A Low-Latency Online Prediction Serving System
Clipper: A Low-Latency Online Prediction Serving System
D. Crankshaw
Xin Wang
Giulio Zhou
Michael Franklin
Joseph E. Gonzalez
Ion Stoica
50
673
0
09 Dec 2016
An overview of gradient descent optimization algorithms
An overview of gradient descent optimization algorithms
Sebastian Ruder
ODL
177
6,170
0
15 Sep 2016
Ups and Downs: Modeling the Visual Evolution of Fashion Trends with
  One-Class Collaborative Filtering
Ups and Downs: Modeling the Visual Evolution of Fashion Trends with One-Class Collaborative Filtering
Ruining He
Julian McAuley
95
2,048
0
04 Feb 2016
MXNet: A Flexible and Efficient Machine Learning Library for
  Heterogeneous Distributed Systems
MXNet: A Flexible and Efficient Machine Learning Library for Heterogeneous Distributed Systems
Tianqi Chen
Mu Li
Yutian Li
Min Lin
Naiyan Wang
Minjie Wang
Tianjun Xiao
Bing Xu
Chiyuan Zhang
Zheng Zhang
120
2,243
0
03 Dec 2015
Stochastic Dual Coordinate Ascent Methods for Regularized Loss
  Minimization
Stochastic Dual Coordinate Ascent Methods for Regularized Loss Minimization
Shai Shalev-Shwartz
Tong Zhang
112
1,031
0
10 Sep 2012
HOGWILD!: A Lock-Free Approach to Parallelizing Stochastic Gradient
  Descent
HOGWILD!: A Lock-Free Approach to Parallelizing Stochastic Gradient Descent
Feng Niu
Benjamin Recht
Christopher Ré
Stephen J. Wright
137
2,272
0
28 Jun 2011
1