Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2003.03423
Cited By
Serverless in the Wild: Characterizing and Optimizing the Serverless Workload at a Large Cloud Provider
6 March 2020
Mohammad Shahrad
Rodrigo Fonseca
Íñigo Goiri
G. Chaudhry
Paul Batum
Jason Cooke
Eduardo Laureano
Colby Tresness
M. Russinovich
Ricardo Bianchini
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Serverless in the Wild: Characterizing and Optimizing the Serverless Workload at a Large Cloud Provider"
41 / 41 papers shown
Title
Confidential Serverless Computing
Patrick Sabanic
Masanori Misono
Teofil Bodea
Julian Pritzi
Michael Hackl
Dimitrios Stavrakakis
Pramod Bhatotia
58
0
0
30 Apr 2025
Improving the Serving Performance of Multi-LoRA Large Language Models via Efficient LoRA and KV Cache Management
Hang Zhang
Jiuchen Shi
Yixiao Wang
Quan Chen
Yizhou Shan
Minyi Guo
31
0
0
19 Apr 2025
SkyServe: Serving AI Models across Regions and Clouds with Spot Instances
Ziming Mao
Tian Xia
Zhanghao Wu
Wei-Lin Chiang
Tyler Griggs
Romil Bhardwaj
Zongheng Yang
S. Shenker
Ion Stoica
56
2
0
03 Nov 2024
Is the GPU Half-Empty or Half-Full? Practical Scheduling Techniques for LLMs
Ferdi Kossmann
Bruce Fontaine
Daya Khudia
Michael Cafarella
Samuel Madden
98
2
0
23 Oct 2024
Loki: A System for Serving ML Inference Pipelines with Hardware and Accuracy Scaling
Sohaib Ahmad
Hui Guan
Ramesh K. Sitaraman
42
4
0
04 Jul 2024
LibProf: A Python Profiler for Improving Cold Start Performance in Serverless Applications
Syed Salauddin Mohammad Tariq
Ali Al Zein
Soumya Sripad Vaidya
Arati D. Khanolkar
Probir Roy
OnRL
32
0
0
17 Jun 2024
Building Socially-Equitable Public Models
Yejia Liu
Jianyi Yang
Pengfei Li
Tongxin Li
Shaolei Ren
OffRL
44
0
0
04 Jun 2024
Towards Cloud Efficiency with Large-scale Workload Characterization
Anjaly Parayil
Jue Zhang
Xiaoting Qin
Íñigo Goiri
Lexiang Huang
Timothy Zhu
Chetan Bansal
26
3
0
12 May 2024
FlexLLM: A System for Co-Serving Large Language Model Inference and Parameter-Efficient Finetuning
Xupeng Miao
Gabriele Oliaro
Xinhao Cheng
Vineeth Kada
Ruohan Gao
...
April Yang
Yingcheng Wang
Mengdi Wu
Colin Unger
Zhihao Jia
MoE
94
9
0
29 Feb 2024
Software Resource Disaggregation for HPC with Serverless Computing
Marcin Copik
Marcin Chrapek
Larissa Schmid
A. Calotoiu
Torsten Hoefler
36
9
0
19 Jan 2024
Shabari: Delayed Decision-Making for Faster and Efficient Serverless Functions
Prasoon Sinha
Kostis Kaffes
N. Yadwadkar
31
1
0
16 Jan 2024
Application-Centric Benchmarking of Distributed FaaS Platforms using BeFaaS
M. Grambow
Tobias Pfandzelter
David Bermbach
35
2
0
16 Nov 2023
A Deep Reinforcement Learning based Algorithm for Time and Cost Optimized Scaling of Serverless Applications
Anupama Mampage
S. Karunasekera
Rajkumar Buyya
33
3
0
22 Aug 2023
Managing Cold-start in The Serverless Cloud with Temporal Convolutional Networks
Tam n. Nguyen
34
6
0
01 Apr 2023
GPU-enabled Function-as-a-Service for Machine Learning Inference
Ming Zhao
Kritshekhar Jha
Sungho Hong
27
7
0
09 Mar 2023
Hydra: Virtualized Multi-Language Runtime for High-Density Serverless Platforms
Serhii Ivanenko
Jovan Stevanovic
V. Jovanovic
Rodrigo Bruno
VLM
27
5
0
20 Dec 2022
Kernel-as-a-Service: A Serverless Interface to GPUs
Nathan Pemberton
Anton Zabreyko
Zhoujie Ding
R. Katz
Joseph E. Gonzalez
24
8
0
15 Dec 2022
Learning-Assisted Algorithm Unrolling for Online Optimization with Budget Constraints
Jianyi Yang
Shaolei Ren
20
2
0
03 Dec 2022
funcX: Federated Function as a Service for Science
Zhuozhao Li
Ryan Chard
Y. Babuji
B. Galewsky
Tyler J. Skluzacek
...
Ben Blaiszik
Josh Bryan
Daniel S. Katz
Ian Foster
Kyle Chard
GNN
LRM
24
5
0
23 Sep 2022
Learnings from an Under the Hood Analysis of an Object Storage Node IO Stack
Pratik Mishra
Rekha Pitchumani
Yang-Suk Kee
23
1
0
05 Jul 2022
Zenix: Efficient Execution of Bulky Serverless Applications
Zhiyuan Guo
Zachary Blanco
Junda Chen
Jinmou Li
Zeru Wei
Bili Dong
Ishaan Pota
Mohammad Shahrad
Harry Xu
Yiying Zhang
17
2
0
27 Jun 2022
The Metaverse Data Deluge: What Can We Do About It?
Beng Chin Ooi
Gang Chen
Mike Zheng Shou
K. Tan
A. Tung
X. Xiao
J. Yip
Meihui Zhang
31
10
0
14 Jun 2022
Let's Trace It: Fine-Grained Serverless Benchmarking using Synchronous and Asynchronous Orchestrated Applications
Joel Scheuner
Simon Eismann
Sacheendra Talluri
Erwin Van Eyk
Cristina L. Abad
Philipp Leitner
Alexandru Iosup
26
14
0
16 May 2022
Virtual Disk Snapshot Management at Scale
Kevin Nguetchouang
Théophile Dubuc
Stella Bitchebe
A. Tchana
Pierre Olivier
17
0
0
13 May 2022
Fusionize: Improving Serverless Application Performance through Feedback-Driven Function Fusion
Trever Schirmer
Joel Scheuner
Tobias Pfandzelter
David Bermbach
26
14
0
25 Apr 2022
No Provisioned Concurrency: Fast RDMA-codesigned Remote Fork for Serverless Computing
Rong Chen
Fangming Lu
Tianxia Wang
Jinyu Gu
Yuh-Wen Yang
Haibo Chen
Haibo Chen
LRM
33
44
0
19 Mar 2022
Performance Modeling of Metric-Based Serverless Computing Platforms
Nima Mahmoudi
Hamzeh Khazaei
36
22
0
23 Feb 2022
Treehouse: A Case For Carbon-Aware Datacenter Software
Thomas Anderson
Adam Belay
Mosharaf Chowdhury
Asaf Cidon
Irene Zhang
32
62
0
06 Jan 2022
SMSE: A Serverless Platform for Multimedia Cloud Systems
Chavit Denninnart
M. Salehi
35
9
0
06 Jan 2022
The Serverless Computing Survey: A Technical Primer for Design Architecture
Zijun Li
Linsong Guo
Jiagan Cheng
Quan Chen
Bingsheng He
M. Guo
31
126
0
24 Dec 2021
Practical Scheduling for Real-World Serverless Computing
Kostis Kaffes
N. Yadwadkar
Christos Kozyrakis
15
13
0
14 Nov 2021
Let's Wait Awhile: How Temporal Workload Shifting Can Reduce Carbon Emissions in the Cloud
Philipp Wiesner
Ilja Behnke
Dominik Scheinert
Kordian Gontarska
L. Thamsen
17
80
0
25 Oct 2021
Accelerating Serverless Computing by Harvesting Idle Resources
Hanfei Yu
Hao Wang
Jian Li
Xuemei Yuan
Seung-Jong Park
17
30
0
28 Aug 2021
A Case Study on the Stability of Performance Tests for Serverless Applications
Simon Eismann
D. Costa
Lizhi Liao
C. Bezemer
Weiyi Shang
A. Hoorn
Samuel Kounev
13
24
0
28 Jul 2021
FaaSNet: Scalable and Fast Provisioning of Custom Serverless Container Runtimes at Alibaba Cloud Function Compute
Ao Wang
Shuai Chang
Huangshi Tian
Hongqi Wang
Haoran Yang
Huiba Li
Rui Du
Yue Cheng
38
104
0
24 May 2021
Data-driven scheduling in serverless computing to reduce response time
Bartłomiej Przybylski
P. Żuk
Krzysztof Rzadca
13
12
0
07 May 2021
LaSS: Running Latency Sensitive Serverless Computations at the Edge
Bin Wang
Ahmed Ali-Eldin
Prashant J. Shenoy
21
60
0
29 Apr 2021
The Hidden cost of the Edge: A Performance Comparison of Edge and Cloud Latencies
Ahmed Ali-Eldin
Bin Wang
Prashant J. Shenoy
22
26
0
29 Apr 2021
Faa
T
:
A
T
r
a
n
s
p
a
r
e
n
t
A
u
t
o
−
S
c
a
l
i
n
g
C
a
c
h
e
f
o
r
S
e
r
v
e
r
l
e
s
s
A
p
p
l
i
c
a
t
i
o
n
s
T: A Transparent Auto-Scaling Cache for Serverless Applications
T
:
A
T
r
an
s
p
a
re
n
t
A
u
t
o
−
S
c
a
l
in
g
C
a
c
h
e
f
or
S
er
v
er
l
ess
A
ppl
i
c
a
t
i
o
n
s
Francisco Romero
G. Chaudhry
Íñigo Goiri
Pragna Gopa
Paul Batum
N. Yadwadkar
Rodrigo Fonseca
Christos Kozyrakis
Ricardo Bianchini
58
111
0
28 Apr 2021
Sizeless: Predicting the optimal size of serverless functions
Simon Eismann
Long Bui
Johannes Grohmann
Cristina L. Abad
N. Herbst
Samuel Kounev
11
83
0
28 Oct 2020
Benchmarking Parallelism in FaaS Platforms
Daniel Barcelona-Pons
Pedro García-López
21
33
0
28 Oct 2020
1