Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2001.02772
Cited By
DeepRecSys: A System for Optimizing End-To-End At-scale Neural Recommendation Inference
8 January 2020
Udit Gupta
Samuel Hsia
V. Saraph
Xiaodong Wang
Brandon Reagen
Gu-Yeon Wei
Hsien-Hsin S. Lee
David Brooks
Carole-Jean Wu
GNN
Re-assign community
ArXiv
PDF
HTML
Papers citing
"DeepRecSys: A System for Optimizing End-To-End At-scale Neural Recommendation Inference"
25 / 75 papers shown
Title
Differentiable NAS Framework and Application to Ads CTR Prediction
Ravi Krishna
Aravind Kalaiah
Bichen Wu
Maxim Naumov
Dheevatsa Mudigere
M. Smelyanskiy
Kurt Keutzer
25
8
0
25 Oct 2021
Understanding Data Storage and Ingestion for Large-Scale Deep Recommendation Model Training
Mark Zhao
Niket Agarwal
Aarti Basant
B. Gedik
Satadru Pan
...
Kevin Wilfong
Harsha Rastogi
Carole-Jean Wu
Christos Kozyrakis
Parikshit Pol
GNN
31
70
0
20 Aug 2021
Cocktail: Leveraging Ensemble Learning for Optimized Model Serving in Public Cloud
Jashwant Raj Gunasekaran
Cyan Subhra Mishra
P. Thinakaran
M. Kandemir
Chita R. Das
8
3
0
09 Jun 2021
Low-Precision Hardware Architectures Meet Recommendation Model Inference at Scale
Zhaoxia Deng
Deng
Jongsoo Park
P. T. P. Tang
Haixin Liu
...
S. Nadathur
Changkyu Kim
Maxim Naumov
S. Naghshineh
M. Smelyanskiy
26
11
0
26 May 2021
RecPipe: Co-designing Models and Hardware to Jointly Optimize Recommendation Quality and Performance
Udit Gupta
Samuel Hsia
J. Zhang
Mark Wilkening
Javin Pombra
Hsien-Hsin S. Lee
Gu-Yeon Wei
Carole-Jean Wu
David Brooks
41
32
0
18 May 2021
Alternate Model Growth and Pruning for Efficient Training of Recommendation Systems
Xiaocong Du
Bhargav Bhushanam
Jiecao Yu
Dhruv Choudhary
Tianxiang Gao
Sherman Wong
Louis Feng
Jongsoo Park
Yu Cao
A. Kejariwal
31
5
0
04 May 2021
MAGMA: An Optimization Framework for Mapping Multiple DNNs on Multiple Accelerator Cores
Sheng-Chun Kao
T. Krishna
22
49
0
28 Apr 2021
Faa
T
:
A
T
r
a
n
s
p
a
r
e
n
t
A
u
t
o
−
S
c
a
l
i
n
g
C
a
c
h
e
f
o
r
S
e
r
v
e
r
l
e
s
s
A
p
p
l
i
c
a
t
i
o
n
s
T: A Transparent Auto-Scaling Cache for Serverless Applications
T
:
A
T
r
an
s
p
a
re
n
t
A
u
t
o
−
S
c
a
l
in
g
C
a
c
h
e
f
or
S
er
v
er
l
ess
A
ppl
i
c
a
t
i
o
n
s
Francisco Romero
G. Chaudhry
Íñigo Goiri
Pragna Gopa
Paul Batum
N. Yadwadkar
Rodrigo Fonseca
Christos Kozyrakis
Ricardo Bianchini
60
111
0
28 Apr 2021
Demystifying BERT: Implications for Accelerator Design
Suchita Pati
Shaizeen Aga
Nuwan Jayasena
Matthew D. Sinclair
LLMAG
38
17
0
14 Apr 2021
ECRM: Efficient Fault Tolerance for Recommendation Model Training via Erasure Coding
Kaige Liu
J. Kosaian
K. V. Rashmi
27
4
0
05 Apr 2021
On Estimating Recommendation Evaluation Metrics under Sampling
R. Jin
Dong Li
Benjamin Mudrak
Jing Gao
Zhi Liu
13
14
0
02 Mar 2021
RecSSD: Near Data Processing for Solid State Drive Based Recommendation Inference
Mark Wilkening
Udit Gupta
Samuel Hsia
Caroline Trippel
Carole-Jean Wu
David Brooks
Gu-Yeon Wei
14
114
0
29 Jan 2021
TT-Rec: Tensor Train Compression for Deep Learning Recommendation Models
Chunxing Yin
Bilge Acun
Xing Liu
Carole-Jean Wu
47
102
0
25 Jan 2021
SpAtten: Efficient Sparse Attention Architecture with Cascade Token and Head Pruning
Hanrui Wang
Zhekai Zhang
Song Han
43
377
0
17 Dec 2020
Understanding Training Efficiency of Deep Learning Recommendation Models at Scale
Bilge Acun
Matthew Murphy
Xiaodong Wang
Jade Nie
Carole-Jean Wu
K. Hazelwood
28
109
0
11 Nov 2020
CPR: Understanding and Improving Failure Tolerant Training for Deep Learning Recommendation with Partial Recovery
Kiwan Maeng
Shivam Bharuka
Isabel Gao
M. C. Jeffrey
V. Saraph
...
Caroline Trippel
Jiyan Yang
Michael G. Rabbat
Brandon Lucia
Carole-Jean Wu
OffRL
24
31
0
05 Nov 2020
Understanding Capacity-Driven Scale-Out Neural Recommendation Inference
Michael Lui
Yavuz Yetim
Özgür Özkan
Zhuoran Zhao
Shin-Yeh Tsai
Carole-Jean Wu
Mark Hempstead
GNN
BDL
LRM
22
51
0
04 Nov 2020
Tensor Casting: Co-Designing Algorithm-Architecture for Personalized Recommendation Training
Youngeun Kwon
Yunjae Lee
Minsoo Rhu
24
39
0
25 Oct 2020
MicroRec: Efficient Recommendation Inference by Hardware and Data Structure Solutions
Wenqi Jiang
Zhen He
Shuai Zhang
Thomas B. Preußer
Kai Zeng
...
Tongxuan Liu
Yong Li
Jingren Zhou
Ce Zhang
Gustavo Alonso
39
7
0
12 Oct 2020
Cross-Stack Workload Characterization of Deep Recommendation Systems
Samuel Hsia
Udit Gupta
Mark Wilkening
Carole-Jean Wu
Gu-Yeon Wei
David Brooks
BDL
GNN
HAI
25
32
0
10 Oct 2020
Accelerating Recommender Systems via Hardware "scale-in"
S. Krishna
Ravi Krishna
GNN
LRM
21
6
0
11 Sep 2020
Centaur: A Chiplet-based, Hybrid Sparse-Dense Accelerator for Personalized Recommendations
Ranggi Hwang
Taehun Kim
Youngeun Kwon
Minsoo Rhu
18
103
0
12 May 2020
Optimizing Deep Learning Recommender Systems' Training On CPU Cluster Architectures
Dhiraj D. Kalamkar
E. Georganas
Sudarshan Srinivasan
Jianping Chen
Mikhail Shiryaev
A. Heinecke
50
47
0
10 May 2020
INFaaS: A Model-less and Managed Inference Serving System
Francisco Romero
Qian Li
N. Yadwadkar
Christos Kozyrakis
28
14
0
30 May 2019
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation
Yonghui Wu
M. Schuster
Z. Chen
Quoc V. Le
Mohammad Norouzi
...
Alex Rudnick
Oriol Vinyals
G. Corrado
Macduff Hughes
J. Dean
AIMat
716
6,746
0
26 Sep 2016
Previous
1
2