ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2001.02772
  4. Cited By
DeepRecSys: A System for Optimizing End-To-End At-scale Neural
  Recommendation Inference

DeepRecSys: A System for Optimizing End-To-End At-scale Neural Recommendation Inference

8 January 2020
Udit Gupta
Samuel Hsia
V. Saraph
Xiaodong Wang
Brandon Reagen
Gu-Yeon Wei
Hsien-Hsin S. Lee
David Brooks
Carole-Jean Wu
    GNN
ArXivPDFHTML

Papers citing "DeepRecSys: A System for Optimizing End-To-End At-scale Neural Recommendation Inference"

25 / 75 papers shown
Title
Differentiable NAS Framework and Application to Ads CTR Prediction
Differentiable NAS Framework and Application to Ads CTR Prediction
Ravi Krishna
Aravind Kalaiah
Bichen Wu
Maxim Naumov
Dheevatsa Mudigere
M. Smelyanskiy
Kurt Keutzer
25
8
0
25 Oct 2021
Understanding Data Storage and Ingestion for Large-Scale Deep
  Recommendation Model Training
Understanding Data Storage and Ingestion for Large-Scale Deep Recommendation Model Training
Mark Zhao
Niket Agarwal
Aarti Basant
B. Gedik
Satadru Pan
...
Kevin Wilfong
Harsha Rastogi
Carole-Jean Wu
Christos Kozyrakis
Parikshit Pol
GNN
31
70
0
20 Aug 2021
Cocktail: Leveraging Ensemble Learning for Optimized Model Serving in
  Public Cloud
Cocktail: Leveraging Ensemble Learning for Optimized Model Serving in Public Cloud
Jashwant Raj Gunasekaran
Cyan Subhra Mishra
P. Thinakaran
M. Kandemir
Chita R. Das
8
3
0
09 Jun 2021
Low-Precision Hardware Architectures Meet Recommendation Model Inference
  at Scale
Low-Precision Hardware Architectures Meet Recommendation Model Inference at Scale
Zhaoxia Deng
Deng
Jongsoo Park
P. T. P. Tang
Haixin Liu
...
S. Nadathur
Changkyu Kim
Maxim Naumov
S. Naghshineh
M. Smelyanskiy
26
11
0
26 May 2021
RecPipe: Co-designing Models and Hardware to Jointly Optimize
  Recommendation Quality and Performance
RecPipe: Co-designing Models and Hardware to Jointly Optimize Recommendation Quality and Performance
Udit Gupta
Samuel Hsia
J. Zhang
Mark Wilkening
Javin Pombra
Hsien-Hsin S. Lee
Gu-Yeon Wei
Carole-Jean Wu
David Brooks
41
32
0
18 May 2021
Alternate Model Growth and Pruning for Efficient Training of
  Recommendation Systems
Alternate Model Growth and Pruning for Efficient Training of Recommendation Systems
Xiaocong Du
Bhargav Bhushanam
Jiecao Yu
Dhruv Choudhary
Tianxiang Gao
Sherman Wong
Louis Feng
Jongsoo Park
Yu Cao
A. Kejariwal
31
5
0
04 May 2021
MAGMA: An Optimization Framework for Mapping Multiple DNNs on Multiple
  Accelerator Cores
MAGMA: An Optimization Framework for Mapping Multiple DNNs on Multiple Accelerator Cores
Sheng-Chun Kao
T. Krishna
22
49
0
28 Apr 2021
Faa$T: A Transparent Auto-Scaling Cache for Serverless Applications
FaaT:ATransparentAuto−ScalingCacheforServerlessApplicationsT: A Transparent Auto-Scaling Cache for Serverless ApplicationsT:ATransparentAuto−ScalingCacheforServerlessApplications
Francisco Romero
G. Chaudhry
Íñigo Goiri
Pragna Gopa
Paul Batum
N. Yadwadkar
Rodrigo Fonseca
Christos Kozyrakis
Ricardo Bianchini
60
111
0
28 Apr 2021
Demystifying BERT: Implications for Accelerator Design
Demystifying BERT: Implications for Accelerator Design
Suchita Pati
Shaizeen Aga
Nuwan Jayasena
Matthew D. Sinclair
LLMAG
38
17
0
14 Apr 2021
ECRM: Efficient Fault Tolerance for Recommendation Model Training via
  Erasure Coding
ECRM: Efficient Fault Tolerance for Recommendation Model Training via Erasure Coding
Kaige Liu
J. Kosaian
K. V. Rashmi
27
4
0
05 Apr 2021
On Estimating Recommendation Evaluation Metrics under Sampling
On Estimating Recommendation Evaluation Metrics under Sampling
R. Jin
Dong Li
Benjamin Mudrak
Jing Gao
Zhi Liu
13
14
0
02 Mar 2021
RecSSD: Near Data Processing for Solid State Drive Based Recommendation
  Inference
RecSSD: Near Data Processing for Solid State Drive Based Recommendation Inference
Mark Wilkening
Udit Gupta
Samuel Hsia
Caroline Trippel
Carole-Jean Wu
David Brooks
Gu-Yeon Wei
14
114
0
29 Jan 2021
TT-Rec: Tensor Train Compression for Deep Learning Recommendation Models
TT-Rec: Tensor Train Compression for Deep Learning Recommendation Models
Chunxing Yin
Bilge Acun
Xing Liu
Carole-Jean Wu
47
102
0
25 Jan 2021
SpAtten: Efficient Sparse Attention Architecture with Cascade Token and
  Head Pruning
SpAtten: Efficient Sparse Attention Architecture with Cascade Token and Head Pruning
Hanrui Wang
Zhekai Zhang
Song Han
43
377
0
17 Dec 2020
Understanding Training Efficiency of Deep Learning Recommendation Models
  at Scale
Understanding Training Efficiency of Deep Learning Recommendation Models at Scale
Bilge Acun
Matthew Murphy
Xiaodong Wang
Jade Nie
Carole-Jean Wu
K. Hazelwood
28
109
0
11 Nov 2020
CPR: Understanding and Improving Failure Tolerant Training for Deep
  Learning Recommendation with Partial Recovery
CPR: Understanding and Improving Failure Tolerant Training for Deep Learning Recommendation with Partial Recovery
Kiwan Maeng
Shivam Bharuka
Isabel Gao
M. C. Jeffrey
V. Saraph
...
Caroline Trippel
Jiyan Yang
Michael G. Rabbat
Brandon Lucia
Carole-Jean Wu
OffRL
24
31
0
05 Nov 2020
Understanding Capacity-Driven Scale-Out Neural Recommendation Inference
Understanding Capacity-Driven Scale-Out Neural Recommendation Inference
Michael Lui
Yavuz Yetim
Özgür Özkan
Zhuoran Zhao
Shin-Yeh Tsai
Carole-Jean Wu
Mark Hempstead
GNN
BDL
LRM
22
51
0
04 Nov 2020
Tensor Casting: Co-Designing Algorithm-Architecture for Personalized
  Recommendation Training
Tensor Casting: Co-Designing Algorithm-Architecture for Personalized Recommendation Training
Youngeun Kwon
Yunjae Lee
Minsoo Rhu
24
39
0
25 Oct 2020
MicroRec: Efficient Recommendation Inference by Hardware and Data
  Structure Solutions
MicroRec: Efficient Recommendation Inference by Hardware and Data Structure Solutions
Wenqi Jiang
Zhen He
Shuai Zhang
Thomas B. Preußer
Kai Zeng
...
Tongxuan Liu
Yong Li
Jingren Zhou
Ce Zhang
Gustavo Alonso
39
7
0
12 Oct 2020
Cross-Stack Workload Characterization of Deep Recommendation Systems
Cross-Stack Workload Characterization of Deep Recommendation Systems
Samuel Hsia
Udit Gupta
Mark Wilkening
Carole-Jean Wu
Gu-Yeon Wei
David Brooks
BDL
GNN
HAI
25
32
0
10 Oct 2020
Accelerating Recommender Systems via Hardware "scale-in"
Accelerating Recommender Systems via Hardware "scale-in"
S. Krishna
Ravi Krishna
GNN
LRM
21
6
0
11 Sep 2020
Centaur: A Chiplet-based, Hybrid Sparse-Dense Accelerator for
  Personalized Recommendations
Centaur: A Chiplet-based, Hybrid Sparse-Dense Accelerator for Personalized Recommendations
Ranggi Hwang
Taehun Kim
Youngeun Kwon
Minsoo Rhu
18
103
0
12 May 2020
Optimizing Deep Learning Recommender Systems' Training On CPU Cluster
  Architectures
Optimizing Deep Learning Recommender Systems' Training On CPU Cluster Architectures
Dhiraj D. Kalamkar
E. Georganas
Sudarshan Srinivasan
Jianping Chen
Mikhail Shiryaev
A. Heinecke
50
47
0
10 May 2020
INFaaS: A Model-less and Managed Inference Serving System
INFaaS: A Model-less and Managed Inference Serving System
Francisco Romero
Qian Li
N. Yadwadkar
Christos Kozyrakis
28
14
0
30 May 2019
Google's Neural Machine Translation System: Bridging the Gap between
  Human and Machine Translation
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation
Yonghui Wu
M. Schuster
Z. Chen
Quoc V. Le
Mohammad Norouzi
...
Alex Rudnick
Oriol Vinyals
G. Corrado
Macduff Hughes
J. Dean
AIMat
716
6,746
0
26 Sep 2016
Previous
12