Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1912.12953
Cited By
RecNMP: Accelerating Personalized Recommendation with Near-Memory Processing
30 December 2019
Liu Ke
Udit Gupta
Carole-Jean Wu
B. Cho
Mark Hempstead
Brandon Reagen
Xuan Zhang
David Brooks
Vikas Chandra
Utku Diril
A. Firoozshahian
K. Hazelwood
Bill Jia
Hsien-Hsin S. Lee
Meng Li
Bertrand A. Maher
Dheevatsa Mudigere
Maxim Naumov
Martin D. Schatz
M. Smelyanskiy
Xiaodong Wang
Re-assign community
ArXiv
PDF
HTML
Papers citing
"RecNMP: Accelerating Personalized Recommendation with Near-Memory Processing"
25 / 25 papers shown
Title
ElasticRec: A Microservice-based Model Serving Architecture Enabling Elastic Resource Scaling for Recommendation Models
Yujeong Choi
Jiin Kim
Minsoo Rhu
39
1
0
11 Jun 2024
PID-Comm: A Fast and Flexible Collective Communication Framework for Commodity Processing-in-DIMM Devices
Si Ung Noh
Junguk Hong
Chaemin Lim
Seong-Yeol Park
Jeehyun Kim
Hanjun Kim
Youngsok Kim
Jinho Lee
34
6
0
13 Apr 2024
SimplePIM: A Software Framework for Productive and Efficient Processing-in-Memory
Jinfan Chen
Juan Gómez Luna
I. E. Hajj
Yu-Yin Guo
Onur Mutlu
29
18
0
03 Oct 2023
Mem-Rec: Memory Efficient Recommendation System using Alternative Representation
Gopu Krishna Jha
Anthony Thomas
Nilesh Jain
Sameh Gobriel
Tajana Rosing
Ravi Iyer
47
2
0
12 May 2023
On Memory Codelets: Prefetching, Recoding, Moving and Streaming Data
D. Fox
J. M. Diaz
Xiaoming Li
6
2
0
31 Jan 2023
Failure Tolerant Training with Persistent Memory Disaggregation over CXL
Miryeong Kwon
Junhyeok Jang
Hanjin Choi
Sangwon Lee
Myoungsoo Jung
24
8
0
14 Jan 2023
An Experimental Evaluation of Machine Learning Training on a Real Processing-in-Memory System
Juan Gómez Luna
Yu-Yin Guo
Sylvan Brocard
Julien Legriel
Remy Cimadomo
Geraldo F. Oliveira
Gagandeep Singh
O. Mutlu
VLM
33
14
0
16 Jul 2022
Heterogeneous Data-Centric Architectures for Modern Data-Intensive Applications: Case Studies in Machine Learning and Databases
Geraldo F. Oliveira
Amirali Boroumand
Saugata Ghose
Juan Gómez Luna
O. Mutlu
28
7
0
29 May 2022
SmartSAGE: Training Large-scale Graph Neural Networks using In-Storage Processing Architectures
Yunjae Lee
Jin-Won Chung
Minsoo Rhu
GNN
29
48
0
10 May 2022
Training Personalized Recommendation Systems from (GPU) Scratch: Look Forward not Backwards
Youngeun Kwon
Minsoo Rhu
21
27
0
10 May 2022
Hercules: Heterogeneity-Aware Inference Serving for At-Scale Personalized Recommendation
Liu Ke
Udit Gupta
Mark Hempstead
Carole-Jean Wu
Hsien-Hsin S. Lee
Xuan Zhang
24
21
0
14 Mar 2022
BagPipe: Accelerating Deep Recommendation Model Training
Saurabh Agarwal
Chengpo Yan
Ziyi Zhang
Shivaram Venkataraman
29
17
0
24 Feb 2022
Google Neural Network Models for Edge Devices: Analyzing and Mitigating Machine Learning Inference Bottlenecks
Amirali Boroumand
Saugata Ghose
Berkin Akin
Ravi Narayanaswami
Geraldo F. Oliveira
Xiaoyu Ma
Eric Shiu
O. Mutlu
20
81
0
29 Sep 2021
Neuro-Symbolic AI: An Emerging Class of AI Workloads and their Characterization
Zachary Susskind
Bryce Arden
L. John
Patrick A Stockton
E. John
NAI
30
40
0
13 Sep 2021
Accelerating Weather Prediction using Near-Memory Reconfigurable Fabric
Gagandeep Singh
D. Diamantopoulos
Juan Gómez Luna
C. Hagleitner
S. Stuijk
Henk Corporaal
O. Mutlu
43
25
0
19 Jul 2021
DAMOV: A New Methodology and Benchmark Suite for Evaluating Data Movement Bottlenecks
Geraldo F. Oliveira
Juan Gómez Luna
Lois Orosa
Saugata Ghose
Nandita Vijaykumar
Ivan Fernandez
Mohammad Sadrosadati
O. Mutlu
36
82
0
08 May 2021
Mitigating Edge Machine Learning Inference Bottlenecks: An Empirical Study on Accelerating Google Edge Models
Amirali Boroumand
Saugata Ghose
Berkin Akin
Ravi Narayanaswami
Geraldo F. Oliveira
Xiaoyu Ma
Eric Shiu
O. Mutlu
16
28
0
01 Mar 2021
SpAtten: Efficient Sparse Attention Architecture with Cascade Token and Head Pruning
Hanrui Wang
Zhekai Zhang
Song Han
43
374
0
17 Dec 2020
Understanding Training Efficiency of Deep Learning Recommendation Models at Scale
Bilge Acun
Matthew Murphy
Xiaodong Wang
Jade Nie
Carole-Jean Wu
K. Hazelwood
23
109
0
11 Nov 2020
CPR: Understanding and Improving Failure Tolerant Training for Deep Learning Recommendation with Partial Recovery
Kiwan Maeng
Shivam Bharuka
Isabel Gao
M. C. Jeffrey
V. Saraph
...
Caroline Trippel
Jiyan Yang
Michael G. Rabbat
Brandon Lucia
Carole-Jean Wu
OffRL
18
31
0
05 Nov 2020
Understanding Capacity-Driven Scale-Out Neural Recommendation Inference
Michael Lui
Yavuz Yetim
Özgür Özkan
Zhuoran Zhao
Shin-Yeh Tsai
Carole-Jean Wu
Mark Hempstead
GNN
BDL
LRM
22
51
0
04 Nov 2020
Tensor Casting: Co-Designing Algorithm-Architecture for Personalized Recommendation Training
Youngeun Kwon
Yunjae Lee
Minsoo Rhu
19
39
0
25 Oct 2020
Enabling Compute-Communication Overlap in Distributed Deep Learning Training Platforms
Saeed Rashidi
Matthew Denton
Srinivas Sridharan
Sudarshan Srinivasan
Amoghavarsha Suresh
Jade Nie
T. Krishna
26
45
0
30 Jun 2020
The Architectural Implications of Facebook's DNN-based Personalized Recommendation
Udit Gupta
Carole-Jean Wu
Xiaodong Wang
Maxim Naumov
Brandon Reagen
...
Andrey Malevich
Dheevatsa Mudigere
M. Smelyanskiy
Liang Xiong
Xuan Zhang
GNN
35
290
0
06 Jun 2019
DeepFM: An End-to-End Wide & Deep Learning Framework for CTR Prediction
Huifeng Guo
Ruiming Tang
Yunming Ye
Zhenguo Li
Xiuqiang He
Zhenhua Dong
115
64
0
12 Apr 2018
1