Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2005.05968
Cited By
Centaur: A Chiplet-based, Hybrid Sparse-Dense Accelerator for Personalized Recommendations
12 May 2020
Ranggi Hwang
Taehun Kim
Youngeun Kwon
Minsoo Rhu
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Centaur: A Chiplet-based, Hybrid Sparse-Dense Accelerator for Personalized Recommendations"
14 / 14 papers shown
Title
REED: Chiplet-Based Accelerator for Fully Homomorphic Encryption
Aikata Aikata
A. Mert
Sunmin Kwon
M. Deryabin
S. Roy
139
2
0
05 Aug 2023
DeepRecSys: A System for Optimizing End-To-End At-scale Neural Recommendation Inference
Udit Gupta
Samuel Hsia
V. Saraph
Xiaodong Wang
Brandon Reagen
Gu-Yeon Wei
Hsien-Hsin S. Lee
David Brooks
Carole-Jean Wu
GNN
73
189
0
08 Jan 2020
RecNMP: Accelerating Personalized Recommendation with Near-Memory Processing
Liu Ke
Udit Gupta
Carole-Jean Wu
B. Cho
Mark Hempstead
...
Dheevatsa Mudigere
Maxim Naumov
Martin D. Schatz
M. Smelyanskiy
Xiaodong Wang
66
222
0
30 Dec 2019
NeuMMU: Architectural Support for Efficient Address Translations in Neural Processing Units
Bongjoon Hyun
Youngeun Kwon
Yujeong Choi
John Kim
Minsoo Rhu
44
29
0
15 Nov 2019
PREMA: A Predictive Multi-task Scheduling Algorithm For Preemptible Neural Processing Units
Yujeong Choi
Minsoo Rhu
53
132
0
06 Sep 2019
TensorDIMM: A Practical Near-Memory Processing Architecture for Embeddings and Tensor Operations in Deep Learning
Youngeun Kwon
Yunjae Lee
Minsoo Rhu
64
213
0
08 Aug 2019
The Architectural Implications of Facebook's DNN-based Personalized Recommendation
Udit Gupta
Carole-Jean Wu
Xiaodong Wang
Maxim Naumov
Brandon Reagen
...
Andrey Malevich
Dheevatsa Mudigere
M. Smelyanskiy
Liang Xiong
Xuan Zhang
GNN
79
291
0
06 Jun 2019
Deep Learning Recommendation Model for Personalization and Recommendation Systems
Maxim Naumov
Dheevatsa Mudigere
Hao-Jun Michael Shi
Jianyu Huang
Narayanan Sundaraman
...
Wenlin Chen
Vijay Rao
Bill Jia
Liang Xiong
M. Smelyanskiy
91
738
0
31 May 2019
Beyond the Memory Wall: A Case for Memory-centric HPC System for Deep Learning
Youngeun Kwon
Minsoo Rhu
56
58
0
18 Feb 2019
Deep Learning Inference in Facebook Data Centers: Characterization, Performance Optimizations and Hardware Implications
Jongsoo Park
Maxim Naumov
Protonu Basu
Summer Deng
Aravind Kalaiah
...
Lin Qiao
Vijay Rao
Nadav Rotem
S. Yoo
M. Smelyanskiy
FedML
GNN
BDL
76
188
0
24 Nov 2018
Restructuring Batch Normalization to Accelerate CNN Training
Wonkyung Jung
Daejin Jung
and Byeongho Kim
Sunjung Lee
Wonjong Rhee
Jung Ho Ahn
45
63
0
04 Jul 2018
Billion-scale Commodity Embedding for E-commerce Recommendation in Alibaba
Jizhe Wang
Pipei Huang
Huan Zhao
Zhibo Zhang
Binqiang Zhao
Lee
59
500
0
06 Mar 2018
Compressing DMA Engine: Leveraging Activation Sparsity for Training Deep Neural Networks
Minsoo Rhu
Mike O'Connor
Niladrish Chatterjee
Jeff Pool
S. Keckler
65
178
0
03 May 2017
In-Datacenter Performance Analysis of a Tensor Processing Unit
N. Jouppi
C. Young
Nishant Patil
David Patterson
Gaurav Agrawal
...
Vijay Vasudevan
Richard Walter
Walter Wang
Eric Wilcox
Doe Hyun Yoon
237
4,644
0
16 Apr 2017
1