Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1906.00091
Cited By
Deep Learning Recommendation Model for Personalization and Recommendation Systems
31 May 2019
Maxim Naumov
Dheevatsa Mudigere
Hao-Jun Michael Shi
Jianyu Huang
Narayanan Sundaraman
Jongsoo Park
Xiaodong Wang
Udit Gupta
Carole-Jean Wu
A. Azzolini
Dmytro Dzhulgakov
Andrey Mallevich
I. Cherniavskii
Yinghai Lu
Raghuraman Krishnamoorthi
Ansha Yu
Volodymyr Kondratenko
Stephanie Pereira
Xianjie Chen
Wenlin Chen
Vijay Rao
Bill Jia
Liang Xiong
M. Smelyanskiy
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Deep Learning Recommendation Model for Personalization and Recommendation Systems"
50 / 117 papers shown
Title
LithOS: An Operating System for Efficient Machine Learning on GPUs
Patrick H. Coppock
Brian Zhang
Eliot H. Solomon
Vasilis Kypriotis
Leon Yang
Bikash Sharma
Dan Schatzberg
Todd C. Mowry
Dimitrios Skarlatos
40
0
0
21 Apr 2025
Harmonia: A Multi-Agent Reinforcement Learning Approach to Data Placement and Migration in Hybrid Storage Systems
Rakesh Nadig
Vamanan Arulchelvan
Rahul Bera
Taha Shahroodi
Gagandeep Singh
Mohammad Sadrosadati
Jisung Park
O. Mutlu
Onur Mutlu
68
0
0
26 Mar 2025
External Large Foundation Model: How to Efficiently Serve Trillions of Parameters for Online Ads Recommendation
Mingfu Liang
Xi Liu
Rong Jin
B. Liu
Qiuling Suo
...
Bo Long
Wenlin Chen
Rocky Liu
Santanu Kolay
Yiming Li
46
2
0
20 Feb 2025
Beyond Self-Consistency: Loss-Balanced Perturbation-Based Regularization Improves Industrial-Scale Ads Ranking
Ilqar Ramazanli
Hamid Eghbalzadeh
Xiaoyi Liu
Yang Wang
Jiaxiang Fu
Kaushik Rangadurai
Sem Park
Bo Long
Xue Feng
51
0
0
05 Feb 2025
Quantum Cognition-Inspired EEG-based Recommendation via Graph Neural Networks
Jinkun Han
Wei Li
Yong Li
Zhipeng Cai
45
2
0
05 Jan 2025
MLPerf Power: Benchmarking the Energy Efficiency of Machine Learning Systems from Microwatts to Megawatts for Sustainable AI
Arya Tschand
Arun Tejusve Raghunath Rajan
S. Idgunji
Anirban Ghosh
J. Holleman
...
Rowan Taubitz
Sean Zhan
Scott Wasson
David Kanter
Vijay Janapa Reddi
62
3
0
15 Oct 2024
Characterizing and Efficiently Accelerating Multimodal Generation Model Inference
Yejin Lee
Anna Y. Sun
Basil Hosmer
Bilge Acun
Can Balioglu
...
Ram Pasunuru
Scott Yih
Sravya Popuri
Xing Liu
Carole-Jean Wu
57
2
0
30 Sep 2024
CADC: Encoding User-Item Interactions for Compressing Recommendation Model Training Data
Hossein Entezari Zarch
Abdulla Alshabanah
Chaoyi Jiang
Murali Annavaram
25
1
0
11 Jul 2024
FRED: Flexible REduction-Distribution Interconnect and Communication Implementation for Wafer-Scale Distributed Training of DNN Models
Saeed Rashidi
William Won
Sudarshan Srinivasan
Puneet Gupta
Tushar Krishna
30
0
0
28 Jun 2024
Exploring Safety-Utility Trade-Offs in Personalized Language Models
Anvesh Rao Vijjini
Somnath Basu Roy Chowdhury
Snigdha Chaturvedi
53
6
0
17 Jun 2024
ElasticRec: A Microservice-based Model Serving Architecture Enabling Elastic Resource Scaling for Recommendation Models
Yujeong Choi
Jiin Kim
Minsoo Rhu
39
1
0
11 Jun 2024
DREW : Towards Robust Data Provenance by Leveraging Error-Controlled Watermarking
Mehrdad Saberi
Vinu Sankar Sadasivan
Arman Zarei
Hessam Mahdavifar
S. Feizi
43
1
0
05 Jun 2024
Scorch: A Library for Sparse Deep Learning
Bobby Yan
Alexander J. Root
Trevor Gale
David Broman
Fredrik Kjolstad
33
0
0
27 May 2024
Retrieval and Distill: A Temporal Data Shift-Free Paradigm for Online Recommendation System
Lei Zheng
Ning Li
Weinan Zhang
Yong Yu
AI4TS
41
0
0
24 Apr 2024
Bullion: A Column Store for Machine Learning
Gang Liao
Ye Liu
Jianjun Chen
Daniel J. Abadi
37
5
0
13 Apr 2024
PID-Comm: A Fast and Flexible Collective Communication Framework for Commodity Processing-in-DIMM Devices
Si Ung Noh
Junguk Hong
Chaemin Lim
Seong-Yeol Park
Jeehyun Kim
Hanjun Kim
Youngsok Kim
Jinho Lee
34
7
0
13 Apr 2024
Optimization Methods for Personalizing Large Language Models through Retrieval Augmentation
Alireza Salemi
Surya Kallumadi
Hamed Zamani
44
46
0
09 Apr 2024
SequentialAttention++ for Block Sparsification: Differentiable Pruning Meets Combinatorial Optimization
T. Yasuda
Kyriakos Axiotis
Gang Fu
M. Bateni
Vahab Mirrokni
47
0
0
27 Feb 2024
Heterogeneity-aware Cross-school Electives Recommendation: a Hybrid Federated Approach
Chengyi Ju
Jiannong Cao
Yu Yang
Zhen-Qun Yang
Ho Man Lee
18
0
0
19 Feb 2024
Fine-Grained Embedding Dimension Optimization During Training for Recommender Systems
Qinyi Luo
Penghan Wang
Wei Zhang
Fan Lai
Jiachen Mao
...
Jun Song
Wei-Yu Tsai
Shuai Yang
Yuxi Hu
Xuehai Qian
50
0
0
09 Jan 2024
Scaling User Modeling: Large-scale Online User Representations for Ads Personalization in Meta
Wei Zhang
Dai Li
Chen Liang
Fang Zhou
Zhongke Zhang
...
Huayu Li
Yunnan Wu
Zhan Shu
Mindi Yuan
Sri Reddy
35
7
0
16 Nov 2023
DistDNAS: Search Efficient Feature Interactions within 2 Hours
Tunhou Zhang
W. Wen
Igor Fedorov
Xi Liu
Buyun Zhang
...
Wen-Yen Chen
Yiping Han
Feng Yan
Hai Helen Li
Yiran Chen
21
1
0
01 Nov 2023
SimplePIM: A Software Framework for Productive and Efficient Processing-in-Memory
Jinfan Chen
Juan Gómez Luna
I. E. Hajj
Yu-Yin Guo
Onur Mutlu
32
19
0
03 Oct 2023
Enhancing Cross-Category Learning in Recommendation Systems with Multi-Layer Embedding Training
Selim F. Yilmaz
Benjamin Ghaemmaghami
A. Singh
Benjamin Cho
Leo Orshansky
Lei Deng
Michael Orshansky
AI4TS
28
0
0
27 Sep 2023
Ad-Rec: Advanced Feature Interactions to Address Covariate-Shifts in Recommendation Networks
Muhammad Adnan
Yassaman Ebrahimzadeh Maboud
Divyat Mahajan
Prashant J. Nair
40
3
0
28 Aug 2023
Is Meta-Learning the Right Approach for the Cold-Start Problem in Recommender Systems?
Davide Buffelli
Ashish Gupta
Agnieszka Strzalka
Vassilis Plachouras
OffRL
LRM
32
1
0
16 Aug 2023
BHEISR: Nudging from Bias to Balance -- Promoting Belief Harmony by Eliminating Ideological Segregation in Knowledge-based Recommendations
Mengyan Wang
Yuxuan Hu
Zihan Yuan
Chenting Jiang
Weihua Li
Shiqing Wu
Quan-wei Bai
22
0
0
06 Jul 2023
Mem-Rec: Memory Efficient Recommendation System using Alternative Representation
Gopu Krishna Jha
Anthony Thomas
Nilesh Jain
Sameh Gobriel
Tajana Rosing
Ravi Iyer
53
2
0
12 May 2023
TorchBench: Benchmarking PyTorch with High API Surface Coverage
Yueming Hao
Xu Zhao
Bin Bao
David Berard
William Constable
Adnan Aziz
Xu Liu
35
5
0
27 Apr 2023
MTrainS: Improving DLRM training efficiency using heterogeneous memories
H. Kassa
Paul Johnson
Jason B. Akers
Mrinmoy Ghosh
Andrew Tulloch
Dheevatsa Mudigere
Jongsoo Park
Xing Liu
R. Dreslinski
E. K. Ardestani
27
1
0
19 Apr 2023
TransPimLib: A Library for Efficient Transcendental Functions on Processing-in-Memory Systems
Maurus Item
Juan Gómez Luna
Yu-Yin Guo
Geraldo F. Oliveira
Mohammad Sadrosadati
O. Mutlu
40
5
0
03 Apr 2023
mSAM: Micro-Batch-Averaged Sharpness-Aware Minimization
Kayhan Behdin
Qingquan Song
Aman Gupta
S. Keerthi
Ayan Acharya
Borja Ocejo
Gregory Dexter
Rajiv Khanna
D. Durfee
Rahul Mazumder
AAML
18
7
0
19 Feb 2023
VEGETA: Vertically-Integrated Extensions for Sparse/Dense GEMM Tile Acceleration on CPUs
Geonhwa Jeong
S. Damani
Abhimanyu Bambhaniya
Eric Qin
C. Hughes
S. Subramoney
Hyesoon Kim
T. Krishna
MoE
46
24
0
17 Feb 2023
With Shared Microexponents, A Little Shifting Goes a Long Way
Bita Darvish Rouhani
Ritchie Zhao
V. Elango
Rasoul Shafipour
Mathew Hall
...
Eric S. Chung
Zhaoxia Deng
S. Naghshineh
Jongsoo Park
Maxim Naumov
MQ
43
36
0
16 Feb 2023
Open Problems in Applied Deep Learning
M. Raissi
AI4CE
42
2
0
26 Jan 2023
Projective Integral Updates for High-Dimensional Variational Inference
J. Duersch
35
1
0
20 Jan 2023
Learning-Rate-Free Learning by D-Adaptation
Aaron Defazio
Konstantin Mishchenko
30
77
0
18 Jan 2023
Failure Tolerant Training with Persistent Memory Disaggregation over CXL
Miryeong Kwon
Junhyeok Jang
Hanjin Choi
Sangwon Lee
Myoungsoo Jung
32
8
0
14 Jan 2023
Data Distillation: A Survey
Noveen Sachdeva
Julian McAuley
DD
45
73
0
11 Jan 2023
Systems for Parallel and Distributed Large-Model Deep Learning Training
Kabir Nagrecha
GNN
VLM
MoE
26
7
0
06 Jan 2023
Improved Deep Neural Network Generalization Using m-Sharpness-Aware Minimization
Kayhan Behdin
Qingquan Song
Aman Gupta
D. Durfee
Ayan Acharya
S. Keerthi
Rahul Mazumder
AAML
31
5
0
07 Dec 2022
RAMP: A Flat Nanosecond Optical Network and MPI Operations for Distributed Deep Learning Systems
Alessandro Ottino
Joshua L. Benjamin
G. Zervas
30
7
0
28 Nov 2022
RecD: Deduplication for End-to-End Deep Learning Recommendation Model Training Infrastructure
Mark Zhao
Dhruv Choudhary
Devashish Tyagi
A. Somani
Max Kaplan
...
Jongsoo Park
Aarti Basant
Niket Agarwal
Carole-Jean Wu
Christos Kozyrakis
VLM
28
6
0
09 Nov 2022
Merlin HugeCTR: GPU-accelerated Recommender System Training and Inference
Zehuan Wang
Yingcan Wei
Minseok Lee
Matthias Langer
F. Yu
...
Daniel G. Abel
Xu Guo
Jianbing Dong
Ji Shi
Kunlun Li
GNN
LRM
25
32
0
17 Oct 2022
Clustering the Sketch: A Novel Approach to Embedding Table Compression
Henry Ling-Hei Tsang
Thomas Dybdahl Ahle
43
1
0
12 Oct 2022
KAIROS: Building Cost-Efficient Machine Learning Inference Systems with Heterogeneous Cloud Resources
Baolin Li
S. Samsi
V. Gadepally
Devesh Tiwari
27
11
0
12 Oct 2022
DreamShard: Generalizable Embedding Table Placement for Recommender Systems
Daochen Zha
Louis Feng
Qiaoyu Tan
Zirui Liu
Kwei-Herng Lai
Bhargav Bhushanam
Yuandong Tian
A. Kejariwal
Xia Hu
LMTD
OffRL
33
28
0
05 Oct 2022
PARSRec: Explainable Personalized Attention-fused Recurrent Sequential Recommendation Using Session Partial Actions
E. Gholami
Mohammad Motamedi
A. Aravindakshan
51
9
0
16 Sep 2022
An Analysis of Collocation on GPUs for Deep Learning Training
Ties Robroek
Ehsan Yousefzadeh-Asl-Miandoab
Pınar Tözün
20
9
0
13 Sep 2022
HammingMesh: A Network Topology for Large-Scale Deep Learning
Torsten Hoefler
Tommaso Bonato
Daniele De Sensi
Salvatore Di Girolamo
Shigang Li
Marco Heddes
Jon Belk
Deepak Goel
Miguel Castro
Steve Scott
3DH
GNN
AI4CE
32
20
0
03 Sep 2022
1
2
3
Next