ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2108.09373
  4. Cited By
Understanding Data Storage and Ingestion for Large-Scale Deep
  Recommendation Model Training

Understanding Data Storage and Ingestion for Large-Scale Deep Recommendation Model Training

20 August 2021
Mark Zhao
Niket Agarwal
Aarti Basant
B. Gedik
Satadru Pan
Muhammet Mustafa Ozdal
Rakesh Komuravelli
Jerry Y. Pan
Tianshu Bao
Haowei Lu
Sundaram Narayanan
Jack Langman
Kevin Wilfong
Harsha Rastogi
Carole-Jean Wu
Christos Kozyrakis
Parikshit Pol
    GNN
ArXivPDFHTML

Papers citing "Understanding Data Storage and Ingestion for Large-Scale Deep Recommendation Model Training"

19 / 19 papers shown
Title
OVERLORD: Ultimate Scaling of DataLoader for Multi-Source Large Foundation Model Training
OVERLORD: Ultimate Scaling of DataLoader for Multi-Source Large Foundation Model Training
Juntao Zhao
Qi Lu
Wei Jia
Borui Wan
Lei Zuo
...
Size Zheng
H. Lin
Haibin Lin
Xin Liu
Chuan Wu
AI4CE
37
0
0
14 Apr 2025
PREBA: A Hardware/Software Co-Design for Multi-Instance GPU based AI
  Inference Servers
PREBA: A Hardware/Software Co-Design for Multi-Instance GPU based AI Inference Servers
Gwangoo Yeo
Jiin Kim
Yujeong Choi
Minsoo Rhu
79
0
0
28 Nov 2024
TensorSocket: Shared Data Loading for Deep Learning Training
TensorSocket: Shared Data Loading for Deep Learning Training
Ties Robroek
Neil Kim Nielsen
Pınar Tözün
26
2
0
27 Sep 2024
PreSto: An In-Storage Data Preprocessing System for Training
  Recommendation Models
PreSto: An In-Storage Data Preprocessing System for Training Recommendation Models
Yunjae Lee
Hyeseong Kim
Minsoo Rhu
42
3
0
11 Jun 2024
Beyond Efficiency: Scaling AI Sustainably
Beyond Efficiency: Scaling AI Sustainably
Carole-Jean Wu
Bilge Acun
Ramya Raghavendra
Kim Hazelwood
GNN
46
14
0
08 Jun 2024
Bullion: A Column Store for Machine Learning
Bullion: A Column Store for Machine Learning
Gang Liao
Ye Liu
Jianjun Chen
Daniel J. Abadi
32
5
0
13 Apr 2024
Data Acquisition: A New Frontier in Data-centric AI
Data Acquisition: A New Frontier in Data-centric AI
Lingjiao Chen
Bilge Acun
Newsha Ardalani
Yifan Sun
Feiyang Kang
...
Yongchan Kwon
Ruoxi Jia
Carole-Jean Wu
Matei A. Zaharia
James Zou
48
8
0
22 Nov 2023
InTune: Reinforcement Learning-based Data Pipeline Optimization for Deep
  Recommendation Models
InTune: Reinforcement Learning-based Data Pipeline Optimization for Deep Recommendation Models
Kabir Nagrecha
Lingyi Liu
P. Delgado
Prasanna Padmanabhan
OffRL
AI4CE
33
5
0
13 Aug 2023
TPU v4: An Optically Reconfigurable Supercomputer for Machine Learning
  with Hardware Support for Embeddings
TPU v4: An Optically Reconfigurable Supercomputer for Machine Learning with Hardware Support for Embeddings
N. Jouppi
George Kurian
Sheng Li
Peter C. Ma
R. Nagarajan
...
Brian Towles
C. Young
Xiaoping Zhou
Zongwei Zhou
David A. Patterson
BDL
VLM
46
336
0
04 Apr 2023
FlexShard: Flexible Sharding for Industry-Scale Sequence Recommendation
  Models
FlexShard: Flexible Sharding for Industry-Scale Sequence Recommendation Models
Geet Sethi
Pallab Bhattacharya
Dhruv Choudhary
Carole-Jean Wu
Christos Kozyrakis
21
5
0
08 Jan 2023
Mystique: Enabling Accurate and Scalable Generation of Production AI
  Benchmarks
Mystique: Enabling Accurate and Scalable Generation of Production AI Benchmarks
Mingyu Liang
Wenyin Fu
Louis Feng
Zhongyi Lin
P. Panakanti
Shengbao Zheng
Srinivas Sridharan
Christina Delimitrou
23
12
0
16 Dec 2022
RecD: Deduplication for End-to-End Deep Learning Recommendation Model
  Training Infrastructure
RecD: Deduplication for End-to-End Deep Learning Recommendation Model Training Infrastructure
Mark Zhao
Dhruv Choudhary
Devashish Tyagi
A. Somani
Max Kaplan
...
Jongsoo Park
Aarti Basant
Niket Agarwal
Carole-Jean Wu
Christos Kozyrakis
VLM
23
6
0
09 Nov 2022
tf.data service: A Case for Disaggregating ML Input Data Processing
tf.data service: A Case for Disaggregating ML Input Data Processing
Andrew Audibert
Yangrui Chen
D. Graur
Ana Klimovic
Jiří Šimša
C. A. Thekkath
42
16
0
26 Oct 2022
Accelerating Transfer Learning with Near-Data Computation on Cloud
  Object Stores
Accelerating Transfer Learning with Near-Data Computation on Cloud Object Stores
Arsany Guirguis
Diana Petrescu
Florin Dinu
D. Quoc
Javier Picorel
R. Guerraoui
40
0
0
16 Oct 2022
Understanding Scaling Laws for Recommendation Models
Understanding Scaling Laws for Recommendation Models
Newsha Ardalani
Carole-Jean Wu
Zeliang Chen
Bhargav Bhushanam
Adnan Aziz
34
28
0
17 Aug 2022
CoVA: Exploiting Compressed-Domain Analysis to Accelerate Video
  Analytics
CoVA: Exploiting Compressed-Domain Analysis to Accelerate Video Analytics
Jinwoo Hwang
Minsu Kim
Daeun Kim
Seungho Nam
Yoonsung Kim
Dohee Kim
Hardik Sharma
Jongse Park
46
14
0
02 Jul 2022
Heterogeneous Acceleration Pipeline for Recommendation System Training
Heterogeneous Acceleration Pipeline for Recommendation System Training
Muhammad Adnan
Yassaman Ebrahimzadeh Maboud
Divyat Mahajan
Prashant J. Nair
28
18
0
11 Apr 2022
RecShard: Statistical Feature-Based Memory Optimization for
  Industry-Scale Neural Recommendation
RecShard: Statistical Feature-Based Memory Optimization for Industry-Scale Neural Recommendation
Geet Sethi
Bilge Acun
Niket Agarwal
Christos Kozyrakis
Caroline Trippel
Carole-Jean Wu
47
66
0
25 Jan 2022
tf.data: A Machine Learning Data Processing Framework
tf.data: A Machine Learning Data Processing Framework
D. Murray
Jiří Šimša
Ana Klimovic
Ihor Indyk
PINN
AI4CE
LMTD
41
87
0
28 Jan 2021
1