Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2206.06022
Cited By
v1
v2 (latest)
Machine Learning Training on a Real Processing-in-Memory System
13 June 2022
Juan Gómez Luna
Yu-Yin Guo
Sylvan Brocard
Julien Legriel
Remy Cimadomo
Geraldo F. Oliveira
Gagandeep Singh
Onur Mutlu
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Machine Learning Training on a Real Processing-in-Memory System"
14 / 14 papers shown
Title
High-throughput Pairwise Alignment with the Wavefront Algorithm using Processing-in-Memory
Safaa Diab
Amir Nassereldine
M. Alser
Juan Gómez Luna
O. Mutlu
I. E. Hajj
FedML
88
24
0
05 Apr 2022
GenStore: A High-Performance and Energy-Efficient In-Storage Computing System for Genome Sequence Analysis
Nika Mansouri-Ghiasi
Jisung Park
Harun Mustafa
Jeremie S. Kim
Ataberk Olgun
...
N. Alserr
Rachata Ausavarungnirun
Nandita Vijaykumar
M. Alser
O. Mutlu
65
28
0
21 Feb 2022
Benchmarking Memory-Centric Computing Systems: Analysis of Real Processing-in-Memory Hardware
Juan Gómez Luna
I. E. Hajj
Ivan Fernandez
Christina Giannoula
Geraldo F. Oliveira
O. Mutlu
67
67
0
04 Oct 2021
Google Neural Network Models for Edge Devices: Analyzing and Mitigating Machine Learning Inference Bottlenecks
Amirali Boroumand
Saugata Ghose
Berkin Akin
Ravi Narayanaswami
Geraldo F. Oliveira
Xiaoyu Ma
Eric Shiu
O. Mutlu
68
86
0
29 Sep 2021
Accelerating Weather Prediction using Near-Memory Reconfigurable Fabric
Gagandeep Singh
D. Diamantopoulos
Juan Gómez Luna
C. Hagleitner
S. Stuijk
Henk Corporaal
O. Mutlu
66
25
0
19 Jul 2021
FPGA-Based Near-Memory Acceleration of Modern Data-Intensive Applications
Gagandeep Singh
M. Alser
Damla Senol Cali
D. Diamantopoulos
Juan Gómez Luna
Henk Corporaal
O. Mutlu
51
76
0
11 Jun 2021
QUAC-TRNG: High-Throughput True Random Number Generation Using Quadruple Row Activation in Commodity DRAM Chips
Ataberk Olgun
Minesh Patel
A. G. Yaglikçi
Haocong Luo
Jeremie S. Kim
Nisa Bostanci
Nandita Vijaykumar
Oguz Ergin
O. Mutlu
63
64
0
19 May 2021
Benchmarking a New Paradigm: An Experimental Analysis of a Real Processing-in-Memory Architecture
Juan Gómez Luna
I. E. Hajj
Ivan Fernandez
Christina Giannoula
Geraldo F. Oliveira
O. Mutlu
59
86
0
09 May 2021
DAMOV: A New Methodology and Benchmark Suite for Evaluating Data Movement Bottlenecks
Geraldo F. Oliveira
Juan Gómez Luna
Lois Orosa
Saugata Ghose
Nandita Vijaykumar
Ivan Fernandez
Mohammad Sadrosadati
O. Mutlu
93
84
0
08 May 2021
SISA: Set-Centric Instruction Set Architecture for Graph Mining on Processing-in-Memory Systems
Maciej Besta
Raghavendra Kanakagiri
Grzegorz Kwa'sniewski
Rachata Ausavarungnirun
Jakub Beránek
...
Salvatore Di Girolamo
Marek Konieczny
Nils Blach
O. Mutlu
Torsten Hoefler
41
86
0
15 Apr 2021
SynCron: Efficient Synchronization Support for Near-Data-Processing Architectures
Christina Giannoula
Nandita Vijaykumar
Nikela Papadopoulou
Vasileios Karakostas
Ivan Fernandez
Juan Gómez Luna
Lois Orosa
N. Koziris
G. Goumas
O. Mutlu
154
85
0
19 Jan 2021
In-Datacenter Performance Analysis of a Tensor Processing Unit
N. Jouppi
C. Young
Nishant Patil
David Patterson
Gaurav Agrawal
...
Vijay Vasudevan
Richard Walter
Walter Wang
Eric Wilcox
Doe Hyun Yoon
237
4,644
0
16 Apr 2017
TensorFlow: A system for large-scale machine learning
Martín Abadi
P. Barham
Jianmin Chen
Zhiwen Chen
Andy Davis
...
Vijay Vasudevan
Pete Warden
Martin Wicke
Yuan Yu
Xiaoqiang Zhang
GNN
AI4CE
433
18,361
0
27 May 2016
cuDNN: Efficient Primitives for Deep Learning
Sharan Chetlur
Cliff Woolley
Philippe Vandermersch
Jonathan M. Cohen
J. Tran
Bryan Catanzaro
Evan Shelhamer
137
1,849
0
03 Oct 2014
1