Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1803.08601
Cited By
Design Principles for Sparse Matrix Multiplication on the GPU
22 March 2018
Carl Yang
A. Buluç
John Douglas Owens
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Design Principles for Sparse Matrix Multiplication on the GPU"
32 / 32 papers shown
Title
Plexus: Taming Billion-edge Graphs with 3D Parallel GNN Training
Aditya K. Ranjan
Siddharth Singh
Cunyang Wei
A. Bhatele
GNN
69
0
0
07 May 2025
HC-SpMM: Accelerating Sparse Matrix-Matrix Multiplication for Graphs with Hybrid GPU Cores
Zhonggen Li
Xiangyu Ke
Yifan Zhu
Yunjun Gao
Yaofeng Tu
116
0
0
12 Dec 2024
GeoT: Tensor Centric Library for Graph Neural Network via Efficient Segment Reduction on GPU
Zhongming Yu
Genghan Zhang
Hanxian Huang
Xin Chen
Jishen Zhao
GNN
34
0
0
03 Apr 2024
JITSPMM: Just-in-Time Instruction Generation for Accelerated Sparse Matrix-Matrix Multiplication
Qiang Fu
Thomas B. Rolinger
H. H. Huang
45
3
0
09 Dec 2023
RDMA-Based Algorithms for Sparse Matrix Multiplication on GPUs
Benjamin Brock
A. Buluç
Katherine Yelick
30
2
0
29 Nov 2023
SUBP: Soft Uniform Block Pruning for 1xN Sparse CNNs Multithreading Acceleration
Jingyang Xiang
Siqi Li
Jun Chen
Shipeng Bai
Yukai Ma
Guang Dai
Yong-Jin Liu
42
1
0
10 Oct 2023
Accel-GCN: High-Performance GPU Accelerator Design for Graph Convolution Networks
Xiaoru Xie
Hongwu Peng
Amit Hasan
Shaoyi Huang
Jiahui Zhao
Haowen Fang
Wei Zhang
Tong Geng
O. Khan
Caiwen Ding
GNN
43
31
0
22 Aug 2023
BitGNN: Unleashing the Performance Potential of Binary Graph Neural Networks on GPUs
Jou-An Chen
Hsin-Hsuan Sung
Xipeng Shen
Sutanay Choudhury
Ang Li
GNN
MQ
44
7
0
04 May 2023
PopSparse: Accelerated block sparse matrix multiplication on IPU
Zhiyi Li
Douglas Orr
V. Ohan
Godfrey Da Costa
Tom Murray
Adam Sanders
D. Beker
Dominic Masters
32
1
0
29 Mar 2023
Exploiting Sparsity in Pruned Neural Networks to Optimize Large Model Training
Siddharth Singh
A. Bhatele
38
9
0
10 Feb 2023
A Programming Model for GPU Load Balancing
Muhammad Osama
Serban D. Porumbescu
John Douglas Owens
35
7
0
12 Jan 2023
Sgap: Towards Efficient Sparse Tensor Algebra Compilation for GPU
Genghan Zhang
Yuetong Zhao
Yanting Tao
Zhongming Yu
Guohao Dai
Sitao Huang
Yuanyuan Wen
Pavlos Petoumenos
Yu Wang
54
4
0
07 Sep 2022
SparseTIR: Composable Abstractions for Sparse Compilation in Deep Learning
Zihao Ye
Ruihang Lai
Junru Shao
Tianqi Chen
Luis Ceze
80
93
0
11 Jul 2022
Heuristic Adaptability to Input Dynamics for SpMM on GPUs
Guohao Dai
Guyue Huang
Shang Yang
Zhongming Yu
Hengrui Zhang
Yufei Ding
Yuan Xie
Huazhong Yang
Yu Wang
8
20
0
17 Feb 2022
Blocking Techniques for Sparse Matrix Multiplication on Tensor Accelerators
P. S. Labini
M. Bernaschi
Francesco Silvestri
Flavio Vella
25
3
0
11 Feb 2022
Sextans: A Streaming Accelerator for General-Purpose Sparse-Matrix Dense-Matrix Multiplication
Linghao Song
Yuze Chi
Atefeh Sohrabizadeh
Young-kyu Choi
Jason Lau
Jason Cong
GNN
45
60
0
22 Sep 2021
Accelerating SpMM Kernel with Cache-First Edge Sampling for Graph Neural Networks
Chien-Yu Lin
Liang Luo
Luis Ceze
GNN
79
8
0
21 Apr 2021
Do We Need Anisotropic Graph Neural Networks?
Shyam A. Tailor
Felix L. Opolka
Pietro Lio
Nicholas D. Lane
51
35
0
03 Apr 2021
A High-Performance Sparse Tensor Algebra Compiler in Multi-Level IR
Ruiqin Tian
Luanzheng Guo
Jiajia Li
Bin Ren
Gokcen Kestor
27
17
0
09 Feb 2021
SparseDNN: Fast Sparse Deep Learning Inference on CPUs
Ziheng Wang
MQ
78
19
0
20 Jan 2021
SparseRT: Accelerating Unstructured Sparsity on GPUs for Deep Learning Inference
Ziheng Wang
47
68
0
26 Aug 2020
FeatGraph: A Flexible and Efficient Backend for Graph Neural Network Systems
Yuwei Hu
Zihao Ye
Minjie Wang
Jiali Yu
Da Zheng
Mu Li
Zheng Zhang
Zhiru Zhang
Yida Wang
GNN
51
80
0
26 Aug 2020
GE-SpMM: General-purpose Sparse Matrix-Matrix Multiplication on GPUs for Graph Neural Networks
Guyue Huang
Guohao Dai
Yu Wang
Huazhong Yang
GNN
32
122
0
07 Jul 2020
Sparse GPU Kernels for Deep Learning
Trevor Gale
Matei A. Zaharia
C. Young
Erich Elsen
40
230
0
18 Jun 2020
GNNAdvisor: An Adaptive and Efficient Runtime System for GNN Acceleration on GPUs
Yuke Wang
Boyuan Feng
Gushu Li
Shuangchen Li
Lei Deng
Yuan Xie
Yufei Ding
GNN
26
121
0
11 Jun 2020
Efficient Sparse-Dense Matrix-Matrix Multiplication on GPUs Using the Customized Sparse Storage Format
Shaoshuai Shi
Qiang-qiang Wang
Xiaowen Chu
29
10
0
29 May 2020
Reducing Communication in Graph Neural Network Training
Alok Tripathy
Katherine Yelick
A. Buluç
GNN
35
104
0
07 May 2020
Fast Sparse ConvNets
Erich Elsen
Marat Dukhan
Trevor Gale
Karen Simonyan
26
151
0
21 Nov 2019
GraphBLAST: A High-Performance Linear Algebra-based Graph Framework on the GPU
Carl Yang
A. Buluç
John Douglas Owens
GNN
29
98
0
04 Aug 2019
Optimizing the Linear Fascicle Evaluation Algorithm for Multi-Core and Many-Core Systems
Karan Aggarwal
Uday Bondhugula
16
2
0
14 May 2019
Batched Sparse Matrix Multiplication for Accelerating Graph Convolutional Networks
Yusuke Nagasaka
Akira Nukada
Ryosuke Kojima
Satoshi Matsuoka
GNN
27
10
0
27 Mar 2019
A GraphBLAS Approach for Subgraph Counting
Langshi Chen
Jiayu Li
A. Azad
Lei Jiang
Madhav Marathe
A. Vullikanti
Andrey Nikolaev
Egor Smirnov
R. Israfilov
J. Qiu
GNN
15
4
0
11 Mar 2019
1