ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1803.08601
  4. Cited By
Design Principles for Sparse Matrix Multiplication on the GPU

Design Principles for Sparse Matrix Multiplication on the GPU

22 March 2018
Carl Yang
A. Buluç
John Douglas Owens
ArXivPDFHTML

Papers citing "Design Principles for Sparse Matrix Multiplication on the GPU"

32 / 32 papers shown
Title
Plexus: Taming Billion-edge Graphs with 3D Parallel GNN Training
Plexus: Taming Billion-edge Graphs with 3D Parallel GNN Training
Aditya K. Ranjan
Siddharth Singh
Cunyang Wei
A. Bhatele
GNN
65
0
0
07 May 2025
HC-SpMM: Accelerating Sparse Matrix-Matrix Multiplication for Graphs
  with Hybrid GPU Cores
HC-SpMM: Accelerating Sparse Matrix-Matrix Multiplication for Graphs with Hybrid GPU Cores
Zhonggen Li
Xiangyu Ke
Yifan Zhu
Yunjun Gao
Yaofeng Tu
113
0
0
12 Dec 2024
GeoT: Tensor Centric Library for Graph Neural Network via Efficient
  Segment Reduction on GPU
GeoT: Tensor Centric Library for Graph Neural Network via Efficient Segment Reduction on GPU
Zhongming Yu
Genghan Zhang
Hanxian Huang
Xin Chen
Jishen Zhao
GNN
34
0
0
03 Apr 2024
JITSPMM: Just-in-Time Instruction Generation for Accelerated Sparse
  Matrix-Matrix Multiplication
JITSPMM: Just-in-Time Instruction Generation for Accelerated Sparse Matrix-Matrix Multiplication
Qiang Fu
Thomas B. Rolinger
H. H. Huang
41
3
0
09 Dec 2023
RDMA-Based Algorithms for Sparse Matrix Multiplication on GPUs
RDMA-Based Algorithms for Sparse Matrix Multiplication on GPUs
Benjamin Brock
A. Buluç
Katherine Yelick
30
2
0
29 Nov 2023
SUBP: Soft Uniform Block Pruning for 1xN Sparse CNNs Multithreading
  Acceleration
SUBP: Soft Uniform Block Pruning for 1xN Sparse CNNs Multithreading Acceleration
Jingyang Xiang
Siqi Li
Jun Chen
Shipeng Bai
Yukai Ma
Guang Dai
Yong-Jin Liu
42
1
0
10 Oct 2023
Accel-GCN: High-Performance GPU Accelerator Design for Graph Convolution
  Networks
Accel-GCN: High-Performance GPU Accelerator Design for Graph Convolution Networks
Xiaoru Xie
Hongwu Peng
Amit Hasan
Shaoyi Huang
Jiahui Zhao
Haowen Fang
Wei Zhang
Tong Geng
O. Khan
Caiwen Ding
GNN
43
31
0
22 Aug 2023
BitGNN: Unleashing the Performance Potential of Binary Graph Neural
  Networks on GPUs
BitGNN: Unleashing the Performance Potential of Binary Graph Neural Networks on GPUs
Jou-An Chen
Hsin-Hsuan Sung
Xipeng Shen
Sutanay Choudhury
Ang Li
GNN
MQ
44
7
0
04 May 2023
PopSparse: Accelerated block sparse matrix multiplication on IPU
PopSparse: Accelerated block sparse matrix multiplication on IPU
Zhiyi Li
Douglas Orr
V. Ohan
Godfrey Da Costa
Tom Murray
Adam Sanders
D. Beker
Dominic Masters
32
1
0
29 Mar 2023
Exploiting Sparsity in Pruned Neural Networks to Optimize Large Model
  Training
Exploiting Sparsity in Pruned Neural Networks to Optimize Large Model Training
Siddharth Singh
A. Bhatele
38
9
0
10 Feb 2023
A Programming Model for GPU Load Balancing
A Programming Model for GPU Load Balancing
Muhammad Osama
Serban D. Porumbescu
John Douglas Owens
35
7
0
12 Jan 2023
Sgap: Towards Efficient Sparse Tensor Algebra Compilation for GPU
Sgap: Towards Efficient Sparse Tensor Algebra Compilation for GPU
Genghan Zhang
Yuetong Zhao
Yanting Tao
Zhongming Yu
Guohao Dai
Sitao Huang
Yuanyuan Wen
Pavlos Petoumenos
Yu Wang
54
4
0
07 Sep 2022
SparseTIR: Composable Abstractions for Sparse Compilation in Deep
  Learning
SparseTIR: Composable Abstractions for Sparse Compilation in Deep Learning
Zihao Ye
Ruihang Lai
Junru Shao
Tianqi Chen
Luis Ceze
78
93
0
11 Jul 2022
Heuristic Adaptability to Input Dynamics for SpMM on GPUs
Heuristic Adaptability to Input Dynamics for SpMM on GPUs
Guohao Dai
Guyue Huang
Shang Yang
Zhongming Yu
Hengrui Zhang
Yufei Ding
Yuan Xie
Huazhong Yang
Yu Wang
6
20
0
17 Feb 2022
Blocking Techniques for Sparse Matrix Multiplication on Tensor
  Accelerators
Blocking Techniques for Sparse Matrix Multiplication on Tensor Accelerators
P. S. Labini
M. Bernaschi
Francesco Silvestri
Flavio Vella
25
3
0
11 Feb 2022
Sextans: A Streaming Accelerator for General-Purpose Sparse-Matrix
  Dense-Matrix Multiplication
Sextans: A Streaming Accelerator for General-Purpose Sparse-Matrix Dense-Matrix Multiplication
Linghao Song
Yuze Chi
Atefeh Sohrabizadeh
Young-kyu Choi
Jason Lau
Jason Cong
GNN
16
60
0
22 Sep 2021
Accelerating SpMM Kernel with Cache-First Edge Sampling for Graph Neural
  Networks
Accelerating SpMM Kernel with Cache-First Edge Sampling for Graph Neural Networks
Chien-Yu Lin
Liang Luo
Luis Ceze
GNN
79
8
0
21 Apr 2021
Do We Need Anisotropic Graph Neural Networks?
Do We Need Anisotropic Graph Neural Networks?
Shyam A. Tailor
Felix L. Opolka
Pietro Lio
Nicholas D. Lane
51
35
0
03 Apr 2021
A High-Performance Sparse Tensor Algebra Compiler in Multi-Level IR
A High-Performance Sparse Tensor Algebra Compiler in Multi-Level IR
Ruiqin Tian
Luanzheng Guo
Jiajia Li
Bin Ren
Gokcen Kestor
27
17
0
09 Feb 2021
SparseDNN: Fast Sparse Deep Learning Inference on CPUs
SparseDNN: Fast Sparse Deep Learning Inference on CPUs
Ziheng Wang
MQ
76
19
0
20 Jan 2021
SparseRT: Accelerating Unstructured Sparsity on GPUs for Deep Learning
  Inference
SparseRT: Accelerating Unstructured Sparsity on GPUs for Deep Learning Inference
Ziheng Wang
47
67
0
26 Aug 2020
FeatGraph: A Flexible and Efficient Backend for Graph Neural Network
  Systems
FeatGraph: A Flexible and Efficient Backend for Graph Neural Network Systems
Yuwei Hu
Zihao Ye
Minjie Wang
Jiali Yu
Da Zheng
Mu Li
Zheng Zhang
Zhiru Zhang
Yida Wang
GNN
51
80
0
26 Aug 2020
GE-SpMM: General-purpose Sparse Matrix-Matrix Multiplication on GPUs for
  Graph Neural Networks
GE-SpMM: General-purpose Sparse Matrix-Matrix Multiplication on GPUs for Graph Neural Networks
Guyue Huang
Guohao Dai
Yu Wang
Huazhong Yang
GNN
32
122
0
07 Jul 2020
Sparse GPU Kernels for Deep Learning
Sparse GPU Kernels for Deep Learning
Trevor Gale
Matei A. Zaharia
C. Young
Erich Elsen
22
230
0
18 Jun 2020
GNNAdvisor: An Adaptive and Efficient Runtime System for GNN
  Acceleration on GPUs
GNNAdvisor: An Adaptive and Efficient Runtime System for GNN Acceleration on GPUs
Yuke Wang
Boyuan Feng
Gushu Li
Shuangchen Li
Lei Deng
Yuan Xie
Yufei Ding
GNN
21
121
0
11 Jun 2020
Efficient Sparse-Dense Matrix-Matrix Multiplication on GPUs Using the
  Customized Sparse Storage Format
Efficient Sparse-Dense Matrix-Matrix Multiplication on GPUs Using the Customized Sparse Storage Format
Shaoshuai Shi
Qiang-qiang Wang
Xiaowen Chu
29
10
0
29 May 2020
Reducing Communication in Graph Neural Network Training
Reducing Communication in Graph Neural Network Training
Alok Tripathy
Katherine Yelick
A. Buluç
GNN
35
104
0
07 May 2020
Fast Sparse ConvNets
Fast Sparse ConvNets
Erich Elsen
Marat Dukhan
Trevor Gale
Karen Simonyan
26
151
0
21 Nov 2019
GraphBLAST: A High-Performance Linear Algebra-based Graph Framework on
  the GPU
GraphBLAST: A High-Performance Linear Algebra-based Graph Framework on the GPU
Carl Yang
A. Buluç
John Douglas Owens
GNN
29
98
0
04 Aug 2019
Optimizing the Linear Fascicle Evaluation Algorithm for Multi-Core and
  Many-Core Systems
Optimizing the Linear Fascicle Evaluation Algorithm for Multi-Core and Many-Core Systems
Karan Aggarwal
Uday Bondhugula
16
2
0
14 May 2019
Batched Sparse Matrix Multiplication for Accelerating Graph
  Convolutional Networks
Batched Sparse Matrix Multiplication for Accelerating Graph Convolutional Networks
Yusuke Nagasaka
Akira Nukada
Ryosuke Kojima
Satoshi Matsuoka
GNN
27
10
0
27 Mar 2019
A GraphBLAS Approach for Subgraph Counting
A GraphBLAS Approach for Subgraph Counting
Langshi Chen
Jiayu Li
A. Azad
Lei Jiang
Madhav Marathe
A. Vullikanti
Andrey Nikolaev
Egor Smirnov
R. Israfilov
J. Qiu
GNN
13
4
0
11 Mar 2019
1