Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2108.13191
Cited By
High Performance GPU Code Generation for Matrix-Matrix Multiplication using MLIR: Some Early Results
23 August 2021
Navdeep Katel
Vivek Khandelwal
Uday Bondhugula
Re-assign community
ArXiv
PDF
HTML
Papers citing
"High Performance GPU Code Generation for Matrix-Matrix Multiplication using MLIR: Some Early Results"
2 / 2 papers shown
Title
QiMeng-TensorOp: Automatically Generating High-Performance Tensor Operators with Hardware Primitives
X. Zhang
Shaohui Peng
Qirui Zhou
Yuanbo Wen
Qi Guo
...
Ke Gao
Chen Zhao
Yanjun Wu
Yunji Chen
Ling Li
VLM
39
0
0
08 May 2025
Bridging Control-Centric and Data-Centric Optimization
Tal Ben-Nun
Berke Ates
A. Calotoiu
Torsten Hoefler
36
7
0
01 Jun 2023
1