Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2207.04296
Cited By
TensorIR: An Abstraction for Automatic Tensorized Program Optimization
9 July 2022
Siyuan Feng
Bohan Hou
Hongyi Jin
Wuwei Lin
Junru Shao
Ruihang Lai
Zihao Ye
Lianmin Zheng
Cody Hao Yu
Yong Yu
Tianqi Chen
Re-assign community
ArXiv
PDF
HTML
Papers citing
"TensorIR: An Abstraction for Automatic Tensorized Program Optimization"
11 / 11 papers shown
Title
QiMeng-TensorOp: Automatically Generating High-Performance Tensor Operators with Hardware Primitives
X. Zhang
Shaohui Peng
Qirui Zhou
Yuanbo Wen
Qi Guo
...
Ke Gao
Chen Zhao
Yanjun Wu
Yunji Chen
Ling Li
VLM
39
0
0
08 May 2025
Tilus: A Virtual Machine for Arbitrary Low-Precision GPGPU Computation in LLM Serving
Yaoyao Ding
Bohan Hou
X. Zhang
Allan Lin
Tianqi Chen
Cody Yu Hao
Yida Wang
Gennady Pekhimenko
50
0
0
17 Apr 2025
Car-GS: Addressing Reflective and Transparent Surface Challenges in 3D Car Reconstruction
Congcong Li
Jin Wang
Xiaomeng Wang
Xingchen Zhou
Wei Wu
Yuzhi Zhang
Tongyi Cao
3DGS
3DV
108
0
0
19 Jan 2025
Tensorized Ant Colony Optimization for GPU Acceleration
Luming Yang
Tao Jiang
Ran Cheng
13
2
0
07 Apr 2024
Allo: A Programming Model for Composable Accelerator Design
Hongzheng Chen
Niansong Zhang
Shaojie Xiang
Zhichen Zeng
Mengjia Dai
Zhiru Zhang
54
14
0
07 Apr 2024
Relax: Composable Abstractions for End-to-End Dynamic Machine Learning
Ruihang Lai
Junru Shao
Siyuan Feng
Steven Lyubomirsky
Bohan Hou
...
Sunghyun Park
Prakalp Srivastava
Jared Roesch
T. Mowry
Tianqi Chen
47
9
0
01 Nov 2023
PowerFusion: A Tensor Compiler with Explicit Data Movement Description and Instruction-level Graph IR
Zixuan Ma
Haojie Wang
Jingze Xing
Liyan Zheng
Chen Zhang
Huanqi Cao
Kezhao Huang
Shizhi Tang
Penghan Wang
Jidong Zhai
GNN
34
1
0
11 Jul 2023
ALT: Boosting Deep Learning Performance by Breaking the Wall between Graph and Operator Level Optimizations
Zhiying Xu
Jiafan Xu
H. Peng
Wei Wang
Xiaoliang Wang
...
Haipeng Dai
Yixu Xu
Hao Cheng
Kun Wang
Guihai Chen
20
0
0
22 Oct 2022
Sgap: Towards Efficient Sparse Tensor Algebra Compilation for GPU
Genghan Zhang
Yuetong Zhao
Yanting Tao
Zhongming Yu
Guohao Dai
Sitao Huang
Yuanyuan Wen
Pavlos Petoumenos
Yu Wang
43
4
0
07 Sep 2022
UNIT: Unifying Tensorized Instruction Compilation
Jian Weng
Animesh Jain
Jie Wang
Leyuan Wang
Yida Wang
Tony Nowatzki
121
30
0
21 Jan 2021
MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications
Andrew G. Howard
Menglong Zhu
Bo Chen
Dmitry Kalenichenko
Weijun Wang
Tobias Weyand
M. Andreetto
Hartwig Adam
3DH
950
20,567
0
17 Apr 2017
1