Share the Tensor Tea: How Databases can Leverage the Machine Learning Ecosystem
Yuki Asada
Victor Fu
Apurva Gandhi
Advitya Gemawat
Lihao Zhang
Dong He
Vikrant Gupta
Ehimwenma Nosakhare
Dalitso Banda
Rathijit Sen
Matteo Interlandi

Abstract
We demonstrate Tensor Query Processor (TQP): a query processor that automatically compiles relational operators into tensor programs. By leveraging tensor runtimes such as PyTorch, TQP is able to: (1) integrate with ML tools (e.g., Pandas for data ingestion, Tensorboard for visualization); (2) target different hardware (e.g., CPU, GPU) and software (e.g., browser) backends; and (3) end-to-end accelerate queries containing both relational and ML operators. TQP is generic enough to support the TPC-H benchmark, and it provides performance that is comparable to, and often better than, that of specialized CPU and GPU query processors.
View on arXivComments on this paper