25
3

Compositional Nonparametric Prediction: Statistical Efficiency and Greedy Regression Algorithm

Abstract

In this paper, we propose a compositional nonparametric method in which a model is expressed as a labeled binary tree of 2k+12k+1 nodes, where each node is either a summation, a multiplication, or the application of one of the qq basis functions to one of the pp covariates. We show that in order to recover a labeled binary tree from a given dataset, the sufficient number of samples is O(klog(pq)+log(k!))O(k\log(pq)+\log(k!)), and the necessary number of samples is Ω(klog(pq)log(k!))\Omega(k\log (pq)-\log(k!)). We further propose a greedy algorithm for regression, and evaluate its effectiveness through synthetic as well as real-world experiments.

View on arXiv
Comments on this paper