128
0
v1v2 (latest)

On the Trade-off between the Number of Nodes and the Number of Trees in a Random Forest

Abstract

In this paper, we focus on the prediction phase of a random forest and study the problem of representing a bag of decision trees using a smaller bag of decision trees, where we only consider binary decision problems on the binary domain and simple decision trees in which an internal node is limited to querying the Boolean value of a single variable. As a main result, we show that the majority function of nn variables can be represented by a bag of TT (<n< n) decision trees each with polynomial size if nTn-T is a constant, where nn and TT must be odd (in order to avoid the tie break). We also show that a bag of nn decision trees can be represented by a bag of TT decision trees each with polynomial size if nTn-T is a constant and a small classification error is allowed. A related result on the kk-out-of-nn functions is presented too.

View on arXiv
Comments on this paper

We use cookies and other tracking technologies to improve your browsing experience on our website, to show you personalized content and targeted ads, to analyze our website traffic, and to understand where our visitors are coming from. See our policy.