ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2302.03994
40
0
v1v2 (latest)

Fully-Dynamic Approximate Decision Trees With Worst-Case Update Time Guarantees

8 February 2023
Marco Bressan
Mauro Sozio
ArXiv (abs)PDFHTML
Abstract

We give the first algorithm that maintains an approximate decision tree over an arbitrary sequence of insertions and deletions of labeled examples, with strong guarantees on the worst-case running time per update request. For instance, we show how to maintain a decision tree where every vertex has Gini gain within an additive α\alphaα of the optimum by performing O(d (log⁡n)4α3)O\Big(\frac{d\,(\log n)^4}{\alpha^3}\Big)O(α3d(logn)4​) elementary operations per update, where ddd is the number of features and nnn the maximum size of the active set (the net result of the update requests). We give similar bounds for the information gain and the variance gain. In fact, all these bounds are corollaries of a more general result, stated in terms of decision rules -- functions that, given a set SSS of labeled examples, decide whether to split SSS or predict a label. Decision rules give a unified view of greedy decision tree algorithms regardless of the example and label domains, and lead to a general notion of ϵ\epsilonϵ-approximate decision trees that, for natural decision rules such as those used by ID3 or C4.5, implies the gain approximation guarantees above. The heart of our work provides a deterministic algorithm that, given any decision rule and any ϵ>0\epsilon > 0ϵ>0, maintains an ϵ\epsilonϵ-approximate tree using O ⁣(d f(n)npoly⁡hϵ)O\!\left(\frac{d\, f(n)}{n} \operatorname{poly}\frac{h}{\epsilon}\right)O(ndf(n)​polyϵh​) operations per update, where f(n)f(n)f(n) is the complexity of evaluating the rule over a set of nnn examples and hhh is the maximum height of the maintained tree.

View on arXiv
Comments on this paper