ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2210.06375
24
8

Superpolynomial Lower Bounds for Decision Tree Learning and Testing

12 October 2022
Caleb M. Koch
Carmen Strassle
Li-Yang Tan
ArXivPDFHTML
Abstract

We establish new hardness results for decision tree optimization problems, adding to a line of work that dates back to Hyafil and Rivest in 1976. We prove, under randomized ETH, superpolynomial lower bounds for two basic problems: given an explicit representation of a function fff and a generator for a distribution D\mathcal{D}D, construct a small decision tree approximator for fff under D\mathcal{D}D, and decide if there is a small decision tree approximator for fff under D\mathcal{D}D. Our results imply new lower bounds for distribution-free PAC learning and testing of decision trees, settings in which the algorithm only has restricted access to fff and D\mathcal{D}D. Specifically, we show: nnn-variable size-sss decision trees cannot be properly PAC learned in time nO~(log⁡log⁡s)n^{\tilde{O}(\log\log s)}nO~(loglogs), and depth-ddd decision trees cannot be tested in time exp⁡(d O(1))\exp(d^{\,O(1)})exp(dO(1)). For learning, the previous best lower bound only ruled out poly(n)\text{poly}(n)poly(n)-time algorithms (Alekhnovich, Braverman, Feldman, Klivans, and Pitassi, 2009). For testing, recent work gives similar though incomparable bounds in the setting where fff is random and D\mathcal{D}D is nonexplicit (Blais, Ferreira Pinto Jr., and Harms, 2021). Assuming a plausible conjecture on the hardness of Set-Cover, we show our lower bound for learning decision trees can be improved to nΩ(log⁡s)n^{\Omega(\log s)}nΩ(logs), matching the best known upper bound of nO(log⁡s)n^{O(\log s)}nO(logs) due to Ehrenfeucht and Haussler (1989). We obtain our results within a unified framework that leverages recent progress in two lines of work: the inapproximability of Set-Cover and XOR lemmas for query complexity. Our framework is versatile and yields results for related concept classes such as juntas and DNF formulas.

View on arXiv
Comments on this paper