Fugu-MT 論文翻訳(概要): Foundational theory for optimal decision tree problems. II. Optimal hypersurface decision tree algorithm

論文の概要: Foundational theory for optimal decision tree problems. II. Optimal hypersurface decision tree algorithm

arxiv url: http://arxiv.org/abs/2509.12057v1
Date: Mon, 15 Sep 2025 15:38:44 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-16 17:26:23.375246
Title: Foundational theory for optimal decision tree problems. II. Optimal hypersurface decision tree algorithm
Title（参考訳）: 最適決定木問題の基本理論 II. 最適超曲面決定木アルゴリズム
Authors: Xi He,
Abstract要約: このシリーズのパート1では、4つの公理を通して適切な決定木モデルを厳格に定義した。第2部では,第1次超曲面決定木(HODT)アルゴリズムを導入する。
参考スコア（独自算出の注目度）: 1.972521190983547
License: http://creativecommons.org/licenses/by-sa/4.0/
Abstract: Decision trees are a ubiquitous model for classification and regression tasks due to their interpretability and efficiency. However, solving the optimal decision tree (ODT) problem remains a challenging combinatorial optimization task. Even for the simplest splitting rules--axis-parallel hyperplanes--it is NP-hard to optimize. In Part I of this series, we rigorously defined the proper decision tree model through four axioms and, based on these, introduced four formal definitions of the ODT problem. From these definitions, we derived four generic algorithms capable of solving ODT problems for arbitrary decision trees satisfying the axioms. We also analyzed the combinatorial geometric properties of hypersurfaces, showing that decision trees defined by polynomial hypersurface splitting rules satisfy the proper axioms that we proposed. In this second paper (Part II) of this two-part series, building on the algorithmic and geometric foundations established in Part I, we introduce the first hypersurface decision tree (HODT) algorithm. To the best of our knowledge, existing optimal decision tree methods are, to date, limited to hyperplane splitting rules--a special case of hypersurfaces--and rely on general-purpose solvers. In contrast, our HODT algorithm addresses the general hypersurface decision tree model without requiring external solvers. Using synthetic datasets generated from ground-truth hyperplane decision trees, we vary tree size, data size, dimensionality, and label and feature noise. Results showing that our algorithm recovers the ground truth more accurately than axis-parallel trees and exhibits greater robustness to noise. We also analyzed generalization performance across 30 real-world datasets, showing that HODT can achieve up to 30% higher accuracy than the state-of-the-art optimal axis-parallel decision tree algorithm when tree complexity is properly controlled.
Abstract（参考訳）: 決定木は、その解釈可能性と効率性のために分類および回帰タスクのユビキタスモデルである。しかし、最適決定木(ODT)問題を解くことは、組合せ最適化の課題である。最も単純な分割規則(軸平行超平面)であっても、最適化はNPハードである。本シリーズの第1部では, 4つの公理を用いて適切な決定木モデルを厳密に定義し, これらに基づき, ODT問題の4つの公式定義を導入した。これらの定義から、公理を満たす任意の決定木に対して、ODT問題を解くことができる4つの汎用アルゴリズムを導出した。また、超曲面の組合せ幾何学的性質を解析し、多項式超曲面分割規則で定義される決定木が、提案した適切な公理を満たすことを示した。第1部で確立されたアルゴリズム的および幾何学的基礎に基づいて構築されたこの2部シリーズの第2部(パートII)では,第1次超曲面決定木(HODT)アルゴリズムを紹介する。我々の知る限り、既存の最適決定木法は、これまでは超平面分割規則に限られており、超曲面の特別な場合であり、汎用的な解法に依存している。対照的に、HODTアルゴリズムは外部解法を必要とせず、一般的な超曲面決定木モデルに対処する。地表面の超平面決定木から生成された合成データセットを用いて,木の大きさ,データサイズ,寸法,ラベル,特徴雑音を変化させる。その結果,本アルゴリズムは軸平行木よりも地上の真理を精度良く復元し,騒音に対する頑健性を示すことがわかった。また,30の実世界のデータセットを対象とした一般化性能を解析し,木々の複雑さを適切に制御した場合,HODTは最先端の最適軸並列決定木アルゴリズムよりも最大30%高い精度で達成可能であることを示した。

論文の概要: Foundational theory for optimal decision tree problems. II. Optimal hypersurface decision tree algorithm

関連論文リスト