Fugu-MT 論文翻訳(概要): CART Random Forests as Sequential Allocation over Random Opportunity Sets: A Stochastic-Control Theory of Ensemble Risk

論文の概要: CART Random Forests as Sequential Allocation over Random Opportunity Sets: A Stochastic-Control Theory of Ensemble Risk

arxiv url: http://arxiv.org/abs/2605.26675v1
Date: Tue, 26 May 2026 08:13:22 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-27 17:51:41.743454
Title: CART Random Forests as Sequential Allocation over Random Opportunity Sets: A Stochastic-Control Theory of Ensemble Risk
Title（参考訳）: ランダム・オポチュニティ・セットの逐次配置としてのCARTランダム・フォレスト--アンサンブル・リスクの確率-制御理論-
Authors: Tianxing Mei, Yingying Fan, Mingming Leng, Jinchi Lv,
Abstract要約: 我々は、CARTランダムチャンスセットアロケーション(CART-ROSA)と呼ばれるCARTランダム森林の力学的視点を開発する。しかし、システムレベルでは、森林目標に対して世界規模で最適である可能性がある。この結果から,操作-検索の視点がトラクタブルを理論的ギャップとして利用し難いことを示す。
参考スコア（独自算出の注目度）: 3.258768210783843
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: CART random forests are among the most widely used modern predictive methods, with well-documented empirical success. Yet, at the mechanistic level, the algorithm is often treated as a black box because of its complexity. In this paper, we develop a stochastic-control perspective on feature-subsampled CART random forests, named CART random opportunity-set allocation (CART-ROSA). At each node, the random subset of features is interpreted as a random feasible action set, and the CART split rule as a masked-action allocation policy. This policy induces a controlled stochastic process over informative split-count states, whose terminal law determines both single-tree error and cross-tree interaction terms in the forest mean squared error (MSE). Such representation opens the black box of CART-forests by separating two design levers: the informative-opportunity rate induced by feature subsampling, and the contraction strength from the within-mask split policy. We establish that the CART policy is locally stabilizing: it contracts imbalances in informative split allocations and concentrates terminal tree geometry. At the system level, however, it can be globally suboptimal for the forest objective. Specializing to the linear model, we derive the MSE risk expansion explicitly. Our results show how an operations-research perspective makes tractable a theoretical gap difficult to access from the standard algorithmic description of CART forests.
Abstract（参考訳）: CARTランダム森林は、よく文書化された実証的な成功と共に、最も広く使われている近代的な予測手法の1つである。しかし、力学レベルでは、アルゴリズムはその複雑さのため、しばしばブラックボックスとして扱われる。本稿では,CART-ROSA (CART-ROSA) と呼ばれる,特徴サブサンプリングされたCARTランダム林の確率制御的視点を開発する。各ノードにおいて、特徴のランダムなサブセットはランダムな実行可能なアクションセットとして解釈され、CARTスプリットルールはマスクされたアクション割り当てポリシーとして解釈される。このポリシーは、森林平均二乗誤差(MSE)における単木誤差とクロスツリー相互作用項を終端法で決定する情報分割状態に対する制御確率過程を誘導する。このような表現は、特徴サブサンプリングによって引き起こされる情報・機会率と、マスク内分割ポリシーからの収縮強度という、2つのデザインレバーを分離することで、CARTフォレストのブラックボックスを開く。我々はCARTポリシーが,情報分割割当の不均衡と終端木形状の集中を両立させ,局所安定化を図っていることを証明した。しかし、システムレベルでは、森林目標に対して世界規模で最適である可能性がある。線形モデルに特化して、MSEのリスク拡張を明示的に導き出す。本研究は,CART林の標準的なアルゴリズム記述から,トラクタブルをアクセスし難い理論的ギャップを,オペレーション・リサーチの観点から示すものである。

論文の概要: CART Random Forests as Sequential Allocation over Random Opportunity Sets: A Stochastic-Control Theory of Ensemble Risk

関連論文リスト