Fugu-MT 論文翻訳(概要): Exploring the Design Space of Fair Tree Learning Algorithms

論文の概要: Exploring the Design Space of Fair Tree Learning Algorithms

arxiv url: http://arxiv.org/abs/2509.03204v1
Date: Wed, 03 Sep 2025 10:39:23 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-04 21:40:46.48806
Title: Exploring the Design Space of Fair Tree Learning Algorithms
Title（参考訳）: フェアツリー学習アルゴリズムの設計空間の探索
Authors: Kiara Stempel, Mattia Cerrato, Stefan Kramer,
Abstract要約: 決定木は、異なるグループに対する非差別を確保しながら、予測性能を最大化することを目的としている。本稿では、上記の2つの追加オプションを設計空間から導入し、複数のデータセット上で実験的に特徴付ける。
参考スコア（独自算出の注目度）: 4.147431995720785
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: Decision trees have been studied extensively in the context of fairness, aiming to maximize prediction performance while ensuring non-discrimination against different groups. Techniques in this space usually focus on imposing constraints at training time, constraining the search space so that solutions which display unacceptable values of relevant metrics are not considered, discarded, or discouraged. If we assume one target variable y and one sensitive attribute s, the design space of tree learning algorithms can be spanned as follows: (i) One can have one tree T that is built using an objective function that is a function of y, s, and T. For instance, one can build a tree based on the weighted information gain regarding y (maximizing) and s (minimizing). (ii) The second option is to have one tree model T that uses an objective function in y and T and a constraint on s and T. Here, s is no longer part of the objective, but part of a constraint. This can be achieved greedily by aborting a further split as soon as the condition that optimizes the objective in y fails to satisfy the constraint on s. A simple way to explore other splits is to backtrack during tree construction once a fairness constraint is violated. (iii) The third option is to have two trees T_y and T_s, one for y and one for s, such that the tree structure for y and s does not have to be shared. In this way, information regarding y and regarding s can be used independently, without having to constrain the choices in tree construction by the mutual information between the two variables. Quite surprisingly, of the three options, only the first one and the greedy variant of the second have been studied in the literature so far. In this paper, we introduce the above two additional options from that design space and characterize them experimentally on multiple datasets.
Abstract（参考訳）: 決定木は、異なるグループに対する非差別性を確保しつつ、予測性能を最大化することを目的として、フェアネスの文脈で広範囲に研究されてきた。この領域のテクニックは通常、トレーニング時に制約を課すことに重点を置いており、検索空間を制約し、関連するメトリクスの許容できない値を示すソリューションが考慮され、破棄され、または無効になるようにしている。 1つの変数 y と 1つの属性 s を仮定すると、ツリー学習アルゴリズムの設計空間は以下のようになる。 i) y, s, T の関数である目的関数を用いて構築される1つのツリー T を持つことができる。例えば、y (最大化) および s (最小化) に関する重み付き情報ゲインに基づいてツリーを構築することができる。 (ii) 第二の選択肢は、y と T の客観的関数と s と T の制約を使用する1つのツリーモデル T を持つことである。これは、y の目的を最適化する条件が s 上の制約を満たすのに失敗すると、さらに分割を中止することで、厳密に達成できる。他の分割を探索する簡単な方法は、公正な制約が破られると、ツリー構築中にバックトラックすることです。 (iii)第3の選択肢は、y と s のツリー構造を共有する必要がないように、y のツリー T_y と T_s を2本、y と s のツリー構造を1本持つことである。このようにして、y に関する情報と s に関する情報は、2つの変数間の相互情報によってツリー構築における選択を制約することなく、独立に使用できる。 3つの選択肢のうち、第1の選択肢と第2のグレディ変種のみが、これまでの文献で研究されている。本稿では、上記の2つの追加オプションを設計空間から導入し、複数のデータセット上で実験的に特徴付ける。

論文の概要: Exploring the Design Space of Fair Tree Learning Algorithms

関連論文リスト