Fugu-MT 論文翻訳(概要): FairLogue: A Toolkit for Intersectional Fairness Analysis in Clinical Machine Learning Models

論文の概要: FairLogue: A Toolkit for Intersectional Fairness Analysis in Clinical Machine Learning Models

arxiv url: http://arxiv.org/abs/2604.04858v1
Date: Mon, 06 Apr 2026 17:03:03 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-07 15:49:19.298921
Title: FairLogue: A Toolkit for Intersectional Fairness Analysis in Clinical Machine Learning Models
Title（参考訳）: FairLogue: 臨床機械学習モデルにおける間欠的公正分析のためのツールキット
Authors: Nick Souligne, Vignesh Subbian,
Abstract要約: アルゴリズムフェアネスは、医療における公平で信頼できる機械学習に不可欠である。本研究は,観測的および対実的文脈における交差フェアネス評価を運用するためのツールキットであるFairlogueを紹介する。
参考スコア（独自算出の注目度）: 0.5951287048890108
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Objective: Algorithmic fairness is essential for equitable and trustworthy machine learning in healthcare. Most fairness tools emphasize single-axis demographic comparisons and may miss compounded disparities affecting intersectional populations. This study introduces Fairlogue, a toolkit designed to operationalize intersectional fairness assessment in observational and counterfactual contexts within clinical settings. Methods: Fairlogue is a Python-based toolkit composed of three components: 1) an observational framework extending demographic parity, equalized odds, and equal opportunity difference to intersectional populations; 2) a counterfactual framework evaluating fairness under treatment-based contexts; and 3) a generalized counterfactual framework assessing fairness under interventions on intersectional group membership. The toolkit was evaluated using electronic health record data from the All of Us Controlled Tier V8 dataset in a glaucoma surgery prediction task using logistic regression with race and gender as protected attributes. Results: Observational analysis identified substantial intersectional disparities despite moderate model performance (AUROC = 0.709; accuracy = 0.651). Intersectional evaluation revealed larger fairness gaps than single-axis analyses, including demographic parity differences of 0.20 and equalized odds true positive and false positive rate gaps of 0.33 and 0.15, respectively. Counterfactual analysis using permutation-based null distributions produced unfairness ("u-value") estimates near zero, suggesting observed disparities were consistent with chance after conditioning on covariates. Conclusion: Fairlogue provides a modular toolkit integrating observational and counterfactual methods for quantifying and evaluating intersectional bias in clinical machine learning workflows.
Abstract（参考訳）: 目的:アルゴリズムフェアネスは、医療における公平で信頼できる機械学習に不可欠である。ほとんどのフェアネスツールは、単一軸の人口比較を強調しており、交差点の人口に影響を及ぼす複合的な格差を見逃す可能性がある。本研究は,臨床現場における観察的および反事実的文脈における交差フェアネス評価を運用するためのツールキットであるFairlogueを紹介する。メソッド: FairlogueはPythonベースのツールキットで、3つのコンポーネントで構成される。 1 人口格差、均等化確率及び対人人口との機会差を拡大する観察的枠組み 2 治療に基づく文脈下での公平性を評価するための対策枠組み及び 3) 交差するグループメンバーシップの介入の下での公正性を評価する一般化された反事実的枠組み。緑内障手術予測タスクにおける全Us制御ティアV8データセットの電子的健康記録データを用いて,人種・性別によるロジスティック回帰を保護属性として評価した。結果: 中程度のモデル性能(AUROC = 0.709; 精度 = 0.651)にもかかわらず,かなりの交叉差が認められた。間欠的評価では, 単軸分析より, 0.20 と等化オッズ, 0.33 と 0.15 の正の正の正の正の正の正の正の正の正の正の正の正の正の正の正の正の正の正の正の正の正の正の正の正の正の正の正の正の正の差が認められた。置換に基づくヌル分布を用いた対実解析では、0に近い不公平(u-値)な推定が得られ、観測された相違は共変量に対する条件付け後の確率と一致していたことが示唆された。結論: Fairlogueは、臨床機械学習ワークフローにおける交差バイアスの定量化と評価のための観察的および反ファクト的手法を統合するモジュラーツールキットを提供する。

関連論文リスト

Understanding challenges to the interpretation of disaggregated evaluations of algorithmic fairness [49.35494016290887]
関係する人口を表わすが、実世界の格差を反映するデータである場合、サブグループ間での平等なパフォーマンスは、信頼できない公平さの尺度であることを示す。本フレームワークでは, 因果関係の明示的な仮定と分析を相補して, 相反や分布変化の制御を提案する。
論文参考訳（メタデータ） (2025-06-04T17:40:31Z)
Evaluating Fair Feature Selection in Machine Learning for Healthcare [0.9222623206734782]
特徴選択の観点からアルゴリズム的公正性を探究する。全人口集団に等しく重要と考えられる公平な特徴選択法を評価する。当社のアプローチを、公開可能な3つの医療データセットでテストしました。
論文参考訳（メタデータ） (2024-03-28T06:24:04Z)
Looking Beyond What You See: An Empirical Analysis on Subgroup Intersectional Fairness for Multi-label Chest X-ray Classification Using Social Determinants of Racial Health Inequities [4.351859373879489]
ディープラーニングモデルにおける継承バイアスは、保護されたグループ間での予測精度の相違につながる可能性がある。本稿では,正確な診断結果を達成し,交差点群間の公平性を確保するための枠組みを提案する。
論文参考訳（メタデータ） (2024-03-27T02:13:20Z)
A structured regression approach for evaluating model performance across intersectional subgroups [53.91682617836498]
分散評価(disaggregated evaluation)は、AIフェアネスアセスメントにおける中心的なタスクであり、AIシステムのさまざまなサブグループ間でのパフォーマンスを測定することを目的としている。非常に小さなサブグループであっても,信頼性の高いシステム性能推定値が得られることを示す。
論文参考訳（メタデータ） (2024-01-26T14:21:45Z)
Auditing ICU Readmission Rates in an Clinical Database: An Analysis of Risk Factors and Clinical Outcomes [0.0]
本研究では,30日間の読解問題における臨床データ分類のための機械学習パイプラインを提案する。公正監査は、平等機会、予測パリティ、偽陽性率パリティ、偽陰性率パリティ基準の格差を明らかにする。この研究は、人工知能(AI)システムのバイアスと公平性に対処するために、研究者、政策立案者、実践者の協力的努力の必要性を示唆している。
論文参考訳（メタデータ） (2023-04-12T17:09:38Z)
Fair Machine Learning in Healthcare: A Review [90.22219142430146]
我々は、機械学習と医療格差における公正性の交差を分析する。機械学習の観点から、関連する公正度メトリクスを批判的にレビューする。本稿では,医療における倫理的かつ公平なMLアプリケーション開発を約束する新たな研究指針を提案する。
論文参考訳（メタデータ） (2022-06-29T04:32:10Z)
Measuring Fairness of Text Classifiers via Prediction Sensitivity [63.56554964580627]
加速度予測感度は、入力特徴の摂動に対するモデルの予測感度に基づいて、機械学習モデルの公正度を測定する。この計量は、群フェアネス(統計パリティ)と個人フェアネスという特定の概念と理論的に関連付けられることを示す。
論文参考訳（メタデータ） (2022-03-16T15:00:33Z)
Estimating and Improving Fairness with Adversarial Learning [65.99330614802388]
本研究では,深層学習に基づく医療画像解析システムにおけるバイアスの同時緩和と検出を目的としたマルチタスク・トレーニング戦略を提案する。具体的には,バイアスに対する識別モジュールと,ベース分類モデルにおける不公平性を予測するクリティカルモジュールを追加することを提案する。大規模で利用可能な皮膚病変データセットのフレームワークを評価します。
論文参考訳（メタデータ） (2021-03-07T03:10:32Z)
An Empirical Characterization of Fair Machine Learning For Clinical Risk Prediction [7.945729033499554]
臨床的意思決定を導くための機械学習の使用は、既存の健康格差を悪化させる可能性がある。近年のいくつかの研究は、この問題をアルゴリズム的公正(英語版)の問題と位置づけている。我々は,グループフェアネス違反の罰則がモデル性能とグループフェアネスの一連の尺度に与える影響を実験的に評価する。
論文参考訳（メタデータ） (2020-07-20T17:46:31Z)

関連論文リストは本サイト内にある論文のタイトル・アブストラクトから自動的に作成しています。

指定された論文の情報です。
本サイトの運営者は本サイト（すべての情報・翻訳含む）の品質を保証せず、本サイト（すべての情報・翻訳含む）を使用して発生したあらゆる結果について一切の責任を負いません。