Fugu-MT 論文翻訳(概要): flowengineR: A Modular and Extensible Framework for Fair and Reproducible Workflow Design in R

論文の概要: flowengineR: A Modular and Extensible Framework for Fair and Reproducible Workflow Design in R

arxiv url: http://arxiv.org/abs/2511.00079v1
Date: Wed, 29 Oct 2025 17:59:19 GMT
ステータス: 翻訳完了
システム内更新日: 2025-11-05 16:37:26.568821
Title: flowengineR: A Modular and Extensible Framework for Fair and Reproducible Workflow Design in R
Title（参考訳）: flowengineR: Rで公正かつ再現可能なワークフロー設計のためのモジュール的で拡張可能なフレームワーク
Authors: Maximilian Willer, Peter Ruckdeschel,
Abstract要約: flowengineRは、再現可能なアルゴリズムパイプラインを構築するためのモジュールとフレームワークを提供するように設計されたRパッケージである。 flowengineRは、データ分割、実行、前処理、トレーニング、インプロセッシング、後処理、評価、レポートのための標準化されたエンジンの統一アーキテクチャを導入することでこの問題に対処する。
参考スコア（独自算出の注目度）: 0.0
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: flowengineR is an R package designed to provide a modular and extensible framework for building reproducible algorithmic workflows for general-purpose machine learning pipelines. It is motivated by the rapidly evolving field of algorithmic fairness where new metrics, mitigation strategies, and machine learning methods continuously emerge. A central challenge in fairness, but also far beyond, is that existing toolkits either focus narrowly on single interventions or treat reproducibility and extensibility as secondary considerations rather than core design principles. flowengineR addresses this by introducing a unified architecture of standardized engines for data splitting, execution, preprocessing, training, inprocessing, postprocessing, evaluation, and reporting. Each engine encapsulates one methodological task yet communicates via a lightweight interface, ensuring workflows remain transparent, auditable, and easily extensible. Although implemented in R, flowengineR builds on ideas from workflow languages (CWL, YAWL), graph-oriented visual programming languages (KNIME), and R frameworks (BatchJobs, batchtools). Its emphasis, however, is less on orchestrating engines for resilient parallel execution but rather on the straightforward setup and management of distinct engines as data structures. This orthogonalization enables distributed responsibilities, independent development, and streamlined integration. In fairness context, by structuring fairness methods as interchangeable engines, flowengineR lets researchers integrate, compare, and evaluate interventions across the modeling pipeline. At the same time, the architecture generalizes to explainability, robustness, and compliance metrics without core modifications. While motivated by fairness, it ultimately provides a general infrastructure for any workflow context where reproducibility, transparency, and extensibility are essential.
Abstract（参考訳）: flowengineRは、汎用機械学習パイプラインのための再現可能なアルゴリズムワークフローを構築するためのモジュラーで拡張可能なフレームワークを提供するように設計されたRパッケージである。新しいメトリクス、緩和戦略、機械学習メソッドが継続的に出現するアルゴリズムフェアネスの急速に発展する分野に動機づけられている。公正性における中心的な課題は、既存のツールキットが単一の介入に狭くフォーカスするか、再現性と拡張性をコア設計原則よりも二次的考慮として扱うことである。 flowengineRは、データ分割、実行、前処理、トレーニング、インプロセッシング、後処理、評価、レポートのための標準化されたエンジンの統一アーキテクチャを導入することでこの問題に対処する。各エンジンは1つの方法論的タスクをカプセル化しますが、軽量インターフェースを介して通信し、ワークフローが透明で監査可能で容易に拡張可能であることを保証します。 Rで実装されているが、flowengineRはワークフロー言語(CWL、YAWL)、グラフ指向ビジュアルプログラミング言語(KNIME)、Rフレームワーク(バッチジョブ、バッチツール)のアイデアに基づいている。しかし、その重点は、弾力性のある並列実行のためのエンジンのオーケストレーションではなく、データ構造として異なるエンジンのセットアップと管理に向けられている。この直交化は、分散責任、独立した開発、合理化された統合を可能にする。フェアネスの文脈では、フェアネスメソッドを交換可能なエンジンとして構成することで、フローエンジンRは研究者がモデリングパイプライン全体にわたる介入を統合し、比較し、評価することができる。同時に、アーキテクチャは、コア修正なしで説明可能性、堅牢性、コンプライアンスのメトリクスを一般化します。公正さを動機とするが、最終的には再現性、透明性、拡張性が不可欠であるワークフローコンテキストに対して、一般的なインフラストラクチャを提供する。

論文の概要: flowengineR: A Modular and Extensible Framework for Fair and Reproducible Workflow Design in R

関連論文リスト