Fugu-MT 論文翻訳(概要): FoundCause: Causal Discovery with Latent Confounders from Observational Data

論文の概要: FoundCause: Causal Discovery with Latent Confounders from Observational Data

arxiv url: http://arxiv.org/abs/2606.17516v1
Date: Tue, 16 Jun 2026 04:50:01 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-17 17:15:32.274142
Title: FoundCause: Causal Discovery with Latent Confounders from Observational Data
Title（参考訳）: FoundCause: 観測データから遅れた共同創設者による因果発見
Authors: Patrick Blöbaum, Krishnakumar Balasubramanian, Shiva Prasad Kasiviswanathan,
Abstract要約: FoundCauseは、完全に合成データに基づいて訓練された、償却された因果発見モデルである。個々のデータセットを超えて一般化される、転送可能な統計パターンをキャプチャする。 FoundCauseは15の現実世界のデータセットで11の古典的な非アモート化メソッドを上回ります。
参考スコア（独自算出の注目度）: 16.98165241701198
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Causal discovery from observational data remains challenging due to the need to recover directed structure and latent confounding without interventions. We propose FoundCause, an amortized causal discovery model trained entirely on synthetic data that maps datasets directly to causal graphs in a single forward pass. By learning from large collections of simulated structural causal models, FoundCause captures transferable statistical patterns that generalize beyond individual datasets. The architecture incorporates several key inductive biases for causal discovery. It uses a permutation-invariant transformer encoder with alternating attention over samples and variables to jointly model cross-variable dependence and per-variable distributions. Pairwise statistical features derived from classical asymmetry measures are injected through statistics-conditioned attention, guiding the model toward known causal signals. A factorized decoder separates edge existence from direction, while a triangular refinement module enables reasoning over higher-order causal motifs such as chains and colliders. In addition, a dedicated confounder module based on learnable latent tokens explicitly models hidden common causes, and the model explicitly handles missing data via its masked input representation. To our knowledge, FoundCause is the first amortized causal discovery approach to explicitly model latent confounding. FoundCause outperforms 11 classical non-amortized methods (e.g., PC, GES, NOTEARS-style optimization) and 4 amortized causal discovery methods on 15 real-world datasets, achieving +9.6% improvement in $F_1$, +1.2% in AUROC, and an 18.9% reduction in structural Hamming distance relative to the strongest non-amortized methods, while performing inference in a single forward pass.
Abstract（参考訳）: 観測データからの因果発見は、指示された構造を復元し、介入なしに潜伏する必要性のため、依然として困難である。我々は、データセットを単一の前方パスで因果グラフに直接マッピングする合成データに基づいて訓練された、償却因果発見モデルFoundCauseを提案する。シミュレーションされた構造因果モデルの大規模なコレクションから学ぶことで、FoundCauseは個々のデータセットを超えて一般化される転送可能な統計パターンをキャプチャする。このアーキテクチャには因果発見のための重要な帰納バイアスがいくつか含まれている。置換不変なトランスフォーマーエンコーダを使用し、サンプルと変数を交互に注目することで、クロス変数依存とパー変数分布を共同でモデル化する。古典的非対称性測度から導かれるペアワイズ統計特徴は、統計条件付き注意を通して注入され、モデルが既知の因果信号に導かれる。分解デコーダは方向からエッジの存在を分離し、三角形の精製モジュールは鎖や衝突子のような高次因果モチーフの推論を可能にする。さらに、学習可能な潜在トークンに基づく専用共同設立モジュールは、隠された共通の原因を明示的にモデル化し、モデルはマスクされた入力表現を通じて、行方不明データを明示的に処理する。我々の知る限り、FoundCauseは潜伏する共起を明示的にモデル化する最初のアモートされた因果発見アプローチである。 FoundCauseは、11の古典的非アモルト化法(例えば、PC、GES、NOTEARSスタイルの最適化)と15の現実世界のデータセット上で4つのアモルト化因果発見法を上回り、AUROCの$F_1$、+1.2%の改善を+9.6%達成し、最強の非アモルト化法に対する構造ハミング距離を18.9%削減した。

論文の概要: FoundCause: Causal Discovery with Latent Confounders from Observational Data

関連論文リスト