Fugu-MT 論文翻訳(概要): How Useful is Causal Invariance for Domain Adaptation in Finite-Sample Settings?

論文の概要: How Useful is Causal Invariance for Domain Adaptation in Finite-Sample Settings?

arxiv url: http://arxiv.org/abs/2606.12680v1
Date: Wed, 10 Jun 2026 21:07:49 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-12 15:55:27.461865
Title: How Useful is Causal Invariance for Domain Adaptation in Finite-Sample Settings?
Title（参考訳）: 有限サンプル設定における領域適応の因果不均一性はどの程度有用か?
Authors: Julia Kostin, Kasra Jalaldoust, Elias Bareinboim, Samory Kpotufe, Fanny Yang,
Abstract要約: 機械学習モデルは、トレーニングされたソースディストリビューションとは異なるターゲットディストリビューションにデプロイされると、しばしば劣化する。因果関係に基づく領域一般化における最近の研究は、共用因果構造が不変な予測因子を誘導する方法を示している。本稿では,完全あるいは部分的な因果知識が,教師付きドメイン適応を確実に改善できるかどうかについて検討する。
参考スコア（独自算出の注目度）: 58.740078141879984
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Machine learning models often degrade when they are deployed on a target distribution that differs from the source distributions they were trained on. Recent work in causality-based domain generalization has shown how shared causal structure between domains can induce invariant predictors, e.g., models on a subset of features which have stable risk across structured domain shifts. However, the extent to which such population-level causal invariances can lead to gains in finite-sample settings remains underexplored. In particular, in practice we often have access to a few labeled target samples, a setting called supervised domain adaptation (sDA). In this paper, we explore when (full or partial) causal knowledge can provably improve supervised domain adaptation. As a first step, we study linear regression, where full or partial causal knowledge specifies a collection of invariant or possibly invariant feature subsets, each yielding a source-trained candidate predictor. We derive matching upper and lower bounds showing that finite-sample gains are governed by the target-risk margins separating the candidates, together with the finite-source estimation error. When these margins are sufficiently large relative to $n_Q$, an adaptive aggregation procedure can match the best candidate predictor while avoiding negative transfer relative to target-only learning. On the other hand, when the margins are too small, no algorithm can reliably exploit the candidate collection to obtain faster finite-sample rates. We further connect these margins to structural shift magnitude in linear SCMs and validate the theory on real-world causal benchmarks.
Abstract（参考訳）: 機械学習モデルは、トレーニングされたソースディストリビューションとは異なるターゲットディストリビューションにデプロイされると、しばしば劣化する。因果関係に基づく領域一般化における最近の研究は、ドメイン間の共用因果構造が、構造された領域シフトの安定なリスクを持つ機能のサブセット上で、不変な予測子を誘導する方法を示している。しかし、そのような集団レベルの因果不均一が有限サンプル設定で得られる程度は、まだ未解明のままである。特に、実際には、監視ドメイン適応(sDA)と呼ばれるいくつかのラベル付きターゲットサンプルにアクセスすることがよくあります。本稿では,(完全あるいは部分的な)因果的知識が,教師付きドメイン適応を確実に向上させる方法について検討する。最初のステップとして、線形回帰について検討し、完全な因果知識または部分的因果知識が不変あるいはおそらく不変な特徴部分集合の集合を規定し、それぞれがソース学習された候補予測子を生成する。我々は、有限サンプルゲインが、候補を分離するターゲットリスクマージンと、有限ソース推定誤差によって支配されていることを示す上限と下限のマッチングを導出する。これらのマージンが$n_Q$に対して十分に大きい場合、アダプティブアグリゲーション手順は、ターゲットのみの学習に対して負の転送を回避しつつ、最適な候補予測子と一致させることができる。一方、マージンが小さすぎる場合、アルゴリズムはより高速な有限サンプルレートを得るために、候補コレクションを確実に活用することができない。さらに、これらのマージンを線形SCMにおける構造シフトマグニチュードに結合し、実世界の因果ベンチマークに関する理論を検証する。

論文の概要: How Useful is Causal Invariance for Domain Adaptation in Finite-Sample Settings?

関連論文リスト