Fugu-MT 論文翻訳(概要): FAME: Failure-Aware Mixture-of-Experts for Message-Level Log Anomaly Detection

論文の概要: FAME: Failure-Aware Mixture-of-Experts for Message-Level Log Anomaly Detection

arxiv url: http://arxiv.org/abs/2605.22779v1
Date: Thu, 21 May 2026 17:34:53 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-22 16:35:42.382698
Title: FAME: Failure-Aware Mixture-of-Experts for Message-Level Log Anomaly Detection
Title（参考訳）: FAME:メッセージレベルログ異常検出のためのフェール・アウェア・ミックス・オブ・エクササイズ
Authors: Huanchi Wang, Zihang Huang, Yifang Tian, Kristina Dzeparoska, Hans-Arno Jacobsen, Alberto Leon-Garcia,
Abstract要約: FAME(Failure-Aware Mixture-of-Experts)は,ラベル効率の高いメッセージレベルのミックス・オブ・エキスパートフレームワークである。 FAMEは軽量ルータと、オンプレミスで実行し、異常予測を出力するドメインエキスパートを訓練する。 BGLでは、F1 = 98.16をK = 100で達成し、76倍のアノテーションの労力を削減し、目に見えないEventIDから86.3%の異常を検出する。
参考スコア（独自算出の注目度）: 6.975728318257755
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Production systems generate millions of log lines daily, yet most anomaly detectors operate at the session or window-level, flagging groups of lines rather than identifying the specific message responsible. This coarse granularity forces operators to inspect many routine lines per alert. Message-level detection offers finer granularity, but remains challenging. A single event template may correspond to both normal and anomalous messages, failures arise from heterogeneous subsystems, and line-level labeling at scale is impractical. Although large language models (LLMs) can reason over log semantics, applying them to every line is too costly for continuous monitoring. We present FAME (Failure-Aware Mixture-of-Experts), a label-efficient message-level mixture-of-experts framework that uses an LLM only once offline. We annotate at most K labeled lines per template to derive binary normal/anomaly indicators and representative examples. The LLM proposes a partition of templates into failure domains, and a certification step validates the proposal before training. FAME trains a lightweight router and domain experts that run on-premise and output anomaly predictions and failure-domain labels. On BGL, FAME achieves F1 = 98.16 at K = 100 reducing annotation effort by 76x and detects 86.3% of anomalies from unseen EventIDs. On Thunderbird, FAME reaches F1 = 99.95 with perfect recall.
Abstract（参考訳）: プロダクションシステムは毎日数百万のログ行を生成するが、ほとんどの異常検知器はセッションやウィンドウレベルで動作し、特定のメッセージを特定するのではなく、ラインのグループにフラグを付ける。この粗い粒度は、オペレーターに警告ごとに多くのルーチンを検査させる。メッセージレベルの検出は、より細かい粒度を提供するが、依然として困難である。単一のイベントテンプレートは、通常のメッセージと異常なメッセージの両方に対応し、不均一なサブシステムからエラーが発生し、スケールでのラインレベルのラベリングは実用的ではない。大規模言語モデル(LLM)はログのセマンティクスを解析できるが、連続的な監視にはコストがかかりすぎる。 FAME(Failure-Aware Mixture-of-Experts)はラベル効率のよいメッセージレベルのMix-of-Expertsフレームワークである。テンプレートあたりのほとんどのKラベル付き行にアノテートして、バイナリ正規/アノマリーインジケータと代表例を導出する。 LLMは、テンプレートを障害ドメインに分割することを提案し、認定ステップは、トレーニング前に提案を検証する。 FAMEは軽量ルータとドメインエキスパートをトレーニングし、オンプレミスで動作し、異常予測と障害ドメインラベルを出力する。 BGLでは、F1 = 98.16をK = 100で達成し、76倍のアノテーションの労力を削減し、目に見えないEventIDから86.3%の異常を検出する。 Thunderbirdでは、FAMEは完全リコールでF1 = 99.95に達する。

論文の概要: FAME: Failure-Aware Mixture-of-Experts for Message-Level Log Anomaly Detection

関連論文リスト