Fugu-MT 論文翻訳(概要): RationAnomaly: Log Anomaly Detection with Rationality via Chain-of-Thought and Reinforcement Learning

論文の概要: RationAnomaly: Log Anomaly Detection with Rationality via Chain-of-Thought and Reinforcement Learning

arxiv url: http://arxiv.org/abs/2509.14693v2
Date: Mon, 22 Sep 2025 02:54:39 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-23 14:36:45.644005
Title: RationAnomaly: Log Anomaly Detection with Rationality via Chain-of-Thought and Reinforcement Learning
Title（参考訳）: Ration Anomaly:Chain-of-ThoughtとReinforcement Learningによるログ異常検出
Authors: Song Xu, Yilun Liu, Minggui He, Mingchen Dai, Ziang Chen, Chunguang Zhao, Jingzhou Du, Shimin Tao, Weibin Meng, Shenglin Zhang, Yongqian Sun, Boxing Chen, Daimeng Wei,
Abstract要約: RationAnomalyは、Chain-of-Thoughtファインチューニングと強化学習を相乗化することにより、ログの異常検出を強化する新しいフレームワークである。コードとデータセットを含む、対応するリソースをリリースしました。
参考スコア（独自算出の注目度）: 27.235259453535537
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Logs constitute a form of evidence signaling the operational status of software systems. Automated log anomaly detection is crucial for ensuring the reliability of modern software systems. However, existing approaches face significant limitations: traditional deep learning models lack interpretability and generalization, while methods leveraging Large Language Models are often hindered by unreliability and factual inaccuracies. To address these issues, we propose RationAnomaly, a novel framework that enhances log anomaly detection by synergizing Chain-of-Thought (CoT) fine-tuning with reinforcement learning. Our approach first instills expert-like reasoning patterns using CoT-guided supervised fine-tuning, grounded in a high-quality dataset corrected through a rigorous expert-driven process. Subsequently, a reinforcement learning phase with a multi-faceted reward function optimizes for accuracy and logical consistency, effectively mitigating hallucinations. Experimentally, RationAnomaly outperforms state-of-the-art baselines, achieving superior F1-scores on key benchmarks while providing transparent, step-by-step analytical outputs. We have released the corresponding resources, including code and datasets.
Abstract（参考訳）: ログは、ソフトウェアシステムの運用状態を示す証拠の一形態である。ログの自動異常検出は、現代のソフトウェアシステムの信頼性を確保するために不可欠である。しかし、従来のディープラーニングモデルは解釈可能性や一般化を欠いているのに対し、大規模言語モデルを利用した手法は信頼性の低い事実的不正確さを妨げていることが多い。これらの問題に対処するために、強化学習と協調してCoT(Chain-of-Thought)ファインチューニングを行うことによりログ異常検出を強化する新しいフレームワークであるRationAnomalyを提案する。当社のアプローチはまず,厳格な専門家駆動プロセスによって修正された高品質なデータセットに基づいて,CoT誘導による微調整によるエキスパートライクな推論パターンを具体化する。その後、多面的な報酬関数を持つ強化学習フェーズは、精度と論理的一貫性を最適化し、幻覚を効果的に緩和する。実験的に、RationAnomalyは最先端のベースラインより優れ、キーベンチマーク上で優れたF1スコアを達成し、透過的でステップバイステップの分析出力を提供する。コードとデータセットを含む、対応するリソースをリリースしました。

論文の概要: RationAnomaly: Log Anomaly Detection with Rationality via Chain-of-Thought and Reinforcement Learning

関連論文リスト