Fugu-MT 論文翻訳(概要): Holmes: Multimodal Agentic Diagnosis for Mixed-Language Mobile Crashes at Industrial Scale

論文の概要: Holmes: Multimodal Agentic Diagnosis for Mixed-Language Mobile Crashes at Industrial Scale

arxiv url: http://arxiv.org/abs/2606.21963v1
Date: Sat, 20 Jun 2026 09:31:26 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-25 23:30:16.266603
Title: Holmes: Multimodal Agentic Diagnosis for Mixed-Language Mobile Crashes at Industrial Scale
Title（参考訳）: ホームズ:複合言語移動クラッシュの産業規模におけるマルチモーダルエージェント診断
Authors: Jia Li, Wenyuan Ma, Ting Peng, Haibin Zheng, Yuetang Deng,
Abstract要約: 本稿では,実行時信号(スタックトレース,ログ,スレッド状態)を合成して根本原因分析を自動化するマルチエージェントシステムであるHolmesについて述べる。ホームズは関数レベルの断層定位において87.6%の精度を達成し、平均調査時間を98%以上短縮する。
参考スコア（独自算出の注目度）: 10.627336348624226
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Diagnosing mobile crashes in ultra-large-scale industrial applications is a formidable challenge due to the sheer volume of code, the complexity of mixed-language environments, and the inability to reproduce failures locally. Traditional static analysis struggles with scalability, while existing LLM-based agents often rely on reproducible environments unavailable in post-mortem scenarios. We present Holmes, a multi-agent system that automates root cause analysis by synthesizing multimodal runtime signals--stack traces, logs, and thread states--to reconstruct failure contexts without reproduction. Holmes introduces a hierarchical Retrieve-Explore-Reason architecture that leverages low-level artifacts (e.g., registers, assembly) to bridge the semantic gap between open-source business logic and closed-source system frameworks. By dynamically compressing the search space using runtime clues, Holmes precisely navigates 70-million-line codebases to identify non-local defects. Evaluated on real-world crashes from WeChat, Holmes achieves 87.6% accuracy in function-level fault localization and reduces average investigation time by over 98% (to ~77 seconds), demonstrating its effectiveness in transforming labor-intensive debugging into an efficient verification workflow.
Abstract（参考訳）: 超大規模産業アプリケーションにおけるモバイルクラッシュの診断は、コード量の多さ、混合言語環境の複雑さ、ローカルで障害を再現できないことなど、非常に難しい課題である。従来の静的解析はスケーラビリティに苦慮するが、既存のLCMベースのエージェントは反省会後のシナリオでは利用できない再現可能な環境に依存していることが多い。マルチモーダルランタイム信号 - スタックトレース,ログ,スレッド状態 - を合成して根本原因分析を自動化するマルチエージェントシステムであるHolmesを提案する。 Holmes氏は階層的なRetrieve-Explore-Reasonアーキテクチャを導入し、低レベルのアーティファクト(レジスタ、アセンブリなど)を活用して、オープンソースビジネスロジックとクローズドソースシステムフレームワーク間のセマンティックギャップを埋める。ランタイムのヒントを使って検索スペースを動的に圧縮することで、Holmesは7000万行のコードベースを正確にナビゲートし、非ローカルな欠陥を特定する。 WeChatの実際のクラッシュを評価したところ、ホームズは関数レベルの障害ローカライゼーションにおいて87.6%の精度を達成し、平均調査時間を98%以上(約77秒)削減し、労働集約デバッグを効率的な検証ワークフローに変換する効果を実証した。

論文の概要: Holmes: Multimodal Agentic Diagnosis for Mixed-Language Mobile Crashes at Industrial Scale

関連論文リスト