Fugu-MT 論文翻訳(概要): SkinGPT-X: A Self-Evolving Collaborative Multi-Agent System for Transparent and Trustworthy Dermatological Diagnosis

論文の概要: SkinGPT-X: A Self-Evolving Collaborative Multi-Agent System for Transparent and Trustworthy Dermatological Diagnosis

arxiv url: http://arxiv.org/abs/2603.26122v1
Date: Fri, 27 Mar 2026 07:14:05 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-30 21:49:48.381013
Title: SkinGPT-X: A Self-Evolving Collaborative Multi-Agent System for Transparent and Trustworthy Dermatological Diagnosis
Title（参考訳）: SkinGPT-X : 透明で信頼性の高い皮膚科診断のための自己進化型多エージェントシステム
Authors: Zhangtianyi Chen, Yuhao Shen, Florensia Widjaja, Yan Xu, Liyuan Sun, Zijian Wang, Hongyi Chen, Wufei Dai, Juexiao Zhou,
Abstract要約: SkinGPT-Xは,自己進化型皮膚科記憶機構と統合された皮膚科診断のための協調的マルチエージェントシステムである。皮膚科医の診断ワークフローをシミュレートすることで、SkinGPT-Xは、複雑で稀な皮膚科の症例を管理するための透明で信頼できる診断を提供する。
参考スコア（独自算出の注目度）: 8.619027630628366
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: While recent advancements in Large Language Models have significantly advanced dermatological diagnosis, monolithic LLMs frequently struggle with fine-grained, large-scale multi-class diagnostic tasks and rare skin disease diagnosis owing to training data sparsity, while also lacking the interpretability and traceability essential for clinical reasoning. Although multi-agent systems can offer more transparent and explainable diagnostics, existing frameworks are primarily concentrated on Visual Question Answering and conversational tasks, and their heavy reliance on static knowledge bases restricts adaptability in complex real-world clinical settings. Here, we present SkinGPT-X, a multimodal collaborative multi-agent system for dermatological diagnosis integrated with a self-evolving dermatological memory mechanism. By simulating the diagnostic workflow of dermatologists and enabling continuous memory evolution, SkinGPT-X delivers transparent and trustworthy diagnostics for the management of complex and rare dermatological cases. To validate the robustness of SkinGPT-X, we design a three-tier comparative experiment. First, we benchmark SkinGPT-X against four state-of-the-art LLMs across four public datasets, demonstrating its state-of-the-art performance with a +9.6% accuracy improvement on DDI31 and +13% weighted F1 gain on Dermnet over the state-of-the-art model. Second, we construct a large-scale multi-class dataset covering 498 distinct dermatological categories to evaluate its fine-grained classification capabilities. Finally, we curate the rare skin disease dataset, the first benchmark to address the scarcity of clinical rare skin diseases which contains 564 clinical samples with eight rare dermatological diseases. On this dataset, SkinGPT-X achieves a +9.8% accuracy improvement, a +7.1% weighted F1 improvement, a +10% Cohen's Kappa improvement.
Abstract（参考訳）: 近年の大規模言語モデルの進歩は皮膚学的な診断が著しく進んでいるが、モノリシックLLMは、データ空間の訓練による細粒度で大規模なマルチクラス診断やまれな皮膚疾患の診断にしばしば苦労する一方で、臨床的推論に不可欠な解釈性とトレーサビリティが欠如している。マルチエージェントシステムはより透明で説明可能な診断を提供するが、既存のフレームワークは主にビジュアル質問回答と会話タスクに集中しており、静的知識ベースに依存しているため、複雑な実世界の臨床環境における適応性を制限する。 SkinGPT-Xは, 自己進化型皮膚科記憶機構と統合された皮膚診断のためのマルチモーダル協調型マルチエージェントシステムである。皮膚科医の診断ワークフローをシミュレートし、継続的な記憶の進化を可能にすることにより、SkinGPT-Xは、複雑で稀な皮膚科の症例を管理するための透明で信頼できる診断を提供する。 SkinGPT-Xのロバスト性を検証するため,3層比較実験を設計した。まず、SkinGPT-Xを4つの公開データセットにわたる4つの最先端LCMに対してベンチマークし、DDI31の精度が+9.6%向上し、Dermnetの重み付きF1ゲインが+13%向上していることを示す。第2に、498の異なる皮膚科領域をカバーする大規模マルチクラスデータセットを構築し、その粒度分類能力を評価する。最後に,8例の皮膚疾患を有する564例の臨床サンプルを含む臨床レア皮膚疾患の重症度に関する最初のベンチマークである,レア皮膚疾患データセットをキュレートする。このデータセットでは、SkinGPT-Xは+9.8%の精度向上、+7.1%の重み付きF1の改善、+10%のコーエンのKappaの改善を実現している。

論文の概要: SkinGPT-X: A Self-Evolving Collaborative Multi-Agent System for Transparent and Trustworthy Dermatological Diagnosis

関連論文リスト