Fugu-MT 論文翻訳(概要): DermAgent: A Self-Reflective Agentic System for Dermatological Image Analysis with Multi-Tool Reasoning and Traceable Decision-Making

論文の概要: DermAgent: A Self-Reflective Agentic System for Dermatological Image Analysis with Multi-Tool Reasoning and Traceable Decision-Making

arxiv url: http://arxiv.org/abs/2605.14403v1
Date: Thu, 14 May 2026 05:41:11 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-15 21:45:34.643933
Title: DermAgent: A Self-Reflective Agentic System for Dermatological Image Analysis with Multi-Tool Reasoning and Traceable Decision-Making
Title（参考訳）: DermAgent:マルチツール推論とトレーサブル意思決定による皮膚画像分析のための自己反射型エージェントシステム
Authors: Yize Liu, Siyuan Yan, Ming Hu, Lie Ju, Xieji Li, Feilong Tang, Wei Feng, Zongyuan Ge,
Abstract要約: DermAgentはPlan-Execute-Reflectフレームワーク内で7つの特別なビジョンと言語モジュールを編成する。相補的な視覚認識ツールを使用して、包括的な形態的記述、皮膚内視鏡的概念アノテーション、病気の診断を行う。最先端のMLLMと医療エージェントのベースラインは、ゼロショットのきめ細かい疾患診断、概念アノテーション、臨床キャプションタスクで一貫して優れています。
参考スコア（独自算出の注目度）: 29.77553689478667
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Dermatological diagnosis requires integrating fine-grained visual perception with expert clinical knowledge. Although Multimodal Large Language Models (MLLMs) facilitate interactive medical image analysis, their application in dermatology is hindered by insufficient domain-specific grounding and hallucinations. To address these issues, we propose DermAgent, a collaborative multi-tool agent that orchestrates seven specialized vision and language modules within a Plan-Execute-Reflect framework. DermAgent delivers stepwise, traceable diagnostic reasoning through three core components. First, it employs complementary visual perception tools for comprehensive morphological description, dermoscopic concept annotation, and disease diagnosis. Second, to overcome the lack of domain prior, a dual-modality retrieval module anchors every prediction in external evidence by cross-referencing 413,210 diagnosed image cases and 3,199 clinical guideline chunks. To further mitigate hallucinations, a deterministic critic module conducts strict post-hoc auditing via confidence, coverage, and conflict gates, automatically detecting inter-source disagreements to trigger targeted self-correction. Extensive experiments on five dermatology benchmarks demonstrate that DermAgent consistently outperforms state-of-the-art MLLMs and medical agent baselines across zero-shot fine-grained disease diagnosis, concept annotation, and clinical captioning tasks, exceeding GPT-4o by 17.6% in skin disease diagnostic accuracy and 3.15% in captioning ROUGE-L. Our code is available at https://github.com/YizeezLiu/DermAgent.
Abstract（参考訳）: 皮膚科診断には、きめ細かい視覚認識と専門的な臨床知識を統合する必要がある。 MLLM(Multimodal Large Language Models)はインタラクティブな医用画像解析を容易にするが,その皮膚科学への応用は,ドメイン固有の基盤や幻覚の不足によって妨げられる。これらの問題に対処するために,我々は,Plan-Execute-Reflectフレームワーク内で7つの特殊なビジョンと言語モジュールを編成する,協調的なマルチツールエージェントであるDermAgentを提案する。 DermAgentは3つのコアコンポーネントを通じて、段階的にトレース可能な診断推論を提供する。まず、総合的な形態的記述、皮膚内視鏡的概念アノテーション、疾患診断に補完的な視覚認識ツールを用いる。第2に、ドメインの不足を克服するため、二重モード検索モジュールは、診断された画像ケース413,210件と臨床ガイドラインチャンク3,199件を相互参照することにより、外部証拠の全ての予測をアンカーする。さらに幻覚を緩和するため、決定論的批判モジュールは、信頼性、カバレッジ、コンフリクトゲートを介して厳格なポストホック監査を行い、ソース間の不一致を自動的に検出し、ターゲットの自己補正をトリガーする。 5つの皮膚科のベンチマークにおいて、DermAgentは、ゼロショットのきめ細かい疾患診断、概念アノテーション、臨床キャプションタスクで、皮膚疾患の診断精度が17.6%、ROUGE-Lを3.15%以上、最先端のMLLMと医療エージェントのベースラインを一貫して上回っていることが示されている。私たちのコードはhttps://github.com/YizeezLiu/DermAgent.comで利用可能です。

関連論文リスト

M3CoTBench: Benchmark Chain-of-Thought of MLLMs in Medical Image Understanding [66.78251988482222]
CoT(Chain-of-Thought)推論は、ステップバイステップの中間推論を奨励することによって、大規模言語モデルの強化に有効であることが証明されている。医用画像理解のための現在のベンチマークでは、推論パスを無視しながら最終回答に重点を置いている。 M3CoTBenchは、透明で信頼性が高く、診断的に正確な医療用AIシステムの開発を促進することを目的としている。
論文参考訳（メタデータ） (2026-01-13T17:42:27Z)
DermoGPT: Open Weights and Open Data for Morphology-Grounded Dermatological Reasoning MLLMs [54.8829900010621]
MLLM (Multimodal Large Language Models) は、医学的応用を約束するが、限られたトレーニングデータ、狭いタスクカバレッジ、臨床現場での監督の欠如により、皮膚科の遅れが進行する。これらのギャップに対処するための包括的なフレームワークを提示します。まず,211,243のイメージと72,675のトラジェクトリを5つのタスク形式に分けた大規模形態素解析コーパスであるDermo Instructを紹介する。第二にDermoBenchは、4つの臨床軸(形態学、診断、推論、フェアネス)にまたがる11のタスクを評価する厳密なベンチマークで、3600の挑戦的なサブセットを含む。
論文参考訳（メタデータ） (2026-01-05T07:55:36Z)
Sim4Seg: Boosting Multimodal Multi-disease Medical Diagnosis Segmentation with Region-Aware Vision-Language Similarity Masks [54.00822479127598]
医療診断(MDS)という医用視覚言語タスクについて紹介する。 MDSは、医療画像に対する臨床クエリを理解し、対応するセグメンテーションマスクと診断結果を生成することを目的としている。診断セグメンテーションの性能を向上させる新しいフレームワークであるSim4Segを提案する。
論文参考訳（メタデータ） (2025-11-10T03:22:42Z)
RAD: Towards Trustworthy Retrieval-Augmented Multi-modal Clinical Diagnosis [56.373297358647655]
Retrieval-Augmented Diagnosis (RAD)は、下流タスクで直接マルチモーダルモデルに外部知識を注入する新しいフレームワークである。 RADは、複数の医療ソースからの疾患中心の知識の検索と改善、ガイドライン強化コントラスト損失トランスフォーマー、デュアルデコーダの3つの主要なメカニズムで機能する。
論文参考訳（メタデータ） (2025-09-24T10:36:14Z)
AURA: A Multi-Modal Medical Agent for Understanding, Reasoning & Annotation [0.8397730500554048]
AURAは、医用画像の包括的分析、説明、評価のために特別に設計された最初の視覚的言語説明性エージェントである。 AURAは、より透明性があり、適応可能で、臨床的に整合したAIシステムに向けた大きな進歩を示している。
論文参考訳（メタデータ） (2025-07-22T18:24:18Z)
Architecting Clinical Collaboration: Multi-Agent Reasoning Systems for Multimodal Medical VQA [1.2744523252873352]
遠隔医療による皮膚科医療は、しばしば個人訪問の豊かな文脈を欠いている。本研究は,6つの構成にまたがる医用視覚質問応答の視覚言語モデルについて検討した。
論文参考訳（メタデータ） (2025-07-07T22:31:56Z)
MedChat: A Multi-Agent Framework for Multimodal Diagnosis with Large Language Models [9.411749481805355]
緑内障検出と大言語モデル(LLM)の統合は、眼科医の不足を軽減するための自動戦略である。一般的なLLMを医用画像に適用することは、幻覚、限定的な解釈可能性、ドメイン固有の医療知識の不足により、依然として困難である。我々は、特殊視モデルと複数のロール固有のLLMエージェントを組み合わせたマルチエージェント診断フレームワークとプラットフォームであるMedChatを提案する。
論文参考訳（メタデータ） (2025-06-09T03:51:18Z)
MvKeTR: Chest CT Report Generation with Multi-View Perception and Knowledge Enhancement [1.6355783973385114]
多視点認識知識強化型TansfoRmer(MvKeTR) 複数の解剖学的ビューから診断情報を効果的に合成するために、ビューアウェアのMVPAを提案する。クエリボリュームに基づいて、最も類似したレポートを取得するために、Cross-Modal Knowledge Enhancer (CMKE) が考案されている。
論文参考訳（メタデータ） (2024-11-27T12:58:23Z)
SkinGEN: an Explainable Dermatology Diagnosis-to-Generation Framework with Interactive Vision-Language Models [54.32264601568605]
SkinGENは、VLMが提供する診断結果から参照デモを生成する、診断から生成までのフレームワークである。システム性能と説明可能性の両方を評価するために,32人の参加者によるユーザスタディを実施している。その結果、SkinGENはVLM予測に対するユーザの理解を著しく改善し、診断プロセスへの信頼を高めることが示されている。
論文参考訳（メタデータ） (2024-04-23T05:36:33Z)

関連論文リストは本サイト内にある論文のタイトル・アブストラクトから自動的に作成しています。

指定された論文の情報です。
本サイトの運営者は本サイト（すべての情報・翻訳含む）の品質を保証せず、本サイト（すべての情報・翻訳含む）を使用して発生したあらゆる結果について一切の責任を負いません。