Fugu-MT 論文翻訳(概要): M3Retrieve: Benchmarking Multimodal Retrieval for Medicine

論文の概要: M3Retrieve: Benchmarking Multimodal Retrieval for Medicine

arxiv url: http://arxiv.org/abs/2510.06888v1
Date: Wed, 08 Oct 2025 11:08:47 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-09 16:41:20.450925
Title: M3Retrieve: Benchmarking Multimodal Retrieval for Medicine
Title（参考訳）: M3Retrieve: マルチモーダル検索のベンチマーク
Authors: Arkadeep Acharya, Akash Ghosh, Pradeepika Verma, Kitsuchart Pasupa, Sriparna Saha, Priti Singh,
Abstract要約: マルチモーダル医療検索モデルのベンチマークであるM3Retrieveをリリースする。 M3Retrieveは5つのドメイン、16の医療分野、4つの異なるタスクにまたがる。本ベンチマークでは,様々な医療分野に特有な課題を探るため,主要なマルチモーダル検索モデルの評価を行った。
参考スコア（独自算出の注目度）: 20.495948250806325
License: http://creativecommons.org/licenses/by/4.0/
Abstract: With the increasing use of RetrievalAugmented Generation (RAG), strong retrieval models have become more important than ever. In healthcare, multimodal retrieval models that combine information from both text and images offer major advantages for many downstream tasks such as question answering, cross-modal retrieval, and multimodal summarization, since medical data often includes both formats. However, there is currently no standard benchmark to evaluate how well these models perform in medical settings. To address this gap, we introduce M3Retrieve, a Multimodal Medical Retrieval Benchmark. M3Retrieve, spans 5 domains,16 medical fields, and 4 distinct tasks, with over 1.2 Million text documents and 164K multimodal queries, all collected under approved licenses. We evaluate leading multimodal retrieval models on this benchmark to explore the challenges specific to different medical specialities and to understand their impact on retrieval performance. By releasing M3Retrieve, we aim to enable systematic evaluation, foster model innovation, and accelerate research toward building more capable and reliable multimodal retrieval systems for medical applications. The dataset and the baselines code are available in this github page https://github.com/AkashGhosh/M3Retrieve.
Abstract（参考訳）: RetrievalAugmented Generation (RAG)の使用の増加に伴い、強力な検索モデルがこれまで以上に重要になっている。医療において、テキストと画像の両方からの情報を組み合わせたマルチモーダル検索モデルは、質問応答、クロスモーダル検索、マルチモーダル要約といった多くの下流タスクに大きな利点をもたらす。しかしながら、これらのモデルが医療的環境でどれだけうまく機能するかを評価するための標準ベンチマークは今のところ存在しない。このギャップに対処するために、マルチモーダル医療検索ベンチマークであるM3Retrieveを紹介する。 M3Retrieveは5つのドメイン、16の医療分野、4つの異なるタスクにまたがっており、120万以上のテキストドキュメントと164Kのマルチモーダルクエリが承認されたライセンスの下で収集されている。本ベンチマークでは,様々な医療分野に特有な課題を探求し,検索性能への影響を明らかにするため,主要なマルチモーダル検索モデルの評価を行った。 M3Retrieveをリリースすることで、系統的な評価を可能にし、モデル革新を奨励し、医療応用のためのより有能で信頼性の高いマルチモーダル検索システムの構築に向けた研究を加速することを目指している。データセットとベースラインコードは、このgithubページ https://github.com/AkashGhosh/M3Retrieveで公開されている。

論文の概要: M3Retrieve: Benchmarking Multimodal Retrieval for Medicine

関連論文リスト