Fugu-MT 論文翻訳(概要): Assessing Foundation Models for Mold Colony Detection with Limited Training Data

論文の概要: Assessing Foundation Models for Mold Colony Detection with Limited Training Data

arxiv url: http://arxiv.org/abs/2510.00561v1
Date: Wed, 01 Oct 2025 06:25:45 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-03 16:59:20.417406
Title: Assessing Foundation Models for Mold Colony Detection with Limited Training Data
Title（参考訳）: 限定訓練データを用いた金型コロニー検出の基礎モデルの評価
Authors: Henrik Pichler, Janis Keuper, Matthew Copping,
Abstract要約: 5000ペトリの食器画像にバウンディングボックスをアノテートした代表的データセットを構築した。従来のタスク固有のメトリクスのベースラインに対して、3つのビジョンファウンデーションモデルをベンチマークする。 MaskDinoは150枚の画像のみを微調整しながら、広範囲に訓練されたYoloV9モデルでほぼ完成している。
参考スコア（独自算出の注目度）: 13.811543290168684
License: http://creativecommons.org/licenses/by/4.0/
Abstract: The process of quantifying mold colonies on Petri dish samples is of critical importance for the assessment of indoor air quality, as high colony counts can indicate potential health risks and deficiencies in ventilation systems. Conventionally the automation of such a labor-intensive process, as well as other tasks in microbiology, relies on the manual annotation of large datasets and the subsequent extensive training of models like YoloV9. To demonstrate that exhaustive annotation is not a prerequisite anymore when tackling a new vision task, we compile a representative dataset of 5000 Petri dish images annotated with bounding boxes, simulating both a traditional data collection approach as well as few-shot and low-shot scenarios with well curated subsets with instance level masks. We benchmark three vision foundation models against traditional baselines on task specific metrics, reflecting realistic real-world requirements. Notably, MaskDINO attains near-parity with an extensively trained YoloV9 model while finetuned only on 150 images, retaining competitive performance with as few as 25 images, still being reliable on $\approx$ 70% of the samples. Our results show that data-efficient foundation models can match traditional approaches with only a fraction of the required data, enabling earlier development and faster iterative improvement of automated microbiological systems with a superior upper-bound performance than traditional models would achieve.
Abstract（参考訳）: ペトリ皿のカビコロニーの定量化は, 室内空気質の評価において重要な役割を担っている。従来、このような労働集約的なプロセスの自動化は、微生物学における他のタスクと同様に、大規模なデータセットのマニュアルアノテーションと、YoloV9のようなモデルの拡張的なトレーニングに依存していた。新しいビジョンタスクに取り組む際には、包括的なアノテーションがもはや必須条件ではないことを示すために、従来のデータ収集アプローチと、インスタンスレベルのマスクを備えた十分にキュレートされたサブセットを備えた、少数ショットと低ショットのシナリオの両方をシミュレートして、5000ペトリの皿イメージの代表的データセットをコンパイルする。実世界の現実的な要件を反映した3つのビジョン基盤モデルを,タスク固有の指標に基づく従来のベースラインに対してベンチマークする。特に、MaskDinoは、広範囲に訓練されたYoloV9モデルで、150枚の画像のみを微調整し、25枚の画像で競争性能を保ちながら、サンプルの70%$\approx$で信頼性を保っている。以上の結果から,データ効率のよい基礎モデルは,必要なデータのごく一部で従来のアプローチと一致し,従来のモデルよりも優れた上界性能を有する自動化微生物システムの早期開発と迅速な反復的改良が可能であることが示唆された。

論文の概要: Assessing Foundation Models for Mold Colony Detection with Limited Training Data

関連論文リスト