Fugu-MT 論文翻訳(概要): Automatic Image-Level Morphological Trait Annotation for Organismal Images

論文の概要: Automatic Image-Level Morphological Trait Annotation for Organismal Images

arxiv url: http://arxiv.org/abs/2604.01619v1
Date: Thu, 02 Apr 2026 04:55:52 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-03 14:21:10.364416
Title: Automatic Image-Level Morphological Trait Annotation for Organismal Images
Title（参考訳）: 臓器画像に対する画像レベルの自動トリオアノテーション
Authors: Vardaan Pahuja, Samuel Stevens, Alyson East, Sydne Record, Yu Su,
Abstract要約: 本稿では,正規分布領域をローカライズし,視覚言語プロンプトを用いて解釈可能な特徴記述を生成する特徴アノテーションパイプラインを提案する。我々は,BioSCAN-5Mから19Kの昆虫画像にまたがる80K特性アノテーションのデータセットであるBioscan-Traitsを構築した。
参考スコア（独自算出の注目度）: 11.844505008466884
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Morphological traits are physical characteristics of biological organisms that provide vital clues on how organisms interact with their environment. Yet extracting these traits remains a slow, expert-driven process, limiting their use in large-scale ecological studies. A major bottleneck is the absence of high-quality datasets linking biological images to trait-level annotations. In this work, we demonstrate that sparse autoencoders trained on foundation-model features yield monosemantic, spatially grounded neurons that consistently activate on meaningful morphological parts. Leveraging this property, we introduce a trait annotation pipeline that localizes salient regions and uses vision-language prompting to generate interpretable trait descriptions. Using this approach, we construct Bioscan-Traits, a dataset of 80K trait annotations spanning 19K insect images from BIOSCAN-5M. Human evaluation confirms the biological plausibility of the generated morphological descriptions. We assess design sensitivity through a comprehensive ablation study, systematically varying key design choices and measuring their impact on the quality of the resulting trait descriptions. By annotating traits with a modular pipeline rather than prohibitively expensive manual efforts, we offer a scalable way to inject biologically meaningful supervision into foundation models, enable large-scale morphological analyses, and bridge the gap between ecological relevance and machine-learning practicality.
Abstract（参考訳）: 形態学的特徴は、生物が環境とどのように相互作用するかの重要な手がかりを提供する生物の物理的特徴である。しかし、これらの特徴を抽出することは、専門家主導のプロセスが遅いままであり、大規模な生態学的研究での使用を制限している。大きなボトルネックは、生物学的イメージと特性レベルのアノテーションをリンクする高品質なデータセットがないことである。本研究では, 基礎モデルの特徴を訓練したスパースオートエンコーダが, 意味のある形態的部分に対して一貫した活性化を示す単意味的, 空間的に接地されたニューロンを生み出すことを実証した。この特性を生かした特徴アノテーションパイプラインを導入し、局所的な領域をローカライズし、視覚言語プロンプトを用いて解釈可能な特徴記述を生成する。この手法を用いて,BioSCAN-5Mの19Kの昆虫画像にまたがる80K特性アノテーションのデータセットであるBioscan-Traitsを構築した。人間による評価は、生成された形態学的記述の生物学的妥当性を確認する。設計感度を総合的アブレーション研究により評価し, 設計選択を体系的に変化させ, 特性記述の質に及ぼす影響を評価する。コストのかかる手作業ではなく,モジュール型パイプラインで特徴を注釈付けすることで,基盤モデルに生物学的に意味のある監督を注入し,大規模形態解析を可能にし,生態的妥当性と機械学習の実践性のギャップを埋める,スケーラブルな方法を提供する。

論文の概要: Automatic Image-Level Morphological Trait Annotation for Organismal Images

関連論文リスト