Fugu-MT 論文翻訳(概要): Dataset Diversity Metrics and Impact on Classification Models

論文の概要: Dataset Diversity Metrics and Impact on Classification Models

arxiv url: http://arxiv.org/abs/2603.15276v1
Date: Mon, 16 Mar 2026 13:41:12 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-17 18:28:58.389767
Title: Dataset Diversity Metrics and Impact on Classification Models
Title（参考訳）: データセットの多様性指標と分類モデルへの影響
Authors: Théo Sourget, Niclas Claßen, Jack Junchi Xu, Rob van der Goot, Veronika Cheplygina,
Abstract要約: MorphoMNIST と PadChest を用いて,画像,テキスト,メタデータに対する複数のデータセットの多様性指標の振る舞いについて検討した。 AUCと画像またはメタデータの参照不要な多様性指標との間には限定的な相関関係があるが、FIDと意味多様性指標との相関関係は高い。
参考スコア（独自算出の注目度）: 11.059756667205603
License: http://creativecommons.org/licenses/by/4.0/
Abstract: The diversity of training datasets is usually perceived as an important aspect to obtain a robust model. However, the definition of diversity is often not defined or differs across papers, and while some metrics exist, the quantification of this diversity is often overlooked when developing new algorithms. In this work, we study the behaviour of multiple dataset diversity metrics for image, text and metadata using MorphoMNIST, a toy dataset with controlled perturbations, and PadChest, a publicly available chest X-ray dataset. We evaluate whether these metrics correlate with each other but also with the intuition of a clinical expert. We also assess whether they correlate with downstream-task performance and how they impact the training dynamic of the models. We find limited correlations between the AUC and image or metadata reference-free diversity metrics, but higher correlations with the FID and the semantic diversity metrics. Finally, the clinical expert indicates that scanners are the main source of diversity in practice. However, we find that the addition of another scanner to the training set leads to shortcut learning. The code used in this study is available at https://github.com/TheoSourget/dataset_diversity_evaluation
Abstract（参考訳）: トレーニングデータセットの多様性は通常、堅牢なモデルを得るために重要な側面として認識される。しかしながら、多様性の定義はしばしば定義されず、論文間で異なっており、いくつかの指標が存在するが、新しいアルゴリズムを開発する際にこの多様性の定量化は見過ごされがちである。本研究では,MorphoMNISTとPadChestを用いて,画像,テキスト,メタデータに対する複数のデータセットの多様性指標の挙動について検討した。また,これらの指標が臨床専門家の直感と相関しているかどうかを検討した。また、下流タスクのパフォーマンスと相関し、モデルのトレーニングダイナミクスにどのように影響するかを評価する。 AUCと画像またはメタデータの参照不要な多様性指標との間には限定的な相関関係があるが、FIDと意味多様性指標との相関関係は高い。最後に、臨床専門家は、スキャナーが実際の主な多様性の源であることを示唆している。しかし、トレーニングセットに別のスキャナを追加することで、ショートカット学習がもたらされることがわかった。この研究で使用されたコードはhttps://github.com/TheoSourget/dataset_diversity_evaluationで公開されている。

論文の概要: Dataset Diversity Metrics and Impact on Classification Models

関連論文リスト