Fugu-MT 論文翻訳(概要): Multimodal Deep Generative Model for Semi-Supervised Learning under Class Imbalance

論文の概要: Multimodal Deep Generative Model for Semi-Supervised Learning under Class Imbalance

arxiv url: http://arxiv.org/abs/2605.06289v1
Date: Thu, 07 May 2026 13:56:45 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-08 22:27:11.868727
Title: Multimodal Deep Generative Model for Semi-Supervised Learning under Class Imbalance
Title（参考訳）: クラス不均衡下での半教師付き学習のためのマルチモーダル深層生成モデル
Authors: Heegeon Yoon, Heeyoung Kim,
Abstract要約: クラス不均衡下での半教師付き学習のための多モード深層生成モデルを提案する。提案手法では,各モダリティに対して別個のエンコーダを用い,各モダリティ間で潜伏変数を共有するとともに,積オブエキスパート法による連成後続計算を簡略化する。ラベル付きデータとラベルなしデータの両方で、$$$-powerの分散を用いて提案モデルをトレーニングするための新しい目的関数を導出する。
参考スコア（独自算出の注目度）: 9.797319790710711
License: http://creativecommons.org/licenses/by/4.0/
Abstract: When modeling class-imbalanced data, it is crucial to address the imbalance, as models trained on such data tend to be biased towards the majority classes. This problem is amplified under partial supervision, where pseudo-labels for unlabeled data are predicted based on imbalanced labeled data, propagating the bias. While recent semi-supervised models address class imbalance, they typically assume single-modal input data. However, with the growing availability of multimodal data, it is essential to leverage complementary modalities. In this article, we propose a multimodal deep generative model for semi-supervised learning under class imbalance. Our approach uses separate encoders for each modality, sharing latent variables across modalities, and simplifies joint posterior computation with a product-of-experts method. To further address class imbalance, we replace typical Gaussian distributions with Student's t-distributions for the prior, encoder, and decoder, better capturing the heavy-tailed latent distributions in imbalanced data. We derive a new objective function for training the proposed model on both labeled and unlabeled data using $γ$-power divergence. Empirical results on benchmark and real-world datasets demonstrate that our model outperforms baseline methods in generalization, achieving superior classification performance for partially labeled multimodal data with imbalanced class distributions.
Abstract（参考訳）: クラス不均衡データをモデル化する場合、そのようなデータで訓練されたモデルは多数派に偏りがちであるため、不均衡に対処することが不可欠である。この問題は部分的な監督の下で増幅され、不均衡なラベル付きデータに基づいてラベルなしデータの擬似ラベルが予測され、バイアスが伝播する。最近の半教師付きモデルはクラス不均衡に対処するが、通常は単一モードの入力データを仮定する。しかし、マルチモーダルデータの可用性が高まるにつれて、相補的なモダリティを活用することが不可欠である。本稿では,クラス不均衡下での半教師付き学習のためのマルチモーダル深層生成モデルを提案する。提案手法では,各モダリティに対して別個のエンコーダを用い,各モダリティ間で潜伏変数を共有するとともに,積オブエキスパート法による連成後続計算を簡略化する。さらに、クラス不均衡に対処するため、従来のガウス分布を前者、エンコーダ、デコーダの生徒のt分布に置き換え、不均衡データにおける重み付き潜在分布をよりよくキャプチャする。ラベル付きデータとラベルなしデータの両方で、$γ$-power divergenceを用いて提案モデルをトレーニングするための新しい目的関数を導出する。ベンチマークおよび実世界のデータセットを用いた実験結果から,本モデルは一般化におけるベースライン法よりも優れ,不均衡なクラス分布を持つ部分ラベル付きマルチモーダルデータに対して,優れた分類性能が得られることが示された。

論文の概要: Multimodal Deep Generative Model for Semi-Supervised Learning under Class Imbalance

関連論文リスト