Fugu-MT 論文翻訳(概要): Enhancing Self-Supervised Learning with Semantic Pairs A New Dataset and Empirical Study

論文の概要: Enhancing Self-Supervised Learning with Semantic Pairs A New Dataset and Empirical Study

arxiv url: http://arxiv.org/abs/2510.08722v2
Date: Mon, 13 Oct 2025 09:09:06 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-14 13:29:55.707593
Title: Enhancing Self-Supervised Learning with Semantic Pairs A New Dataset and Empirical Study
Title（参考訳）: セマンティックペアによる自己指導型学習の促進 : 新たなデータセットと実証的研究
Authors: Mohammad Alkhalefi, Georgios Leontidis, Mingjun Zhong,
Abstract要約: インスタンス識別は、データセット内の個々のインスタンスを別々のクラスとして扱う、自己教師付き表現学習パラダイムである。モデル表現の一般化性を高めるためにセマンティックペアを利用するための技術基盤を提供する。
参考スコア（独自算出の注目度）: 2.4405762029252465
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Instance discrimination is a self-supervised representation learning paradigm wherein individual instances within a dataset are treated as distinct classes. This is typically achieved by generating two disparate views of each instance by applying stochastic transformations, encouraging the model to learn representations invariant to the common underlying object across these views. While this approach facilitates the acquisition of invariant representations for dataset instances under various handcrafted transformations (e.g., random cropping, colour jittering), an exclusive reliance on such data transformations for achieving invariance may inherently limit the model's generalizability to unseen datasets and diverse downstream tasks. The inherent limitation stems from the fact that the finite set of transformations within the data processing pipeline is unable to encompass the full spectrum of potential data variations. In this study, we provide the technical foundation for leveraging semantic pairs to enhance the generalizability of the model's representation and empirically demonstrate that incorporating semantic pairs mitigates the issue of limited transformation coverage. Specifically, we propose that by exposing the model to semantic pairs (i.e., two instances belonging to the same semantic category), we introduce varied real-world scene contexts, thereby fostering the development of more generalizable object representations. To validate this hypothesis, we constructed and released a novel dataset comprising curated semantic pairs and conducted extensive experimentation to empirically establish that their inclusion enables the model to learn more general representations, ultimately leading to improved performance across diverse downstream tasks.
Abstract（参考訳）: インスタンス識別は、データセット内の個々のインスタンスを別々のクラスとして扱う、自己教師付き表現学習パラダイムである。これは典型的には、確率変換を適用して各インスタンスの2つの異なるビューを生成し、モデルがこれらのビューにまたがる共通の基礎となるオブジェクトに不変な表現を学習するように促すことによって達成される。このアプローチは、さまざまな手作業による変換(例えば、ランダムなトリミング、カラージッタリング)の下で、データセットインスタンスの不変表現の取得を容易にするが、不変性を達成するためのデータ変換への排他的依存は、本質的には、データセットの一般化性や、さまざまな下流タスクに制限する可能性がある。固有の制限は、データ処理パイプライン内の有限個の変換セットが、潜在的なデータバリエーションの完全なスペクトルを包含できないという事実に起因している。本研究では,モデル表現の一般化性を高めるためにセマンティックペアを利用する技術基盤を提供し,セマンティックペアを組み込むことによって,限られた変換カバレッジの問題を軽減することを実証的に実証する。具体的には、モデルをセマンティックペア(すなわち、同じセマンティックカテゴリに属する2つのインスタンス)に公開することにより、様々な現実世界のシーンコンテキストを導入し、より一般化可能なオブジェクト表現の開発を促進することを提案する。この仮説を検証するために、我々は、キュレートされたセマンティックペアからなる新しいデータセットを構築し、広範囲な実験を行い、モデルがより汎用的な表現を学習できるようにすることを実証的に証明し、最終的には様々な下流タスクのパフォーマンス向上につながった。

論文の概要: Enhancing Self-Supervised Learning with Semantic Pairs A New Dataset and Empirical Study

関連論文リスト