Fugu-MT 論文翻訳(概要): IBiT: Utilizing Inductive Biases to Create a More Data Efficient Attention Mechanism

論文の概要: IBiT: Utilizing Inductive Biases to Create a More Data Efficient Attention Mechanism

arxiv url: http://arxiv.org/abs/2509.22719v1
Date: Wed, 24 Sep 2025 17:19:23 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-30 22:32:18.820417
Title: IBiT: Utilizing Inductive Biases to Create a More Data Efficient Attention Mechanism
Title（参考訳）: IBiT: インダクティブビアーゼを使ってよりデータ効率の良いアテンションメカニズムを作る
Authors: Adithya Giri,
Abstract要約: 近年,トランスフォーマーベースのアーキテクチャがコンピュータビジョンアプリケーションの主要な手法となっている。 Transformerは、データセットのサイズに合わせて説明可能でスケール可能であるが、畳み込みニューラルネットワークの帰納バイアスは欠如している。これらの帰納バイアスを学習マスクを通じて導入することで、視覚変換器は知識蒸留なしではるかに小さなデータセットで学習できることを示す。
参考スコア（独自算出の注目度）: 0.0
License: http://creativecommons.org/licenses/by/4.0/
Abstract: In recent years, Transformer-based architectures have become the dominant method for Computer Vision applications. While Transformers are explainable and scale well with dataset size, they lack the inductive biases of Convolutional Neural Networks. While these biases may be learned on large datasets, we show that introducing these inductive biases through learned masks allow Vision Transformers to learn on much smaller datasets without Knowledge Distillation. These Transformers, which we call Inductively Biased Image Transformers (IBiT), are significantly more accurate on small datasets, while retaining the explainability Transformers.
Abstract（参考訳）: 近年,トランスフォーマーベースのアーキテクチャがコンピュータビジョンアプリケーションの主要な手法となっている。 Transformerは、データセットのサイズに合わせて説明可能でスケール可能であるが、畳み込みニューラルネットワークの帰納バイアスは欠如している。これらのバイアスは大きなデータセットで学習されるかもしれないが、学習マスクを通じて誘導バイアスを導入することで、視覚変換者が知識蒸留なしではるかに小さなデータセットで学習できることが示される。 Inductively Biased Image Transformers (IBiT)と呼ぶこれらのトランスフォーマーは、説明可能性のあるトランスフォーマーを維持しながら、小さなデータセットよりもはるかに正確である。

論文の概要: IBiT: Utilizing Inductive Biases to Create a More Data Efficient Attention Mechanism

関連論文リスト