Fugu-MT 論文翻訳(概要): Harnessing Lightweight Transformer with Contextual Synergic Enhancement for Efficient 3D Medical Image Segmentation

論文の概要: Harnessing Lightweight Transformer with Contextual Synergic Enhancement for Efficient 3D Medical Image Segmentation

arxiv url: http://arxiv.org/abs/2603.23390v1
Date: Tue, 24 Mar 2026 16:24:19 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-25 19:53:37.583103
Title: Harnessing Lightweight Transformer with Contextual Synergic Enhancement for Efficient 3D Medical Image Segmentation
Title（参考訳）: 高能率3次元医用画像分割のための文脈交感神経刺激を用いた高調波型軽量変圧器
Authors: Xinyu Liu, Zhen Chen, Wuyang Li, Chenxin Li, Yixuan Yuan,
Abstract要約: モデル効率を実現するための軽量トランスであるLight-UNETRを提案する。 Light-UNETRは軽量次元減少(LIDR)モジュールを備えており、空間次元とチャネル次元を減少させる。また,変換器のデータ効率向上を目的としたCSE学習戦略も導入した。
参考スコア（独自算出の注目度）: 59.711744386226194
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Transformers have shown remarkable performance in 3D medical image segmentation, but their high computational requirements and need for large amounts of labeled data limit their applicability. To address these challenges, we consider two crucial aspects: model efficiency and data efficiency. Specifically, we propose Light-UNETR, a lightweight transformer designed to achieve model efficiency. Light-UNETR features a Lightweight Dimension Reductive Attention (LIDR) module, which reduces spatial and channel dimensions while capturing both global and local features via multi-branch attention. Additionally, we introduce a Compact Gated Linear Unit (CGLU) to selectively control channel interaction with minimal parameters. Furthermore, we introduce a Contextual Synergic Enhancement (CSE) learning strategy, which aims to boost the data efficiency of Transformers. It first leverages the extrinsic contextual information to support the learning of unlabeled data with Attention-Guided Replacement, then applies Spatial Masking Consistency that utilizes intrinsic contextual information to enhance the spatial context reasoning for unlabeled data. Extensive experiments on various benchmarks demonstrate the superiority of our approach in both performance and efficiency. For example, with only 10% labeled data on the Left Atrial Segmentation dataset, our method surpasses BCP by 1.43% Jaccard while drastically reducing the FLOPs by 90.8% and parameters by 85.8%. Code is released at https://github.com/CUHK-AIM-Group/Light-UNETR.
Abstract（参考訳）: トランスフォーマーは3次元医用画像セグメンテーションにおいて顕著な性能を示したが、高い計算要求と大量のラベル付きデータを必要とするため、適用性が制限された。これらの課題に対処するために、モデル効率とデータ効率の2つの重要な側面を考えます。具体的には、モデル効率を実現するために設計された軽量トランスであるLight-UNETRを提案する。 Light-UNETRは軽量次元減少注意(LIDR)モジュールを備えており、空間とチャネルの次元を減らし、グローバルとローカルの両方の特徴をマルチブランチで捉えている。さらに,最小パラメータとのチャネル相互作用を選択的に制御するCGLU(Compact Gated Linear Unit)を導入する。さらに,変換器のデータ効率を向上させることを目的とした,文脈交感神経性増強(CSE)学習戦略を導入する。まず,非ラベルデータの学習を支援するために外在的文脈情報を活用する。次に,非ラベルデータに対する空間的文脈推論を強化するために,内在的文脈情報を利用する空間的マスキング一貫性を適用する。様々なベンチマークに関する大規模な実験は、性能と効率の両方において、我々のアプローチの優位性を実証している。例えば、左心房セグメンテーションデータセットのラベル付きデータは10%に過ぎず、我々の手法はBCPを1.43%上回り、FLOPを90.8%、パラメータを85.8%減らした。コードはhttps://github.com/CUHK-AIM-Group/Light-UNETRで公開されている。

論文の概要: Harnessing Lightweight Transformer with Contextual Synergic Enhancement for Efficient 3D Medical Image Segmentation

関連論文リスト