Fugu-MT 論文翻訳(概要): InterFormer: Real-time Interactive Image Segmentation

論文の概要: InterFormer: Real-time Interactive Image Segmentation

arxiv url: http://arxiv.org/abs/2304.02942v2
Date: Wed, 9 Aug 2023 08:41:39 GMT
ステータス: 翻訳完了
システム内更新日: 2023-08-10 17:37:13.341034
Title: InterFormer: Real-time Interactive Image Segmentation
Title（参考訳）: InterFormer:リアルタイムインタラクティブイメージセグメンテーション
Authors: You Huang, Hao Yang, Ke Sun, Shengchuan Zhang, Liujuan Cao, Guannan Jiang, Rongrong Ji
Abstract要約: インタラクティブなイメージセグメンテーションにより、アノテータはセグメンテーションタスクのためのピクセルレベルのアノテーションを効率的に実行することができる。既存のインタラクティブセグメンテーションパイプラインは、インタラクティブモデルの非効率な計算に悩まされている。これらの問題に対処するための新しいパイプラインに従うInterFormerという手法を提案する。
参考スコア（独自算出の注目度）: 80.45763765116175
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Interactive image segmentation enables annotators to efficiently perform pixel-level annotation for segmentation tasks. However, the existing interactive segmentation pipeline suffers from inefficient computations of interactive models because of the following two issues. First, annotators' later click is based on models' feedback of annotators' former click. This serial interaction is unable to utilize model's parallelism capabilities. Second, in each interaction step, the model handles the invariant image along with the sparse variable clicks, resulting in a process that's highly repetitive and redundant. For efficient computations, we propose a method named InterFormer that follows a new pipeline to address these issues. InterFormer extracts and preprocesses the computationally time-consuming part i.e. image processing from the existing process. Specifically, InterFormer employs a large vision transformer (ViT) on high-performance devices to preprocess images in parallel, and then uses a lightweight module called interactive multi-head self attention (I-MSA) for interactive segmentation. Furthermore, the I-MSA module's deployment on low-power devices extends the practical application of interactive segmentation. The I-MSA module utilizes the preprocessed features to efficiently response to the annotator inputs in real-time. The experiments on several datasets demonstrate the effectiveness of InterFormer, which outperforms previous interactive segmentation models in terms of computational efficiency and segmentation quality, achieve real-time high-quality interactive segmentation on CPU-only devices. The code is available at https://github.com/YouHuang67/InterFormer.
Abstract（参考訳）: インタラクティブな画像セグメンテーションにより、アノテーションはセグメンテーションタスクのピクセルレベルのアノテーションを効率的に実行することができる。しかし、既存のインタラクティブセグメンテーションパイプラインは、以下の2つの問題により、インタラクティブモデルの非効率な計算に苦しむ。第一に、アノテーションの後のクリックは、アノテーションの前のクリックに対するモデルのフィードバックに基づいている。このシリアル相互作用はモデルの並列性を利用できない。次に、各インタラクションステップにおいて、モデルはスパース変数クリックとともに不変画像を処理し、非常に反復的で冗長なプロセスをもたらす。そこで我々は,これらの問題に対処する新しいパイプラインに従うInterFormerという手法を提案する。インターフォーマは、計算時間消費部、すなわち既存のプロセスから画像処理を抽出・前処理する。具体的には、InterFormerは高速デバイスに大きな視覚変換器(ViT)を並列に前処理し、インタラクティブなセグメンテーションのためにインタラクティブなマルチヘッド自己注意(I-MSA)と呼ばれる軽量モジュールを使用する。さらに、I-MSAモジュールの低消費電力デバイスへの展開は、インタラクティブセグメンテーションの実践的応用を拡張している。 I-MSAモジュールは、前処理した機能を利用して、リアルタイムでアノテータ入力に効率的に応答する。複数のデータセットにおける実験は、計算効率とセグメンテーション品質の観点から従来の対話型セグメンテーションモデルを上回るinterformerの有効性を示し、cpuのみのデバイスでリアルタイムな高品質なインタラクティブセグメンテーションを実現する。コードはhttps://github.com/youhuang67/interformerで入手できる。

論文の概要: InterFormer: Real-time Interactive Image Segmentation

関連論文リスト