Fugu-MT 論文翻訳(概要): Segmentation method of U-net sheet metal engineering drawing based on CBAM attention mechanism

論文の概要: Segmentation method of U-net sheet metal engineering drawing based on CBAM attention mechanism

arxiv url: http://arxiv.org/abs/2209.14102v2
Date: Thu, 27 Apr 2023 06:19:51 GMT
ステータス: 翻訳完了
システム内更新日: 2023-04-28 17:14:34.754138
Title: Segmentation method of U-net sheet metal engineering drawing based on CBAM attention mechanism
Title（参考訳）: cbamアテンション機構に基づくu-net鋼板設計図面のセグメンテーション法
Authors: Zhiwei Song, Hui Yao
Abstract要約: 本稿では, 溶接工法における特定ユニットの分割抽出のためのU-net法を提案する。バックボーンネットワークとしてvgg16を用いて、溶接エンジニアリングのデータセットセグメンテーションタスクにおける我々のモデルのIoU、mAP、Accuはそれぞれ84.72%、86.84%、99.42%であることを確認した。
参考スコア（独自算出の注目度）: 0.0
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: In the manufacturing process of heavy industrial equipment, the specific unit in the welding diagram is first manually redrawn and then the corresponding sheet metal parts are cut, which is inefficient. To this end, this paper proposes a U-net-based method for the segmentation and extraction of specific units in welding engineering drawings. This method enables the cutting device to automatically segment specific graphic units according to visual information and automatically cut out sheet metal parts of corresponding shapes according to the segmentation results. This process is more efficient than traditional human-assisted cutting. Two weaknesses in the U-net network will lead to a decrease in segmentation performance: first, the focus on global semantic feature information is weak, and second, there is a large dimensional difference between shallow encoder features and deep decoder features. Based on the CBAM (Convolutional Block Attention Module) attention mechanism, this paper proposes a U-net jump structure model with an attention mechanism to improve the network's global semantic feature extraction ability. In addition, a U-net attention mechanism model with dual pooling convolution fusion is designed, the deep encoder's maximum pooling + convolution features and the shallow encoder's average pooling + convolution features are fused vertically to reduce the dimension difference between the shallow encoder and deep decoder. The dual-pool convolutional attention jump structure replaces the traditional U-net jump structure, which can effectively improve the specific unit segmentation performance of the welding engineering drawing. Using vgg16 as the backbone network, experiments have verified that the IoU, mAP, and Accu of our model in the welding engineering drawing dataset segmentation task are 84.72%, 86.84%, and 99.42%, respectively.
Abstract（参考訳）: 重工業機器の製造工程において、溶接図中の特定単位を手動で再描画した後、対応する板状金属部品を切断して非効率にする。そこで本研究では,溶接工学図面における特定ユニットの分割抽出のためのU-net方式を提案する。カット装置は、視覚情報に応じて特定のグラフィック単位を自動的に分割し、セグメント化結果に応じて対応する形状のシート金属部品を自動的に切断することができる。このプロセスは従来の人手による切削よりも効率的である。 U-netネットワークの2つの弱点はセグメンテーション性能の低下につながる: まず、グローバルな意味的特徴情報へのフォーカスが弱く、次に、浅いエンコーダ特徴と深いデコーダ特徴の間に大きな次元差がある。本稿では,cbam(convolutional block attention module)の注意機構に基づき,ネットワークのグローバルセマンティック特徴抽出能力を向上させるための注意機構を備えたu-netジャンプ構造モデルを提案する。さらに、二重プーリング畳み込み融合を用いたU-netアテンション機構モデルを設計し、深部エンコーダの最大プーリング+畳み込み特性と浅部エンコーダの平均プーリング+畳み込み特性を垂直に融合させ、浅部エンコーダと深部デコーダの寸法差を低減する。デュアルプール畳み込み型アテンションジャンプ構造は、従来のu-netジャンプ構造を置き換えるもので、溶接エンジニアリングドローイングの特定のユニットセグメンテーション性能を効果的に改善することができる。バックボーンネットワークとしてvgg16を用いて、溶接エンジニアリングのデータセットセグメンテーションタスクにおける我々のモデルのIoU、mAP、Accuはそれぞれ84.72%、86.84%、99.42%であることを確認した。

関連論文リスト

DSU-Net:An Improved U-Net Model Based on DINOv2 and SAM2 with Multi-scale Cross-model Feature Enhancement [7.9006143460465355]
本稿では,DINOv2によるSAM2用マルチスケール機能協調フレームワークを提案する。コストのかかるトレーニングプロセスを必要とせず、カモフラージュ目標検出や有能なオブジェクト検出といった下流タスクにおいて、既存の最先端のメソオードを超越している。
論文参考訳（メタデータ） (2025-03-27T06:08:24Z)
Prior-guided Hierarchical Harmonization Network for Efficient Image Dehazing [50.92820394852817]
画像復調のためのtextitPrior-textitguided textitHarmonization Network (PGH$2$Net) を提案する。 PGH$2$Netは、2つのモジュールタイプからなる効率的なエンコーダとデコーダを備えたUNetのようなアーキテクチャ上に構築されている。
論文参考訳（メタデータ） (2025-03-03T03:36:30Z)
CARE Transformer: Mobile-Friendly Linear Visual Transformer via Decoupled Dual Interaction [77.8576094863446]
本稿では,新しいdetextbfCoupled dutextbfAl-interactive lineatextbfR atttextbfEntion (CARE) 機構を提案する。まず,非対称な特徴分離戦略を提案し,非対称的に学習プロセスを局所帰納バイアスと長距離依存に分解する。分離学習方式を採用し,特徴間の相補性を完全に活用することにより,高い効率性と精度を両立させることができる。
論文参考訳（メタデータ） (2024-11-25T07:56:13Z)
Prototype Learning Guided Hybrid Network for Breast Tumor Segmentation in DCE-MRI [58.809276442508256]
本稿では,畳み込みニューラルネットワーク(CNN)とトランスフォーマー層を組み合わせたハイブリッドネットワークを提案する。プライベートおよびパブリックなDCE-MRIデータセットの実験結果から,提案したハイブリッドネットワークは最先端の手法よりも優れた性能を示した。
論文参考訳（メタデータ） (2024-08-11T15:46:00Z)
Progressive Token Length Scaling in Transformer Encoders for Efficient Universal Segmentation [67.85309547416155]
ユニバーサルセグメンテーションのための強力なアーキテクチャは、マルチスケールの画像特徴を符号化し、オブジェクトクエリをマスク予測にデコードするトランスフォーマーに依存している。このようなモデルのスケーリングには効率性が優先されるため、最先端のMask2Formerでは、変換器エンコーダのみに計算の50%を使用しています。これは、エンコーダ層ごとにすべてのバックボーン機能スケールのトークンレベルの完全な表現が保持されているためである。
論文参考訳（メタデータ） (2024-04-23T01:34:20Z)
Bilateral Network with Residual U-blocks and Dual-Guided Attention for Real-time Semantic Segmentation [18.393208069320362]
注意計算によって導かれる2分岐アーキテクチャのための新しい融合機構を設計する。正確には、DGA(Dual-Guided Attention)モジュールを使用して、いくつかのマルチスケール変換を置き換えることを提案した。 Cityscapes と CamVid のデータセットを用いた実験により,本手法の有効性が示された。
論文参考訳（メタデータ） (2023-10-31T09:20:59Z)
Unite-Divide-Unite: Joint Boosting Trunk and Structure for High-accuracy Dichotomous Image Segmentation [48.995367430746086]
Dichotomous Image rendering (DIS) は、自然の風景からカテゴリーに依存しない前景の物体をピンポイントすることを目的としている。本稿では, トランクと構造同定の有効性を高めるために, 相補的特徴を再構成し, 分割的に配置する, UDUN (Unite-Divide-Unite Network) を提案する。 1024*1024入力を用いて、ResNet-18で65.3fpsのリアルタイム推論を可能にする。
論文参考訳（メタデータ） (2023-07-26T09:04:35Z)
Joint Channel Estimation and Feedback with Masked Token Transformers in Massive MIMO Systems [74.52117784544758]
本稿では,CSI行列内の固有周波数領域相関を明らかにするエンコーダデコーダに基づくネットワークを提案する。エンコーダ・デコーダネットワーク全体がチャネル圧縮に使用される。提案手法は,共同作業における現状のチャネル推定およびフィードバック技術より優れる。
論文参考訳（メタデータ） (2023-06-08T06:15:17Z)
EAA-Net: Rethinking the Autoencoder Architecture with Intra-class Features for Medical Image Segmentation [4.777011444412729]
We propose a light-weight end-to-end segmentation framework based on multi-task learning, called Edge Attention autoencoder Network (EAA-Net)。提案手法は,クラス間特徴の抽出にセグメンテーションネットワークを利用するだけでなく,フォアグラウンド内でのクラス内特徴の抽出にも再構成ネットワークを適用する。実験結果から,医用画像分割作業において,本手法が良好に機能することが確認された。
論文参考訳（メタデータ） (2022-08-19T07:42:55Z)
MGAE: Masked Autoencoders for Self-Supervised Learning on Graphs [55.66953093401889]
Masked Graph Autoencoder (MGAE) フレームワークは、グラフ構造データの効果的な学習を行う。自己指導型学習から洞察を得て、私たちはランダムに大量のエッジを隠蔽し、トレーニング中に欠落したエッジを再構築しようとします。
論文参考訳（メタデータ） (2022-01-07T16:48:07Z)
GridDehazeNet+: An Enhanced Multi-Scale Network with Intra-Task Knowledge Transfer for Single Image Dehazing [12.982905875008214]
GridDehazeNet+と呼ばれる強化されたマルチスケールネットワークを提案します。プリプロセス、バックボーン、後処理の3つのモジュールで構成されている。
論文参考訳（メタデータ） (2021-03-25T17:35:36Z)
Multi-stage Attention ResU-Net for Semantic Segmentation of Fine-Resolution Remote Sensing Images [9.398340832493457]
この問題に対処するための線形注意機構(LAM)を提案する。 LAMは、計算効率の高いドット積アテンションとほぼ同値である。微細なリモートセンシング画像からのセマンティックセグメンテーションのためのマルチステージアテンションResU-Netを設計する。
論文参考訳（メタデータ） (2020-11-29T07:24:21Z)
Efficient Medical Image Segmentation with Intermediate Supervision Mechanism [48.244918515770514]
U-Netの拡張経路は小目標の特性を無視する可能性があるため、中間的な監視機構が提案される。中間監督機構はセグメンテーション精度を向上させるが、余分な入力と複数の損失関数のためにトレーニング時間は長すぎる。モデルの冗長性を低減するため,共有重み付きデコーダモジュールと結合重み付きデコーダモジュールを組み合わせる。
論文参考訳（メタデータ） (2020-11-15T13:46:00Z)
Multi-scale Attention U-Net (MsAUNet): A Modified U-Net Architecture for Scene Segmentation [1.713291434132985]
画像からコンテキスト情報を用いたシーンセグメンテーションのためのマルチスケールアテンションネットワークを提案する。このネットワークは、局所的な特徴をグローバルな特徴にマップし、精度を向上し、識別画像領域を強調する。我々はPascalVOC2012とADE20kという2つの標準データセットでモデルを評価した。
論文参考訳（メタデータ） (2020-09-15T08:03:41Z)

関連論文リストは本サイト内にある論文のタイトル・アブストラクトから自動的に作成しています。

指定された論文の情報です。
本サイトの運営者は本サイト（すべての情報・翻訳含む）の品質を保証せず、本サイト（すべての情報・翻訳含む）を使用して発生したあらゆる結果について一切の責任を負いません。