Fugu-MT 論文翻訳(概要): Group channel pruning and spatial attention distilling for object detection

論文の概要: Group channel pruning and spatial attention distilling for object detection

arxiv url: http://arxiv.org/abs/2306.01526v1
Date: Fri, 2 Jun 2023 13:26:23 GMT
ステータス: 翻訳完了
システム内更新日: 2023-06-05 14:50:11.628350
Title: Group channel pruning and spatial attention distilling for object detection
Title（参考訳）: 物体検出のためのグループチャネルプルーニングと空間的注意蒸留
Authors: Yun Chu, Pu Li, Yong Bai, Zhuhua Hu, Yongqing Chen and Jiafeng Lu
Abstract要約: 動的スパーストレーニング,グループチャネルプルーニング,空間アテンション蒸留という3段階モデル圧縮手法を提案する。本手法は,モデルのパラメータを64.7%削減し,計算量を34.9%削減する。
参考スコア（独自算出の注目度）: 2.8675002818821542
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Due to the over-parameterization of neural networks, many model compression methods based on pruning and quantization have emerged. They are remarkable in reducing the size, parameter number, and computational complexity of the model. However, most of the models compressed by such methods need the support of special hardware and software, which increases the deployment cost. Moreover, these methods are mainly used in classification tasks, and rarely directly used in detection tasks. To address these issues, for the object detection network we introduce a three-stage model compression method: dynamic sparse training, group channel pruning, and spatial attention distilling. Firstly, to select out the unimportant channels in the network and maintain a good balance between sparsity and accuracy, we put forward a dynamic sparse training method, which introduces a variable sparse rate, and the sparse rate will change with the training process of the network. Secondly, to reduce the effect of pruning on network accuracy, we propose a novel pruning method called group channel pruning. In particular, we divide the network into multiple groups according to the scales of the feature layer and the similarity of module structure in the network, and then we use different pruning thresholds to prune the channels in each group. Finally, to recover the accuracy of the pruned network, we use an improved knowledge distillation method for the pruned network. Especially, we extract spatial attention information from the feature maps of specific scales in each group as knowledge for distillation. In the experiments, we use YOLOv4 as the object detection network and PASCAL VOC as the training dataset. Our method reduces the parameters of the model by 64.7 % and the calculation by 34.9%.
Abstract（参考訳）: ニューラルネットワークの過度パラメータ化により、プルーニングと量子化に基づくモデル圧縮法が数多く出現している。それらはモデルのサイズ、パラメータ数、計算の複雑さを減少させるのに顕著である。しかし、そのような方法で圧縮されたモデルのほとんどは特別なハードウェアとソフトウェアをサポートする必要があり、デプロイメントコストが増大する。また、これらの手法は主に分類タスクで使われ、検出タスクで直接使用されることは稀である。これらの問題に対処するため,オブジェクト検出ネットワークでは,動的スパーストレーニング,グループチャネルプルーニング,空間注意蒸留という3段階モデル圧縮手法を導入する。まず、ネットワーク内の重要でないチャネルを選択し、空間性と精度のバランスをよく保ちながら、可変スパース率を導入し、ネットワークのトレーニングプロセスでスパース率を変化させるダイナミックスパーストレーニング手法を提案する。次に,ネットワーク精度への影響を低減するため,グループチャネルプルーニングと呼ばれる新しいプルーニング手法を提案する。特に,特徴層のスケールとモジュール構造の類似性に応じて,ネットワークを複数のグループに分割し,異なるプルーニングしきい値を用いて各グループ内のチャネルをプルークする。最後に,prunedネットワークの精度を回復するために,prunedネットワークの知識蒸留法を改良した。特に,蒸留の知識として,各グループの特定スケールの特徴マップから空間的注意情報を抽出する。実験では,オブジェクト検出ネットワークとしてYOLOv4,トレーニングデータセットとしてPASCALVOCを用いた。本手法はモデルのパラメータを64.7%削減し,計算量を34.9%削減する。

論文の概要: Group channel pruning and spatial attention distilling for object detection

関連論文リスト