Fugu-MT 論文翻訳(概要): LegoNet: Memory Footprint Reduction Through Block Weight Clustering

論文の概要: LegoNet: Memory Footprint Reduction Through Block Weight Clustering

arxiv url: http://arxiv.org/abs/2603.06606v1
Date: Wed, 18 Feb 2026 12:09:29 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-15 16:38:22.424538
Title: LegoNet: Memory Footprint Reduction Through Block Weight Clustering
Title（参考訳）: LegoNet:ブロック重みクラスタリングによるメモリフットプリント削減
Authors: Joseph Bingham, Noah Green, Saman Zonouz,
Abstract要約: textbfLegoNetは,モデル全体の重みのブロックをレイヤタイプやクラスタによらず構築する圧縮手法である。 Cifar-10とImageNetでトレーニングされたResNet-50を324x4ブロックで圧縮することができ、メモリフットプリントを textbf64x で圧縮できた。
参考スコア（独自算出の注目度）: 0.0
License: http://creativecommons.org/licenses/by/4.0/
Abstract: As the need for neural network-based applications to become more accurate and powerful grows, so too does their size and memory footprint. With embedded devices, whose cache and RAM are limited, this growth hinders their ability to leverage state-of-the-art neural network architectures. In this work, we propose \textbf{LegoNet}, a compression technique that \textbf{constructs blocks of weights of the entire model regardless of layer type} and clusters these induced blocks. Using blocks instead of individual values to cluster the weights, we were able to compress ResNet-50 trained for Cifar-10 and ImageNet with only 32 4x4 blocks, compressing the memory footprint by over a factor of \textbf{64x without having to remove any weights} or changing the architecture and \textbf{no loss to accuracy}, nor retraining or any data, and show how to find an arrangement of 16 4x4 blocks that gives a compression ratio of \textbf{128x with less than 3\% accuracy loss}. This was all achieved with \textbf{no need for (re)training or fine-tuning}.
Abstract（参考訳）: ニューラルネットワークベースのアプリケーションがより正確でパワフルになる必要性が高まるにつれて、サイズやメモリフットプリントも大きくなる。キャッシュとRAMが限られている組み込みデバイスでは、この成長は最先端のニューラルネットワークアーキテクチャを活用する能力を妨げている。本研究では, モデル全体の重みのブロックを層の種類に関係なく構築する圧縮手法である \textbf{LegoNet} を提案し, これら誘導ブロックをクラスタ化する。 Cifar-10 と ImageNet でトレーニングされた ResNet-50 を 32 4x4 ブロックで圧縮し、メモリフットプリントを \textbf{64x の係数で圧縮したり、アーキテクチャや \textbf{no loss to accuracy} を変更したり、あるいはデータを再トレーニングしたり、あるいは、16 4x4 ブロックのアライメントを見つけ、その圧縮比率が 3\% 未満の精度損失で \textbf{128x となることを示すことができた。これはすべて、(再)トレーニングや微調整が必要な \textbf{no で達成された。

論文の概要: LegoNet: Memory Footprint Reduction Through Block Weight Clustering

関連論文リスト