Fugu-MT 論文翻訳(概要): Efficient Semantic Image Communication for Traffic Monitoring at the Edge

論文の概要: Efficient Semantic Image Communication for Traffic Monitoring at the Edge

arxiv url: http://arxiv.org/abs/2604.12622v1
Date: Tue, 14 Apr 2026 11:51:17 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-15 19:11:32.417843
Title: Efficient Semantic Image Communication for Traffic Monitoring at the Edge
Title（参考訳）: エッジにおける交通監視のための効率的なセマンティック画像通信
Authors: Damir Assylbek, Nurmukhammed Aitymbetov, Marko Ristin, Dimitrios Zorbas,
Abstract要約: 本稿では,交通監視のための2つのセマンティック画像通信パイプラインであるMMSDとSAMRについて述べる。実験の結果, MMSDは99%, SAMRは99.1%, 平均送信データ削減率は99%であった。
参考スコア（独自算出の注目度）: 0.6999740786886536
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: Many visual monitoring systems operate under strict communication constraints, where transmitting full-resolution images is impractical and often unnecessary. In such settings, visual data is often used for object presence, spatial relationships, and scene context rather than exact pixel fidelity. This paper presents two semantic image communication pipelines for traffic monitoring, MMSD and SAMR, that reduce transmission cost while preserving meaningful visual information. MMSD (Multi-Modal Semantic Decomposition) targets very high compression together with data confidentiality, since sensitive pixel content is not transmitted. It replaces the original image with compact semantic representations, namely segmentation maps, edge maps, and textual descriptions, and reconstructs the scene at the receiver using a diffusion-based generative model. SAMR (Semantic-Aware Masking Reconstruction) targets higher visual quality while maintaining strong compression. It selectively suppresses non-critical image regions according to semantic importance before standard JPEG encoding and restores the missing content at the receiver through generative inpainting. Both designs follow an asymmetric sender-receiver architecture, where lightweight processing is performed at the edge and computationally intensive reconstruction is offloaded to the server. On a Raspberry Pi~5, the edge-side processing time is about 15s for MMSD and 9s for SAMR. Experimental results show average transmitted-data reductions of 99% for MMSD and 99.1% for SAMR. In addition, MMSD achieves lower payload size than the recent SPIC baseline while preserving strong semantic consistency, whereas SAMR provides a better quality-compression trade-off than standard JPEG and SQ-GAN under comparable operating conditions.
Abstract（参考訳）: 多くの視覚監視システムは厳密な通信制約の下で動作し、フル解像度画像の送信は非現実的であり、しばしば不要である。このような設定では、ビジュアルデータは、正確なピクセルの忠実さよりも、オブジェクトの存在、空間的関係、シーンコンテキストによく使用される。本稿では,交通監視のための2つのセマンティック画像通信パイプラインであるMMSDとSAMRについて述べる。 MMSD (Multi-Modal Semantic Decomposition) は、機密画素が送信されないため、データ機密性とともに非常に高い圧縮を目標としている。元のイメージをコンパクトな意味表現、すなわちセグメンテーションマップ、エッジマップ、テキスト記述に置き換え、拡散ベースの生成モデルを用いてレシーバーのシーンを再構築する。 SAMR(Semantic-Aware Masking Reconstruction)は、強い圧縮を維持しながら、より高い視覚品質を目標とする。 JPEG符号化前の意味的重要性に応じて、非クリティカル画像領域を選択的に抑制し、生成的インペイントにより受信側で欠落したコンテンツを復元する。どちらの設計も非対称な送信受信アーキテクチャに従っており、エッジで軽量な処理が行われ、計算集約的な再構築がサーバにオフロードされる。 Raspberry Pi~5では、エッジサイドの処理時間はMMSDで約15秒、SAMRで9秒である。実験の結果, MMSDは99%, SAMRは99.1%, 平均送信データ削減率は99%であった。さらに、MMSDは最近のSPICベースラインよりも低いペイロードサイズを実現し、強いセマンティック一貫性を維持しているのに対し、SAMRは同等の操作条件下でのJPEGやSQ-GANよりも高品質な圧縮トレードオフを提供する。

論文の概要: Efficient Semantic Image Communication for Traffic Monitoring at the Edge

関連論文リスト