Spectral Complex Autoencoder Pruning: A Fidelity-Guided Criterion for Extreme Structured Channel Compression
- URL: http://arxiv.org/abs/2601.09352v1
- Date: Wed, 14 Jan 2026 10:34:18 GMT
- Title: Spectral Complex Autoencoder Pruning: A Fidelity-Guided Criterion for Extreme Structured Channel Compression
- Authors: Wei Liu, Xing Deng, Haijian Shao, Yingtao Jiang,
- Abstract summary: We propose a reconstruction-based criterion that measures functional redundancy at the level of individual output channels.<n>We transform a complex interaction field to the frequency domain and train a low-capacity autoencoder to reconstruct normalized spectra.<n>We find that spectral reconstruction fidelity of complex interaction fields is an effective proxy for channel-level redundancy under aggressive compression.
- Score: 7.913101398893967
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: We propose Spectral Complex Autoencoder Pruning (SCAP), a reconstruction-based criterion that measures functional redundancy at the level of individual output channels. For each convolutional layer, we construct a complex interaction field by pairing the full multi-channel input activation as the real part with a single output-channel activation (spatially aligned and broadcast across input channels) as the imaginary part. We transform this complex field to the frequency domain and train a low-capacity autoencoder to reconstruct normalized spectra. Channels whose spectra are reconstructed with high fidelity are interpreted as lying close to a low-dimensional manifold captured by the autoencoder and are therefore more compressible; conversely, channels with low fidelity are retained as they encode information that cannot be compactly represented by the learned manifold. This yields an importance score (optionally fused with the filter L1 norm) that supports simple threshold-based pruning and produces a structurally consistent pruned network. On VGG16 trained on CIFAR-10, at a fixed threshold of 0.6, we obtain 90.11% FLOP reduction and 96.30% parameter reduction with an absolute Top-1 accuracy drop of 1.67% from a 93.44% baseline after fine-tuning, demonstrating that spectral reconstruction fidelity of complex interaction fields is an effective proxy for channel-level redundancy under aggressive compression.
Related papers
- EqDeepRx: Learning a Scalable MIMO Receiver [6.732584013520367]
This paper presents EqDeepRx, a practical deep-learning-aided multiple-input multiple-output (MIMO) receiver.<n>At the core of the receiver model is a shared-weight DetectorNN that operates independently on each spatial stream or layer.<n>5G/6G-compliant end-to-end simulations across multiple channel scenarios, pilot patterns, and inter-cell interference conditions show improved error rate and spectral efficiency.
arXiv Detail & Related papers (2026-02-12T11:22:30Z) - Prosody-Guided Harmonic Attention for Phase-Coherent Neural Vocoding in the Complex Spectrum [1.3066182802188198]
We introduce prosody-guided harmonic attention to enhance voiced segment encoding and directly predict complex spectral components for waveform synthesis via inverse STFT.<n>Experiments on benchmark datasets demonstrate consistent gains over HiFi-GAN and AutoVocoder: F0 RMSE reduced by 22 percent, voiced/unvoiced error lowered by 18 percent, and MOS scores improved by 0.15.<n>These results show that prosody-guided attention combined with direct complex spectrum modeling yields more natural, pitch-accurate, and robust synthetic speech, setting a strong foundation for expressive neural vocoding.
arXiv Detail & Related papers (2026-01-20T20:53:24Z) - HSCP: A Two-Stage Spectral Clustering Framework for Resource-Constrained UAV Identification [16.880773405024126]
This paper introduces HSCP, a Hierarchical Spectral Clustering Pruning framework.<n>It combines layer pruning with channel pruning to achieve extreme compression, high performance, and efficient inference.<n>Experiments on the UAV-M100 benchmark demonstrate that HSCP outperforms existing channel and layer pruning methods.
arXiv Detail & Related papers (2025-12-05T16:03:53Z) - FLaTEC: Frequency-Disentangled Latent Triplanes for Efficient Compression of LiDAR Point Clouds [52.997038111673966]
FLaTEC is a frequency-aware compression model that enables the compression of a full scan with high compression ratios.<n>We convert voxelized embeddings into triplane representations to reduce sparsity, computational cost, and storage requirements.<n>Our method achieves state-of-the-art rate-distortion performance and outperforms the standard codecs by 78% and 94% in BD-rate on both datasets.
arXiv Detail & Related papers (2025-11-25T08:37:49Z) - Semantic Channel Equalization Strategies for Deep Joint Source-Channel Coding [8.967618587731694]
Deep joint source-channel coding (DeepJSCC) has emerged as a powerful paradigm for end-to-end semantic communications.<n>Existing DeepJSCC schemes assume a shared latent space at transmitter (TX) and receiver (RX)<n>This mismatch introduces "semantic noise", degrading reconstruction quality and downstream task performance.
arXiv Detail & Related papers (2025-10-06T10:29:07Z) - High-Fidelity Prediction of Perturbed Optical Fields using Fourier Feature Networks [0.0]
We present a novel data-efficient machine learning framework that learns the perturbation-dependent transmission matrix of a multimode fibre.<n>On experimental data from a compressed fibre, our model predicts the output field with a 0.995 complex correlation to the ground truth.<n>This approach provides a general tool for modelling complex optical systems from sparse measurements.
arXiv Detail & Related papers (2025-08-27T10:25:57Z) - Joint Channel Estimation and Feedback with Masked Token Transformers in
Massive MIMO Systems [74.52117784544758]
This paper proposes an encoder-decoder based network that unveils the intrinsic frequency-domain correlation within the CSI matrix.
The entire encoder-decoder network is utilized for channel compression.
Our method outperforms state-of-the-art channel estimation and feedback techniques in joint tasks.
arXiv Detail & Related papers (2023-06-08T06:15:17Z) - Structured Sparsity Learning for Efficient Video Super-Resolution [99.1632164448236]
We develop a structured pruning scheme called Structured Sparsity Learning (SSL) according to the properties of video super-resolution (VSR) models.
In SSL, we design pruning schemes for several key components in VSR models, including residual blocks, recurrent networks, and upsampling networks.
arXiv Detail & Related papers (2022-06-15T17:36:04Z) - Distortion-Aware Loop Filtering of Intra 360^o Video Coding with
Equirectangular Projection [81.63407194858854]
We propose a distortion-aware loop filtering model to improve the performance of intra coding for 360$o$ videos projected via equirectangular projection (ERP) format.
Our proposed module analyzes content characteristics based on a coding unit (CU) partition mask and processes them through partial convolution to activate the specified area.
arXiv Detail & Related papers (2022-02-20T12:00:18Z) - Deep Learning Based Antenna-time Domain Channel Extrapolation for Hybrid
mmWave Massive MIMO [30.201881862681972]
We design a latent ordinary differential equation (ODE)-based network to learn the mapping function from the partial uplink channels to the full downlink ones at the base station.
Simulation results show that the designed network can efficiently infer the full downlink channels from the partial uplink ones.
arXiv Detail & Related papers (2021-08-09T11:12:46Z) - Model-Driven Deep Learning Based Channel Estimation and Feedback for
Millimeter-Wave Massive Hybrid MIMO Systems [61.78590389147475]
This paper proposes a model-driven deep learning (MDDL)-based channel estimation and feedback scheme for millimeter-wave (mmWave) systems.
To reduce the uplink pilot overhead for estimating the high-dimensional channels from a limited number of radio frequency (RF) chains, we propose to jointly train the phase shift network and the channel estimator as an auto-encoder.
Numerical results show that the proposed MDDL-based channel estimation and feedback scheme outperforms the state-of-the-art approaches.
arXiv Detail & Related papers (2021-04-22T13:34:53Z) - Unfolding Neural Networks for Compressive Multichannel Blind
Deconvolution [71.29848468762789]
We propose a learned-structured unfolding neural network for the problem of compressive sparse multichannel blind-deconvolution.
In this problem, each channel's measurements are given as convolution of a common source signal and sparse filter.
We demonstrate that our method is superior to classical structured compressive sparse multichannel blind-deconvolution methods in terms of accuracy and speed of sparse filter recovery.
arXiv Detail & Related papers (2020-10-22T02:34:33Z) - Deep Denoising Neural Network Assisted Compressive Channel Estimation
for mmWave Intelligent Reflecting Surfaces [99.34306447202546]
This paper proposes a deep denoising neural network assisted compressive channel estimation for mmWave IRS systems.
We first introduce a hybrid passive/active IRS architecture, where very few receive chains are employed to estimate the uplink user-to-IRS channels.
The complete channel matrix can be reconstructed from the limited measurements based on compressive sensing.
arXiv Detail & Related papers (2020-06-03T12:18:57Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.