Learned Point Cloud Compression for Classification
- URL: http://arxiv.org/abs/2308.05959v1
- Date: Fri, 11 Aug 2023 06:28:19 GMT
- Title: Learned Point Cloud Compression for Classification
- Authors: Mateen Ulhaq, Ivan V. Baji\'c
- Abstract summary: Deep learning is increasingly being used to perform machine vision tasks such as classification, object detection, and segmentation on 3D point cloud data.
We present a novel point cloud that is highly specialized for the machine task of classification.
In particular, it achieves a 93% reduction in BD-bitrate over non-specialized codecs on the ModelNet40 dataset.
- Score: 35.103437828235826
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Deep learning is increasingly being used to perform machine vision tasks such
as classification, object detection, and segmentation on 3D point cloud data.
However, deep learning inference is computationally expensive. The limited
computational capabilities of end devices thus necessitate a codec for
transmitting point cloud data over the network for server-side processing. Such
a codec must be lightweight and capable of achieving high compression ratios
without sacrificing accuracy. Motivated by this, we present a novel point cloud
codec that is highly specialized for the machine task of classification. Our
codec, based on PointNet, achieves a significantly better rate-accuracy
trade-off in comparison to alternative methods. In particular, it achieves a
94% reduction in BD-bitrate over non-specialized codecs on the ModelNet40
dataset. For low-resource end devices, we also propose two lightweight
configurations of our encoder that achieve similar BD-bitrate reductions of 93%
and 92% with 3% and 5% drops in top-1 accuracy, while consuming only 0.470 and
0.048 encoder-side kMACs/point, respectively. Our codec demonstrates the
potential of specialized codecs for machine analysis of point clouds, and
provides a basis for extension to more complex tasks and datasets in the
future.
Related papers
- LCM: Locally Constrained Compact Point Cloud Model for Masked Point Modeling [47.94285833315427]
We propose a Locally constrained Compact point cloud Model (LCM) consisting of a locally constrained compact encoder and a locally constrained Mamba-based decoder.
Our encoder replaces self-attention with our local aggregation layers to achieve an elegant balance between performance and efficiency.
This decoder ensures linear complexity while maximizing the perception of point cloud geometry information from unmasked patches with higher information density.
arXiv Detail & Related papers (2024-05-27T13:19:23Z) - Scalable Human-Machine Point Cloud Compression [29.044369073873465]
In this paper, we present a scalable for point-cloud data that is specialized for the machine task of classification, while also providing a mechanism for human viewing.
In the proposed scalable, the "base" bitstream supports the machine task, and an "enhancement" bitstream may be used for better input reconstruction performance for human viewing.
arXiv Detail & Related papers (2024-02-19T20:43:10Z) - SeRP: Self-Supervised Representation Learning Using Perturbed Point
Clouds [6.29475963948119]
SeRP consists of encoder-decoder architecture that takes perturbed or corrupted point clouds as inputs.
We have used Transformers and PointNet-based Autoencoders.
arXiv Detail & Related papers (2022-09-13T15:22:36Z) - SoftPool++: An Encoder-Decoder Network for Point Cloud Completion [93.54286830844134]
We propose a novel convolutional operator for the task of point cloud completion.
The proposed operator does not require any max-pooling or voxelization operation.
We show that our approach achieves state-of-the-art performance in shape completion at low and high resolutions.
arXiv Detail & Related papers (2022-05-08T15:31:36Z) - Variable Rate Compression for Raw 3D Point Clouds [5.107705550575662]
We propose a novel variable rate deep compression architecture that operates on raw 3D point cloud data.
Our network is capable of explicitly processing point clouds and generating a compressed description.
arXiv Detail & Related papers (2022-02-28T15:15:39Z) - StolenEncoder: Stealing Pre-trained Encoders [62.02156378126672]
We propose the first attack called StolenEncoder to steal pre-trained image encoders.
Our results show that the encoders stolen by StolenEncoder have similar functionality with the target encoders.
arXiv Detail & Related papers (2022-01-15T17:04:38Z) - Dynamic Neural Representational Decoders for High-Resolution Semantic
Segmentation [98.05643473345474]
We propose a novel decoder, termed dynamic neural representational decoder (NRD)
As each location on the encoder's output corresponds to a local patch of the semantic labels, in this work, we represent these local patches of labels with compact neural networks.
This neural representation enables our decoder to leverage the smoothness prior in the semantic label space, and thus makes our decoder more efficient.
arXiv Detail & Related papers (2021-07-30T04:50:56Z) - Lightweight Compression of Intermediate Neural Network Features for
Collaborative Intelligence [32.03465747357384]
In collaborative intelligence applications, part of a deep neural network (DNN) is deployed on a lightweight device such as a mobile phone or edge device.
This paper presents a novel lightweight compression technique designed specifically to quantize and compress the features output by the intermediate layer of a split DNN.
arXiv Detail & Related papers (2021-05-15T00:10:12Z) - Lightweight compression of neural network feature tensors for
collaborative intelligence [32.03465747357384]
In collaborative intelligence applications, part of a deep neural network (DNN) is deployed on a relatively low-complexity device such as a mobile phone or edge device.
This paper presents a novel lightweight compression technique designed specifically to code the activations of a split DNN layer.
arXiv Detail & Related papers (2021-05-12T23:41:35Z) - OctSqueeze: Octree-Structured Entropy Model for LiDAR Compression [77.8842824702423]
We present a novel deep compression algorithm to reduce the memory footprint of LiDAR point clouds.
Our method exploits the sparsity and structural redundancy between points to reduce the memory footprint.
Our algorithm can be used to reduce the onboard and offboard storage of LiDAR points for applications such as self-driving cars.
arXiv Detail & Related papers (2020-05-14T17:48:49Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.