RIDDLE: Lidar Data Compression with Range Image Deep Delta Encoding
- URL: http://arxiv.org/abs/2206.01738v1
- Date: Thu, 2 Jun 2022 21:53:43 GMT
- Title: RIDDLE: Lidar Data Compression with Range Image Deep Delta Encoding
- Authors: Xuanyu Zhou, Charles R. Qi, Yin Zhou, Dragomir Anguelov
- Abstract summary: Lidars are depth measuring sensors widely used in autonomous driving and augmented reality.
Large volume of data produced by lidars can lead to high costs in data storage and transmission.
We propose a novel data-driven range image compression algorithm, named RIDDLE (Range Image Deep DeLta.
- Score: 21.70770383279559
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Lidars are depth measuring sensors widely used in autonomous driving and
augmented reality. However, the large volume of data produced by lidars can
lead to high costs in data storage and transmission. While lidar data can be
represented as two interchangeable representations: 3D point clouds and range
images, most previous work focus on compressing the generic 3D point clouds. In
this work, we show that directly compressing the range images can leverage the
lidar scanning pattern, compared to compressing the unprojected point clouds.
We propose a novel data-driven range image compression algorithm, named RIDDLE
(Range Image Deep DeLta Encoding). At its core is a deep model that predicts
the next pixel value in a raster scanning order, based on contextual laser
shots from both the current and past scans (represented as a 4D point cloud of
spherical coordinates and time). The deltas between predictions and original
values can then be compressed by entropy encoding. Evaluated on the Waymo Open
Dataset and KITTI, our method demonstrates significant improvement in the
compression rate (under the same distortion) compared to widely used point
cloud and range image compression algorithms as well as recent deep methods.
Related papers
- PointRegGPT: Boosting 3D Point Cloud Registration using Generative Point-Cloud Pairs for Training [90.06520673092702]
We present PointRegGPT, boosting 3D point cloud registration using generative point-cloud pairs for training.
To our knowledge, this is the first generative approach that explores realistic data generation for indoor point cloud registration.
arXiv Detail & Related papers (2024-07-19T06:29:57Z) - 3D Point Cloud Compression with Recurrent Neural Network and Image
Compression Methods [0.0]
Storing and transmitting LiDAR point cloud data is essential for many AV applications.
Due to the sparsity and unordered structure of the data, it is difficult to compress point cloud data to a low volume.
We propose a new 3D-to-2D transformation which allows compression algorithms to efficiently exploit spatial correlations.
arXiv Detail & Related papers (2024-02-18T19:08:19Z) - 3D Compression Using Neural Fields [90.24458390334203]
We propose a novel NF-based compression algorithm for 3D data.
We demonstrate that our method excels at geometry compression on 3D point clouds as well as meshes.
It is straightforward to extend our compression algorithm to compress both geometry and attribute (e.g. color) of 3D data.
arXiv Detail & Related papers (2023-11-21T21:36:09Z) - Converting Depth Images and Point Clouds for Feature-based Pose
Estimation [0.0]
This paper presents a method of converting depth data into images capable of visualizing spatial details that are basically hidden in traditional depth images.
Compared to Bearing Angle images, our method yields brighter, higher-contrast images with more visible contours and more details.
arXiv Detail & Related papers (2023-10-23T13:29:42Z) - Poster: Making Edge-assisted LiDAR Perceptions Robust to Lossy Point
Cloud Compression [0.0]
We present an algorithm improving the quality of a LiDAR point cloud to mitigate the perception performance loss due to lossy compression.
Compared to existing image algorithms, our algorithm shows a better qualitative result when the point cloud is reconstructed from the range image.
arXiv Detail & Related papers (2023-09-08T18:34:48Z) - TriVol: Point Cloud Rendering via Triple Volumes [57.305748806545026]
We present a dense while lightweight 3D representation, named TriVol, that can be combined with NeRF to render photo-realistic images from point clouds.
Our framework has excellent generalization ability to render a category of scenes/objects without fine-tuning.
arXiv Detail & Related papers (2023-03-29T06:34:12Z) - Point2Pix: Photo-Realistic Point Cloud Rendering via Neural Radiance
Fields [63.21420081888606]
Recent Radiance Fields and extensions are proposed to synthesize realistic images from 2D input.
We present Point2Pix as a novel point to link the 3D sparse point clouds with 2D dense image pixels.
arXiv Detail & Related papers (2023-03-29T06:26:55Z) - Crowd Counting on Heavily Compressed Images with Curriculum Pre-Training [90.76576712433595]
Applying lossy compression on images processed by deep neural networks can lead to significant accuracy degradation.
Inspired by the curriculum learning paradigm, we present a novel training approach called curriculum pre-training (CPT) for crowd counting on compressed images.
arXiv Detail & Related papers (2022-08-15T08:43:21Z) - Accelerated Deep Lossless Image Coding with Unified Paralleleized GPU
Coding Architecture [1.2124289787900182]
Our algorithm is based on a neural network combined with an entropy encoder.
The neural network performs a density estimation on each pixel of the source image.
The density estimation is then used to code the target pixel, beating FLIF in terms of compression rate.
arXiv Detail & Related papers (2022-07-11T19:34:56Z) - Variable Rate Compression for Raw 3D Point Clouds [5.107705550575662]
We propose a novel variable rate deep compression architecture that operates on raw 3D point cloud data.
Our network is capable of explicitly processing point clouds and generating a compressed description.
arXiv Detail & Related papers (2022-02-28T15:15:39Z) - VPFNet: Improving 3D Object Detection with Virtual Point based LiDAR and
Stereo Data Fusion [62.24001258298076]
VPFNet is a new architecture that cleverly aligns and aggregates the point cloud and image data at the virtual' points.
Our VPFNet achieves 83.21% moderate 3D AP and 91.86% moderate BEV AP on the KITTI test set, ranking the 1st since May 21th, 2021.
arXiv Detail & Related papers (2021-11-29T08:51:20Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.