Related papers: Image-Pointcloud Fusion based Anomaly Detection using PD-REAL Dataset

Image-Pointcloud Fusion based Anomaly Detection using PD-REAL Dataset

URL: http://arxiv.org/abs/2311.04095v1
Date: Tue, 7 Nov 2023 16:05:27 GMT
Title: Image-Pointcloud Fusion based Anomaly Detection using PD-REAL Dataset
Authors: Jianjian Qin, Chunzhi Gu, Jun Yu, Chao Zhang
Abstract summary: We present PD-REAL, a novel large-scale dataset for unsupervised anomaly detection (AD) in the 3D domain. It is motivated by the fact that 2D-only representations in the AD task may fail to capture the geometric structures of anomalies due to uncertainty in lighting conditions or shooting angles. PD-REAL consists entirely of Play-Doh models for 15 object categories and focuses on the analysis of potential benefits from 3D information in a controlled environment.
Score: 15.212031255539022
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: We present PD-REAL, a novel large-scale dataset for unsupervised anomaly detection (AD) in the 3D domain. It is motivated by the fact that 2D-only representations in the AD task may fail to capture the geometric structures of anomalies due to uncertainty in lighting conditions or shooting angles. PD-REAL consists entirely of Play-Doh models for 15 object categories and focuses on the analysis of potential benefits from 3D information in a controlled environment. Specifically, objects are first created with six types of anomalies, such as dent, crack, or perforation, and then photographed under different lighting conditions to mimic real-world inspection scenarios. To demonstrate the usefulness of 3D information, we use a commercially available RealSense camera to capture RGB and depth images. Compared to the existing 3D dataset for AD tasks, the data acquisition of PD-REAL is significantly cheaper, easily scalable and easier to control variables. Extensive evaluations with state-of-the-art AD algorithms on our dataset demonstrate the benefits as well as challenges of using 3D information. Our dataset can be downloaded from https://github.com/Andy-cs008/PD-REAL

Related papers

Robust 3D Object Detection using Probabilistic Point Clouds from Single-Photon LiDARs [29.92823252627008]
LiDAR-based 3D sensors provide point clouds, a canonical 3D representation used in various scene understanding tasks.<n>Modern LiDARs face challenges in several real-world scenarios, such as long-distance or low-albedo objects, producing sparse or erroneous point clouds.<n>We propose Probabilistic Point Clouds (PPC), a novel 3D scene representation where each point is augmented with a probability attribute.
arXiv Detail & Related papers (2025-07-31T21:32:21Z)
DriveGEN: Generalized and Robust 3D Detection in Driving via Controllable Text-to-Image Diffusion Generation [49.32104127246474]
DriveGEN is a training-free controllable Text-to-Image Diffusion Generation. It consistently preserves objects with precise 3D geometry across diverse Out-of-Distribution generations.
arXiv Detail & Related papers (2025-03-14T06:35:38Z)
PointAD: Comprehending 3D Anomalies from Points and Pixels for Zero-shot 3D Anomaly Detection [13.60524473223155]
This paper introduces PointAD, a novel approach that transfers the strong generalization capabilities of CLIP for recognizing 3D anomalies on unseen objects. PointAD renders 3D anomalies into multiple 2D renderings and projects them back into 3D space. Our model can directly integrate RGB information, further enhancing the understanding of 3D anomalies in a plug-and-play manner.
arXiv Detail & Related papers (2024-10-01T01:40:22Z)
VFMM3D: Releasing the Potential of Image by Vision Foundation Model for Monocular 3D Object Detection [80.62052650370416]
monocular 3D object detection holds significant importance across various applications, including autonomous driving and robotics. In this paper, we present VFMM3D, an innovative framework that leverages the capabilities of Vision Foundation Models (VFMs) to accurately transform single-view images into LiDAR point cloud representations.
arXiv Detail & Related papers (2024-04-15T03:12:12Z)
3DiffTection: 3D Object Detection with Geometry-Aware Diffusion Features [70.50665869806188]
3DiffTection is a state-of-the-art method for 3D object detection from single images. We fine-tune a diffusion model to perform novel view synthesis conditioned on a single image. We further train the model on target data with detection supervision.
arXiv Detail & Related papers (2023-11-07T23:46:41Z)
Real3D-AD: A Dataset of Point Cloud Anomaly Detection [75.56719157477661]
We introduce Real3D-AD, a challenging high-precision point cloud anomaly detection dataset. With 1,254 high-resolution 3D items from forty thousand to millions of points for each item, Real3D-AD is the largest dataset for high-precision 3D industrial anomaly detection. We present a comprehensive benchmark for Real3D-AD, revealing the absence of baseline methods for high-precision point cloud anomaly detection.
arXiv Detail & Related papers (2023-09-23T00:43:38Z)
Digital Twin Tracking Dataset (DTTD): A New RGB+Depth 3D Dataset for Longer-Range Object Tracking Applications [3.9776693020673677]
Digital twin is a problem of augmenting real objects with their digital counterparts. A critical component in a good digital-twin system is real-time, accurate 3D object tracking. In this work, we create a novel RGB-D dataset, called Digital Twin Tracking dataset (DTTD)
arXiv Detail & Related papers (2023-02-12T20:06:07Z)
Bridged Transformer for Vision and Point Cloud 3D Object Detection [92.86856146086316]
Bridged Transformer (BrT) is an end-to-end architecture for 3D object detection. BrT learns to identify 3D and 2D object bounding boxes from both points and image patches. We experimentally show that BrT surpasses state-of-the-art methods on SUN RGB-D and ScanNetV2 datasets.
arXiv Detail & Related papers (2022-10-04T05:44:22Z)
3D-VField: Learning to Adversarially Deform Point Clouds for Robust 3D Object Detection [111.32054128362427]
In safety-critical settings, robustness on out-of-distribution and long-tail samples is fundamental to circumvent dangerous issues. We substantially improve the generalization of 3D object detectors to out-of-domain data by taking into account deformed point clouds during training. We propose and share open source CrashD: a synthetic dataset of realistic damaged and rare cars.
arXiv Detail & Related papers (2021-12-09T08:50:54Z)
SF-UDA$^{3D}$: Source-Free Unsupervised Domain Adaptation for LiDAR-Based 3D Object Detection [66.63707940938012]
3D object detectors based only on LiDAR point clouds hold the state-of-the-art on modern street-view benchmarks. This paper proposes SF-UDA$3D$ to domain-adapt the state-of-the-art PointRCNN 3D detector to target domains for which we have no annotations.
arXiv Detail & Related papers (2020-10-16T08:44:49Z)

This list is automatically generated from the titles and abstracts of the papers in this site.