Unsupervised 3D out-of-distribution detection with latent diffusion
  models
        - URL: http://arxiv.org/abs/2307.03777v1
- Date: Fri, 7 Jul 2023 18:00:38 GMT
- Title: Unsupervised 3D out-of-distribution detection with latent diffusion
  models
- Authors: Mark S. Graham, Walter Hugo Lopez Pinaya, Paul Wright, Petru-Daniel
  Tudosiu, Yee H. Mah, James T. Teo, H. Rolf J\"ager, David Werring, Parashkev
  Nachev, Sebastien Ourselin, and M. Jorge Cardoso
- Abstract summary: We propose to use Latent Diffusion Models (LDMs) to enable the scaling of DDPMs to high-resolution 3D medical data.
Not only does the proposed LDM-based approach achieve statistically significant better performance, it also shows less sensitivity to the underlying latent representation.
- Score: 1.7587591581995812
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract:   Methods for out-of-distribution (OOD) detection that scale to 3D data are
crucial components of any real-world clinical deep learning system. Classic
denoising diffusion probabilistic models (DDPMs) have been recently proposed as
a robust way to perform reconstruction-based OOD detection on 2D datasets, but
do not trivially scale to 3D data. In this work, we propose to use Latent
Diffusion Models (LDMs), which enable the scaling of DDPMs to high-resolution
3D medical data. We validate the proposed approach on near- and far-OOD
datasets and compare it to a recently proposed, 3D-enabled approach using
Latent Transformer Models (LTMs). Not only does the proposed LDM-based approach
achieve statistically significant better performance, it also shows less
sensitivity to the underlying latent representation, more favourable memory
scaling, and produces better spatial anomaly maps. Code is available at
https://github.com/marksgraham/ddpm-ood
 
      
        Related papers
        - Boosting 3D Liver Shape Datasets with Diffusion Models and Implicit   Neural Representations [0.7106122418396085]
 We propose a solution using diffusion models combined with implicit neural representations (INRs) to augment and expand existing datasets.
Our approach utilizes the generative capabilities of diffusion models to create realistic, diverse 3D liver shapes.
 arXiv  Detail & Related papers  (2025-04-28T00:56:18Z)
- Introducing 3D Representation for Medical Image Volume-to-Volume   Translation via Score Fusion [3.3559609260669303]
 We present Score-Fusion, a novel volumetric translation model that effectively learns 3D representations by ensembling perpendicularly trained 2D diffusion models in score function space.
We show that Score-Fusion achieves superior accuracy and volumetric fidelity in 3D medical image super-resolution and modality translation.
 arXiv  Detail & Related papers  (2025-01-13T15:54:21Z)
- DSplats: 3D Generation by Denoising Splats-Based Multiview Diffusion   Models [67.50989119438508]
 We introduce DSplats, a novel method that directly denoises multiview images using Gaussian-based Reconstructors to produce realistic 3D assets.
Our experiments demonstrate that DSplats not only produces high-quality, spatially consistent outputs, but also sets a new standard in single-image to 3D reconstruction.
 arXiv  Detail & Related papers  (2024-12-11T07:32:17Z)
- A Lesson in Splats: Teacher-Guided Diffusion for 3D Gaussian Splats   Generation with 2D Supervision [65.33043028101471]
 We introduce a diffusion model for Gaussian Splats, SplatDiffusion, to enable generation of three-dimensional structures from single images.
Existing methods rely on deterministic, feed-forward predictions, which limit their ability to handle the inherent ambiguity of 3D inference from 2D data.
 arXiv  Detail & Related papers  (2024-12-01T00:29:57Z)
- R3D-AD: Reconstruction via Diffusion for 3D Anomaly Detection [12.207437451118036]
 3D anomaly detection plays a crucial role in monitoring parts for localized inherent defects in precision manufacturing.
 Embedding-based and reconstruction-based approaches are among the most popular and successful methods.
We propose R3D-AD, reconstructing anomalous point clouds by diffusion model for precise 3D anomaly detection.
 arXiv  Detail & Related papers  (2024-07-15T16:10:58Z)
- DM3D: Distortion-Minimized Weight Pruning for Lossless 3D Object   Detection [42.07920565812081]
 We propose a novel post-training weight pruning scheme for 3D object detection.
It determines redundant parameters in the pretrained model that lead to minimal distortion in both locality and confidence.
This framework aims to minimize detection distortion of network output to maximally maintain detection precision.
 arXiv  Detail & Related papers  (2024-07-02T09:33:32Z)
- DeepSet SimCLR: Self-supervised deep sets for improved pathology
  representation learning [4.40560654491339]
 We aim to improve standard 2D SSL algorithms by modelling the inherent 3D nature of these datasets implicitly.
We propose two variants that build upon a strong baseline model and show that both of these variants often outperform the baseline in a variety of downstream tasks.
 arXiv  Detail & Related papers  (2024-02-23T20:37:59Z)
- Volumetric Semantically Consistent 3D Panoptic Mapping [77.13446499924977]
 We introduce an online 2D-to-3D semantic instance mapping algorithm aimed at generating semantic 3D maps suitable for autonomous agents in unstructured environments.
It introduces novel ways of integrating semantic prediction confidence during mapping, producing semantic and instance-consistent 3D regions.
The proposed method achieves accuracy superior to the state of the art on public large-scale datasets, improving on a number of widely used metrics.
 arXiv  Detail & Related papers  (2023-09-26T08:03:10Z)
- Diffusion-based 3D Object Detection with Random Boxes [58.43022365393569]
 Existing anchor-based 3D detection methods rely on empiricals setting of anchors, which makes the algorithms lack elegance.
Our proposed Diff3Det migrates the diffusion model to proposal generation for 3D object detection by considering the detection boxes as generative targets.
In the inference stage, the model progressively refines a set of random boxes to the prediction results.
 arXiv  Detail & Related papers  (2023-09-05T08:49:53Z)
- Improving 3D Imaging with Pre-Trained Perpendicular 2D Diffusion Models [52.529394863331326]
 We propose a novel approach using two perpendicular pre-trained 2D diffusion models to solve the 3D inverse problem.
Our method is highly effective for 3D medical image reconstruction tasks, including MRI Z-axis super-resolution, compressed sensing MRI, and sparse-view CT.
 arXiv  Detail & Related papers  (2023-03-15T08:28:06Z)
- Solving Sample-Level Out-of-Distribution Detection on 3D Medical Images [0.06117371161379209]
 Out-of-distribution (OOD) detection helps to identify data samples, increasing the model's reliability.
Recent works have developed DL-based OOD detection that achieves promising results on 2D medical images.
However, scaling most of these approaches on 3D images is computationally intractable.
We propose a histogram-based method that requires no DL and achieves almost perfect results in this domain.
 arXiv  Detail & Related papers  (2022-12-13T11:42:23Z)
- Automated Model Design and Benchmarking of 3D Deep Learning Models for
  COVID-19 Detection with Chest CT Scans [72.04652116817238]
 We propose a differentiable neural architecture search (DNAS) framework to automatically search for the 3D DL models for 3D chest CT scans classification.
We also exploit the Class Activation Mapping (CAM) technique on our models to provide the interpretability of the results.
 arXiv  Detail & Related papers  (2021-01-14T03:45:01Z)
- Revisiting 3D Context Modeling with Supervised Pre-training for
  Universal Lesion Detection in CT Slices [48.85784310158493]
 We propose a Modified Pseudo-3D Feature Pyramid Network (MP3D FPN) to efficiently extract 3D context enhanced 2D features for universal lesion detection in CT slices.
With the novel pre-training method, the proposed MP3D FPN achieves state-of-the-art detection performance on the DeepLesion dataset.
The proposed 3D pre-trained weights can potentially be used to boost the performance of other 3D medical image analysis tasks.
 arXiv  Detail & Related papers  (2020-12-16T07:11:16Z)
- Reinforced Axial Refinement Network for Monocular 3D Object Detection [160.34246529816085]
 Monocular 3D object detection aims to extract the 3D position and properties of objects from a 2D input image.
 Conventional approaches sample 3D bounding boxes from the space and infer the relationship between the target object and each of them, however, the probability of effective samples is relatively small in the 3D space.
We propose to start with an initial prediction and refine it gradually towards the ground truth, with only one 3d parameter changed in each step.
This requires designing a policy which gets a reward after several steps, and thus we adopt reinforcement learning to optimize it.
 arXiv  Detail & Related papers  (2020-08-31T17:10:48Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
       
     
           This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.