Related papers: RibPull: Implicit Occupancy Fields and Medial Axis Extraction for CT Ribcage Scans

RibPull: Implicit Occupancy Fields and Medial Axis Extraction for CT Ribcage Scans

URL: http://arxiv.org/abs/2509.01402v1
Date: Mon, 01 Sep 2025 11:54:50 GMT
Title: RibPull: Implicit Occupancy Fields and Medial Axis Extraction for CT Ribcage Scans
Authors: Emmanouil Nikolakakis, Amine Ouasfi, Julie Digne, Razvan Marinescu,
Abstract summary: Implicit 3D representations use continuous functions that handle sparse and noisy data more effectively than discrete methods.<n>We evaluate our methodology on 20 medical scans from the RibSeg dataset, which is itself an extension of the RibFrac dataset.
Score: 10.8145995157397
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: We present RibPull, a methodology that utilizes implicit occupancy fields to bridge computational geometry and medical imaging. Implicit 3D representations use continuous functions that handle sparse and noisy data more effectively than discrete methods. While voxel grids are standard for medical imaging, they suffer from resolution limitations, topological information loss, and inefficient handling of sparsity. Coordinate functions preserve complex geometrical information and represent a better solution for sparse data representation, while allowing for further morphological operations. Implicit scene representations enable neural networks to encode entire 3D scenes within their weights. The result is a continuous function that can implicitly compesate for sparse signals and infer further information about the 3D scene by passing any combination of 3D coordinates as input to the model. In this work, we use neural occupancy fields that predict whether a 3D point lies inside or outside an object to represent CT-scanned ribcages. We also apply a Laplacian-based contraction to extract the medial axis of the ribcage, thus demonstrating a geometrical operation that benefits greatly from continuous coordinate-based 3D scene representations versus voxel-based representations. We evaluate our methodology on 20 medical scans from the RibSeg dataset, which is itself an extension of the RibFrac dataset. We will release our code upon publication.

Related papers

TomoGraphView: 3D Medical Image Classification with Omnidirectional Slice Representations and Graph Neural Networks [2.2906925991630085]
3D medical image classification remains a challenging task due to the complex spatial relationships and long-range dependencies inherent in accessible data.<n>Recent studies have highlighted the potential of 2D vision foundation models, originally trained on natural images, as powerful feature extractors for medical image analysis.<n>We propose TomoGraphView, a novel framework that integrates omnidirectional volume slicing with spherical graph-based feature aggregation.
arXiv Detail & Related papers (2025-11-12T16:30:34Z)
End-to-End Learning of Multi-Organ Implicit Surfaces from 3D Medical Imaging Data [8.279683600959418]
ImplMORe is an end-to-end deep learning method using implicit surface representations for multi-organ reconstruction from 3D medical images.<n>By leveraging the continuous nature of occupancy functions, our approach outperforms the explicit representation based surface reconstruction approaches.
arXiv Detail & Related papers (2025-09-15T15:52:20Z)
NUDF: Neural Unsigned Distance Fields for high resolution 3D medical image segmentation [0.13431733228151765]
We propose to learn a Neural Unsigned Distance Field (NUDF) directly from the image.<n>We evaluate our method on the task of left atrial appendage (LAA) segmentation from Computed Tomography (CT) images.<n>We are able to predict 3D mesh models that capture the details of the LAA and achieve accuracy in the order of the voxel spacing in the CT images.
arXiv Detail & Related papers (2025-04-25T13:32:16Z)
GaussianFormer: Scene as Gaussians for Vision-Based 3D Semantic Occupancy Prediction [70.65250036489128]
3D semantic occupancy prediction aims to obtain 3D fine-grained geometry and semantics of the surrounding scene. We propose an object-centric representation to describe 3D scenes with sparse 3D semantic Gaussians. GaussianFormer achieves comparable performance with state-of-the-art methods with only 17.8% - 24.8% of their memory consumption.
arXiv Detail & Related papers (2024-05-27T17:59:51Z)
N-BVH: Neural ray queries with bounding volume hierarchies [51.430495562430565]
In 3D computer graphics, the bulk of a scene's memory usage is due to polygons and textures. We devise N-BVH, a neural compression architecture designed to answer arbitrary ray queries in 3D. Our method provides faithful approximations of visibility, depth, and appearance attributes.
arXiv Detail & Related papers (2024-05-25T13:54:34Z)
ToNNO: Tomographic Reconstruction of a Neural Network's Output for Weakly Supervised Segmentation of 3D Medical Images [6.035125735474387]
ToNNO is based on the Tomographic reconstruction of a Neural Network's Output. It extracts stacks of slices with different angles from the input 3D volume, feeds these slices to a 2D encoder, and applies the inverse Radon transform in order to reconstruct a 3D heatmap of the encoder's predictions. We apply it to weakly supervised medical image segmentation by training the 2D encoder to output high values for slices containing the regions of interest.
arXiv Detail & Related papers (2024-04-19T11:27:56Z)
Oriented-grid Encoder for 3D Implicit Representations [10.02138130221506]
This paper is the first to exploit 3D characteristics in 3D geometric encoders explicitly. Our method gets state-of-the-art results when compared to the prior techniques.
arXiv Detail & Related papers (2024-02-09T19:28:13Z)
Two-Stream Graph Convolutional Network for Intra-oral Scanner Image Segmentation [133.02190910009384]
We propose a two-stream graph convolutional network (i.e., TSGCN) to handle inter-view confusion between different raw attributes. Our TSGCN significantly outperforms state-of-the-art methods in 3D tooth (surface) segmentation.
arXiv Detail & Related papers (2022-04-19T10:41:09Z)
Using the Order of Tomographic Slices as a Prior for Neural Networks Pre-Training [1.1470070927586016]
We propose a pre-training method SortingLoss on slices instead of volumes. It performs pre-training on slices instead of volumes, so that a model could be fine-tuned on a sparse set of slices. We show that the proposed method performs on par with SimCLR, while working 2x faster and requiring 1.5x less memory.
arXiv Detail & Related papers (2022-03-17T14:58:15Z)
Cherry-Picking Gradients: Learning Low-Rank Embeddings of Visual Data via Differentiable Cross-Approximation [53.95297550117153]
We propose an end-to-end trainable framework that processes large-scale visual data tensors by looking emphat a fraction of their entries only. The proposed approach is particularly useful for large-scale multidimensional grid data, and for tasks that require context over a large receptive field.
arXiv Detail & Related papers (2021-05-29T08:39:57Z)
CoTr: Efficiently Bridging CNN and Transformer for 3D Medical Image Segmentation [95.51455777713092]
Convolutional neural networks (CNNs) have been the de facto standard for nowadays 3D medical image segmentation. We propose a novel framework that efficiently bridges a bf Convolutional neural network and a bf Transformer bf (CoTr) for accurate 3D medical image segmentation.
arXiv Detail & Related papers (2021-03-04T13:34:22Z)
Learning Hybrid Representations for Automatic 3D Vessel Centerline Extraction [57.74609918453932]
Automatic blood vessel extraction from 3D medical images is crucial for vascular disease diagnoses. Existing methods may suffer from discontinuities of extracted vessels when segmenting such thin tubular structures from 3D images. We argue that preserving the continuity of extracted vessels requires to take into account the global geometry. We propose a hybrid representation learning approach to address this challenge.
arXiv Detail & Related papers (2020-12-14T05:22:49Z)
Exploring Deep 3D Spatial Encodings for Large-Scale 3D Scene Understanding [19.134536179555102]
We propose an alternative approach to overcome the limitations of CNN based approaches by encoding the spatial features of raw 3D point clouds into undirected graph models. The proposed method achieves on par state-of-the-art accuracy with improved training time and model stability thus indicating strong potential for further research.
arXiv Detail & Related papers (2020-11-29T12:56:19Z)

This list is automatically generated from the titles and abstracts of the papers in this site.