Related papers: End-to-End Learning of Multi-Organ Implicit Surfaces from 3D Medical Imaging Data

End-to-End Learning of Multi-Organ Implicit Surfaces from 3D Medical Imaging Data

URL: http://arxiv.org/abs/2509.12068v1
Date: Mon, 15 Sep 2025 15:52:20 GMT
Title: End-to-End Learning of Multi-Organ Implicit Surfaces from 3D Medical Imaging Data
Authors: Farahdiba Zarin, Nicolas Padoy, Jérémy Dana, Vinkle Srivastav,
Abstract summary: ImplMORe is an end-to-end deep learning method using implicit surface representations for multi-organ reconstruction from 3D medical images.<n>By leveraging the continuous nature of occupancy functions, our approach outperforms the explicit representation based surface reconstruction approaches.
Score: 8.279683600959418
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: The fine-grained surface reconstruction of different organs from 3D medical imaging can provide advanced diagnostic support and improved surgical planning. However, the representation of the organs is often limited by the resolution, with a detailed higher resolution requiring more memory and computing footprint. Implicit representations of objects have been proposed to alleviate this problem in general computer vision by providing compact and differentiable functions to represent the 3D object shapes. However, architectural and data-related differences prevent the direct application of these methods to medical images. This work introduces ImplMORe, an end-to-end deep learning method using implicit surface representations for multi-organ reconstruction from 3D medical images. ImplMORe incorporates local features using a 3D CNN encoder and performs multi-scale interpolation to learn the features in the continuous domain using occupancy functions. We apply our method for single and multiple organ reconstructions using the totalsegmentator dataset. By leveraging the continuous nature of occupancy functions, our approach outperforms the discrete explicit representation based surface reconstruction approaches, providing fine-grained surface details of the organ at a resolution higher than the given input image. The source code will be made publicly available at: https://github.com/CAMMA-public/ImplMORe

Related papers

Object-X: Learning to Reconstruct Multi-Modal 3D Object Representations [114.57192386025373]
Object-X is a versatile multi-modal 3D representation framework.<n>It can encoding rich object embeddings and decoding them back into geometric and visual reconstructions.<n>It supports a range of downstream tasks, including scene alignment, single-image 3D object reconstruction, and localization.
arXiv Detail & Related papers (2025-06-05T09:14:42Z)
Volumetric Reconstruction of Prostatectomy Specimens from Histology [0.0]
Surgical treatment for prostate cancer often involves organ removal, i.e., prostatectomy.<n>The diagnostic process generates extensive and complex information that is difficult to represent in reports.<n>Existing approaches in this area have proven labor-intensive and challenging to integrate into clinical imaging modalities.<n>3D-SLIVER provides a simplified solution, implemented as an open-source 3DSlicer extension.
arXiv Detail & Related papers (2024-11-29T22:33:49Z)
Large Spatial Model: End-to-end Unposed Images to Semantic 3D [79.94479633598102]
Large Spatial Model (LSM) processes unposed RGB images directly into semantic radiance fields. LSM simultaneously estimates geometry, appearance, and semantics in a single feed-forward operation. It can generate versatile label maps by interacting with language at novel viewpoints.
arXiv Detail & Related papers (2024-10-24T17:54:42Z)
μ-Net: A Deep Learning-Based Architecture for μ-CT Segmentation [2.012378666405002]
X-ray computed microtomography (mu-CT) is a non-destructive technique that can generate high-resolution 3D images of the internal anatomy of medical and biological samples. extracting relevant information from 3D images requires semantic segmentation of the regions of interest. We propose a novel framework that uses a convolutional neural network (CNN) to automatically segment the full morphology of the heart of Carassius auratus.
arXiv Detail & Related papers (2024-06-24T15:29:08Z)
Disruptive Autoencoders: Leveraging Low-level features for 3D Medical Image Pre-training [51.16994853817024]
This work focuses on designing an effective pre-training framework for 3D radiology images. We introduce Disruptive Autoencoders, a pre-training framework that attempts to reconstruct the original image from disruptions created by a combination of local masking and low-level perturbations. The proposed pre-training framework is tested across multiple downstream tasks and achieves state-of-the-art performance.
arXiv Detail & Related papers (2023-07-31T17:59:42Z)
Multi-View Vertebra Localization and Identification from CT Images [57.56509107412658]
We propose a multi-view vertebra localization and identification from CT images. We convert the 3D problem into a 2D localization and identification task on different views. Our method can learn the multi-view global information naturally.
arXiv Detail & Related papers (2023-07-24T14:43:07Z)
A Point Cloud Generative Model via Tree-Structured Graph Convolutions for 3D Brain Shape Reconstruction [31.436531681473753]
It is almost impossible to obtain the intraoperative 3D shape information by using physical methods such as sensor scanning. In this paper, a general generative adversarial network (GAN) architecture is proposed to reconstruct the 3D point clouds (PCs) of brains by using one single 2D image.
arXiv Detail & Related papers (2021-07-21T07:57:37Z)
A Geometry-Informed Deep Learning Framework for Ultra-Sparse 3D Tomographic Image Reconstruction [13.44786774177579]
We establish a geometry-informed deep learning framework for ultra-sparse 3D tomographic image reconstruction. We demonstrate that the seamless inclusion of known priors is essential to enhance the performance of 3D volumetric computed tomography imaging.
arXiv Detail & Related papers (2021-05-25T06:20:03Z)
CoTr: Efficiently Bridging CNN and Transformer for 3D Medical Image Segmentation [95.51455777713092]
Convolutional neural networks (CNNs) have been the de facto standard for nowadays 3D medical image segmentation. We propose a novel framework that efficiently bridges a bf Convolutional neural network and a bf Transformer bf (CoTr) for accurate 3D medical image segmentation.
arXiv Detail & Related papers (2021-03-04T13:34:22Z)
Detailed 3D Human Body Reconstruction from Multi-view Images Combining Voxel Super-Resolution and Learned Implicit Representation [12.459968574683625]
We propose a coarse-to-fine method to reconstruct a detailed 3D human body from multi-view images. The coarse 3D models are estimated by learning an implicit representation based on multi-scale features. The refined detailed 3D human body models can be produced by the voxel super-resolution which can preserve the details.
arXiv Detail & Related papers (2020-12-11T08:07:39Z)
Hierarchical Amortized Training for Memory-efficient High Resolution 3D GAN [52.851990439671475]
We propose a novel end-to-end GAN architecture that can generate high-resolution 3D images. We achieve this goal by using different configurations between training and inference. Experiments on 3D thorax CT and brain MRI demonstrate that our approach outperforms state of the art in image generation.
arXiv Detail & Related papers (2020-08-05T02:33:04Z)

This list is automatically generated from the titles and abstracts of the papers in this site.