Representing Topological Self-Similarity Using Fractal Feature Maps for Accurate Segmentation of Tubular Structures
- URL: http://arxiv.org/abs/2407.14754v1
- Date: Sat, 20 Jul 2024 05:22:59 GMT
- Title: Representing Topological Self-Similarity Using Fractal Feature Maps for Accurate Segmentation of Tubular Structures
- Authors: Jiaxing Huang, Yanfeng Zhou, Yaoru Luo, Guole Liu, Heng Guo, Ge Yang,
- Abstract summary: In this study, we incorporate fractal features into a deep learning model by extending FD to the pixel-level using a sliding window technique.
The resulting fractal feature maps (FFMs) are then incorporated as additional input to the model and additional weight in the loss function.
Experiments on five tubular structure datasets validate the effectiveness and robustness of our approach.
- Score: 12.038095281876071
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Accurate segmentation of long and thin tubular structures is required in a wide variety of areas such as biology, medicine, and remote sensing. The complex topology and geometry of such structures often pose significant technical challenges. A fundamental property of such structures is their topological self-similarity, which can be quantified by fractal features such as fractal dimension (FD). In this study, we incorporate fractal features into a deep learning model by extending FD to the pixel-level using a sliding window technique. The resulting fractal feature maps (FFMs) are then incorporated as additional input to the model and additional weight in the loss function to enhance segmentation performance by utilizing the topological self-similarity. Moreover, we extend the U-Net architecture by incorporating an edge decoder and a skeleton decoder to improve boundary accuracy and skeletal continuity of segmentation, respectively. Extensive experiments on five tubular structure datasets validate the effectiveness and robustness of our approach. Furthermore, the integration of FFMs with other popular segmentation models such as HR-Net also yields performance enhancement, suggesting FFM can be incorporated as a plug-in module with different model architectures. Code and data are openly accessible at https://github.com/cbmi-group/FFM-Multi-Decoder-Network.
Related papers
- SACNet: A Spatially Adaptive Convolution Network for 2D Multi-organ Medical Segmentation [7.897088081928714]
Multi-organ segmentation in medical image analysis is crucial for diagnosis and treatment planning.
In this paper, we utilize the knowledge of Deformable Convolution V3 to optimize our Spatially Adaptive Convolution Network (SACNet)
Experiments on 3D slice datasets from ACDC and Synapse demonstrate that SACNet delivers superior segmentation performance compared to several existing methods.
arXiv Detail & Related papers (2024-07-14T10:58:09Z) - CFPFormer: Feature-pyramid like Transformer Decoder for Segmentation and Detection [1.837431956557716]
Feature pyramids have been widely adopted in convolutional neural networks (CNNs) and transformers for tasks like medical image segmentation and object detection.
We propose a novel decoder block that integrates feature pyramids and transformers.
Our model achieves superior performance in detecting small objects compared to existing methods.
arXiv Detail & Related papers (2024-04-23T18:46:07Z) - On The Potential of The Fractal Geometry and The CNNs Ability to Encode
it [1.7311053765541484]
The fractal dimension provides a statistical index of object complexity.
Although useful in several classification tasks, the fractal dimension is under-explored in deep learning applications.
We show that training a shallow network on fractal features achieves performance comparable to that of deep networks trained on raw data.
arXiv Detail & Related papers (2024-01-07T15:22:56Z) - Dynamic Snake Convolution based on Topological Geometric Constraints for
Tubular Structure Segmentation [12.081234339680456]
We use this knowledge to guide our DSCNet to simultaneously enhance perception in three stages: feature extraction, feature fusion, and loss constraint.
Experiments on 2D and 3D datasets show that our DSCNet provides better accuracy and continuity on the tubular structure segmentation task compared with several methods.
arXiv Detail & Related papers (2023-07-17T10:55:58Z) - T1: Scaling Diffusion Probabilistic Fields to High-Resolution on Unified
Visual Modalities [69.16656086708291]
Diffusion Probabilistic Field (DPF) models the distribution of continuous functions defined over metric spaces.
We propose a new model comprising of a view-wise sampling algorithm to focus on local structure learning.
The model can be scaled to generate high-resolution data while unifying multiple modalities.
arXiv Detail & Related papers (2023-05-24T03:32:03Z) - SpatioTemporal Focus for Skeleton-based Action Recognition [66.8571926307011]
Graph convolutional networks (GCNs) are widely adopted in skeleton-based action recognition.
We argue that the performance of recent proposed skeleton-based action recognition methods is limited by the following factors.
Inspired by the recent attention mechanism, we propose a multi-grain contextual focus module, termed MCF, to capture the action associated relation information.
arXiv Detail & Related papers (2022-03-31T02:45:24Z) - PnP-DETR: Towards Efficient Visual Analysis with Transformers [146.55679348493587]
Recently, DETR pioneered the solution vision tasks with transformers, it directly translates the image feature map into the object result.
Recent transformer-based image recognition model andTT show consistent efficiency gain.
arXiv Detail & Related papers (2021-09-15T01:10:30Z) - Manifold Topology Divergence: a Framework for Comparing Data Manifolds [109.0784952256104]
We develop a framework for comparing data manifold, aimed at the evaluation of deep generative models.
Based on the Cross-Barcode, we introduce the Manifold Topology Divergence score (MTop-Divergence)
We demonstrate that the MTop-Divergence accurately detects various degrees of mode-dropping, intra-mode collapse, mode invention, and image disturbance.
arXiv Detail & Related papers (2021-06-08T00:30:43Z) - Spatio-Temporal Inception Graph Convolutional Networks for
Skeleton-Based Action Recognition [126.51241919472356]
We design a simple and highly modularized graph convolutional network architecture for skeleton-based action recognition.
Our network is constructed by repeating a building block that aggregates multi-granularity information from both the spatial and temporal paths.
arXiv Detail & Related papers (2020-11-26T14:43:04Z) - Feature Pyramid Grids [140.11116687047058]
We present Feature Pyramid Grids (FPG), a deep multi-pathway feature pyramid.
FPG can improve single-pathway feature pyramid networks by significantly increasing its performance at similar computation cost.
arXiv Detail & Related papers (2020-04-07T17:59:52Z) - Multi-organ Segmentation over Partially Labeled Datasets with
Multi-scale Feature Abstraction [14.92032083210668]
Shortage of fully annotated datasets has been a limiting factor in developing deep learning based image segmentation algorithms.
We propose a unified training strategy that enables a novel multi-scale deep neural network to be trained on multiple partially labeled datasets.
arXiv Detail & Related papers (2020-01-01T13:51:11Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.