Integrating Multi-scale and Multi-filtration Topological Features for Medical Image Classification
- URL: http://arxiv.org/abs/2512.07190v1
- Date: Mon, 08 Dec 2025 06:02:02 GMT
- Title: Integrating Multi-scale and Multi-filtration Topological Features for Medical Image Classification
- Authors: Pengfei Gu, Huimin Li, Haoteng Tang, Dongkuan, Xu, Erik Enriquez, DongChul Kim, Bin Fu, Danny Z. Chen,
- Abstract summary: Deep neural networks have shown remarkable performance in medical image classification.<n>We propose a new topology-guided classification framework that extracts multi-scale and multi-filtration persistent topological features.<n>Our approach enhances the model's capacity to recognize complex anatomical structures.
- Score: 20.820287362872975
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Modern deep neural networks have shown remarkable performance in medical image classification. However, such networks either emphasize pixel-intensity features instead of fundamental anatomical structures (e.g., those encoded by topological invariants), or they capture only simple topological features via single-parameter persistence. In this paper, we propose a new topology-guided classification framework that extracts multi-scale and multi-filtration persistent topological features and integrates them into vision classification backbones. For an input image, we first compute cubical persistence diagrams (PDs) across multiple image resolutions/scales. We then develop a ``vineyard'' algorithm that consolidates these PDs into a single, stable diagram capturing signatures at varying granularities, from global anatomy to subtle local irregularities that may indicate early-stage disease. To further exploit richer topological representations produced by multiple filtrations, we design a cross-attention-based neural network that directly processes the consolidated final PDs. The resulting topological embeddings are fused with feature maps from CNNs or Transformers. By integrating multi-scale and multi-filtration topologies into an end-to-end architecture, our approach enhances the model's capacity to recognize complex anatomical structures. Evaluations on three public datasets show consistent, considerable improvements over strong baselines and state-of-the-art methods, demonstrating the value of our comprehensive topological perspective for robust and interpretable medical image classification.
Related papers
- A Semantically Enhanced Generative Foundation Model Improves Pathological Image Synthesis [82.01597026329158]
We introduce a Correlation-Regulated Alignment Framework for Tissue Synthesis (CRAFTS) for pathology-specific text-to-image synthesis.<n>CRAFTS incorporates a novel alignment mechanism that suppresses semantic drift to ensure biological accuracy.<n>This model generates diverse pathological images spanning 30 cancer types, with quality rigorously validated by objective metrics and pathologist evaluations.
arXiv Detail & Related papers (2025-12-15T10:22:43Z) - Large Language Model Evaluated Stand-alone Attention-Assisted Graph Neural Network with Spatial and Structural Information Interaction for Precise Endoscopic Image Segmentation [16.773882069530426]
We propose FOCUS-Med, which stands for Fusion of spatial and structural graph with attentional context-aware polyp segmentation.<n> FOCUS-Med integrates a Dual Graph Convolutional Network (Dual-GCN) module to capture contextual spatial and topological structural dependencies.<n>Experiments on public benchmarks demonstrate that FOCUS-Med achieves state-of-the-art performance across five key metrics.
arXiv Detail & Related papers (2025-08-09T15:53:19Z) - A Graph-Based Framework for Interpretable Whole Slide Image Analysis [86.37618055724441]
We develop a framework that transforms whole-slide images into biologically-informed graph representations.<n>Our approach builds graph nodes from tissue regions that respect natural structures, not arbitrary grids.<n>We demonstrate strong performance on challenging cancer staging and survival prediction tasks.
arXiv Detail & Related papers (2025-03-14T20:15:04Z) - PHG-Net: Persistent Homology Guided Medical Image Classification [14.450329809640422]
We propose a persistent homology guided approach (PHG-Net) that explores topological features of objects for medical image classification.
For an input image, we first compute its cubical persistence diagram and extract topological features into a vector representation.
The extracted topological features are then incorporated into the feature map generated by CNN or Transformer for feature fusion.
arXiv Detail & Related papers (2023-11-28T21:34:06Z) - Dual-channel Prototype Network for few-shot Classification of
Pathological Images [0.7562219957261347]
We introduce the Dual-channel Prototype Network (DCPN) to tackle the challenge of classifying pathological images with limited samples.
DCPN augments the Pyramid Vision Transformer framework for few-shot classification via self-supervised learning and integrates it with convolutional neural networks.
This combination forms a dual-channel architecture that extracts multi-scale, highly precise pathological features.
arXiv Detail & Related papers (2023-11-14T03:03:21Z) - Asymmetric Co-Training with Explainable Cell Graph Ensembling for
Histopathological Image Classification [28.949527817202984]
We propose an asymmetric co-training framework combining a deep graph convolutional network and a convolutional neural network.
We build a 14-layer deep graph convolutional network to handle cell graph data.
We evaluate our approach on the private LUAD7C and public colorectal cancer datasets.
arXiv Detail & Related papers (2023-08-24T12:27:03Z) - NexToU: Efficient Topology-Aware U-Net for Medical Image Segmentation [3.8336080345323227]
CNN and Transformer variants have emerged as the leading medical image segmentation backbones.
We propose NexToU, a novel hybrid architecture for medical image segmentation.
Our method consistently outperforms other state-of-the-art (SOTA) architectures.
arXiv Detail & Related papers (2023-05-25T10:18:57Z) - Affinity Feature Strengthening for Accurate, Complete and Robust Vessel
Segmentation [48.638327652506284]
Vessel segmentation is crucial in many medical image applications, such as detecting coronary stenoses, retinal vessel diseases and brain aneurysms.
We present a novel approach, the affinity feature strengthening network (AFN), which jointly models geometry and refines pixel-wise segmentation features using a contrast-insensitive, multiscale affinity approach.
arXiv Detail & Related papers (2022-11-12T05:39:17Z) - Two-Stream Graph Convolutional Network for Intra-oral Scanner Image
Segmentation [133.02190910009384]
We propose a two-stream graph convolutional network (i.e., TSGCN) to handle inter-view confusion between different raw attributes.
Our TSGCN significantly outperforms state-of-the-art methods in 3D tooth (surface) segmentation.
arXiv Detail & Related papers (2022-04-19T10:41:09Z) - Dist2Cycle: A Simplicial Neural Network for Homology Localization [66.15805004725809]
Simplicial complexes can be viewed as high dimensional generalizations of graphs that explicitly encode multi-way ordered relations.
We propose a graph convolutional model for learning functions parametrized by the $k$-homological features of simplicial complexes.
arXiv Detail & Related papers (2021-10-28T14:59:41Z) - Few-shot Medical Image Segmentation using a Global Correlation Network
with Discriminative Embedding [60.89561661441736]
We propose a novel method for few-shot medical image segmentation.
We construct our few-shot image segmentor using a deep convolutional network trained episodically.
We enhance discriminability of deep embedding to encourage clustering of the feature domains of the same class.
arXiv Detail & Related papers (2020-12-10T04:01:07Z) - Pathological Retinal Region Segmentation From OCT Images Using Geometric
Relation Based Augmentation [84.7571086566595]
We propose improvements over previous GAN-based medical image synthesis methods by jointly encoding the intrinsic relationship of geometry and shape.
The proposed method outperforms state-of-the-art segmentation methods on the public RETOUCH dataset having images captured from different acquisition procedures.
arXiv Detail & Related papers (2020-03-31T11:50:43Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.