SAMConvex: Fast Discrete Optimization for CT Registration using
Self-supervised Anatomical Embedding and Correlation Pyramid
- URL: http://arxiv.org/abs/2307.09727v1
- Date: Wed, 19 Jul 2023 02:28:41 GMT
- Title: SAMConvex: Fast Discrete Optimization for CT Registration using
Self-supervised Anatomical Embedding and Correlation Pyramid
- Authors: Zi Li and Lin Tian and Tony C. W. Mok and Xiaoyu Bai and Puyang Wang
and Jia Ge and Jingren Zhou and Le Lu and Xianghua Ye and Ke Yan and Dakai
Jin
- Abstract summary: Estimating displacement vector field via a cost volume computed in the feature space has shown great success in image registration.
Existing feature descriptors only extract local features incapable of representing the global semantic information.
We propose SAMConvex, a fast coarse-to-fine discrete optimization method for CT registration.
- Score: 32.424451941998484
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Estimating displacement vector field via a cost volume computed in the
feature space has shown great success in image registration, but it suffers
excessive computation burdens. Moreover, existing feature descriptors only
extract local features incapable of representing the global semantic
information, which is especially important for solving large transformations.
To address the discussed issues, we propose SAMConvex, a fast coarse-to-fine
discrete optimization method for CT registration that includes a decoupled
convex optimization procedure to obtain deformation fields based on a
self-supervised anatomical embedding (SAM) feature extractor that captures both
local and global information. To be specific, SAMConvex extracts per-voxel
features and builds 6D correlation volumes based on SAM features, and
iteratively updates a flow field by performing lookups on the correlation
volumes with a coarse-to-fine scheme. SAMConvex outperforms the
state-of-the-art learning-based methods and optimization-based methods over two
inter-patient registration datasets (Abdomen CT and HeadNeck CT) and one
intra-patient registration dataset (Lung CT). Moreover, as an
optimization-based method, SAMConvex only takes $\sim2$s ($\sim5s$ with
instance optimization) for one paired images.
Related papers
- MsaMIL-Net: An End-to-End Multi-Scale Aware Multiple Instance Learning Network for Efficient Whole Slide Image Classification [0.7510165488300369]
Bag-based Multiple Instance Learning (MIL) approaches have emerged as the mainstream methodology for Whole Slide Image (WSI) classification.
This paper proposes an end-to-end multi-scale WSI classification framework that integrates multi-scale feature extraction with multiple instance learning.
arXiv Detail & Related papers (2025-03-11T16:16:44Z) - Efficient Knowledge Distillation of SAM for Medical Image Segmentation [0.04672991859386895]
The Segment Anything Model (SAM) has set a new standard in interactive image segmentation, offering robust performance across various tasks.
We propose a novel knowledge distillation approach, KD SAM, which incorporates both encoder and decoder optimization through a combination of Mean Squared Error (MSE) and Perceptual Loss.
KD SAM effectively balances segmentation accuracy and computational efficiency, making it well-suited for real-time medical image segmentation applications in resource-constrained environments.
arXiv Detail & Related papers (2025-01-28T06:33:30Z) - ASPS: Augmented Segment Anything Model for Polyp Segmentation [77.25557224490075]
The Segment Anything Model (SAM) has introduced unprecedented potential for polyp segmentation.
SAM's Transformer-based structure prioritizes global and low-frequency information.
CFA integrates a trainable CNN encoder branch with a frozen ViT encoder, enabling the integration of domain-specific knowledge.
arXiv Detail & Related papers (2024-06-30T14:55:32Z) - Rotated Multi-Scale Interaction Network for Referring Remote Sensing Image Segmentation [63.15257949821558]
Referring Remote Sensing Image (RRSIS) is a new challenge that combines computer vision and natural language processing.
Traditional Referring Image (RIS) approaches have been impeded by the complex spatial scales and orientations found in aerial imagery.
We introduce the Rotated Multi-Scale Interaction Network (RMSIN), an innovative approach designed for the unique demands of RRSIS.
arXiv Detail & Related papers (2023-12-19T08:14:14Z) - SAME++: A Self-supervised Anatomical eMbeddings Enhanced medical image
registration framework using stable sampling and regularized transformation [19.683682147655496]
In this work, we introduce a fast and accurate method for unsupervised 3D medical image registration building on top of a Self-supervised Anatomical eMbedding algorithm.
We name our approach SAM-Enhanced registration (SAME++), which decomposes image registration into four steps: affine transformation, coarse deformation, deep non-parametric transformation, and instance optimization.
As a complete registration framework, SAME++ markedly outperforms leading methods by $4.2%$ - $8.2%$ in terms of Dice score.
arXiv Detail & Related papers (2023-11-25T10:11:04Z) - Self-Supervised Dataset Distillation for Transfer Learning [77.4714995131992]
We propose a novel problem of distilling an unlabeled dataset into a set of small synthetic samples for efficient self-supervised learning (SSL)
We first prove that a gradient of synthetic samples with respect to a SSL objective in naive bilevel optimization is textitbiased due to randomness originating from data augmentations or masking.
We empirically validate the effectiveness of our method on various applications involving transfer learning.
arXiv Detail & Related papers (2023-10-10T10:48:52Z) - Consensus-Adaptive RANSAC [104.87576373187426]
We propose a new RANSAC framework that learns to explore the parameter space by considering the residuals seen so far via a novel attention layer.
The attention mechanism operates on a batch of point-to-model residuals, and updates a per-point estimation state to take into account the consensus found through a lightweight one-step transformer.
arXiv Detail & Related papers (2023-07-26T08:25:46Z) - Matching in the Wild: Learning Anatomical Embeddings for Multi-Modality
Images [28.221419419614183]
Radiotherapists require accurate registration of MR/CT images to effectively use information from both modalities.
Recent learning-based methods have shown promising results in the rigid/affine step.
We propose a new approach called Cross-SAM to enable cross-modality matching.
arXiv Detail & Related papers (2023-07-07T11:49:06Z) - Voxelmorph++ Going beyond the cranial vault with keypoint supervision
and multi-channel instance optimisation [8.88841928746097]
Recent Learn2Reg benchmark shows single-scale U-Net architectures fall short of state-of-the-art performance for abdominal or intra-patient lung registration.
Here, we propose two straightforward steps that greatly reduce this gap in accuracy.
First, we employ keypoint self-supervision with a novel network head that predicts a discretised heatmap.
Second, we replace multiple learned fine-tuning steps by a single instance with hand-crafted features and the Adam optimiser.
arXiv Detail & Related papers (2022-02-28T19:23:29Z) - Real-Time Scene Text Detection with Differentiable Binarization and
Adaptive Scale Fusion [62.269219152425556]
segmentation-based scene text detection methods have drawn extensive attention in the scene text detection field.
We propose a Differentiable Binarization (DB) module that integrates the binarization process into a segmentation network.
An efficient Adaptive Scale Fusion (ASF) module is proposed to improve the scale robustness by fusing features of different scales adaptively.
arXiv Detail & Related papers (2022-02-21T15:30:14Z) - Shared Space Transfer Learning for analyzing multi-site fMRI data [83.41324371491774]
Multi-voxel pattern analysis (MVPA) learns predictive models from task-based functional magnetic resonance imaging (fMRI) data.
MVPA works best with a well-designed feature set and an adequate sample size.
Most fMRI datasets are noisy, high-dimensional, expensive to collect, and with small sample sizes.
This paper proposes the Shared Space Transfer Learning (SSTL) as a novel transfer learning approach.
arXiv Detail & Related papers (2020-10-24T08:50:26Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.