D-Net: Siamese based Network with Mutual Attention for Volume Alignment
- URL: http://arxiv.org/abs/2101.10248v1
- Date: Mon, 25 Jan 2021 17:24:16 GMT
- Title: D-Net: Siamese based Network with Mutual Attention for Volume Alignment
- Authors: Jian-Qing Zheng, Ngee Han Lim, Bartlomiej W. Papiez
- Abstract summary: D-net is an extension to the branched Siamese encoder-decoder structure connected by new mutual non-local links.
We present a novel network, D-net, to estimate arbitrary rotation and translation between 3D CT scans.
Results show a significant improvement in the estimation of CT alignment, outperforming the current comparable methods.
- Score: 3.00948372643855
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Alignment of contrast and non-contrast-enhanced imaging is essential for the
quantification of changes in several biomedical applications. In particular,
the extraction of cartilage shape from contrast-enhanced Computed Tomography
(CT) of tibiae requires accurate alignment of the bone, currently performed
manually. Existing deep learning-based methods for alignment require a common
template or are limited in rotation range. Therefore, we present a novel
network, D-net, to estimate arbitrary rotation and translation between 3D CT
scans that additionally does not require a prior standard template. D-net is an
extension to the branched Siamese encoder-decoder structure connected by new
mutual non-local links, which efficiently capture long-range connections of
similar features between two branches. The 3D supervised network is trained and
validated using preclinical CT scans of mouse tibiae with and without contrast
enhancement in cartilage. The presented results show a significant improvement
in the estimation of CT alignment, outperforming the current comparable
methods.
Related papers
- Preoperative-to-intraoperative Liver Registration for Laparoscopic Surgery via Latent-Grounded Correspondence Constraints [51.7011449975586]
Land-Reg is a deformable registration framework that learns latent-grounded 2D-3D landmark correspondences.<n>For rigid registration, Land-Reg embraces a Cross-modal Latent Alignment module.<n>An Uncertainty-enhanced Overlap Landmark Detector with similarity matching is proposed to robustly estimate explicit 2D-3D landmark correspondences.
arXiv Detail & Related papers (2026-03-02T10:44:03Z) - Precise Liver Tumor Segmentation in CT Using a Hybrid Deep Learning-Radiomics Framework [5.499971648188868]
We propose a hybrid framework that couples an attention-enhanced cascaded U-Net with handcrafted radiomics and voxel-wise 3D CNN refinement.<n>A 2.5D two-stage network with a densely connected encoder, sub-pixel convolution decoders and multi-scale attention gates produces initial liver and tumor probability maps.<n>A compact 3D patch-based CNN derived from AlexNet operates in a narrow band around the tumor boundary to perform voxel-level relabelling and contour smoothing.
arXiv Detail & Related papers (2025-12-08T14:09:21Z) - Deep Learning-Based Cross-Anatomy CT Synthesis Using Adapted nnResU-Net with Anatomical Feature Prioritized Loss [3.583510367091671]
We present a patch-based 3D nnUNet adaptation for MR to CT and CBCT to CT image translation using the multicenter SynthRAD2025 dataset.
arXiv Detail & Related papers (2025-09-26T14:22:15Z) - BridgeSplat: Bidirectionally Coupled CT and Non-Rigid Gaussian Splatting for Deformable Intraoperative Surgical Navigation [69.14180476971602]
We introduce BridgeSplat, a novel approach for deformable surgical navigation.<n>Our method rigs 3D Gaussians to a CT mesh, enabling joint optimization of Gaussian parameters and mesh deformation.<n>We demonstrate BridgeSplat's effectiveness on visceral pig surgeries and synthetic data of a human liver under simulation.
arXiv Detail & Related papers (2025-09-23T01:09:36Z) - TRACE: Temporally Reliable Anatomically-Conditioned 3D CT Generation with Enhanced Efficiency [40.82927972746919]
TRACE is a framework that generates 3D medical images with temporal alignment.<n>An overlapping-frame frame pairs pairs into a flexible length sequence, reconstructed into atemporally and anatomically aligned 3D volume.
arXiv Detail & Related papers (2025-07-01T14:35:39Z) - Deep Conditional Shape Models for 3D cardiac image segmentation [1.4042211166197214]
We introduce a novel segmentation algorithm that uses Deep Shape models (DCSMs) Conditional as a core component.
To fit the generated shape to the image, the shape model is conditioned on anatomic landmarks that can be automatically detected or provided by the user.
We demonstrate that the automatic DCSM outperforms the baseline for non-contrasted CT without the local refinement, and with the refinement for contrasted CT and 3DE, especially with significant improvement in the Hausdorff distance.
arXiv Detail & Related papers (2023-10-16T18:38:26Z) - Two-Stream Graph Convolutional Network for Intra-oral Scanner Image
Segmentation [133.02190910009384]
We propose a two-stream graph convolutional network (i.e., TSGCN) to handle inter-view confusion between different raw attributes.
Our TSGCN significantly outperforms state-of-the-art methods in 3D tooth (surface) segmentation.
arXiv Detail & Related papers (2022-04-19T10:41:09Z) - CyTran: A Cycle-Consistent Transformer with Multi-Level Consistency for
Non-Contrast to Contrast CT Translation [56.622832383316215]
We propose a novel approach to translate unpaired contrast computed tomography (CT) scans to non-contrast CT scans.
Our approach is based on cycle-consistent generative adversarial convolutional transformers, for short, CyTran.
Our empirical results show that CyTran outperforms all competing methods.
arXiv Detail & Related papers (2021-10-12T23:25:03Z) - The entire network structure of Crossmodal Transformer [4.605531191013731]
The proposed approach first deep learns skeletal features from 2D X-ray and 3D CT images.
As a result, the well-trained network can directly predict the spatial correspondence between arbitrary 2D X-ray and 3D CT.
arXiv Detail & Related papers (2021-04-29T11:47:31Z) - SCPM-Net: An Anchor-free 3D Lung Nodule Detection Network using Sphere
Representation and Center Points Matching [47.79483848496141]
We propose a 3D sphere representation-based center-points matching detection network (SCPM-Net)
It is anchor-free and automatically predicts the position, radius, and offset of nodules without the manual design of nodule/anchor parameters.
We show that our proposed SCPM-Net framework achieves superior performance compared with existing used anchor-based and anchor-free methods for lung nodule detection.
arXiv Detail & Related papers (2021-04-12T05:51:29Z) - TSGCNet: Discriminative Geometric Feature Learning with Two-Stream
GraphConvolutional Network for 3D Dental Model Segmentation [141.2690520327948]
We propose a two-stream graph convolutional network (TSGCNet) to learn multi-view information from different geometric attributes.
We evaluate our proposed TSGCNet on a real-patient dataset of dental models acquired by 3D intraoral scanners.
arXiv Detail & Related papers (2020-12-26T08:02:56Z) - Revisiting 3D Context Modeling with Supervised Pre-training for
Universal Lesion Detection in CT Slices [48.85784310158493]
We propose a Modified Pseudo-3D Feature Pyramid Network (MP3D FPN) to efficiently extract 3D context enhanced 2D features for universal lesion detection in CT slices.
With the novel pre-training method, the proposed MP3D FPN achieves state-of-the-art detection performance on the DeepLesion dataset.
The proposed 3D pre-trained weights can potentially be used to boost the performance of other 3D medical image analysis tasks.
arXiv Detail & Related papers (2020-12-16T07:11:16Z) - Context-Aware Refinement Network Incorporating Structural Connectivity
Prior for Brain Midline Delineation [50.868845400939314]
We propose a context-aware refinement network (CAR-Net) to refine and integrate the feature pyramid representation generated by the UNet.
For keeping the structural connectivity of the brain midline, we introduce a novel connectivity regular loss.
The proposed method requires fewer parameters and outperforms three state-of-the-art methods in terms of four evaluation metrics.
arXiv Detail & Related papers (2020-07-10T14:01:20Z) - A lateral semicircular canal segmentation based geometric calibration
for human temporal bone CT Image [13.829782834867192]
We propose an automatic calibration algorithm for temporal bone CT images.
The lateral semicircular canals (LSCs) are segmented as anchors at first. Then, we define a standard 3D coordinate system.
We design a novel 3D LSC segmentation encoder-decoder network, which introduces a 3D dilated convolution and a multi-pooling scheme for feature fusion in the encoding stage.
arXiv Detail & Related papers (2020-06-28T12:36:08Z) - HRINet: Alternative Supervision Network for High-resolution CT image
Interpolation [3.7966959476339035]
We propose a novel network, High Resolution Interpolation Network (HRINet), aiming at producing high-resolution CT images.
We combine the idea of ACAI and GANs, and propose a novel idea of alternative supervision method by applying supervised and unsupervised training.
Our experiments show the great improvement on 256 2 and 5122 images quantitatively and qualitatively.
arXiv Detail & Related papers (2020-02-11T15:09:42Z) - A$^3$DSegNet: Anatomy-aware artifact disentanglement and segmentation
network for unpaired segmentation, artifact reduction, and modality
translation [18.500206499468902]
CBCT images are of low-quality and artifact-laden due to noise, poor tissue contrast, and the presence of metallic objects.
There exists a wealth of artifact-free, high quality CT images with vertebra annotations.
This motivates us to build a CBCT vertebra segmentation model using unpaired CT images with annotations.
arXiv Detail & Related papers (2020-01-02T06:37:09Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.