Transfer Learning from an Artificial Radiograph-landmark Dataset for
Registration of the Anatomic Skull Model to Dual Fluoroscopic X-ray Images
- URL: http://arxiv.org/abs/2108.06466v1
- Date: Sat, 14 Aug 2021 04:49:36 GMT
- Title: Transfer Learning from an Artificial Radiograph-landmark Dataset for
Registration of the Anatomic Skull Model to Dual Fluoroscopic X-ray Images
- Authors: Chaochao Zhou, Thomas Cha, Yun Peng, Guoan Li
- Abstract summary: We propose a transfer learning strategy for 3D-to-2D registration using deep neural networks trained from an artificial dataset.
Digitally reconstructed radiographs (DRRs) and radiographic skull landmarks were automatically created from craniocervical CT data of a female subject.
They were used to train a residual network (ResNet) for landmark detection and a cycle generative adversarial network (GAN) to eliminate the style difference between DRRs and actual X-rays.
The methodology to strategically augment artificial training data can tackle the complicated skull registration scenario, and has potentials to extend to widespread registration scenarios.
- Score: 0.4205692673448206
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Registration of 3D anatomic structures to their 2D dual fluoroscopic X-ray
images is a widely used motion tracking technique. However, deep learning
implementation is often impeded by a paucity of medical images and ground
truths. In this study, we proposed a transfer learning strategy for 3D-to-2D
registration using deep neural networks trained from an artificial dataset.
Digitally reconstructed radiographs (DRRs) and radiographic skull landmarks
were automatically created from craniocervical CT data of a female subject.
They were used to train a residual network (ResNet) for landmark detection and
a cycle generative adversarial network (GAN) to eliminate the style difference
between DRRs and actual X-rays. Landmarks on the X-rays experiencing GAN style
translation were detected by the ResNet, and were used in triangulation
optimization for 3D-to-2D registration of the skull in actual dual-fluoroscope
images (with a non-orthogonal setup, point X-ray sources, image distortions,
and partially captured skull regions). The registration accuracy was evaluated
in multiple scenarios of craniocervical motions. In walking, learning-based
registration for the skull had angular/position errors of 3.9 +- 2.1 deg / 4.6
+- 2.2 mm. However, the accuracy was lower during functional neck activity, due
to overly small skull regions imaged on the dual fluoroscopic images at
end-range positions. The methodology to strategically augment artificial
training data can tackle the complicated skull registration scenario, and has
potentials to extend to widespread registration scenarios.
Related papers
- RadRotator: 3D Rotation of Radiographs with Diffusion Models [0.0]
We introduce a diffusion model-based technology that can rotate the anatomical content of any input radiograph in 3D space.
Similar to previous studies, we used CT volumes to create Digitally Reconstructed Radiographs (DRRs) as the training data for our model.
arXiv Detail & Related papers (2024-04-19T16:55:12Z) - Intraoperative 2D/3D Image Registration via Differentiable X-ray Rendering [5.617649111108429]
We present DiffPose, a self-supervised approach that leverages patient-specific simulation and differentiable physics-based rendering to achieve accurate 2D/3D registration without relying on manually labeled data.
DiffPose achieves sub-millimeter accuracy across surgical datasets at intraoperative speeds, improving upon existing unsupervised methods by an order of magnitude and even outperforming supervised baselines.
arXiv Detail & Related papers (2023-12-11T13:05:54Z) - On the Localization of Ultrasound Image Slices within Point Distribution
Models [84.27083443424408]
Thyroid disorders are most commonly diagnosed using high-resolution Ultrasound (US)
Longitudinal tracking is a pivotal diagnostic protocol for monitoring changes in pathological thyroid morphology.
We present a framework for automated US image slice localization within a 3D shape representation.
arXiv Detail & Related papers (2023-09-01T10:10:46Z) - Multi-View Vertebra Localization and Identification from CT Images [57.56509107412658]
We propose a multi-view vertebra localization and identification from CT images.
We convert the 3D problem into a 2D localization and identification task on different views.
Our method can learn the multi-view global information naturally.
arXiv Detail & Related papers (2023-07-24T14:43:07Z) - Deep learning network to correct axial and coronal eye motion in 3D OCT
retinal imaging [65.47834983591957]
We propose deep learning based neural networks to correct axial and coronal motion artifacts in OCT based on a single scan.
The experimental result shows that the proposed method can effectively correct motion artifacts and achieve smaller error than other methods.
arXiv Detail & Related papers (2023-05-27T03:55:19Z) - Geometry-Aware Attenuation Learning for Sparse-View CBCT Reconstruction [53.93674177236367]
Cone Beam Computed Tomography (CBCT) plays a vital role in clinical imaging.
Traditional methods typically require hundreds of 2D X-ray projections to reconstruct a high-quality 3D CBCT image.
This has led to a growing interest in sparse-view CBCT reconstruction to reduce radiation doses.
We introduce a novel geometry-aware encoder-decoder framework to solve this problem.
arXiv Detail & Related papers (2023-03-26T14:38:42Z) - CNN-based real-time 2D-3D deformable registration from a single X-ray
projection [2.1198879079315573]
This paper presents a method for real-time 2D-3D non-rigid registration using a single fluoroscopic image.
A dataset composed of displacement fields and 2D projections of the anatomy is generated from a preoperative scan.
A neural network is trained to recover the unknown 3D displacement field from a single projection image.
arXiv Detail & Related papers (2022-12-15T09:57:19Z) - Revisiting 3D Context Modeling with Supervised Pre-training for
Universal Lesion Detection in CT Slices [48.85784310158493]
We propose a Modified Pseudo-3D Feature Pyramid Network (MP3D FPN) to efficiently extract 3D context enhanced 2D features for universal lesion detection in CT slices.
With the novel pre-training method, the proposed MP3D FPN achieves state-of-the-art detection performance on the DeepLesion dataset.
The proposed 3D pre-trained weights can potentially be used to boost the performance of other 3D medical image analysis tasks.
arXiv Detail & Related papers (2020-12-16T07:11:16Z) - XraySyn: Realistic View Synthesis From a Single Radiograph Through CT
Priors [118.27130593216096]
A radiograph visualizes the internal anatomy of a patient through the use of X-ray, which projects 3D information onto a 2D plane.
To the best of our knowledge, this is the first work on radiograph view synthesis.
We show that by gaining an understanding of radiography in 3D space, our method can be applied to radiograph bone extraction and suppression without groundtruth bone labels.
arXiv Detail & Related papers (2020-12-04T05:08:53Z) - 3D Probabilistic Segmentation and Volumetry from 2D projection images [10.32519161805588]
X-Ray imaging is quick, cheap and useful for front-line care assessment and intra-operative real-time imaging.
It suffers from projective information loss and lacks vital information on which many essential diagnostic biomarkers are based.
In this paper we explore probabilistic methods to reconstruct 3D volumetric images from 2D imaging modalities.
arXiv Detail & Related papers (2020-06-23T08:00:51Z) - End-To-End Convolutional Neural Network for 3D Reconstruction of Knee
Bones From Bi-Planar X-Ray Images [6.645111950779666]
We present an end-to-end Convolutional Neural Network (CNN) approach for 3D reconstruction of knee bones directly from two bi-planar X-ray images.
arXiv Detail & Related papers (2020-04-02T08:37:11Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.