Volumetric Attention for 3D Medical Image Segmentation and Detection
- URL: http://arxiv.org/abs/2004.01997v1
- Date: Sat, 4 Apr 2020 18:55:06 GMT
- Title: Volumetric Attention for 3D Medical Image Segmentation and Detection
- Authors: Xudong Wang, Shizhong Han, Yunqiang Chen, Dashan Gao, and Nuno
Vasconcelos
- Abstract summary: A volumetric attention(VA) module for 3D medical image segmentation and detection is proposed.
VA attention is inspired by recent advances in video processing, enables 2.5D networks to leverage context information along the z direction.
Its integration in the Mask R-CNN is shown to enable state-of-the-art performance on the Liver Tumor (LiTS) Challenge.
- Score: 53.041572035020344
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: A volumetric attention(VA) module for 3D medical image segmentation and
detection is proposed. VA attention is inspired by recent advances in video
processing, enables 2.5D networks to leverage context information along the z
direction, and allows the use of pretrained 2D detection models when training
data is limited, as is often the case for medical applications. Its integration
in the Mask R-CNN is shown to enable state-of-the-art performance on the Liver
Tumor Segmentation (LiTS) Challenge, outperforming the previous challenge
winner by 3.9 points and achieving top performance on the LiTS leader board at
the time of paper submission. Detection experiments on the DeepLesion dataset
also show that the addition of VA to existing object detectors enables a 69.1
sensitivity at 0.5 false positive per image, outperforming the best published
results by 6.6 points.
Related papers
- Merlin: A Vision Language Foundation Model for 3D Computed Tomography [23.553846980246302]
Over 85 million computed tomography (CT) scans are performed annually in the US, of which approximately one quarter focus on the abdomen.
There is a large impetus to use artificial intelligence to alleviate the burden of interpreting these complex imaging studies.
We introduce Merlin - a 3D VLM that we train using paired CT scans, EHR diagnosis codes, and radiology reports.
arXiv Detail & Related papers (2024-06-10T17:53:01Z) - Weakly supervised segmentation of intracranial aneurysms using a novel 3D focal modulation UNet [0.5106162890866905]
We propose FocalSegNet, a novel 3D focal modulation UNet, to detect an aneurysm and offer an initial, coarse segmentation of it from time-of-flight MRA image patches.
We trained and evaluated our model on a public dataset, and in terms of UIA detection, our model showed a low false-positive rate of 0.21 and a high sensitivity of 0.80.
arXiv Detail & Related papers (2023-08-06T03:28:08Z) - Solving Sample-Level Out-of-Distribution Detection on 3D Medical Images [0.06117371161379209]
Out-of-distribution (OOD) detection helps to identify data samples, increasing the model's reliability.
Recent works have developed DL-based OOD detection that achieves promising results on 2D medical images.
However, scaling most of these approaches on 3D images is computationally intractable.
We propose a histogram-based method that requires no DL and achieves almost perfect results in this domain.
arXiv Detail & Related papers (2022-12-13T11:42:23Z) - SALISA: Saliency-based Input Sampling for Efficient Video Object
Detection [58.22508131162269]
We propose SALISA, a novel non-uniform SALiency-based Input SAmpling technique for video object detection.
We show that SALISA significantly improves the detection of small objects.
arXiv Detail & Related papers (2022-04-05T17:59:51Z) - HMD-EgoPose: Head-Mounted Display-Based Egocentric Marker-Less Tool and
Hand Pose Estimation for Augmented Surgical Guidance [0.0]
We present HMD-EgoPose, a single-shot learning-based approach to hand and object pose estimation.
We demonstrate state-of-the-art performance on a benchmark dataset for marker-less hand and surgical instrument pose tracking.
arXiv Detail & Related papers (2022-02-24T04:07:34Z) - Systematic Clinical Evaluation of A Deep Learning Method for Medical
Image Segmentation: Radiosurgery Application [48.89674088331313]
We systematically evaluate a Deep Learning (DL) method in a 3D medical image segmentation task.
Our method is integrated into the radiosurgery treatment process and directly impacts the clinical workflow.
arXiv Detail & Related papers (2021-08-21T16:15:40Z) - Automated Model Design and Benchmarking of 3D Deep Learning Models for
COVID-19 Detection with Chest CT Scans [72.04652116817238]
We propose a differentiable neural architecture search (DNAS) framework to automatically search for the 3D DL models for 3D chest CT scans classification.
We also exploit the Class Activation Mapping (CAM) technique on our models to provide the interpretability of the results.
arXiv Detail & Related papers (2021-01-14T03:45:01Z) - Revisiting 3D Context Modeling with Supervised Pre-training for
Universal Lesion Detection in CT Slices [48.85784310158493]
We propose a Modified Pseudo-3D Feature Pyramid Network (MP3D FPN) to efficiently extract 3D context enhanced 2D features for universal lesion detection in CT slices.
With the novel pre-training method, the proposed MP3D FPN achieves state-of-the-art detection performance on the DeepLesion dataset.
The proposed 3D pre-trained weights can potentially be used to boost the performance of other 3D medical image analysis tasks.
arXiv Detail & Related papers (2020-12-16T07:11:16Z) - Weakly-supervised Learning For Catheter Segmentation in 3D Frustum
Ultrasound [74.22397862400177]
We propose a novel Frustum ultrasound based catheter segmentation method.
The proposed method achieved the state-of-the-art performance with an efficiency of 0.25 second per volume.
arXiv Detail & Related papers (2020-10-19T13:56:22Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.