Feature Aggregation and Refinement Network for 2D AnatomicalLandmark
Detection
- URL: http://arxiv.org/abs/2111.00659v1
- Date: Mon, 1 Nov 2021 02:16:13 GMT
- Title: Feature Aggregation and Refinement Network for 2D AnatomicalLandmark
Detection
- Authors: Yueyuan Ao and Hong Wu
- Abstract summary: We propose a novel deep network, named feature aggregation and refinement network (FARNet) for the automatic detection of anatomical landmarks.
Our network has been evaluated on three publicly available anatomical landmark detection datasets.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Localization of anatomical landmarks is essential for clinical diagnosis,
treatment planning, and research. In this paper, we propose a novel deep
network, named feature aggregation and refinement network (FARNet), for the
automatic detection of anatomical landmarks. To alleviate the problem of
limited training data in the medical domain, our network adopts a deep network
pre-trained on natural images as the backbone network and several popular
networks have been compared. Our FARNet also includes a multi-scale feature
aggregation module for multi-scale feature fusion and a feature refinement
module for high-resolution heatmap regression. Coarse-to-fine supervisions are
applied to the two modules to facilitate the end-to-end training. We further
propose a novel loss function named Exponential Weighted Center loss for
accurate heatmap regression, which focuses on the losses from the pixels near
landmarks and suppresses the ones from far away. Our network has been evaluated
on three publicly available anatomical landmark detection datasets, including
cephalometric radiographs, hand radiographs, and spine radiographs, and
achieves state-of-art performances on all three datasets. Code is available at:
\url{https://github.com/JuvenileInWind/FARNet}
Related papers
- Unleashing the Power of Depth and Pose Estimation Neural Networks by
Designing Compatible Endoscopic Images [12.412060445862842]
We conduct a detail analysis of the properties of endoscopic images and improve the compatibility of images and neural networks.
First, we introcude the Mask Image Modelling (MIM) module, which inputs partial image information instead of complete image information.
Second, we propose a lightweight neural network to enhance the endoscopic images, to explicitly improve the compatibility between images and neural networks.
arXiv Detail & Related papers (2023-09-14T02:19:38Z) - AttResDU-Net: Medical Image Segmentation Using Attention-based Residual
Double U-Net [0.0]
This paper proposes an attention-based residual Double U-Net architecture (AttResDU-Net) that improves on the existing medical image segmentation networks.
We conducted experiments on three datasets: CVC Clinic-DB, ISIC 2018, and the 2018 Data Science Bowl datasets and achieved Dice Coefficient scores of 94.35%, 91.68%, and 92.45% respectively.
arXiv Detail & Related papers (2023-06-25T14:28:08Z) - 3D Medical Point Transformer: Introducing Convolution to Attention
Networks for Medical Point Cloud Analysis [21.934221178688116]
We propose an attention-based model specifically for medical point clouds, namely 3D medical point Transformer (3DMedPT)
By augmenting contextual information and summarizing local responses at query, our attention module can capture both local context and global content feature interactions.
Experiments conducted on IntrA dataset proves the superiority of 3DMedPT, where we achieve the best classification and segmentation results.
arXiv Detail & Related papers (2021-12-09T12:31:28Z) - Topological obstructions in neural networks learning [67.8848058842671]
We study global properties of the loss gradient function flow.
We use topological data analysis of the loss function and its Morse complex to relate local behavior along gradient trajectories with global properties of the loss surface.
arXiv Detail & Related papers (2020-12-31T18:53:25Z) - PC-RGNN: Point Cloud Completion and Graph Neural Network for 3D Object
Detection [57.49788100647103]
LiDAR-based 3D object detection is an important task for autonomous driving.
Current approaches suffer from sparse and partial point clouds of distant and occluded objects.
In this paper, we propose a novel two-stage approach, namely PC-RGNN, dealing with such challenges by two specific solutions.
arXiv Detail & Related papers (2020-12-18T18:06:43Z) - An Uncertainty-Driven GCN Refinement Strategy for Organ Segmentation [53.425900196763756]
We propose a segmentation refinement method based on uncertainty analysis and graph convolutional networks.
We employ the uncertainty levels of the convolutional network in a particular input volume to formulate a semi-supervised graph learning problem.
We show that our method outperforms the state-of-the-art CRF refinement method by improving the dice score by 1% for the pancreas and 2% for spleen.
arXiv Detail & Related papers (2020-12-06T18:55:07Z) - Ventral-Dorsal Neural Networks: Object Detection via Selective Attention [51.79577908317031]
We propose a new framework called Ventral-Dorsal Networks (VDNets)
Inspired by the structure of the human visual system, we propose the integration of a "Ventral Network" and a "Dorsal Network"
Our experimental results reveal that the proposed method outperforms state-of-the-art object detection approaches.
arXiv Detail & Related papers (2020-05-15T23:57:36Z) - AutoHR: A Strong End-to-end Baseline for Remote Heart Rate Measurement
with Neural Searching [76.4844593082362]
We investigate the reason why existing end-to-end networks perform poorly in challenging conditions and establish a strong baseline for remote HR measurement with architecture search (NAS)
Comprehensive experiments are performed on three benchmark datasets on both intra-temporal and cross-dataset testing.
arXiv Detail & Related papers (2020-04-26T05:43:21Z) - Structured Landmark Detection via Topology-Adapting Deep Graph Learning [75.20602712947016]
We present a new topology-adapting deep graph learning approach for accurate anatomical facial and medical landmark detection.
The proposed method constructs graph signals leveraging both local image features and global shape features.
Experiments are conducted on three public facial image datasets (WFLW, 300W, and COFW-68) as well as three real-world X-ray medical datasets (Cephalometric (public), Hand and Pelvis)
arXiv Detail & Related papers (2020-04-17T11:55:03Z) - Volumetric landmark detection with a multi-scale shift equivariant
neural network [16.114319747246334]
We propose a multi-scale, end-to-end deep learning method that achieves fast and memory-efficient landmark detection in 3D images.
We evaluate our method for carotid artery bifurcations detection on 263 CT volumes and achieve a better than state-of-the-art accuracy with mean Euclidean distance error of 2.81mm.
arXiv Detail & Related papers (2020-03-03T17:06:19Z) - Residual Block-based Multi-Label Classification and Localization Network
with Integral Regression for Vertebrae Labeling [4.867669606257232]
Existing methods are mainly based on the integration of multiple neural networks, and most of them use the Gaussian heat map to locate the vertebrae's centroid.
For end-to-end differential training of vertebra coordinates on CT scans, a robust and accurate automatic vertebral labeling algorithm is proposed in this study.
The proposed method is evaluated on a challenging dataset and the results are significantly better than the state-of-the-art methods.
arXiv Detail & Related papers (2020-01-01T09:16:10Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.