Region Proposal Network with Graph Prior and IoU-Balance Loss for
Landmark Detection in 3D Ultrasound
- URL: http://arxiv.org/abs/2004.00207v1
- Date: Wed, 1 Apr 2020 03:00:03 GMT
- Title: Region Proposal Network with Graph Prior and IoU-Balance Loss for
Landmark Detection in 3D Ultrasound
- Authors: Chaoyu Chen, Xin Yang, Ruobing Huang, Wenlong Shi, Shengfeng Liu,
Mingrong Lin, Yuhao Huang, Yong Yang, Yuanji Zhang, Huanjia Luo, Yankai
Huang, Yi Xiong, Dong Ni
- Abstract summary: 3D ultrasound (US) can facilitate detailed prenatal examinations for fetal growth monitoring.
To analyze a 3D US volume, it is fundamental to identify anatomical landmarks accurately.
We exploit an object detection framework to detect landmarks in 3D fetal facial US volumes.
- Score: 16.523977092204813
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: 3D ultrasound (US) can facilitate detailed prenatal examinations for fetal
growth monitoring. To analyze a 3D US volume, it is fundamental to identify
anatomical landmarks of the evaluated organs accurately. Typical deep learning
methods usually regress the coordinates directly or involve heatmap-matching.
However, these methods struggle to deal with volumes with large sizes and the
highly-varying positions and orientations of fetuses. In this work, we exploit
an object detection framework to detect landmarks in 3D fetal facial US
volumes. By regressing multiple parameters of the landmark-centered bounding
box (B-box) with a strict criteria, the proposed model is able to pinpoint the
exact location of the targeted landmarks. Specifically, the model uses a 3D
region proposal network (RPN) to generate 3D candidate regions, followed by
several 3D classification branches to select the best candidate. It also adopts
an IoU-balance loss to improve communications between branches that benefits
the learning process. Furthermore, it leverages a distance-based graph prior to
regularize the training and helps to reduce false positive predictions. The
performance of the proposed framework is evaluated on a 3D US dataset to detect
five key fetal facial landmarks. Results showed the proposed method outperforms
some of the state-of-the-art methods in efficacy and efficiency.
Related papers
- CatFree3D: Category-agnostic 3D Object Detection with Diffusion [63.75470913278591]
We introduce a novel pipeline that decouples 3D detection from 2D detection and depth prediction.
We also introduce the Normalised Hungarian Distance (NHD) metric for an accurate evaluation of 3D detection results.
arXiv Detail & Related papers (2024-08-22T22:05:57Z) - 3D Object Detection from Point Cloud via Voting Step Diffusion [52.9966883689137]
existing voting-based methods often receive votes from the partial surfaces of individual objects together with severe noises, leading to sub-optimal detection performance.
We propose a new method to move random 3D points toward the high-density region of the distribution by estimating the score function of the distribution with a noise conditioned score network.
Experiments on two large scale indoor 3D scene datasets, SUN RGB-D and ScanNet V2, demonstrate the superiority of our proposed method.
arXiv Detail & Related papers (2024-03-21T05:04:52Z) - FetusMapV2: Enhanced Fetal Pose Estimation in 3D Ultrasound [28.408626329596668]
We propose a novel 3D fetal pose estimation framework (called FetusMapV2) to overcome the above challenges.
First, we propose a scheme that explores the complementary network structure-unconstrained and activation-unreserved GPU memory management approaches.
Second, we design a novel Pair Loss to mitigate confusion caused by symmetrical and similar anatomical structures.
Third, we propose a shape priors-based self-supervised learning by selecting the relatively stable landmarks to refine the pose online.
arXiv Detail & Related papers (2023-10-30T06:18:47Z) - Volumetric Semantically Consistent 3D Panoptic Mapping [77.13446499924977]
We introduce an online 2D-to-3D semantic instance mapping algorithm aimed at generating semantic 3D maps suitable for autonomous agents in unstructured environments.
It introduces novel ways of integrating semantic prediction confidence during mapping, producing semantic and instance-consistent 3D regions.
The proposed method achieves accuracy superior to the state of the art on public large-scale datasets, improving on a number of widely used metrics.
arXiv Detail & Related papers (2023-09-26T08:03:10Z) - Progressive Coordinate Transforms for Monocular 3D Object Detection [52.00071336733109]
We propose a novel and lightweight approach, dubbed em Progressive Coordinate Transforms (PCT) to facilitate learning coordinate representations.
In this paper, we propose a novel and lightweight approach, dubbed em Progressive Coordinate Transforms (PCT) to facilitate learning coordinate representations.
arXiv Detail & Related papers (2021-08-12T15:22:33Z) - Delving into Localization Errors for Monocular 3D Object Detection [85.77319416168362]
Estimating 3D bounding boxes from monocular images is an essential component in autonomous driving.
In this work, we quantify the impact introduced by each sub-task and find the localization error' is the vital factor in restricting monocular 3D detection.
arXiv Detail & Related papers (2021-03-30T10:38:01Z) - Automated 3D cephalometric landmark identification using computerized
tomography [1.4349468613117398]
Identification of 3D cephalometric landmarks that serve as proxy to the shape of human skull is the fundamental step in cephalometric analysis.
Recently, automatic landmarking of 2D cephalograms using deep learning (DL) has achieved great success, but 3D landmarking for more than 80 landmarks has not yet reached a satisfactory level.
This paper presents a semi-supervised DL method for 3D landmarking that takes advantage of anonymized landmark dataset with paired CT data being removed.
arXiv Detail & Related papers (2020-12-16T07:29:32Z) - Revisiting 3D Context Modeling with Supervised Pre-training for
Universal Lesion Detection in CT Slices [48.85784310158493]
We propose a Modified Pseudo-3D Feature Pyramid Network (MP3D FPN) to efficiently extract 3D context enhanced 2D features for universal lesion detection in CT slices.
With the novel pre-training method, the proposed MP3D FPN achieves state-of-the-art detection performance on the DeepLesion dataset.
The proposed 3D pre-trained weights can potentially be used to boost the performance of other 3D medical image analysis tasks.
arXiv Detail & Related papers (2020-12-16T07:11:16Z) - SMOKE: Single-Stage Monocular 3D Object Detection via Keypoint
Estimation [3.1542695050861544]
Estimating 3D orientation and translation of objects is essential for infrastructure-less autonomous navigation and driving.
We propose a novel 3D object detection method, named SMOKE, that combines a single keypoint estimate with regressed 3D variables.
Despite of its structural simplicity, our proposed SMOKE network outperforms all existing monocular 3D detection methods on the KITTI dataset.
arXiv Detail & Related papers (2020-02-24T08:15:36Z) - FetusMap: Fetal Pose Estimation in 3D Ultrasound [42.59502360552173]
We propose to estimate the 3D pose of fetus in US volumes to facilitate its quantitative analyses.
This is the first work about 3D pose estimation of fetus in the literature.
We propose a self-supervised learning (SSL) framework to finetune the deep network to form visually plausible pose predictions.
arXiv Detail & Related papers (2019-10-11T01:45:09Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.