One-Shot Medical Landmark Detection
- URL: http://arxiv.org/abs/2103.04527v1
- Date: Mon, 8 Mar 2021 03:16:53 GMT
- Title: One-Shot Medical Landmark Detection
- Authors: Qingsong Yao, Quan Quan, Li Xiao, S. Kevin Zhou
- Abstract summary: We propose a novel framework named Cascade Comparing to Detect (CC2D) for one-shot landmark detection.
CC2D consists of two stages: 1) Self-supervised learning (CC2D-SSL) and 2) Training with pseudo-labels (CC2D-TPL)
The effectiveness of CC2D is evaluated on a widely-used public dataset of cephalometric landmark detection, which achieves a competitive detection accuracy of 81.01% within 4.0mm.
- Score: 11.213814977894314
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: The success of deep learning methods relies on the availability of a large
number of datasets with annotations; however, curating such datasets is
burdensome, especially for medical images. To relieve such a burden for a
landmark detection task, we explore the feasibility of using only a single
annotated image and propose a novel framework named Cascade Comparing to Detect
(CC2D) for one-shot landmark detection. CC2D consists of two stages: 1)
Self-supervised learning (CC2D-SSL) and 2) Training with pseudo-labels
(CC2D-TPL). CC2D-SSL captures the consistent anatomical information in a
coarse-to-fine fashion by comparing the cascade feature representations and
generates predictions on the training set. CC2D-TPL further improves the
performance by training a new landmark detector with those predictions. The
effectiveness of CC2D is evaluated on a widely-used public dataset of
cephalometric landmark detection, which achieves a competitive detection
accuracy of 81.01\% within 4.0mm, comparable to the state-of-the-art
fully-supervised methods using a lot more than one training image.
Related papers
- FM-OSD: Foundation Model-Enabled One-Shot Detection of Anatomical Landmarks [44.54301473673582]
We propose the first foundation model-enabled one-shot landmark detection (FM-OSD) framework for accurate landmark detection in medical images.
By using solely a single template image, our method demonstrates significant superiority over strong state-of-the-art one-shot landmark detection methods.
arXiv Detail & Related papers (2024-07-07T15:37:02Z) - A Closer Look at Spatial-Slice Features Learning for COVID-19 Detection [8.215897530386343]
We introduce an enhanced Spatial-Slice Feature Learning (SSFL++) framework specifically designed for CT scan.
It aim to filter out a OOD data within whole CT scan, enabling our to select crucial spatial-slice for analysis by reducing 70% redundancy totally.
Experiments demonstrate the promising performance of our model using a simple EfficientNet-2D (E2D) model, even with only 1% of the training data.
arXiv Detail & Related papers (2024-04-02T05:19:27Z) - Simple 2D Convolutional Neural Network-based Approach for COVID-19 Detection [8.215897530386343]
This study explores the use of deep learning techniques for analyzing lung Computed Tomography (CT) images.
We propose an advanced Spatial-Slice Feature Learning (SSFL++) framework specifically tailored for CT scans.
It aims to filter out out out-of-distribution (OOD) data within the entire CT scan, allowing us to select essential spatial-slice features for analysis by reducing data redundancy by 70%.
arXiv Detail & Related papers (2024-03-17T14:34:51Z) - CSP: Self-Supervised Contrastive Spatial Pre-Training for
Geospatial-Visual Representations [90.50864830038202]
We present Contrastive Spatial Pre-Training (CSP), a self-supervised learning framework for geo-tagged images.
We use a dual-encoder to separately encode the images and their corresponding geo-locations, and use contrastive objectives to learn effective location representations from images.
CSP significantly boosts the model performance with 10-34% relative improvement with various labeled training data sampling ratios.
arXiv Detail & Related papers (2023-05-01T23:11:18Z) - Relative distance matters for one-shot landmark detection [16.032695993409853]
We upgrade cascade comparing to detect (CC2D) to version II by incorporating a simple-yet-effective relative distance bias in the training stage.
CC2Dv2 is less possible to detect a wrong point far from the correct landmark.
We present an open-source, landmark-labeled dataset for the measurement of biomechanical parameters of the lower extremity.
arXiv Detail & Related papers (2022-03-03T12:50:56Z) - Dense Label Encoding for Boundary Discontinuity Free Rotation Detection [69.75559390700887]
This paper explores a relatively less-studied methodology based on classification.
We propose new techniques to push its frontier in two aspects.
Experiments and visual analysis on large-scale public datasets for aerial images show the effectiveness of our approach.
arXiv Detail & Related papers (2020-11-19T05:42:02Z) - Deep Q-Network-Driven Catheter Segmentation in 3D US by Hybrid
Constrained Semi-Supervised Learning and Dual-UNet [74.22397862400177]
We propose a novel catheter segmentation approach, which requests fewer annotations than the supervised learning method.
Our scheme considers a deep Q learning as the pre-localization step, which avoids voxel-level annotation.
With the detected catheter, patch-based Dual-UNet is applied to segment the catheter in 3D volumetric data.
arXiv Detail & Related papers (2020-06-25T21:10:04Z) - One-Shot Object Detection without Fine-Tuning [62.39210447209698]
We introduce a two-stage model consisting of a first stage Matching-FCOS network and a second stage Structure-Aware Relation Module.
We also propose novel training strategies that effectively improve detection performance.
Our method exceeds the state-of-the-art one-shot performance consistently on multiple datasets.
arXiv Detail & Related papers (2020-05-08T01:59:23Z) - Gradient-Induced Co-Saliency Detection [81.54194063218216]
Co-saliency detection (Co-SOD) aims to segment the common salient foreground in a group of relevant images.
In this paper, inspired by human behavior, we propose a gradient-induced co-saliency detection method.
arXiv Detail & Related papers (2020-04-28T08:40:55Z) - BiDet: An Efficient Binarized Object Detector [96.19708396510894]
We propose a binarized neural network learning method called BiDet for efficient object detection.
Our BiDet fully utilizes the representational capacity of the binary neural networks for object detection by redundancy removal.
Our method outperforms the state-of-the-art binary neural networks by a sizable margin.
arXiv Detail & Related papers (2020-03-09T08:16:16Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.