Revisiting Cephalometric Landmark Detection from the view of Human Pose
Estimation with Lightweight Super-Resolution Head
- URL: http://arxiv.org/abs/2309.17143v1
- Date: Fri, 29 Sep 2023 11:15:39 GMT
- Title: Revisiting Cephalometric Landmark Detection from the view of Human Pose
Estimation with Lightweight Super-Resolution Head
- Authors: Qian Wu and Si Yong Yeo and Yufei Chen and Jun Liu
- Abstract summary: We develop a benchmark based on the well-established human pose estimation (HPE) known as MMPose.
We introduce an upscaling design within the framework to further enhance performance.
In the MICCAI CLDetection2023 challenge, our method achieves 1st place ranking on three metrics and 3rd place on the remaining one.
- Score: 11.40242574405714
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Accurate localization of cephalometric landmarks holds great importance in
the fields of orthodontics and orthognathics due to its potential for
automating key point labeling. In the context of landmark detection,
particularly in cephalometrics, it has been observed that existing methods
often lack standardized pipelines and well-designed bias reduction processes,
which significantly impact their performance. In this paper, we revisit a
related task, human pose estimation (HPE), which shares numerous similarities
with cephalometric landmark detection (CLD), and emphasize the potential for
transferring techniques from the former field to benefit the latter. Motivated
by this insight, we have developed a robust and adaptable benchmark based on
the well-established HPE codebase known as MMPose. This benchmark can serve as
a dependable baseline for achieving exceptional CLD performance. Furthermore,
we introduce an upscaling design within the framework to further enhance
performance. This enhancement involves the incorporation of a lightweight and
efficient super-resolution module, which generates heatmap predictions on
high-resolution features and leads to further performance refinement,
benefiting from its ability to reduce quantization bias. In the MICCAI
CLDetection2023 challenge, our method achieves 1st place ranking on three
metrics and 3rd place on the remaining one. The code for our method is
available at https://github.com/5k5000/CLdetection2023.
Related papers
- Small Object Detection via Coarse-to-fine Proposal Generation and
Imitation Learning [52.06176253457522]
We propose a two-stage framework tailored for small object detection based on the Coarse-to-fine pipeline and Feature Imitation learning.
CFINet achieves state-of-the-art performance on the large-scale small object detection benchmarks, SODA-D and SODA-A.
arXiv Detail & Related papers (2023-08-18T13:13:09Z) - High-level Feature Guided Decoding for Semantic Segmentation [54.424062794490254]
We propose to use powerful pre-trained high-level features as guidance (HFG) for the upsampler to produce robust results.
Specifically, the high-level features from the backbone are used to train the class tokens, which are then reused by the upsampler for classification.
To push the upper limit of HFG, we introduce a context augmentation encoder (CAE) that can efficiently and effectively operate on the low-resolution high-level feature.
arXiv Detail & Related papers (2023-03-15T14:23:07Z) - Detecting Rotated Objects as Gaussian Distributions and Its 3-D
Generalization [81.29406957201458]
Existing detection methods commonly use a parameterized bounding box (BBox) to model and detect (horizontal) objects.
We argue that such a mechanism has fundamental limitations in building an effective regression loss for rotation detection.
We propose to model the rotated objects as Gaussian distributions.
We extend our approach from 2-D to 3-D with a tailored algorithm design to handle the heading estimation.
arXiv Detail & Related papers (2022-09-22T07:50:48Z) - The KFIoU Loss for Rotated Object Detection [115.334070064346]
In this paper, we argue that one effective alternative is to devise an approximate loss who can achieve trend-level alignment with SkewIoU loss.
Specifically, we model the objects as Gaussian distribution and adopt Kalman filter to inherently mimic the mechanism of SkewIoU.
The resulting new loss called KFIoU is easier to implement and works better compared with exact SkewIoU.
arXiv Detail & Related papers (2022-01-29T10:54:57Z) - Dynamic Iterative Refinement for Efficient 3D Hand Pose Estimation [87.54604263202941]
We propose a tiny deep neural network of which partial layers are iteratively exploited for refining its previous estimations.
We employ learned gating criteria to decide whether to exit from the weight-sharing loop, allowing per-sample adaptation in our model.
Our method consistently outperforms state-of-the-art 2D/3D hand pose estimation approaches in terms of both accuracy and efficiency for widely used benchmarks.
arXiv Detail & Related papers (2021-11-11T23:31:34Z) - Occlusion-Robust Object Pose Estimation with Holistic Representation [42.27081423489484]
State-of-the-art (SOTA) object pose estimators take a two-stage approach.
We develop a novel occlude-and-blackout batch augmentation technique.
We also develop a multi-precision supervision architecture to encourage holistic pose representation learning.
arXiv Detail & Related papers (2021-10-22T08:00:26Z) - Learn Fine-grained Adaptive Loss for Multiple Anatomical Landmark
Detection in Medical Images [15.7026400415269]
We propose a novel learning-to-learn framework for landmark detection.
Our proposed framework is general and shows the potential to improve the efficiency of anatomical landmark detection.
arXiv Detail & Related papers (2021-05-19T13:39:18Z) - Lite-FPN for Keypoint-based Monocular 3D Object Detection [18.03406686769539]
Keypoint-based monocular 3D object detection has made tremendous progress and achieved great speed-accuracy trade-off.
We propose a sort of lightweight feature pyramid network called Lite-FPN to achieve multi-scale feature fusion.
Our proposed method achieves significantly higher accuracy and frame rate at the same time.
arXiv Detail & Related papers (2021-05-01T14:44:31Z) - Towards High Performance Human Keypoint Detection [87.1034745775229]
We find that context information plays an important role in reasoning human body configuration and invisible keypoints.
Inspired by this, we propose a cascaded context mixer ( CCM) which efficiently integrates spatial and channel context information.
To maximize CCM's representation capability, we develop a hard-negative person detection mining strategy and a joint-training strategy.
We present several sub-pixel refinement techniques for postprocessing keypoint predictions to improve detection accuracy.
arXiv Detail & Related papers (2020-02-03T02:24:51Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.