You Only Learn Once: Universal Anatomical Landmark Detection
- URL: http://arxiv.org/abs/2103.04657v1
- Date: Mon, 8 Mar 2021 10:38:52 GMT
- Title: You Only Learn Once: Universal Anatomical Landmark Detection
- Authors: Heqin Zhu, Qingsong Yao, Li Xiao, S. Kevin Zhou
- Abstract summary: We develop a universal anatomical landmark detection model to realize multiple landmark detection tasks.
The model consists of a local network and a global network.
We evaluate our YOLO model on three X-ray datasets of 1,588 images on the head, hand, and chest, collectively contributing 62 landmarks.
- Score: 8.116895827446088
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Detecting anatomical landmarks in medical images plays an essential role in
understanding the anatomy and planning automated processing. In recent years, a
variety of deep neural network methods have been developed to detect landmarks
automatically. However, all of those methods are unary in the sense that a
highly specialized network is trained for a single task say associated with a
particular anatomical region. In this work, for the first time, we investigate
the idea of "You Only Learn Once (YOLO)" and develop a universal anatomical
landmark detection model to realize multiple landmark detection tasks with
end-to-end training based on mixed datasets. The model consists of a local
network and a global network: The local network is built upon the idea of
universal UNet to learn multi-domain local features and the global network is a
parallelly-duplicated sequential of dilated convolutions that extract global
features to further disambiguate the landmark locations. It is worth mentioning
that the new model design requires fewer parameters than models with standard
convolutions to train. We evaluate our YOLO model on three X-ray datasets of
1,588 images on the head, hand, and chest, collectively contributing 62
landmarks. The experimental results show that our proposed universal model
behaves largely better than any previous models trained on multiple datasets.
It even beats the performance of the model that is trained separately for every
single dataset.
Related papers
- Predicting Infant Brain Connectivity with Federated Multi-Trajectory
GNNs using Scarce Data [54.55126643084341]
Existing deep learning solutions suffer from three major limitations.
We introduce FedGmTE-Net++, a federated graph-based multi-trajectory evolution network.
Using the power of federation, we aggregate local learnings among diverse hospitals with limited datasets.
arXiv Detail & Related papers (2024-01-01T10:20:01Z) - Deception Detection from Linguistic and Physiological Data Streams Using Bimodal Convolutional Neural Networks [19.639533220155965]
This paper explores the application of convolutional neural networks for the purpose of multimodal deception detection.
We use a dataset built by interviewing 104 subjects about two topics, with one truthful and one falsified response from each subject about each topic.
arXiv Detail & Related papers (2023-11-18T02:44:33Z) - A Domain Generalization Approach for Out-Of-Distribution 12-lead ECG
Classification with Convolutional Neural Networks [3.1372269816123994]
We propose a method that classifies abnormalities on 12-lead Electrocardiograms (ECGs) by leveraging information extracted across the architecture of a Deep Neural Network.
We adopt publicly available ECG datasets from four sources and handle them as separate domains.
To simulate the distributional shift present in real-world settings, we train our model on a subset of the domains and leave-out the remaining ones.
arXiv Detail & Related papers (2022-08-20T10:58:31Z) - Learning Signal-Agnostic Manifolds of Neural Fields [50.066449953522685]
We leverage neural fields to capture the underlying structure in image, shape, audio and cross-modal audiovisual domains.
We show that by walking across the underlying manifold of GEM, we may generate new samples in our signal domains.
arXiv Detail & Related papers (2021-11-11T18:57:40Z) - Feature Aggregation and Refinement Network for 2D AnatomicalLandmark
Detection [0.0]
We propose a novel deep network, named feature aggregation and refinement network (FARNet) for the automatic detection of anatomical landmarks.
Our network has been evaluated on three publicly available anatomical landmark detection datasets.
arXiv Detail & Related papers (2021-11-01T02:16:13Z) - Multi-Branch Deep Radial Basis Function Networks for Facial Emotion
Recognition [80.35852245488043]
We propose a CNN based architecture enhanced with multiple branches formed by radial basis function (RBF) units.
RBF units capture local patterns shared by similar instances using an intermediate representation.
We show it is the incorporation of local information what makes the proposed model competitive.
arXiv Detail & Related papers (2021-09-07T21:05:56Z) - Universal Undersampled MRI Reconstruction [12.731566667990315]
We propose a framework to learn a universal deep neural network for undersampled MRI reconstruction.
Specifically, anatomy-specific instance normalization is proposed to compensate for statistical shift and allow easy generalization to new datasets.
Experimental results show the proposed universal model can reconstruct both brain and knee images with high image quality.
arXiv Detail & Related papers (2021-03-09T04:25:22Z) - Anomaly Detection on Attributed Networks via Contrastive Self-Supervised
Learning [50.24174211654775]
We present a novel contrastive self-supervised learning framework for anomaly detection on attributed networks.
Our framework fully exploits the local information from network data by sampling a novel type of contrastive instance pair.
A graph neural network-based contrastive learning model is proposed to learn informative embedding from high-dimensional attributes and local structure.
arXiv Detail & Related papers (2021-02-27T03:17:20Z) - An Ensemble of Knowledge Sharing Models for Dynamic Hand Gesture
Recognition [1.7158296436650335]
The focus of this paper is dynamic gesture recognition in the context of the interaction between humans and machines.
We propose a model consisting of two sub-networks, a transformer and an ordered-neuron long-short-term-memory based recurrent neural network (RNN)
The proposed ensemble of knowledge-sharing models exhibits an overall accuracy of 86.11% using only skeleton information.
arXiv Detail & Related papers (2020-08-13T07:37:27Z) - Structured Landmark Detection via Topology-Adapting Deep Graph Learning [75.20602712947016]
We present a new topology-adapting deep graph learning approach for accurate anatomical facial and medical landmark detection.
The proposed method constructs graph signals leveraging both local image features and global shape features.
Experiments are conducted on three public facial image datasets (WFLW, 300W, and COFW-68) as well as three real-world X-ray medical datasets (Cephalometric (public), Hand and Pelvis)
arXiv Detail & Related papers (2020-04-17T11:55:03Z) - MS-Net: Multi-Site Network for Improving Prostate Segmentation with
Heterogeneous MRI Data [75.73881040581767]
We propose a novel multi-site network (MS-Net) for improving prostate segmentation by learning robust representations.
Our MS-Net improves the performance across all datasets consistently, and outperforms state-of-the-art methods for multi-site learning.
arXiv Detail & Related papers (2020-02-09T14:11:50Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.