Hybrid Efficient Unsupervised Anomaly Detection for Early Pandemic Case Identification
- URL: http://arxiv.org/abs/2408.05347v1
- Date: Fri, 9 Aug 2024 21:31:39 GMT
- Title: Hybrid Efficient Unsupervised Anomaly Detection for Early Pandemic Case Identification
- Authors: Ghazal Ghajari, Mithun Kumar PK, Fathi Amsaad,
- Abstract summary: Unsupervised anomaly detection is a promising technique for identifying unusual patterns in data without the need for labeled training examples.
This research introduces a novel hybrid method for anomaly detection that combines distance and density measures.
Our method is especially relevant in pandemic situations, as demonstrated during the COVID-19 crisis.
- Score: 0.26217304977339473
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Unsupervised anomaly detection is a promising technique for identifying unusual patterns in data without the need for labeled training examples. This approach is particularly valuable for early case detection in epidemic management, especially when early-stage data are scarce. This research introduces a novel hybrid method for anomaly detection that combines distance and density measures, enhancing its applicability across various infectious diseases. Our method is especially relevant in pandemic situations, as demonstrated during the COVID-19 crisis, where traditional supervised classification methods fall short due to limited data. The efficacy of our method is evaluated using COVID-19 chest X-ray data, where it significantly outperforms established unsupervised techniques. It achieves an average AUC of 77.43%, surpassing the AUC of Isolation Forest at 73.66% and KNN at 52.93%. These results highlight the potential of our hybrid anomaly detection method to improve early detection capabilities in diverse epidemic scenarios, thereby facilitating more effective and timely responses.
Related papers
- Murmur2Vec: A Hashing Based Solution For Embedding Generation Of COVID-19 Spike Sequences [4.970277730082774]
Early detection and characterization of coronavirus disease (COVID-19), caused by SARS-CoV-2, remain critical for effective clinical response and public-health planning.<n>Existing approaches face notable limitations. Phylogenetic tree-based methods are computationally intensive and do not scale efficiently to today's multi-million-sequence datasets.<n>In this study, we focus on the most prevalent SARS-CoV-2 lineages associated with the spike protein region and introduce a scalable embedding method that leverages hashing to generate compact, low-dimensional representations of spike sequences.
arXiv Detail & Related papers (2025-12-10T23:03:10Z) - Early Detection of Forest Calamities in Homogeneous Stands -- Deep Learning Applied to Bark-Beetle Outbreaks [0.0]
This study investigates the potential of a Deep Learning algorithm based on a Long Short Term Memory (LSTM) Autoencoder for the detection of anomalies in forest health.
In this study, we monitored pure stands of spruce in Thuringia, Germany, over a 7-year period from 2018 to the end of 2024.
Our best model achieved a detection accuracy of 87% on test data and was able to detect 61% of all anomalies at a very early stage.
arXiv Detail & Related papers (2025-03-17T07:28:15Z) - Cardiovascular Disease Detection By Leveraging Semi-Supervised Learning [0.815557531820863]
Cardiovascular disease (CVD) persists as a primary cause of death on a global scale.
Traditional supervised learning approaches for CVD detection rely heavily on large-labeled datasets.
This paper employs semi-supervised learning models to boost efficiency and accuracy of CVD detection when there are few labeled samples.
arXiv Detail & Related papers (2024-12-13T21:15:38Z) - SepsisLab: Early Sepsis Prediction with Uncertainty Quantification and Active Sensing [67.8991481023825]
Sepsis is the leading cause of in-hospital mortality in the USA.
Existing predictive models are usually trained on high-quality data with few missing information.
For the potential high-risk patients with low confidence due to limited observations, we propose a robust active sensing algorithm.
arXiv Detail & Related papers (2024-07-24T04:47:36Z) - Alzheimer's disease detection in PSG signals [2.8691549050152965]
Alzheimers disease (AD) and sleep disorders exhibit a close association, where disruptions in sleep patterns often precede the onset of early-stage AD.
This study delves into the potential of utilizing sleep-related electroencephalography (EEG) signals for the early detection of AD.
Our primary focus is on exploring semi-supervised Deep Learning techniques for the classification of EEG signals due to the limited data availability.
arXiv Detail & Related papers (2024-04-04T15:56:23Z) - Automatic diagnosis of knee osteoarthritis severity using Swin
transformer [55.01037422579516]
Knee osteoarthritis (KOA) is a widespread condition that can cause chronic pain and stiffness in the knee joint.
We propose an automated approach that employs the Swin Transformer to predict the severity of KOA.
arXiv Detail & Related papers (2023-07-10T09:49:30Z) - Precursor-of-Anomaly Detection for Irregular Time Series [31.73234935455713]
We present a novel type of anomaly detection, called Precursor-of-Anomaly (PoA) detection.
To solve both problems at the same time, we present a neural controlled differential equation-based neural network and its multi-task learning algorithm.
arXiv Detail & Related papers (2023-06-27T14:10:09Z) - Unsupervised deep learning techniques for powdery mildew recognition
based on multispectral imaging [63.62764375279861]
This paper presents a deep learning approach to automatically recognize powdery mildew on cucumber leaves.
We focus on unsupervised deep learning techniques applied to multispectral imaging data.
We propose the use of autoencoder architectures to investigate two strategies for disease detection.
arXiv Detail & Related papers (2021-12-20T13:29:13Z) - Uncertainty-Aware Semi-supervised Method using Large Unlabelled and
Limited Labeled COVID-19 Data [14.530328267425638]
We propose a Semi-supervised Classification using Limited Labelled Data (SCLLD) for automated COVID-19 detection.
The proposed system is trained using 10,000 CT scans collected from Omid hospital.
Our method significantly outperforms the supervised training of Convolutional Neural Network (CNN) in case labelled training data is scarce.
arXiv Detail & Related papers (2021-02-12T08:20:20Z) - Bayesian prognostic covariate adjustment [59.75318183140857]
Historical data about disease outcomes can be integrated into the analysis of clinical trials in many ways.
We build on existing literature that uses prognostic scores from a predictive model to increase the efficiency of treatment effect estimates.
arXiv Detail & Related papers (2020-12-24T05:19:03Z) - STELAR: Spatio-temporal Tensor Factorization with Latent Epidemiological
Regularization [76.57716281104938]
We develop a tensor method to predict the evolution of epidemic trends for many regions simultaneously.
STELAR enables long-term prediction by incorporating latent temporal regularization through a system of discrete-time difference equations.
We conduct experiments using both county- and state-level COVID-19 data and show that our model can identify interesting latent patterns of the epidemic.
arXiv Detail & Related papers (2020-12-08T21:21:47Z) - Detecting Parkinsonian Tremor from IMU Data Collected In-The-Wild using
Deep Multiple-Instance Learning [59.74684475991192]
Parkinson's Disease (PD) is a slowly evolving neuro-logical disease that affects about 1% of the population above 60 years old.
PD symptoms include tremor, rigidity and braykinesia.
We present a method for automatically identifying tremorous episodes related to PD, based on IMU signals captured via a smartphone device.
arXiv Detail & Related papers (2020-05-06T09:02:30Z) - Deep Recurrent Model for Individualized Prediction of Alzheimer's
Disease Progression [4.034948808542701]
Alzheimer's disease (AD) is one of the major causes of dementia and is characterized by slow progression over several years.
We propose a novel computational framework that can predict the phenotypic measurements of MRI biomarkers and trajectories of clinical status.
arXiv Detail & Related papers (2020-05-06T08:08:00Z) - Anomaly Detection in Univariate Time-series: A Survey on the
State-of-the-Art [0.0]
Anomaly detection for time-series data has been an important research field for a long time.
Recent years an increasing number of machine learning algorithms have been developed to detect anomalies on time-series.
Researchers tried to improve these techniques using (deep) neural networks.
arXiv Detail & Related papers (2020-04-01T13:22:34Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.