Outlier Detection Using a Novel method: Quantum Clustering
- URL: http://arxiv.org/abs/2006.04760v1
- Date: Mon, 8 Jun 2020 17:19:41 GMT
- Title: Outlier Detection Using a Novel method: Quantum Clustering
- Authors: Ding Liu, Hui Li
- Abstract summary: We propose a new assumption in outlier detection: Normal data instances are commonly located in the area that there is hardly any fluctuation on data density.
We apply a novel density-based approach to unsupervised outlier detection.
- Score: 24.11904406960212
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We propose a new assumption in outlier detection: Normal data instances are
commonly located in the area that there is hardly any fluctuation on data
density, while outliers are often appeared in the area that there is violent
fluctuation on data density. And based on this hypothesis, we apply a novel
density-based approach to unsupervised outlier detection. This approach, called
Quantum Clustering (QC), deals with unlabeled data processing and constructs a
potential function to find the centroids of clusters and the outliers. The
experiments show that the potential function could clearly find the hidden
outliers in data points effectively. Besides, by using QC, we could find more
subtle outliers by adjusting the parameter $\sigma$. Moreover, our approach is
also evaluated on two datasets (Air Quality Detection and Darwin Correspondence
Project) from two different research areas, and the results show the wide
applicability of our method.
Related papers
- Fuzzy Granule Density-Based Outlier Detection with Multi-Scale Granular Balls [65.44462297594308]
Outlier detection refers to the identification of anomalous samples that deviate significantly from the distribution of normal data.
Most unsupervised outlier detection methods are carefully designed to detect specified outliers.
We propose a fuzzy rough sets-based multi-scale outlier detection method to identify various types of outliers.
arXiv Detail & Related papers (2025-01-06T12:35:51Z) - Incremental Gaussian Mixture Clustering for Data Streams [0.08192907805418582]
We present and demonstrate effective working of an algorithm to find clusters and anomalous data points in a streaming datasets.
As the clusters are formed we also identify anomalous datapoints that show up far away from all known clusters.
arXiv Detail & Related papers (2024-12-10T06:15:14Z) - Anomaly Detection by Context Contrasting [57.695202846009714]
Anomaly detection focuses on identifying samples that deviate from the norm.
Recent advances in self-supervised learning have shown great promise in this regard.
We propose Con$$, which learns through context augmentations.
arXiv Detail & Related papers (2024-05-29T07:59:06Z) - HLoOP -- Hyperbolic 2-space Local Outlier Probabilities [4.030910640265943]
This paper introduces a simple framework to detect local outliers for datasets grounded in hyperbolic 2-spaces.
The developed HLoOP combines the idea of finding nearest neighbors, density-based outlier scoring with a probabilistic, statistically oriented approach.
The HLoOP algorithm is tested on the WordNet dataset yielding promising results.
arXiv Detail & Related papers (2023-12-06T20:38:39Z) - Beyond Labels: Advancing Cluster Analysis with the Entropy of Distance
Distribution (EDD) [0.0]
Entropy of Distance Distribution (EDD) is a paradigm shift in label-free clustering analysis.
Our method employs the Shannon information entropy to quantify the 'peakedness' or 'flatness' of distance distributions in a data set.
EDD's potential extends beyond conventional clustering analysis, offering a robust, scalable tool for unraveling complex data structures.
arXiv Detail & Related papers (2023-11-28T09:22:17Z) - Rethinking k-means from manifold learning perspective [122.38667613245151]
We present a new clustering algorithm which directly detects clusters of data without mean estimation.
Specifically, we construct distance matrix between data points by Butterworth filter.
To well exploit the complementary information embedded in different views, we leverage the tensor Schatten p-norm regularization.
arXiv Detail & Related papers (2023-05-12T03:01:41Z) - Anomaly Clustering: Grouping Images into Coherent Clusters of Anomaly
Types [60.45942774425782]
We introduce anomaly clustering, whose goal is to group data into coherent clusters of anomaly types.
This is different from anomaly detection, whose goal is to divide anomalies from normal data.
We present a simple yet effective clustering framework using a patch-based pretrained deep embeddings and off-the-shelf clustering methods.
arXiv Detail & Related papers (2021-12-21T23:11:33Z) - DAAIN: Detection of Anomalous and Adversarial Input using Normalizing
Flows [52.31831255787147]
We introduce a novel technique, DAAIN, to detect out-of-distribution (OOD) inputs and adversarial attacks (AA)
Our approach monitors the inner workings of a neural network and learns a density estimator of the activation distribution.
Our model can be trained on a single GPU making it compute efficient and deployable without requiring specialized accelerators.
arXiv Detail & Related papers (2021-05-30T22:07:13Z) - Am I Rare? An Intelligent Summarization Approach for Identifying Hidden
Anomalies [0.0]
In this paper, we propose an INtelligent Summarization approach for IDENTifying hidden anomalies, called INSIDENT.
Our approach is a clustering-based algorithm that dynamically maps original feature space to a new feature space by locally weighting features in each cluster. Besides, selecting representatives based on cluster size keeps the same distribution as the original data in summarized data.
arXiv Detail & Related papers (2020-12-24T23:22:57Z) - Classify and Generate Reciprocally: Simultaneous Positive-Unlabelled
Learning and Conditional Generation with Extra Data [77.31213472792088]
The scarcity of class-labeled data is a ubiquitous bottleneck in many machine learning problems.
We address this problem by leveraging Positive-Unlabeled(PU) classification and the conditional generation with extra unlabeled data.
We present a novel training framework to jointly target both PU classification and conditional generation when exposed to extra data.
arXiv Detail & Related papers (2020-06-14T08:27:40Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.