Review of Clustering Methods for Functional Data
- URL: http://arxiv.org/abs/2210.00847v1
- Date: Mon, 3 Oct 2022 12:15:23 GMT
- Title: Review of Clustering Methods for Functional Data
- Authors: Mimi Zhang and Andrew Parnell
- Abstract summary: The review aims to bridge the gap between the functional data analysis community and the clustering community.
We propose a systematic taxonomy that explores the connections and differences among the existing functional data clustering methods.
- Score: 1.827510863075184
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Functional data clustering is to identify heterogeneous morphological
patterns in the continuous functions underlying the discrete
measurements/observations. Application of functional data clustering has
appeared in many publications across various fields of sciences, including but
not limited to biology, (bio)chemistry, engineering, environmental science,
medical science, psychology, social science, etc. The phenomenal growth of the
application of functional data clustering indicates the urgent need for a
systematic approach to develop efficient clustering methods and scalable
algorithmic implementations. On the other hand, there is abundant literature on
the cluster analysis of time series, trajectory data, spatio-temporal data,
etc., which are all related to functional data. Therefore, an overarching
structure of existing functional data clustering methods will enable the
cross-pollination of ideas across various research fields. We here conduct a
comprehensive review of original clustering methods for functional data. We
propose a systematic taxonomy that explores the connections and differences
among the existing functional data clustering methods and relates them to the
conventional multivariate clustering methods. The structure of the taxonomy is
built on three main attributes of a functional data clustering method and
therefore is more reliable than existing categorizations. The review aims to
bridge the gap between the functional data analysis community and the
clustering community and to generate new principles for functional data
clustering.
Related papers
- UniCell: Universal Cell Nucleus Classification via Prompt Learning [76.11864242047074]
We propose a universal cell nucleus classification framework (UniCell)
It employs a novel prompt learning mechanism to uniformly predict the corresponding categories of pathological images from different dataset domains.
In particular, our framework adopts an end-to-end architecture for nuclei detection and classification, and utilizes flexible prediction heads for adapting various datasets.
arXiv Detail & Related papers (2024-02-20T11:50:27Z) - Generating Hierarchical Structures for Improved Time Series
Classification Using Stochastic Splitting Functions [0.0]
This study introduces a novel hierarchical divisive clustering approach with splitting functions (SSFs) to enhance classification performance in multi-class datasets through hierarchical classification (HC)
The method has the unique capability of generating hierarchy without requiring explicit information, making it suitable for datasets lacking prior knowledge of hierarchy.
arXiv Detail & Related papers (2023-09-21T10:34:50Z) - funLOCI: a local clustering algorithm for functional data [0.0]
funLOCI is a three-step algorithm based on divisive hierarchical clustering.
To deal with the large quantity of local clusters, an extra step is implemented to reduce the number of results to the minimum.
arXiv Detail & Related papers (2023-05-22T12:51:58Z) - Unified Multi-View Orthonormal Non-Negative Graph Based Clustering
Framework [74.25493157757943]
We formulate a novel clustering model, which exploits the non-negative feature property and incorporates the multi-view information into a unified joint learning framework.
We also explore, for the first time, the multi-model non-negative graph-based approach to clustering data based on deep features.
arXiv Detail & Related papers (2022-11-03T08:18:27Z) - Deep Clustering: A Comprehensive Survey [53.387957674512585]
Clustering analysis plays an indispensable role in machine learning and data mining.
Deep clustering, which can learn clustering-friendly representations using deep neural networks, has been broadly applied in a wide range of clustering tasks.
Existing surveys for deep clustering mainly focus on the single-view fields and the network architectures, ignoring the complex application scenarios of clustering.
arXiv Detail & Related papers (2022-10-09T02:31:32Z) - Detection and Evaluation of Clusters within Sequential Data [58.720142291102135]
Clustering algorithms for Block Markov Chains possess theoretical optimality guarantees.
In particular, our sequential data is derived from human DNA, written text, animal movement data and financial markets.
It is found that the Block Markov Chain model assumption can indeed produce meaningful insights in exploratory data analyses.
arXiv Detail & Related papers (2022-10-04T15:22:39Z) - Clustering Optimisation Method for Highly Connected Biological Data [0.0]
We show how a simple metric for connectivity clustering evaluation leads to an optimised segmentation of biological data.
The novelty of the work resides in the creation of a simple optimisation method for clustering crowded data.
arXiv Detail & Related papers (2022-08-08T17:33:32Z) - Enhancing cluster analysis via topological manifold learning [0.3823356975862006]
We show that inferring the topological structure of a dataset before clustering can considerably enhance cluster detection.
We combine manifold learning method UMAP for inferring the topological structure with density-based clustering method DBSCAN.
arXiv Detail & Related papers (2022-07-01T15:53:39Z) - Scalable Hierarchical Agglomerative Clustering [65.66407726145619]
Existing scalable hierarchical clustering methods sacrifice quality for speed.
We present a scalable, agglomerative method for hierarchical clustering that does not sacrifice quality and scales to billions of data points.
arXiv Detail & Related papers (2020-10-22T15:58:35Z) - Structured Graph Learning for Clustering and Semi-supervised
Classification [74.35376212789132]
We propose a graph learning framework to preserve both the local and global structure of data.
Our method uses the self-expressiveness of samples to capture the global structure and adaptive neighbor approach to respect the local structure.
Our model is equivalent to a combination of kernel k-means and k-means methods under certain condition.
arXiv Detail & Related papers (2020-08-31T08:41:20Z) - Heterogeneous Transfer Learning in Ensemble Clustering [0.0]
We consider a clustering problem in which "similar" labeled data are available.
The method is based on constructing meta-features which describe structural characteristics of data.
An experimental study of the method using Monte Carlo modeling has confirmed its efficiency.
arXiv Detail & Related papers (2020-01-20T16:03:38Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.