A Novel Granular-Based Bi-Clustering Method of Deep Mining the
Co-Expressed Genes
- URL: http://arxiv.org/abs/2005.05519v1
- Date: Tue, 12 May 2020 02:04:40 GMT
- Title: A Novel Granular-Based Bi-Clustering Method of Deep Mining the
Co-Expressed Genes
- Authors: Kaijie Xu, Witold Pedrycz, Zhiwu Li, Yinghui Quan, and Weike Nie
- Abstract summary: Bi-clustering methods are used to mine bi-clusters whose subsets of samples (genes) are co-regulated under their test conditions.
Unfortunately, traditional bi-clustering methods are not fully effective in discovering such bi-clusters.
We propose a novel bi-clustering method by involving here the theory of Granular Computing.
- Score: 76.84066556597342
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Traditional clustering methods are limited when dealing with huge and
heterogeneous groups of gene expression data, which motivates the development
of bi-clustering methods. Bi-clustering methods are used to mine bi-clusters
whose subsets of samples (genes) are co-regulated under their test conditions.
Studies show that mining bi-clusters of consistent trends and trends with
similar degrees of fluctuations from the gene expression data is essential in
bioinformatics research. Unfortunately, traditional bi-clustering methods are
not fully effective in discovering such bi-clusters. Therefore, we propose a
novel bi-clustering method by involving here the theory of Granular Computing.
In the proposed scheme, the gene data matrix, considered as a group of time
series, is transformed into a series of ordered information granules. With the
information granules we build a characteristic matrix of the gene data to
capture the fluctuation trend of the expression value between consecutive
conditions to mine the ideal bi-clusters. The experimental results are in
agreement with the theoretical analysis, and show the excellent performance of
the proposed method.
Related papers
- Robust Multi-view Co-expression Network Inference [8.697303234009528]
Inferring gene co-expression networks from transcriptome data presents many challenges.
We introduce a robust method for high-dimensional graph inference from multiple independent studies.
arXiv Detail & Related papers (2024-09-30T06:30:09Z) - HBIC: A Biclustering Algorithm for Heterogeneous Datasets [0.0]
Biclustering is an unsupervised machine-learning approach aiming to cluster rows and columns simultaneously in a data matrix.
We introduce a biclustering approach called HBIC, capable of discovering meaningful biclusters in complex heterogeneous data.
arXiv Detail & Related papers (2024-08-23T16:48:10Z) - HeNCler: Node Clustering in Heterophilous Graphs through Learned Asymmetric Similarity [55.27586970082595]
HeNCler is a novel approach for Heterophilous Node Clustering.
We show that HeNCler significantly enhances performance in node clustering tasks within heterophilous graph contexts.
arXiv Detail & Related papers (2024-05-27T11:04:05Z) - Single-Cell Deep Clustering Method Assisted by Exogenous Gene
Information: A Novel Approach to Identifying Cell Types [50.55583697209676]
We develop an attention-enhanced graph autoencoder, which is designed to efficiently capture the topological features between cells.
During the clustering process, we integrated both sets of information and reconstructed the features of both cells and genes to generate a discriminative representation.
This research offers enhanced insights into the characteristics and distribution of cells, thereby laying the groundwork for early diagnosis and treatment of diseases.
arXiv Detail & Related papers (2023-11-28T09:14:55Z) - A Novel Fuzzy Bi-Clustering Algorithm with AFS for Identification of
Co-Regulated Genes [0.799536002595393]
This paper proposes a novel fuzzy bi-clustering algorithm for identification of co-regulated genes.
The proposed algorithm can effectively detect the co-regulated genes without any prior knowledge of the gene expression data.
arXiv Detail & Related papers (2023-02-03T08:35:49Z) - Uniform tensor clustering by jointly exploring sample affinities of
various orders [37.11798745294855]
We propose a unified tensor clustering method (UTC) that characterizes sample proximity using multiple samples' affinity.
UTC is affirmed to enhance clustering by exploiting different order affinities when processing high-dimensional data.
arXiv Detail & Related papers (2023-02-03T06:43:08Z) - Clustering Ensemble Meets Low-rank Tensor Approximation [50.21581880045667]
This paper explores the problem of clustering ensemble, which aims to combine multiple base clusterings to produce better performance than that of the individual one.
We propose a novel low-rank tensor approximation-based method to solve the problem from a global perspective.
Experimental results over 7 benchmark data sets show that the proposed model achieves a breakthrough in clustering performance, compared with 12 state-of-the-art methods.
arXiv Detail & Related papers (2020-12-16T13:01:37Z) - Conjoined Dirichlet Process [63.89763375457853]
We develop a novel, non-parametric probabilistic biclustering method based on Dirichlet processes to identify biclusters with strong co-occurrence in both rows and columns.
We apply our method to two different applications, text mining and gene expression analysis, and demonstrate that our method improves bicluster extraction in many settings compared to existing approaches.
arXiv Detail & Related papers (2020-02-08T19:41:23Z) - Clustering Binary Data by Application of Combinatorial Optimization
Heuristics [52.77024349608834]
We study clustering methods for binary data, first defining aggregation criteria that measure the compactness of clusters.
Five new and original methods are introduced, using neighborhoods and population behavior optimization metaheuristics.
From a set of 16 data tables generated by a quasi-Monte Carlo experiment, a comparison is performed for one of the aggregations using L1 dissimilarity, with hierarchical clustering, and a version of k-means: partitioning around medoids or PAM.
arXiv Detail & Related papers (2020-01-06T23:33:31Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.