Fast Data Driven Estimation of Cluster Number in Multiplex Images using
Embedded Density Outliers
- URL: http://arxiv.org/abs/2207.10469v1
- Date: Thu, 21 Jul 2022 13:33:40 GMT
- Title: Fast Data Driven Estimation of Cluster Number in Multiplex Images using
Embedded Density Outliers
- Authors: Spencer A. Thomas
- Abstract summary: The usage of chemical imaging technologies is becoming a routine accompaniment to traditional methods in pathology.
Techniques such as imaging mass provide labelled multidimensional (multiplex) images of specific components used in conjunction with digital pathology techniques.
We propose a methodology to estimate the number of clusters in an automatic data-driven manner using a deep sparse autoencoder to embed the data into a lower dimensional space.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The usage of chemical imaging technologies is becoming a routine
accompaniment to traditional methods in pathology. Significant technological
advances have developed these next generation techniques to provide rich,
spatially resolved, multidimensional chemical images. The rise of digital
pathology has significantly enhanced the synergy of these imaging modalities
with optical microscopy and immunohistochemistry, enhancing our understanding
of the biological mechanisms and progression of diseases. Techniques such as
imaging mass cytometry provide labelled multidimensional (multiplex) images of
specific components used in conjunction with digital pathology techniques.
These powerful techniques generate a wealth of high dimensional data that
create significant challenges in data analysis. Unsupervised methods such as
clustering are an attractive way to analyse these data, however, they require
the selection of parameters such as the number of clusters. Here we propose a
methodology to estimate the number of clusters in an automatic data-driven
manner using a deep sparse autoencoder to embed the data into a lower
dimensional space. We compute the density of regions in the embedded space, the
majority of which are empty, enabling the high density regions to be detected
as outliers and provide an estimate for the number of clusters. This framework
provides a fully unsupervised and data-driven method to analyse
multidimensional data. In this work we demonstrate our method using 45
multiplex imaging mass cytometry datasets. Moreover, our model is trained using
only one of the datasets and the learned embedding is applied to the remaining
44 images providing an efficient process for data analysis. Finally, we
demonstrate the high computational efficiency of our method which is two orders
of magnitude faster than estimating via computing the sum squared distances as
a function of cluster number.
Related papers
- Multiplex Imaging Analysis in Pathology: a Comprehensive Review on Analytical Approaches and Digital Toolkits [0.7968706282619793]
Multi multiplexed imaging allows for simultaneous visualization of multiple biomarkers in a single section.
Data from multiplexed imaging requires sophisticated computational methods for preprocessing, segmentation, feature extraction, and spatial analysis.
PathML is an AI-powered platform that streamlines image analysis, making complex interpretation accessible for clinical and research settings.
arXiv Detail & Related papers (2024-11-01T18:02:41Z) - Private, Efficient and Scalable Kernel Learning for Medical Image Analysis [1.7999333451993955]
OKRA (Orthonormal K-fRAmes) is a novel randomized encoding-based approach for kernel-based machine learning.
It significantly enhances scalability and speed compared to current state-of-the-art solutions.
arXiv Detail & Related papers (2024-10-21T10:03:03Z) - Discriminative Hamiltonian Variational Autoencoder for Accurate Tumor Segmentation in Data-Scarce Regimes [2.8498944632323755]
We propose an end-to-end hybrid architecture for medical image segmentation.
We use Hamiltonian Variational Autoencoders (HVAE) and a discriminative regularization to improve the quality of generated images.
Our architecture operates on a slice-by-slice basis to segment 3D volumes, capitilizing on the richly augmented dataset.
arXiv Detail & Related papers (2024-06-17T15:42:08Z) - Rapid hyperspectral photothermal mid-infrared spectroscopic imaging from
sparse data for gynecologic cancer tissue subtyping [3.550171634694342]
Mid-infrared (MIR) hyperspectral photothermal imaging is a label-free, biochemically quantitative technology.
This work presents a novel approach to MIR photothermal imaging that enhances its speed by an order of magnitude.
arXiv Detail & Related papers (2024-02-28T00:57:35Z) - Datacube segmentation via Deep Spectral Clustering [76.48544221010424]
Extended Vision techniques often pose a challenge in their interpretation.
The huge dimensionality of data cube spectra poses a complex task in its statistical interpretation.
In this paper, we explore the possibility of applying unsupervised clustering methods in encoded space.
A statistical dimensional reduction is performed by an ad hoc trained (Variational) AutoEncoder, while the clustering process is performed by a (learnable) iterative K-Means clustering algorithm.
arXiv Detail & Related papers (2024-01-31T09:31:28Z) - Building RadiologyNET: Unsupervised annotation of a large-scale
multimodal medical database [0.4915744683251151]
The usage of machine learning in medical diagnosis and treatment has witnessed significant growth in recent years.
However, the availability of large annotated image datasets remains a major obstacle since the process of annotation is time-consuming and costly.
This paper explores how to automatically annotate a database of medical radiology images with regard to their semantic similarity.
arXiv Detail & Related papers (2023-07-27T13:00:33Z) - Two Approaches to Supervised Image Segmentation [55.616364225463066]
The present work develops comparison experiments between deep learning and multiset neurons approaches.
The deep learning approach confirmed its potential for performing image segmentation.
The alternative multiset methodology allowed for enhanced accuracy while requiring little computational resources.
arXiv Detail & Related papers (2023-07-19T16:42:52Z) - AMIGO: Sparse Multi-Modal Graph Transformer with Shared-Context
Processing for Representation Learning of Giga-pixel Images [53.29794593104923]
We present a novel concept of shared-context processing for whole slide histopathology images.
AMIGO uses the celluar graph within the tissue to provide a single representation for a patient.
We show that our model is strongly robust to missing information to an extent that it can achieve the same performance with as low as 20% of the data.
arXiv Detail & Related papers (2023-03-01T23:37:45Z) - Incorporating Texture Information into Dimensionality Reduction for
High-Dimensional Images [65.74185962364211]
We present a method for incorporating neighborhood information into distance-based dimensionality reduction methods.
Based on a classification of different methods for comparing image patches, we explore a number of different approaches.
arXiv Detail & Related papers (2022-02-18T13:17:43Z) - A parameter refinement method for Ptychography based on Deep Learning
concepts [55.41644538483948]
coarse parametrisation in propagation distance, position errors and partial coherence frequently menaces the experiment viability.
A modern Deep Learning framework is used to correct autonomously the setup incoherences, thus improving the quality of a ptychography reconstruction.
We tested our system on both synthetic datasets and also on real data acquired at the TwinMic beamline of the Elettra synchrotron facility.
arXiv Detail & Related papers (2021-05-18T10:15:17Z) - Pathological Retinal Region Segmentation From OCT Images Using Geometric
Relation Based Augmentation [84.7571086566595]
We propose improvements over previous GAN-based medical image synthesis methods by jointly encoding the intrinsic relationship of geometry and shape.
The proposed method outperforms state-of-the-art segmentation methods on the public RETOUCH dataset having images captured from different acquisition procedures.
arXiv Detail & Related papers (2020-03-31T11:50:43Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.