Immunocto: a massive immune cell database auto-generated for histopathology
- URL: http://arxiv.org/abs/2406.02618v1
- Date: Mon, 3 Jun 2024 17:03:58 GMT
- Title: Immunocto: a massive immune cell database auto-generated for histopathology
- Authors: Mikaƫl Simard, Zhuoyan Shen, Maria A. Hawkins, Charles-Antoine Collins-Fekete,
- Abstract summary: We introduce Immunocto, a massive, multi-million automatically generated database of 6,848,454 human cells.
We provide a 64$times$64 pixels H&E image at $mathbf40times$ magnification, along with a binary mask of the nucleus and a label.
For each cell, we provide a 64$times$64 pixels H&E image at $mathbf40times$ magnification, along with a binary mask of the nucleus and a label.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: With the advent of novel cancer treatment options such as immunotherapy, studying the tumour immune micro-environment is crucial to inform on prognosis and understand response to therapeutic agents. A key approach to characterising the tumour immune micro-environment may be through combining (1) digitised microscopic high-resolution optical images of hematoxylin and eosin (H&E) stained tissue sections obtained in routine histopathology examinations with (2) automated immune cell detection and classification methods. However, current individual immune cell classification models for digital pathology present relatively poor performance. This is mainly due to the limited size of currently available datasets of individual immune cells, a consequence of the time-consuming and difficult problem of manually annotating immune cells on digitised H&E whole slide images. In that context, we introduce Immunocto, a massive, multi-million automatically generated database of 6,848,454 human cells, including 2,282,818 immune cells distributed across 4 subtypes: CD4$^+$ T cell lymphocytes, CD8$^+$ T cell lymphocytes, B cell lymphocytes, and macrophages. For each cell, we provide a 64$\times$64 pixels H&E image at $\mathbf{40}\times$ magnification, along with a binary mask of the nucleus and a label. To create Immunocto, we combined open-source models and data to automatically generate the majority of contours and labels. The cells are obtained from a matched H&E and immunofluorescence colorectal dataset from the Orion platform, while contours are obtained using the Segment Anything Model. A classifier trained on H&E images from Immunocto produces an average F1 score of 0.74 to differentiate the 4 immune cell subtypes and other cells. Immunocto can be downloaded at: https://zenodo.org/uploads/11073373.
Related papers
- Semi-supervised variational autoencoder for cell feature extraction in multiplexed immunofluorescence images [40.234346302444536]
We propose a deep learning-based cell feature extraction model using a variational autoencoder with supervision.
We perform cell phenotype classification using a cohort of more than 44,000 multiplexed immunofluorescence cell image patches extracted across 1,093 tissue microarray cores of breast cancer patients.
arXiv Detail & Related papers (2024-06-22T04:32:50Z) - MMIL: A novel algorithm for disease associated cell type discovery [58.044870442206914]
Single-cell datasets often lack individual cell labels, making it challenging to identify cells associated with disease.
We introduce Mixture Modeling for Multiple Learning Instance (MMIL), an expectation method that enables the training and calibration of cell-level classifiers.
arXiv Detail & Related papers (2024-06-12T15:22:56Z) - Single-Cell Deep Clustering Method Assisted by Exogenous Gene
Information: A Novel Approach to Identifying Cell Types [50.55583697209676]
We develop an attention-enhanced graph autoencoder, which is designed to efficiently capture the topological features between cells.
During the clustering process, we integrated both sets of information and reconstructed the features of both cells and genes to generate a discriminative representation.
This research offers enhanced insights into the characteristics and distribution of cells, thereby laying the groundwork for early diagnosis and treatment of diseases.
arXiv Detail & Related papers (2023-11-28T09:14:55Z) - Immunohistochemistry guided segmentation of benign epithelial cells, in situ lesions, and invasive epithelial cells in breast cancer slides [0.3251634769699391]
We developed an AI model for segmentation of epithelial cells in sections from breast cancer.
Quantitative evaluation, a mean Dice score of 0.70, 0.79, and 0.75 for invasive epithelial cells, benign epithelial cells, and in situ lesions, respectively, were achieved.
arXiv Detail & Related papers (2023-11-22T09:25:08Z) - An AI-Ready Multiplex Staining Dataset for Reproducible and Accurate
Characterization of Tumor Immune Microenvironment [7.595983383242494]
We introduce a new AI-ready computational pathology dataset containing restained and co-registered digitized images from eight head-and-neck squamous cell carcinoma patients.
The same tumor sections were stained with the expensive multiplex immunofluorescence (mIF) assay first and then restained with cheaper multiplexchemistry (mIHC)
This is a first public dataset that demonstrates the equivalence of these two staining methods which in turn allows several use cases.
arXiv Detail & Related papers (2023-05-25T20:42:23Z) - Artificial-intelligence-based molecular classification of diffuse
gliomas using rapid, label-free optical imaging [59.79875531898648]
DeepGlioma is an artificial-intelligence-based diagnostic screening system.
DeepGlioma can predict the molecular alterations used by the World Health Organization to define the adult-type diffuse glioma taxonomy.
arXiv Detail & Related papers (2023-03-23T18:50:18Z) - Generating counterfactual explanations of tumor spatial proteomes to
discover effective strategies for enhancing immune infiltration [44.99833362998488]
The tumor microenvironment (TME) significantly impacts cancer prognosis due to its immune composition.
Here, we formulate T-cell infiltration prediction as a self-supervised machine learning problem.
We apply our framework to melanoma, colorectal cancer liver metastases, and breast tumor data, discovering perturbations predicted to support T-cell infiltration.
arXiv Detail & Related papers (2022-11-08T05:46:02Z) - Employing Feature Selection Algorithms to Determine the Immune State of
a Mouse Model of Rheumatoid Arthritis [43.410962336636224]
The immune response is a dynamic process by which the body determines whether an antigen is self or nonself.
The goal of immunotherapy as applied to, e.g. Rheumatoid Arthritis (RA), is to bias the immune state in favor of the regulatory actors.
arXiv Detail & Related papers (2022-07-12T23:08:47Z) - Federated Learning Enables Big Data for Rare Cancer Boundary Detection [98.5549882883963]
We present findings from the largest Federated ML study to-date, involving data from 71 healthcare institutions across 6 continents.
We generate an automatic tumor boundary detector for the rare disease of glioblastoma.
We demonstrate a 33% improvement over a publicly trained model to delineate the surgically targetable tumor, and 23% improvement over the tumor's entire extent.
arXiv Detail & Related papers (2022-04-22T17:27:00Z) - A Pragmatic Machine Learning Approach to Quantify Tumor Infiltrating
Lymphocytes in Whole Slide Images [0.0]
Increased levels of tumor infiltrating lymphocytes (TILs) in cancer tissue indicate favourable outcomes in many types of cancer.
Our aim is to leverage a computational solution to automatically quantify TILs in whole slide images (WSIs) of standard diagnostic haematoxylin and eosin stained sections (H&E slides) from lung cancer patients.
arXiv Detail & Related papers (2022-02-14T10:22:10Z) - Hoechst Is All You Need: LymphocyteClassification with Deep Learning [15.530447606593238]
Hoechst staining is much cheaper and easier to perform, but is not typically used in this case as it binds to DNA rather than to the proteins targeted by immunofluorescent techniques.
It was not previously thought possible to differentiate cells expressing these proteins based only on DNA morphology.
In this work we show otherwise, training a deep convolutional neural network to identify cells expressing three proteins with greater than 90% precision and recall.
arXiv Detail & Related papers (2021-07-09T12:33:22Z) - A Multi-Scale Conditional Deep Model for Tumor Cell Ratio Counting [4.164451715899639]
We propose a method to accurately obtain the ratio of tumor cells over an entire histological slide.
We use deep fully convolutional neural network models trained to detect and classify cells on images of H&E-stained tissue sections.
We show that combining two models, each working at a different magnification allows the system to capture both cell-level details and surrounding context.
arXiv Detail & Related papers (2021-01-27T22:40:33Z) - Towards an Automatic Analysis of CHO-K1 Suspension Growth in
Microfluidic Single-cell Cultivation [63.94623495501023]
We propose a novel Machine Learning architecture, which allows us to infuse a neural deep network with human-powered abstraction on the level of data.
Specifically, we train a generative model simultaneously on natural and synthetic data, so that it learns a shared representation, from which a target variable, such as the cell count, can be reliably estimated.
arXiv Detail & Related papers (2020-10-20T08:36:51Z) - Automated Phenotyping via Cell Auto Training (CAT) on the Cell DIVE
Platform [0.5599792629509229]
We present a method for automatic cell classification in tissue samples using an automated training set from multiplexed immunofluorescence images.
The method utilizes multiple markers stained in situ on a single tissue section on a robust hyperplex immunofluorescence platform.
arXiv Detail & Related papers (2020-07-18T16:45:32Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.