Lizard: A Large-Scale Dataset for Colonic Nuclear Instance Segmentation
and Classification
- URL: http://arxiv.org/abs/2108.11195v1
- Date: Wed, 25 Aug 2021 11:58:52 GMT
- Title: Lizard: A Large-Scale Dataset for Colonic Nuclear Instance Segmentation
and Classification
- Authors: Simon Graham, Mostafa Jahanifar, Ayesha Azam, Mohammed Nimir, Yee-Wah
Tsang, Katherine Dodd, Emily Hero, Harvir Sahota, Atisha Tank, Ksenija Benes,
Noorul Wahab, Fayyaz Minhas, Shan E Ahmed Raza, Hesham El Daly, Kishore
Gopalakrishnan, David Snead, Nasir Rajpoot
- Abstract summary: We propose a multi-stage annotation pipeline to enable the collection of large-scale datasets for histology image analysis.
We generate the largest known nuclear instance segmentation and classification dataset, containing nearly half a million labelled nuclei.
- Score: 4.642724910208435
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: The development of deep segmentation models for computational pathology
(CPath) can help foster the investigation of interpretable morphological
biomarkers. Yet, there is a major bottleneck in the success of such approaches
because supervised deep learning models require an abundance of accurately
labelled data. This issue is exacerbated in the field of CPath because the
generation of detailed annotations usually demands the input of a pathologist
to be able to distinguish between different tissue constructs and nuclei.
Manually labelling nuclei may not be a feasible approach for collecting
large-scale annotated datasets, especially when a single image region can
contain thousands of different cells. However, solely relying on automatic
generation of annotations will limit the accuracy and reliability of ground
truth. Therefore, to help overcome the above challenges, we propose a
multi-stage annotation pipeline to enable the collection of large-scale
datasets for histology image analysis, with pathologist-in-the-loop refinement
steps. Using this pipeline, we generate the largest known nuclear instance
segmentation and classification dataset, containing nearly half a million
labelled nuclei in H&E stained colon tissue. We have released the dataset and
encourage the research community to utilise it to drive forward the development
of downstream cell-based models in CPath.
Related papers
- Towards a Benchmark for Colorectal Cancer Segmentation in Endorectal Ultrasound Videos: Dataset and Model Development [59.74920439478643]
In this paper, we collect and annotated the first benchmark dataset that covers diverse ERUS scenarios.
Our ERUS-10K dataset comprises 77 videos and 10,000 high-resolution annotated frames.
We introduce a benchmark model for colorectal cancer segmentation, named the Adaptive Sparse-context TRansformer (ASTR)
arXiv Detail & Related papers (2024-08-19T15:04:42Z) - AWGUNET: Attention-Aided Wavelet Guided U-Net for Nuclei Segmentation in Histopathology Images [26.333686941245197]
We present a segmentation approach that combines the U-Net architecture with a DenseNet-121 backbone.
Our model introduces the Wavelet-guided channel attention module to enhance cell boundary delineation.
The experimental results conducted on two publicly accessible histopathology datasets, namely Monuseg and TNBC, underscore the superiority of our proposed model.
arXiv Detail & Related papers (2024-06-12T17:10:27Z) - UniCell: Universal Cell Nucleus Classification via Prompt Learning [76.11864242047074]
We propose a universal cell nucleus classification framework (UniCell)
It employs a novel prompt learning mechanism to uniformly predict the corresponding categories of pathological images from different dataset domains.
In particular, our framework adopts an end-to-end architecture for nuclei detection and classification, and utilizes flexible prediction heads for adapting various datasets.
arXiv Detail & Related papers (2024-02-20T11:50:27Z) - BoNuS: Boundary Mining for Nuclei Segmentation with Partial Point Labels [34.57288003249214]
We propose a weakly-supervised nuclei segmentation method that only requires partial point labels of nuclei.
Specifically, we propose a novel boundary mining framework for nuclei segmentation, named BoNuS, which simultaneously learns nuclei interior and boundary information from the point labels.
We consider a nuclei detection module with curriculum learning to detect the missing nuclei with prior morphological knowledge.
arXiv Detail & Related papers (2024-01-15T02:50:39Z) - Diffusion-based Data Augmentation for Nuclei Image Segmentation [68.28350341833526]
We introduce the first diffusion-based augmentation method for nuclei segmentation.
The idea is to synthesize a large number of labeled images to facilitate training the segmentation model.
The experimental results show that by augmenting 10% labeled real dataset with synthetic samples, one can achieve comparable segmentation results.
arXiv Detail & Related papers (2023-10-22T06:16:16Z) - PointNu-Net: Keypoint-assisted Convolutional Neural Network for
Simultaneous Multi-tissue Histology Nuclei Segmentation and Classification [23.466331358975044]
We study and design a new method to simultaneously detect, segment, and classify nuclei from Haematoxylin and Eosin stained histopathology data.
We demonstrate the superior performance of our proposed approach for nuclei segmentation and classification across 19 different tissue types.
arXiv Detail & Related papers (2021-11-01T08:29:40Z) - NuCLS: A scalable crowdsourcing, deep learning approach and dataset for
nucleus classification, localization and segmentation [0.0]
Deep learning algorithms can provide accurate mappings given large numbers of labeled instances for training and validation.
generating adequate volume of quality labels has emerged as a critical barrier in computational pathology.
We show how suggested annotations generated by a weak algorithm can improve the accuracy of annotations generated by non-experts.
arXiv Detail & Related papers (2021-02-18T01:17:17Z) - Towards an Automatic Analysis of CHO-K1 Suspension Growth in
Microfluidic Single-cell Cultivation [63.94623495501023]
We propose a novel Machine Learning architecture, which allows us to infuse a neural deep network with human-powered abstraction on the level of data.
Specifically, we train a generative model simultaneously on natural and synthetic data, so that it learns a shared representation, from which a target variable, such as the cell count, can be reliably estimated.
arXiv Detail & Related papers (2020-10-20T08:36:51Z) - Instance-aware Self-supervised Learning for Nuclei Segmentation [47.07869311690419]
We propose a novel self-supervised learning framework to exploit the capacity of convolutional neural networks (CNNs) on the nuclei instance segmentation task.
The proposed approach involves two sub-tasks, which enable neural networks to implicitly leverage the prior-knowledge of nuclei size and quantity.
Experimental results on the publicly available MoNuSeg dataset show that the proposed self-supervised learning approach can remarkably boost the segmentation accuracy of nuclei instance.
arXiv Detail & Related papers (2020-07-22T03:37:14Z) - Self-Supervised Nuclei Segmentation in Histopathological Images Using
Attention [6.3039500405009665]
We present a self-supervised approach for segmentation of nuclei for whole slide histopathology images.
Our method works on the assumption that the size and texture of nuclei can determine the magnification at which a patch is extracted.
Our experiments show that with standard post-processing, our method can outperform other unsupervised nuclei segmentation approaches.
arXiv Detail & Related papers (2020-07-16T14:49:20Z) - Weakly Supervised Deep Nuclei Segmentation Using Partial Points
Annotation in Histopathology Images [51.893494939675314]
We propose a novel weakly supervised segmentation framework based on partial points annotation.
We show that our method can achieve competitive performance compared to the fully supervised counterpart and the state-of-the-art methods.
arXiv Detail & Related papers (2020-07-10T15:41:29Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.