A Petri Dish for Histopathology Image Analysis
- URL: http://arxiv.org/abs/2101.12355v1
- Date: Fri, 29 Jan 2021 02:01:45 GMT
- Title: A Petri Dish for Histopathology Image Analysis
- Authors: Jerry Wei and Arief Suriawinata and Bing Ren and Xiaoying Liu and
Mikhail Lisovsky and Louis Vaickus and Charles Brown and Michael Baker and
Naofumi Tomita and Lorenzo Torresani and Jason Wei and Saeed Hassanpour
- Abstract summary: We introduce a minimalist histopathology image analysis dataset (MHIST)
MHIST is a binary classification dataset of 3,152 fixed-size images of colorectal polyps.
MHIST occupies less than 400 MB of disk space, and a ResNet-18 baseline can be trained to convergence on MHIST in just 6 minutes.
- Score: 25.424907516487327
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: With the rise of deep learning, there has been increased interest in using
neural networks for histopathology image analysis, a field that investigates
the properties of biopsy or resected specimens that are traditionally manually
examined under a microscope by pathologists. In histopathology image analysis,
however, challenges such as limited data, costly annotation, and processing
high-resolution and variable-size images create a high barrier of entry and
make it difficult to quickly iterate over model designs.
Throughout scientific history, many significant research directions have
leveraged small-scale experimental setups as petri dishes to efficiently
evaluate exploratory ideas, which are then validated in large-scale
applications. For instance, the Drosophila fruit fly in genetics and MNIST in
computer vision are well-known petri dishes. In this paper, we introduce a
minimalist histopathology image analysis dataset (MHIST), an analogous petri
dish for histopathology image analysis. MHIST is a binary classification
dataset of 3,152 fixed-size images of colorectal polyps, each with a
gold-standard label determined by the majority vote of seven board-certified
gastrointestinal pathologists and annotator agreement level. MHIST occupies
less than 400 MB of disk space, and a ResNet-18 baseline can be trained to
convergence on MHIST in just 6 minutes using 3.5 GB of memory on a NVIDIA RTX
3090. As example use cases, we use MHIST to study natural questions such as how
dataset size, network depth, transfer learning, and high-disagreement examples
affect model performance.
By introducing MHIST, we hope to not only help facilitate the work of current
histopathology imaging researchers, but also make histopathology image analysis
more accessible to the general computer vision community. Our dataset is
available at https://bmirds.github.io/MHIST.
Related papers
- From Pixels to Histopathology: A Graph-Based Framework for Interpretable Whole Slide Image Analysis [81.19923502845441]
We develop a graph-based framework that constructs WSI graph representations.
We build tissue representations (nodes) that follow biological boundaries rather than arbitrary patches.
In our method's final step, we solve the diagnostic task through a graph attention network.
arXiv Detail & Related papers (2025-03-14T20:15:04Z) - μ-Net: A Deep Learning-Based Architecture for μ-CT Segmentation [2.012378666405002]
X-ray computed microtomography (mu-CT) is a non-destructive technique that can generate high-resolution 3D images of the internal anatomy of medical and biological samples.
extracting relevant information from 3D images requires semantic segmentation of the regions of interest.
We propose a novel framework that uses a convolutional neural network (CNN) to automatically segment the full morphology of the heart of Carassius auratus.
arXiv Detail & Related papers (2024-06-24T15:29:08Z) - Learning Multimodal Volumetric Features for Large-Scale Neuron Tracing [72.45257414889478]
We aim to reduce human workload by predicting connectivity between over-segmented neuron pieces.
We first construct a dataset, named FlyTracing, that contains millions of pairwise connections of segments expanding the whole fly brain.
We propose a novel connectivity-aware contrastive learning method to generate dense volumetric EM image embedding.
arXiv Detail & Related papers (2024-01-05T19:45:12Z) - Rotation-Agnostic Image Representation Learning for Digital Pathology [0.8246494848934447]
This paper introduces a fast patch selection method, FPS, for whole-slide image (WSI) analysis.
It also presents PathDino, a lightweight histopathology feature extractor with a minimal configuration of five Transformer blocks.
We show that our compact model outperforms existing state-of-the-art histopathology-specific vision transformers on 12 diverse datasets.
arXiv Detail & Related papers (2023-11-14T18:01:15Z) - Data-Efficient Vision Transformers for Multi-Label Disease
Classification on Chest Radiographs [55.78588835407174]
Vision Transformers (ViTs) have not been applied to this task despite their high classification performance on generic images.
ViTs do not rely on convolutions but on patch-based self-attention and in contrast to CNNs, no prior knowledge of local connectivity is present.
Our results show that while the performance between ViTs and CNNs is on par with a small benefit for ViTs, DeiTs outperform the former if a reasonably large data set is available for training.
arXiv Detail & Related papers (2022-08-17T09:07:45Z) - Gram Barcodes for Histopathology Tissue Texture Retrieval [0.0]
Histopathology Image Retrieval (HIR) systems search through databases of biopsy images to find similar cases to a given query image.
We propose the application of Gram barcodes as image features for HIR systems.
arXiv Detail & Related papers (2021-11-28T17:59:42Z) - Pathological Analysis of Blood Cells Using Deep Learning Techniques [0.0]
A neural based network has been proposed for classification of blood cells images into various categories.
The performance of proposed model is better than existing standard architectures and work done by various researchers.
arXiv Detail & Related papers (2021-11-05T05:37:10Z) - A QuadTree Image Representation for Computational Pathology [1.8047694351309205]
Histopathology images are large and need to be split up into image tiles or patches so modern convolutional neural networks (CNNs) can process them.
We present a method to generate an interpretable image representation of computational pathology images using quadtrees and a pipeline.
arXiv Detail & Related papers (2021-08-24T17:53:19Z) - Machine Learning Methods for Histopathological Image Analysis: A Review [62.14548392474976]
Histopathological images (HIs) are the gold standard for evaluating some types of tumors for cancer diagnosis.
One of the ways of accelerating such an analysis is to use computer-aided diagnosis (CAD) systems.
arXiv Detail & Related papers (2021-02-07T19:12:32Z) - Generative Adversarial U-Net for Domain-free Medical Image Augmentation [49.72048151146307]
The shortage of annotated medical images is one of the biggest challenges in the field of medical image computing.
In this paper, we develop a novel generative method named generative adversarial U-Net.
Our newly designed model is domain-free and generalizable to various medical images.
arXiv Detail & Related papers (2021-01-12T23:02:26Z) - Deep Low-Shot Learning for Biological Image Classification and
Visualization from Limited Training Samples [52.549928980694695]
In situ hybridization (ISH) gene expression pattern images from the same developmental stage are compared.
labeling training data with precise stages is very time-consuming even for biologists.
We propose a deep two-step low-shot learning framework to accurately classify ISH images using limited training images.
arXiv Detail & Related papers (2020-10-20T06:06:06Z) - Interpretation of 3D CNNs for Brain MRI Data Classification [56.895060189929055]
We extend the previous findings in gender differences from diffusion-tensor imaging on T1 brain MRI scans.
We provide the voxel-wise 3D CNN interpretation comparing the results of three interpretation methods.
arXiv Detail & Related papers (2020-06-20T17:56:46Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.