Related papers: A Petri Dish for Histopathology Image Analysis

A Petri Dish for Histopathology Image Analysis

URL: http://arxiv.org/abs/2101.12355v1
Date: Fri, 29 Jan 2021 02:01:45 GMT
Title: A Petri Dish for Histopathology Image Analysis
Authors: Jerry Wei and Arief Suriawinata and Bing Ren and Xiaoying Liu and Mikhail Lisovsky and Louis Vaickus and Charles Brown and Michael Baker and Naofumi Tomita and Lorenzo Torresani and Jason Wei and Saeed Hassanpour
Abstract summary: We introduce a minimalist histopathology image analysis dataset (MHIST) MHIST is a binary classification dataset of 3,152 fixed-size images of colorectal polyps. MHIST occupies less than 400 MB of disk space, and a ResNet-18 baseline can be trained to convergence on MHIST in just 6 minutes.
Score: 25.424907516487327
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: With the rise of deep learning, there has been increased interest in using neural networks for histopathology image analysis, a field that investigates the properties of biopsy or resected specimens that are traditionally manually examined under a microscope by pathologists. In histopathology image analysis, however, challenges such as limited data, costly annotation, and processing high-resolution and variable-size images create a high barrier of entry and make it difficult to quickly iterate over model designs. Throughout scientific history, many significant research directions have leveraged small-scale experimental setups as petri dishes to efficiently evaluate exploratory ideas, which are then validated in large-scale applications. For instance, the Drosophila fruit fly in genetics and MNIST in computer vision are well-known petri dishes. In this paper, we introduce a minimalist histopathology image analysis dataset (MHIST), an analogous petri dish for histopathology image analysis. MHIST is a binary classification dataset of 3,152 fixed-size images of colorectal polyps, each with a gold-standard label determined by the majority vote of seven board-certified gastrointestinal pathologists and annotator agreement level. MHIST occupies less than 400 MB of disk space, and a ResNet-18 baseline can be trained to convergence on MHIST in just 6 minutes using 3.5 GB of memory on a NVIDIA RTX 3090. As example use cases, we use MHIST to study natural questions such as how dataset size, network depth, transfer learning, and high-disagreement examples affect model performance. By introducing MHIST, we hope to not only help facilitate the work of current histopathology imaging researchers, but also make histopathology image analysis more accessible to the general computer vision community. Our dataset is available at https://bmirds.github.io/MHIST.

Related papers

PixCell: A generative foundation model for digital histopathology images [49.00921097924924]
We introduce PixCell, the first diffusion-based generative foundation model for histopathology.<n>We train PixCell on PanCan-30M, a vast, diverse dataset derived from 69,184 H&E-stained whole slide images covering various cancer types.
arXiv Detail & Related papers (2025-06-05T15:14:32Z)
From Pixels to Histopathology: A Graph-Based Framework for Interpretable Whole Slide Image Analysis [81.19923502845441]
We develop a graph-based framework that constructs WSI graph representations. We build tissue representations (nodes) that follow biological boundaries rather than arbitrary patches. In our method's final step, we solve the diagnostic task through a graph attention network.
arXiv Detail & Related papers (2025-03-14T20:15:04Z)
μ-Net: A Deep Learning-Based Architecture for μ-CT Segmentation [2.012378666405002]
X-ray computed microtomography (mu-CT) is a non-destructive technique that can generate high-resolution 3D images of the internal anatomy of medical and biological samples. extracting relevant information from 3D images requires semantic segmentation of the regions of interest. We propose a novel framework that uses a convolutional neural network (CNN) to automatically segment the full morphology of the heart of Carassius auratus.
arXiv Detail & Related papers (2024-06-24T15:29:08Z)
Learning Multimodal Volumetric Features for Large-Scale Neuron Tracing [72.45257414889478]
We aim to reduce human workload by predicting connectivity between over-segmented neuron pieces. We first construct a dataset, named FlyTracing, that contains millions of pairwise connections of segments expanding the whole fly brain. We propose a novel connectivity-aware contrastive learning method to generate dense volumetric EM image embedding.
arXiv Detail & Related papers (2024-01-05T19:45:12Z)
Rotation-Agnostic Image Representation Learning for Digital Pathology [0.8246494848934447]
This paper introduces a fast patch selection method, FPS, for whole-slide image (WSI) analysis. It also presents PathDino, a lightweight histopathology feature extractor with a minimal configuration of five Transformer blocks. We show that our compact model outperforms existing state-of-the-art histopathology-specific vision transformers on 12 diverse datasets.
arXiv Detail & Related papers (2023-11-14T18:01:15Z)
Data-Efficient Vision Transformers for Multi-Label Disease Classification on Chest Radiographs [55.78588835407174]
Vision Transformers (ViTs) have not been applied to this task despite their high classification performance on generic images. ViTs do not rely on convolutions but on patch-based self-attention and in contrast to CNNs, no prior knowledge of local connectivity is present. Our results show that while the performance between ViTs and CNNs is on par with a small benefit for ViTs, DeiTs outperform the former if a reasonably large data set is available for training.
arXiv Detail & Related papers (2022-08-17T09:07:45Z)
Gram Barcodes for Histopathology Tissue Texture Retrieval [0.0]
Histopathology Image Retrieval (HIR) systems search through databases of biopsy images to find similar cases to a given query image. We propose the application of Gram barcodes as image features for HIR systems.
arXiv Detail & Related papers (2021-11-28T17:59:42Z)
Pathological Analysis of Blood Cells Using Deep Learning Techniques [0.0]
A neural based network has been proposed for classification of blood cells images into various categories. The performance of proposed model is better than existing standard architectures and work done by various researchers.
arXiv Detail & Related papers (2021-11-05T05:37:10Z)
A QuadTree Image Representation for Computational Pathology [1.8047694351309205]
Histopathology images are large and need to be split up into image tiles or patches so modern convolutional neural networks (CNNs) can process them. We present a method to generate an interpretable image representation of computational pathology images using quadtrees and a pipeline.
arXiv Detail & Related papers (2021-08-24T17:53:19Z)
Machine Learning Methods for Histopathological Image Analysis: A Review [62.14548392474976]
Histopathological images (HIs) are the gold standard for evaluating some types of tumors for cancer diagnosis. One of the ways of accelerating such an analysis is to use computer-aided diagnosis (CAD) systems.
arXiv Detail & Related papers (2021-02-07T19:12:32Z)
Generative Adversarial U-Net for Domain-free Medical Image Augmentation [49.72048151146307]
The shortage of annotated medical images is one of the biggest challenges in the field of medical image computing. In this paper, we develop a novel generative method named generative adversarial U-Net. Our newly designed model is domain-free and generalizable to various medical images.
arXiv Detail & Related papers (2021-01-12T23:02:26Z)
Deep Low-Shot Learning for Biological Image Classification and Visualization from Limited Training Samples [52.549928980694695]
In situ hybridization (ISH) gene expression pattern images from the same developmental stage are compared. labeling training data with precise stages is very time-consuming even for biologists. We propose a deep two-step low-shot learning framework to accurately classify ISH images using limited training images.
arXiv Detail & Related papers (2020-10-20T06:06:06Z)
Interpretation of 3D CNNs for Brain MRI Data Classification [56.895060189929055]
We extend the previous findings in gender differences from diffusion-tensor imaging on T1 brain MRI scans. We provide the voxel-wise 3D CNN interpretation comparing the results of three interpretation methods.
arXiv Detail & Related papers (2020-06-20T17:56:46Z)

This list is automatically generated from the titles and abstracts of the papers in this site.