DSNet: A Dual-Stream Framework for Weakly-Supervised Gigapixel Pathology
Image Analysis
- URL: http://arxiv.org/abs/2109.05788v1
- Date: Mon, 13 Sep 2021 09:10:43 GMT
- Title: DSNet: A Dual-Stream Framework for Weakly-Supervised Gigapixel Pathology
Image Analysis
- Authors: Tiange Xiang, Yang Song, Chaoyi Zhang, Dongnan Liu, Mei Chen, Fan
Zhang, Heng Huang, Lauren O'Donnell, Weidong Cai
- Abstract summary: We present a novel weakly-supervised framework for classifying whole slide images (WSIs)
WSIs are commonly processed by patch-wise classification with patch-level labels.
With image-level labels only, patch-wise classification would be sub-optimal due to inconsistency between the patch appearance and image-level label.
- Score: 78.78181964748144
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We present a novel weakly-supervised framework for classifying whole slide
images (WSIs). WSIs, due to their gigapixel resolution, are commonly processed
by patch-wise classification with patch-level labels. However, patch-level
labels require precise annotations, which is expensive and usually unavailable
on clinical data. With image-level labels only, patch-wise classification would
be sub-optimal due to inconsistency between the patch appearance and
image-level label. To address this issue, we posit that WSI analysis can be
effectively conducted by integrating information at both high magnification
(local) and low magnification (regional) levels. We auto-encode the visual
signals in each patch into a latent embedding vector representing local
information, and down-sample the raw WSI to hardware-acceptable thumbnails
representing regional information. The WSI label is then predicted with a
Dual-Stream Network (DSNet), which takes the transformed local patch embeddings
and multi-scale thumbnail images as inputs and can be trained by the
image-level label only. Experiments conducted on two large-scale public
datasets demonstrate that our method outperforms all recent state-of-the-art
weakly-supervised WSI classification methods.
Related papers
- Label Filling via Mixed Supervision for Medical Image Segmentation from Noisy Annotations [22.910649758574852]
We propose a simple yet effective Label Filling framework, termed as LF-Net.
It predicts the groundtruth segmentation label given only noisy annotations during training.
Results on five datasets show that our LF-Net boosts segmentation accuracy in all datasets compared with state-of-the-art methods.
arXiv Detail & Related papers (2024-10-21T14:36:36Z) - FAST: A Dual-tier Few-Shot Learning Paradigm for Whole Slide Image Classification [23.323845050957196]
Existing few-shot WSI classification methods only utilize a small number of fine-grained labels or weakly supervised slide labels for training.
They lack sufficient mining of available WSIs, severely limiting WSI classification performance.
We propose a novel and efficient dual-tier few-shot learning paradigm for WSI classification, named FAST.
arXiv Detail & Related papers (2024-09-29T14:31:52Z) - Distilling Self-Supervised Vision Transformers for Weakly-Supervised
Few-Shot Classification & Segmentation [58.03255076119459]
We address the task of weakly-supervised few-shot image classification and segmentation, by leveraging a Vision Transformer (ViT)
Our proposed method takes token representations from the self-supervised ViT and leverages their correlations, via self-attention, to produce classification and segmentation predictions.
Experiments on Pascal-5i and COCO-20i demonstrate significant performance gains in a variety of supervision settings.
arXiv Detail & Related papers (2023-07-07T06:16:43Z) - Weakly Supervised Joint Whole-Slide Segmentation and Classification in
Prostate Cancer [8.790852468118208]
WholeSIGHT is a weakly-supervised method to segment and classify Whole-Slide images.
We evaluated WholeSIGHT on three public prostate cancer WSI datasets.
arXiv Detail & Related papers (2023-01-07T20:38:36Z) - Dual-Perspective Semantic-Aware Representation Blending for Multi-Label
Image Recognition with Partial Labels [70.36722026729859]
We propose a dual-perspective semantic-aware representation blending (DSRB) that blends multi-granularity category-specific semantic representation across different images.
The proposed DS consistently outperforms current state-of-the-art algorithms on all proportion label settings.
arXiv Detail & Related papers (2022-05-26T00:33:44Z) - Learning Self-Supervised Low-Rank Network for Single-Stage Weakly and
Semi-Supervised Semantic Segmentation [119.009033745244]
This paper presents a Self-supervised Low-Rank Network ( SLRNet) for single-stage weakly supervised semantic segmentation (WSSS) and semi-supervised semantic segmentation (SSSS)
SLRNet uses cross-view self-supervision, that is, it simultaneously predicts several attentive LR representations from different views of an image to learn precise pseudo-labels.
Experiments on the Pascal VOC 2012, COCO, and L2ID datasets demonstrate that our SLRNet outperforms both state-of-the-art WSSS and SSSS methods with a variety of different settings.
arXiv Detail & Related papers (2022-03-19T09:19:55Z) - Semantic-Aware Representation Blending for Multi-Label Image Recognition
with Partial Labels [86.17081952197788]
We propose to blend category-specific representation across different images to transfer information of known labels to complement unknown labels.
Experiments on the MS-COCO, Visual Genome, Pascal VOC 2007 datasets show that the proposed SARB framework obtains superior performance over current leading competitors.
arXiv Detail & Related papers (2022-03-04T07:56:16Z) - Semi-weakly Supervised Contrastive Representation Learning for Retinal
Fundus Images [0.2538209532048867]
We propose a semi-weakly supervised contrastive learning framework for representation learning using semi-weakly annotated images.
We empirically validate the transfer learning performance of SWCL on seven public retinal fundus datasets.
arXiv Detail & Related papers (2021-08-04T15:50:09Z) - Mixed Supervision Learning for Whole Slide Image Classification [88.31842052998319]
We propose a mixed supervision learning framework for super high-resolution images.
During the patch training stage, this framework can make use of coarse image-level labels to refine self-supervised learning.
A comprehensive strategy is proposed to suppress pixel-level false positives and false negatives.
arXiv Detail & Related papers (2021-07-02T09:46:06Z) - Pay Attention with Focus: A Novel Learning Scheme for Classification of
Whole Slide Images [8.416553728391309]
We propose a novel two-stage approach to analyze whole slide images (WSIs)
First, we extract a set of representative patches (called mosaic) from a WSI.
Each patch of a mosaic is encoded to a feature vector using a deep network.
In the second stage, a set of encoded patch-level features from a WSI is used to compute the primary diagnosis probability.
arXiv Detail & Related papers (2021-06-11T21:59:02Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.