HAMIL: Hierarchical Aggregation-Based Multi-Instance Learning for
Microscopy Image Classification
- URL: http://arxiv.org/abs/2103.09764v1
- Date: Wed, 17 Mar 2021 16:34:08 GMT
- Title: HAMIL: Hierarchical Aggregation-Based Multi-Instance Learning for
Microscopy Image Classification
- Authors: Yanlun Tu, Houchao Lei, Wei Long, Yang Yang
- Abstract summary: Multi-instance learning is common for computer vision tasks, especially in biomedical image processing.
In this study, we propose a hierarchical aggregation network for multi-instance learning, called HAMIL.
The hierarchical aggregation protocol enables feature fusion in a defined order, and the simple convolutional aggregation units lead to an efficient and flexible architecture.
- Score: 4.566276053984716
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Multi-instance learning is common for computer vision tasks, especially in
biomedical image processing. Traditional methods for multi-instance learning
focus on designing feature aggregation methods and multi-instance classifiers,
where the aggregation operation is performed either in feature extraction or
learning phase. As deep neural networks (DNNs) achieve great success in image
processing via automatic feature learning, certain feature aggregation
mechanisms need to be incorporated into common DNN architecture for
multi-instance learning. Moreover, flexibility and reliability are crucial
considerations to deal with varying quality and number of instances.
In this study, we propose a hierarchical aggregation network for
multi-instance learning, called HAMIL. The hierarchical aggregation protocol
enables feature fusion in a defined order, and the simple convolutional
aggregation units lead to an efficient and flexible architecture. We assess the
model performance on two microscopy image classification tasks, namely protein
subcellular localization using immunofluorescence images and gene annotation
using spatial gene expression images. The experimental results show that HAMIL
outperforms the state-of-the-art feature aggregation methods and the existing
models for addressing these two tasks. The visualization analyses also
demonstrate the ability of HAMIL to focus on high-quality instances.
Related papers
- Affine-Consistent Transformer for Multi-Class Cell Nuclei Detection [76.11864242047074]
We propose a novel Affine-Consistent Transformer (AC-Former), which directly yields a sequence of nucleus positions.
We introduce an Adaptive Affine Transformer (AAT) module, which can automatically learn the key spatial transformations to warp original images for local network training.
Experimental results demonstrate that the proposed method significantly outperforms existing state-of-the-art algorithms on various benchmarks.
arXiv Detail & Related papers (2023-10-22T02:27:02Z) - Self-Supervised Neuron Segmentation with Multi-Agent Reinforcement
Learning [53.00683059396803]
Mask image model (MIM) has been widely used due to its simplicity and effectiveness in recovering original information from masked images.
We propose a decision-based MIM that utilizes reinforcement learning (RL) to automatically search for optimal image masking ratio and masking strategy.
Our approach has a significant advantage over alternative self-supervised methods on the task of neuron segmentation.
arXiv Detail & Related papers (2023-10-06T10:40:46Z) - Convolutional neural network based on sparse graph attention mechanism
for MRI super-resolution [0.34410212782758043]
Medical image super-resolution (SR) reconstruction using deep learning techniques can enhance lesion analysis and assist doctors in improving diagnostic efficiency and accuracy.
Existing deep learning-based SR methods rely on convolutional neural networks (CNNs), which inherently limit the expressive capabilities of these models.
We propose an A-network that utilizes multiple convolution operator feature extraction modules (MCO) for extracting image features.
arXiv Detail & Related papers (2023-05-29T06:14:22Z) - Multi-domain learning CNN model for microscopy image classification [3.2835754110596236]
We present a multi-domain learning architecture for the classification of microscopy images.
Unlike previous methods that are computationally intensive, we have developed a compact model, called Mobincep.
It surpasses state-of-the-art results and is robust for limited labeled data.
arXiv Detail & Related papers (2023-04-20T19:32:23Z) - Deep Image Clustering with Contrastive Learning and Multi-scale Graph
Convolutional Networks [58.868899595936476]
This paper presents a new deep clustering approach termed image clustering with contrastive learning and multi-scale graph convolutional networks (IcicleGCN)
Experiments on multiple image datasets demonstrate the superior clustering performance of IcicleGCN over the state-of-the-art.
arXiv Detail & Related papers (2022-07-14T19:16:56Z) - Learning multi-scale functional representations of proteins from
single-cell microscopy data [77.34726150561087]
We show that simple convolutional networks trained on localization classification can learn protein representations that encapsulate diverse functional information.
We also propose a robust evaluation strategy to assess quality of protein representations across different scales of biological function.
arXiv Detail & Related papers (2022-05-24T00:00:07Z) - Multi-level Second-order Few-shot Learning [111.0648869396828]
We propose a Multi-level Second-order (MlSo) few-shot learning network for supervised or unsupervised few-shot image classification and few-shot action recognition.
We leverage so-called power-normalized second-order base learner streams combined with features that express multiple levels of visual abstraction.
We demonstrate respectable results on standard datasets such as Omniglot, mini-ImageNet, tiered-ImageNet, Open MIC, fine-grained datasets such as CUB Birds, Stanford Dogs and Cars, and action recognition datasets such as HMDB51, UCF101, and mini-MIT.
arXiv Detail & Related papers (2022-01-15T19:49:00Z) - Microscopic fine-grained instance classification through deep attention [7.50282814989294]
Fine-grained classification of microscopic image data with limited samples is an open problem in computer vision and biomedical imaging.
We propose a simple yet effective deep network that performs two tasks simultaneously in an end-to-end manner.
The result is a robust but lightweight end-to-end trainable deep network that yields state-of-the-art results.
arXiv Detail & Related papers (2020-10-06T15:29:58Z) - A Trainable Optimal Transport Embedding for Feature Aggregation and its
Relationship to Attention [96.77554122595578]
We introduce a parametrized representation of fixed size, which embeds and then aggregates elements from a given input set according to the optimal transport plan between the set and a trainable reference.
Our approach scales to large datasets and allows end-to-end training of the reference, while also providing a simple unsupervised learning mechanism with small computational cost.
arXiv Detail & Related papers (2020-06-22T08:35:58Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.