Related papers: Semantics-Aware Attention Guidance for Diagnosing Whole Slide Images

Semantics-Aware Attention Guidance for Diagnosing Whole Slide Images

URL: http://arxiv.org/abs/2404.10894v1
Date: Tue, 16 Apr 2024 20:37:14 GMT
Title: Semantics-Aware Attention Guidance for Diagnosing Whole Slide Images
Authors: Kechun Liu, Wenjun Wu, Joann G. Elmore, Linda G. Shapiro,
Abstract summary: We introduce a novel framework named Semantics-Aware Attention Guidance (SAG) SAG includes 1) a technique for converting diagnostically relevant entities into attention signals, and 2) a flexible attention loss that efficiently integrates semantically significant information. Our experiments on two distinct cancer datasets demonstrate consistent improvements in accuracy, precision, and recall.
Score: 5.856390270089738
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Accurate cancer diagnosis remains a critical challenge in digital pathology, largely due to the gigapixel size and complex spatial relationships present in whole slide images. Traditional multiple instance learning (MIL) methods often struggle with these intricacies, especially in preserving the necessary context for accurate diagnosis. In response, we introduce a novel framework named Semantics-Aware Attention Guidance (SAG), which includes 1) a technique for converting diagnostically relevant entities into attention signals, and 2) a flexible attention loss that efficiently integrates various semantically significant information, such as tissue anatomy and cancerous regions. Our experiments on two distinct cancer datasets demonstrate consistent improvements in accuracy, precision, and recall with two state-of-the-art baseline models. Qualitative analysis further reveals that the incorporation of heuristic guidance enables the model to focus on regions critical for diagnosis. SAG is not only effective for the models discussed here, but its adaptability extends to any attention-based diagnostic model. This opens up exciting possibilities for further improving the accuracy and efficiency of cancer diagnostics.

Related papers

UGPL: Uncertainty-Guided Progressive Learning for Evidence-Based Classification in Computed Tomography [0.0]
Current approaches typically process images uniformly, limiting their ability to detect localized abnormalities.<n>We introduce UGPL, an uncertainty-guided progressive learning framework that performs a global-to-local analysis.<n> Experiments across three CT datasets demonstrate that UGPL consistently outperforms state-of-the-art methods.
arXiv Detail & Related papers (2025-07-18T17:30:56Z)
A Dual-Task Synergy-Driven Generalization Framework for Pancreatic Cancer Segmentation in CT Scans [10.62594407632477]
Pancreatic cancer, characterized by its notable prevalence and mortality rates, demands accurate lesion delineation.<n>We propose a generalization framework that synergizes pixel-level classification and regression tasks, to accurately delineate lesions.<n>Our model successfully improves the results of the highly challenging cross-lesion generalized pancreatic cancer segmentation task by 9.51%.
arXiv Detail & Related papers (2025-05-03T00:54:00Z)
Efficient and Comprehensive Feature Extraction in Large Vision-Language Model for Clinical Pathology Analysis [34.199766079609795]
Pathological diagnosis is vital for determining disease characteristics, guiding treatment, and assessing prognosis. Traditional pure vision models face challenges of redundant feature extraction. Existing large vision-language models (LVLMs) are limited by input resolution constraints, hindering their efficiency and accuracy. We propose two innovative strategies: the mixed task-guided feature enhancement, and the prompt-guided detail feature completion.
arXiv Detail & Related papers (2024-12-12T18:07:23Z)
FOCUS: Knowledge-enhanced Adaptive Visual Compression for Few-shot Whole Slide Image Classification [4.148491257542209]
Few-shot learning presents a critical solution for cancer diagnosis in computational pathology. A key challenge in this paradigm stems from the inherent disparity between the limited training set of whole slide images (WSIs) and the enormous number of contained patches. We introduce the knowledge-enhanced adaptive visual compression framework, dubbed FOCUS, to enable a focused analysis of diagnostically relevant regions.
arXiv Detail & Related papers (2024-11-22T05:36:38Z)
Potential of Multimodal Large Language Models for Data Mining of Medical Images and Free-text Reports [51.45762396192655]
Multimodal large language models (MLLMs) have recently transformed many domains, significantly affecting the medical field. Notably, Gemini-Vision-series (Gemini) and GPT-4-series (GPT-4) models have epitomized a paradigm shift in Artificial General Intelligence for computer vision. This study evaluated the performance of the Gemini, GPT-4, and 4 popular large models for an exhaustive evaluation across 14 medical imaging datasets.
arXiv Detail & Related papers (2024-07-08T09:08:42Z)
Low-Resolution Chest X-ray Classification via Knowledge Distillation and Multi-task Learning [46.75992018094998]
This research addresses the challenges of diagnosing chest X-rays (CXRs) at low resolutions. High-resolution CXR imaging is crucial for identifying small but critical anomalies, such as nodules or opacities. This paper presents the Multilevel Collaborative Attention Knowledge (MLCAK) method.
arXiv Detail & Related papers (2024-05-22T06:10:54Z)
Cross-Modal Domain Adaptation in Brain Disease Diagnosis: Maximum Mean Discrepancy-based Convolutional Neural Networks [0.0]
Brain disorders are a major challenge to global health, causing millions of deaths each year. Accurate diagnosis of these diseases relies heavily on advanced medical imaging techniques such as MRI and CT. The scarcity of annotated data poses a significant challenge in deploying machine learning models for medical diagnosis.
arXiv Detail & Related papers (2024-05-06T07:44:46Z)
Deformable MRI Sequence Registration for AI-based Prostate Cancer Diagnosis [2.102189448685959]
The PI-CAI (Prostate Imaging: Cancer AI) challenge led to expert-level diagnostic algorithms for clinically significant prostate cancer detection. These algorithms receive biparametric MRI scans as input, which consist of T2-weighted and diffusion-weighted scans. These scans can be misaligned due to multiple factors in the scanning process. Image registration can alleviate this issue by predicting the deformation between the sequences.
arXiv Detail & Related papers (2024-04-15T10:57:16Z)
SeATrans: Learning Segmentation-Assisted diagnosis model via Transforme [13.63128987400635]
We propose Vision-Assisted diagnosis Transformer (SeATrans) to transfer the segmentation knowledge to the disease diagnosis network. We first propose an asymmetric multi-scale interaction strategy to correlate each single low-level diagnosis feature with multi-scale segmentation features. To model the segmentation-diagnosis interaction, SeA-block first embeds the diagnosis feature based on the segmentation information via the encoder, and then transfers the embedding back to the diagnosis feature space by a decoder.
arXiv Detail & Related papers (2022-06-12T15:10:33Z)
BI-RADS-Net: An Explainable Multitask Learning Approach for Cancer Diagnosis in Breast Ultrasound Images [69.41441138140895]
This paper introduces BI-RADS-Net, a novel explainable deep learning approach for cancer detection in breast ultrasound images. The proposed approach incorporates tasks for explaining and classifying breast tumors, by learning feature representations relevant to clinical diagnosis. Explanations of the predictions (benign or malignant) are provided in terms of morphological features that are used by clinicians for diagnosis and reporting in medical practice.
arXiv Detail & Related papers (2021-10-05T19:14:46Z)
Act Like a Radiologist: Towards Reliable Multi-view Correspondence Reasoning for Mammogram Mass Detection [49.14070210387509]
We propose an Anatomy-aware Graph convolutional Network (AGN) for mammogram mass detection. AGN is tailored for mammogram mass detection and endows existing detection methods with multi-view reasoning ability. Experiments on two standard benchmarks reveal that AGN significantly exceeds the state-of-the-art performance.
arXiv Detail & Related papers (2021-05-21T06:48:34Z)
Many-to-One Distribution Learning and K-Nearest Neighbor Smoothing for Thoracic Disease Identification [83.6017225363714]
deep learning has become the most powerful computer-aided diagnosis technology for improving disease identification performance. For chest X-ray imaging, annotating large-scale data requires professional domain knowledge and is time-consuming. In this paper, we propose many-to-one distribution learning (MODL) and K-nearest neighbor smoothing (KNNS) methods to improve a single model's disease identification performance.
arXiv Detail & Related papers (2021-02-26T02:29:30Z)
Spatio-spectral deep learning methods for in-vivo hyperspectral laryngeal cancer detection [49.32653090178743]
Early detection of head and neck tumors is crucial for patient survival. Hyperspectral imaging (HSI) can be used for non-invasive detection of head and neck tumors. We present multiple deep learning techniques for in-vivo laryngeal cancer detection based on HSI.
arXiv Detail & Related papers (2020-04-21T17:07:18Z)

This list is automatically generated from the titles and abstracts of the papers in this site.