A Novel Multi-Task Model Imitating Dermatologists for Accurate
Differential Diagnosis of Skin Diseases in Clinical Images
- URL: http://arxiv.org/abs/2307.08308v1
- Date: Mon, 17 Jul 2023 08:05:30 GMT
- Title: A Novel Multi-Task Model Imitating Dermatologists for Accurate
Differential Diagnosis of Skin Diseases in Clinical Images
- Authors: Yan-Jie Zhou, Wei Liu, Yuan Gao, Jing Xu, Le Lu, Yuping Duan, Hao
Cheng, Na Jin, Xiaoyong Man, Shuang Zhao, Yu Wang
- Abstract summary: A novel multi-task model, namely DermImitFormer, is proposed to fill this gap by imitating dermatologists' diagnostic procedures and strategies.
The model simultaneously predicts body parts and lesion attributes in addition to the disease itself, enhancing diagnosis accuracy and improving diagnosis interpretability.
- Score: 27.546559936765863
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Skin diseases are among the most prevalent health issues, and accurate
computer-aided diagnosis methods are of importance for both dermatologists and
patients. However, most of the existing methods overlook the essential domain
knowledge required for skin disease diagnosis. A novel multi-task model, namely
DermImitFormer, is proposed to fill this gap by imitating dermatologists'
diagnostic procedures and strategies. Through multi-task learning, the model
simultaneously predicts body parts and lesion attributes in addition to the
disease itself, enhancing diagnosis accuracy and improving diagnosis
interpretability. The designed lesion selection module mimics dermatologists'
zoom-in action, effectively highlighting the local lesion features from noisy
backgrounds. Additionally, the presented cross-interaction module explicitly
models the complicated diagnostic reasoning between body parts, lesion
attributes, and diseases. To provide a more robust evaluation of the proposed
method, a large-scale clinical image dataset of skin diseases with
significantly more cases than existing datasets has been established. Extensive
experiments on three different datasets consistently demonstrate the
state-of-the-art recognition performance of the proposed approach.
Related papers
- A Multimodal Approach to The Detection and Classification of Skin Diseases [0.5755004576310334]
Many diseases are left undiagnosed and untreated, even if the disease shows many physical symptoms on the skin.
With the rise of AI, self-diagnosis and improved disease recognition have become more promising than ever.
This study incorporates readily available and easily accessible patient information via image and text for skin disease classification.
arXiv Detail & Related papers (2024-11-21T05:27:42Z) - Anatomy-Guided Radiology Report Generation with Pathology-Aware Regional Prompts [3.1019279528120363]
Radiology reporting generative AI holds significant potential to alleviate clinical workloads and streamline medical care.
Existing systems often fall short due to their reliance on fixed size, patch-level image features and insufficient incorporation of pathological information.
We propose an innovative approach that leverages pathology-aware regional prompts to explicitly integrate anatomical and pathological information of various scales.
arXiv Detail & Related papers (2024-11-16T12:36:20Z) - A General-Purpose Multimodal Foundation Model for Dermatology [14.114262475562846]
PanDerm is a multimodal dermatology foundation model pretrained through self-supervised learning on a dataset of over 2 million real-world images of skin diseases.
PanDerm achieved state-of-the-art performance across all evaluated tasks.
PanDerm could enhance the management of skin diseases and serve as a model for developing multimodal foundation models in other medical specialties.
arXiv Detail & Related papers (2024-10-19T08:48:01Z) - Potential of Multimodal Large Language Models for Data Mining of Medical Images and Free-text Reports [51.45762396192655]
Multimodal large language models (MLLMs) have recently transformed many domains, significantly affecting the medical field. Notably, Gemini-Vision-series (Gemini) and GPT-4-series (GPT-4) models have epitomized a paradigm shift in Artificial General Intelligence for computer vision.
This study evaluated the performance of the Gemini, GPT-4, and 4 popular large models for an exhaustive evaluation across 14 medical imaging datasets.
arXiv Detail & Related papers (2024-07-08T09:08:42Z) - A Clinical-oriented Multi-level Contrastive Learning Method for Disease Diagnosis in Low-quality Medical Images [4.576524795036682]
Disease diagnosis methods guided by contrastive learning (CL) have shown significant advantages in lesion feature representation.
We propose a clinical-oriented multi-level CL framework that aims to enhance the model's capacity to extract lesion features.
The proposed CL framework is validated on two public medical image datasets, EyeQ and Chest X-ray.
arXiv Detail & Related papers (2024-04-07T09:08:14Z) - Optimizing Skin Lesion Classification via Multimodal Data and Auxiliary
Task Integration [54.76511683427566]
This research introduces a novel multimodal method for classifying skin lesions, integrating smartphone-captured images with essential clinical and demographic information.
A distinctive aspect of this method is the integration of an auxiliary task focused on super-resolution image prediction.
The experimental evaluations have been conducted using the PAD-UFES20 dataset, applying various deep-learning architectures.
arXiv Detail & Related papers (2024-02-16T05:16:20Z) - A Transformer-based representation-learning model with unified
processing of multimodal input for clinical diagnostics [63.106382317917344]
We report a Transformer-based representation-learning model as a clinical diagnostic aid that processes multimodal input in a unified manner.
The unified model outperformed an image-only model and non-unified multimodal diagnosis models in the identification of pulmonary diseases.
arXiv Detail & Related papers (2023-06-01T16:23:47Z) - Malignancy Prediction and Lesion Identification from Clinical
Dermatological Images [65.1629311281062]
We consider machine-learning-based malignancy prediction and lesion identification from clinical dermatological images.
We first identify all lesions present in the image regardless of sub-type or likelihood of malignancy, then it estimates their likelihood of malignancy, and through aggregation, it also generates an image-level likelihood of malignancy.
arXiv Detail & Related papers (2021-04-02T20:52:05Z) - Many-to-One Distribution Learning and K-Nearest Neighbor Smoothing for
Thoracic Disease Identification [83.6017225363714]
deep learning has become the most powerful computer-aided diagnosis technology for improving disease identification performance.
For chest X-ray imaging, annotating large-scale data requires professional domain knowledge and is time-consuming.
In this paper, we propose many-to-one distribution learning (MODL) and K-nearest neighbor smoothing (KNNS) methods to improve a single model's disease identification performance.
arXiv Detail & Related papers (2021-02-26T02:29:30Z) - Inheritance-guided Hierarchical Assignment for Clinical Automatic
Diagnosis [50.15205065710629]
Clinical diagnosis, which aims to assign diagnosis codes for a patient based on the clinical note, plays an essential role in clinical decision-making.
We propose a novel framework to combine the inheritance-guided hierarchical assignment and co-occurrence graph propagation for clinical automatic diagnosis.
arXiv Detail & Related papers (2021-01-27T13:16:51Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.