A Personalized Diagnostic Generation Framework Based on Multi-source
Heterogeneous Data
- URL: http://arxiv.org/abs/2110.13677v1
- Date: Tue, 26 Oct 2021 13:12:52 GMT
- Title: A Personalized Diagnostic Generation Framework Based on Multi-source
Heterogeneous Data
- Authors: Jialun Wu, Zeyu Gao, Haichuan Zhang, Ruonan Zhang, Tieliang Gong,
Chunbao Wang, and Chen Li
- Abstract summary: We propose a framework that combines pathological images and medical reports to generate a personalized diagnosis result for individual patient.
We use nuclei-level image feature similarity and content-based deep learning method to search for a personalized group of population with similar pathological characteristics.
- Score: 8.115713756776119
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Personalized diagnoses have not been possible due to sear amount of data
pathologists have to bear during the day-to-day routine. This lead to the
current generalized standards that are being continuously updated as new
findings are reported. It is noticeable that these effective standards are
developed based on a multi-source heterogeneous data, including whole-slide
images and pathology and clinical reports. In this study, we propose a
framework that combines pathological images and medical reports to generate a
personalized diagnosis result for individual patient. We use nuclei-level image
feature similarity and content-based deep learning method to search for a
personalized group of population with similar pathological characteristics,
extract structured prognostic information from descriptive pathology reports of
the similar patient population, and assign importance of different prognostic
factors to generate a personalized pathological diagnosis result. We use
multi-source heterogeneous data from TCGA (The Cancer Genome Atlas) database.
The result demonstrate that our framework matches the performance of
pathologists in the diagnosis of renal cell carcinoma. This framework is
designed to be generic, thus could be applied for other types of cancer. The
weights could provide insights to the known prognostic factors and further
guide more precise clinical treatment protocols.
Related papers
- A Lung Nodule Dataset with Histopathology-based Cancer Type Annotation [12.617587827105496]
This research aims to bridge the gap by providing publicly accessible datasets and reliable tools for medical diagnosis.
We curated a diverse dataset of lung Computed Tomography (CT) images, comprising 330 annotated nodules (nodules are labeled as bounding boxes) from 95 distinct patients.
These promising results demonstrate that the dataset has a feasible application and further facilitate intelligent auxiliary diagnosis.
arXiv Detail & Related papers (2024-06-26T06:39:11Z) - Radiology Report Generation Using Transformers Conditioned with
Non-imaging Data [55.17268696112258]
This paper proposes a novel multi-modal transformer network that integrates chest x-ray (CXR) images and associated patient demographic information.
The proposed network uses a convolutional neural network to extract visual features from CXRs and a transformer-based encoder-decoder network that combines the visual features with semantic text embeddings of patient demographic information.
arXiv Detail & Related papers (2023-11-18T14:52:26Z) - Hierarchical Classification System for Breast Cancer Specimen Report
(HCSBC) -- an end-to-end model for characterizing severity and diagnosis [3.4454444815042735]
We develop a hierarchical hybrid transformer-based pipeline (59 labels) - Hierarchical Classification System for Breast Cancer Specimen Report (HCSBC)
We trained the model on the EUH data and evaluated our model's performance on two external datasets - MGH and Mayo Clinic.
arXiv Detail & Related papers (2023-11-02T18:37:45Z) - An end-to-end framework for gene expression classification by
integrating a background knowledge graph: application to cancer prognosis
prediction [1.5484595752241122]
We proposed an end-to-end framework to handle secondary data to construct a classification model for primary data.
We applied this framework to cancer prognosis prediction using gene expression data and a biological network.
arXiv Detail & Related papers (2023-06-29T11:20:47Z) - Data and Knowledge Co-driving for Cancer Subtype Classification on
Multi-Scale Histopathological Slides [4.22412600279685]
We propose a Data and Knowledge Co-driving (D&K) model to replicate the process of cancer subtype classification on a histological slide like a pathologist.
Specifically, in the data-driven module, the bagging mechanism in ensemble learning is leveraged to integrate the histological features from various bags extracted by the embedding representation unit.
arXiv Detail & Related papers (2023-04-18T21:57:37Z) - Variational Knowledge Distillation for Disease Classification in Chest
X-Rays [102.04931207504173]
We propose itvariational knowledge distillation (VKD), which is a new probabilistic inference framework for disease classification based on X-rays.
We demonstrate the effectiveness of our method on three public benchmark datasets with paired X-ray images and EHRs.
arXiv Detail & Related papers (2021-03-19T14:13:56Z) - G-MIND: An End-to-End Multimodal Imaging-Genetics Framework for
Biomarker Identification and Disease Classification [49.53651166356737]
We propose a novel deep neural network architecture to integrate imaging and genetics data, as guided by diagnosis, that provides interpretable biomarkers.
We have evaluated our model on a population study of schizophrenia that includes two functional MRI (fMRI) paradigms and Single Nucleotide Polymorphism (SNP) data.
arXiv Detail & Related papers (2021-01-27T19:28:04Z) - Topological Data Analysis of copy number alterations in cancer [70.85487611525896]
We explore the potential to capture information contained in cancer genomic information using a novel topology-based approach.
We find that this technique has the potential to extract meaningful low-dimensional representations in cancer somatic genetic data.
arXiv Detail & Related papers (2020-11-22T17:31:23Z) - Trajectories, bifurcations and pseudotime in large clinical datasets:
applications to myocardial infarction and diabetes data [94.37521840642141]
We suggest a semi-supervised methodology for the analysis of large clinical datasets, characterized by mixed data types and missing values.
The methodology is based on application of elastic principal graphs which can address simultaneously the tasks of dimensionality reduction, data visualization, clustering, feature selection and quantifying the geodesic distances (pseudotime) in partially ordered sequences of observations.
arXiv Detail & Related papers (2020-07-07T21:04:55Z) - Semi-supervised Medical Image Classification with Relation-driven
Self-ensembling Model [71.80319052891817]
We present a relation-driven semi-supervised framework for medical image classification.
It exploits the unlabeled data by encouraging the prediction consistency of given input under perturbations.
Our method outperforms many state-of-the-art semi-supervised learning methods on both single-label and multi-label image classification scenarios.
arXiv Detail & Related papers (2020-05-15T06:57:54Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.