Interpretable Data Mining of Follicular Thyroid Cancer Ultrasound Features Using Enhanced Association Rules
- URL: http://arxiv.org/abs/2509.12238v1
- Date: Tue, 09 Sep 2025 03:02:45 GMT
- Title: Interpretable Data Mining of Follicular Thyroid Cancer Ultrasound Features Using Enhanced Association Rules
- Authors: Songlin Zhou, Tao Zhou, Xin Li, Stephen Shing-Toung Yau,
- Abstract summary: We analyzed the clinical data of follicular thyroid cancer based on a novel data mining tool.<n>Combination of Hashimoto's thyroiditis may also have a strong malignant association.
- Score: 11.458809910127627
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Purpose: Thyroid cancer has been a common cancer. Papillary thyroid cancer and follicular thyroid cancer are the two most common types of thyroid cancer. Follicular thyroid cancer lacks distinctive ultrasound signs and is more difficult to diagnose preoperatively than the more prevalent papillary thyroid cancer, and the clinical studies associated with it are less well established. We aimed to analyze the clinical data of follicular thyroid cancer based on a novel data mining tool to identify some clinical indications that may help in preoperative diagnosis. Methods: We performed a retrospective analysis based on case data collected by the Department of General Surgery of Peking University Third Hospital between 2010 and 2023. Unlike traditional statistical methods, we improved the association rule mining, a classical data mining method, and proposed new analytical metrics reflecting the malignant association between clinical indications and cancer with the help of the idea of SHAP method in interpretable machine learning. Results: The dataset was preprocessed to contain 1673 cases (in terms of nodes rather than patients), of which 1414 were benign and 259 were malignant nodes. Our analysis pointed out that in addition to some common indicators (e.g., irregular or lobulated nodal margins, uneven thickness halo, hypoechogenicity), there were also some indicators with strong malignant associations, such as nodule-in-nodule pattern, trabecular pattern, and low TSH scores. In addition, our results suggest that the combination of Hashimoto's thyroiditis may also have a strong malignant association. Conclusion: In the preoperative diagnosis of nodules suspected of follicular thyroid cancer, multiple clinical indications should be considered for a more accurate diagnosis. The diverse malignant associations identified in our study may serve as a reference for clinicians in related fields.
Related papers
- Enhancing Trust in Clinically Significant Prostate Cancer Prediction with Multiple Magnetic Resonance Imaging Modalities [61.36288157482697]
In the United States, prostate cancer is the second leading cause of deaths in males with a predicted 35,250 deaths in 2024.
In this paper, we investigate combining multiple MRI modalities to train a deep learning model to enhance trust in the models for clinically significant prostate cancer prediction.
arXiv Detail & Related papers (2024-11-07T12:48:27Z) - An Explainable AI Model for Predicting the Recurrence of Differentiated Thyroid Cancer [0.0]
This study employs machine learning, particularly deep learning models, to predict the recurrence of thyroid cancer.
By analysing a dataset containing clinicopathological features of patients, the model achieved remarkable accuracy rates of 98% during training and 96% during testing.
arXiv Detail & Related papers (2024-10-13T23:12:33Z) - Advancements in Radiomics and Artificial Intelligence for Thyroid Cancer Diagnosis [1.6687900252134487]
The application of AI and radiomics to thyroid cancer diagnosis is examined in this review.
A review of multiple databases was conducted in compliance with PRISMA guidelines until October 2023.
arXiv Detail & Related papers (2024-04-09T11:05:20Z) - RCdpia: A Renal Carcinoma Digital Pathology Image Annotation dataset based on pathologists [14.79279940958727]
We have compiled the TCGA digital pathological dataset with independent labeling of tumor regions and adjacent areas (RCdpia)
This dataset is now publicly accessible at http://39.171.241.18:8888/RCdpia/.
arXiv Detail & Related papers (2024-03-17T13:23:25Z) - ChatRadio-Valuer: A Chat Large Language Model for Generalizable
Radiology Report Generation Based on Multi-institution and Multi-system Data [115.0747462486285]
ChatRadio-Valuer is a tailored model for automatic radiology report generation that learns generalizable representations.
The clinical dataset utilized in this study encompasses a remarkable total of textbf332,673 observations.
ChatRadio-Valuer consistently outperforms state-of-the-art models, especially ChatGPT (GPT-3.5-Turbo) and GPT-4 et al.
arXiv Detail & Related papers (2023-10-08T17:23:17Z) - A new algorithm for Subgroup Set Discovery based on Information Gain [58.720142291102135]
Information Gained Subgroup Discovery (IGSD) is a new SD algorithm for pattern discovery.
We compare IGSD with two state-of-the-art SD algorithms: FSSD and SSD++.
IGSD provides better OR values than FSSD and SSD++, stating a higher dependence between patterns and targets.
arXiv Detail & Related papers (2023-07-26T21:42:34Z) - A Pathologist-Informed Workflow for Classification of Prostate Glands in
Histopathology [62.997667081978825]
Pathologists diagnose and grade prostate cancer by examining tissue from needle biopsies on glass slides.
Cancer's severity and risk of metastasis are determined by the Gleason grade, a score based on the organization and morphology of prostate cancer glands.
This paper proposes an automated workflow that follows pathologists' textitmodus operandi, isolating and classifying multi-scale patches of individual glands.
arXiv Detail & Related papers (2022-09-27T14:08:19Z) - Less is More: Adaptive Curriculum Learning for Thyroid Nodule Diagnosis [50.231954872304314]
We propose an Adaptive Curriculum Learning framework, which adaptively discovers and discards the samples with inconsistent labels.
We also contribute TNCD: a Thyroid Nodule Classification dataset.
arXiv Detail & Related papers (2022-07-02T11:50:02Z) - Improving The Diagnosis of Thyroid Cancer by Machine Learning and
Clinical Data [3.6985351289638957]
Thyroid cancer is a common endocrine carcinoma that occurs in the thyroid gland.
Current human assessment of thyroid nodule malignancy is prone to errors and may not guarantee an accurate preoperative diagnosis.
This study proposed a machine framework to predict thyroid nodule malignancy based on a novel clinical dataset.
arXiv Detail & Related papers (2022-03-27T17:37:18Z) - Topological Data Analysis of copy number alterations in cancer [70.85487611525896]
We explore the potential to capture information contained in cancer genomic information using a novel topology-based approach.
We find that this technique has the potential to extract meaningful low-dimensional representations in cancer somatic genetic data.
arXiv Detail & Related papers (2020-11-22T17:31:23Z) - A systematic review on the role of artificial intelligence in
sonographic diagnosis of thyroid cancer: Past, present and future [0.6523396727243321]
This review follows a methodology-based classification of the different techniques available for thyroid cancer diagnosis.
We reflect on the trends and challenges of the field of sonographic diagnosis of thyroid malignancies.
Machine learning will continue to play a fundamental role in the development of future thyroid cancer diagnosis frameworks.
arXiv Detail & Related papers (2020-06-10T14:38:05Z) - Segmentation for Classification of Screening Pancreatic Neuroendocrine
Tumors [72.65802386845002]
This work presents comprehensive results to detect in the early stage the pancreatic neuroendocrine tumors (PNETs) in abdominal CT scans.
To the best of our knowledge, this task has not been studied before as a computational task.
Our approach outperforms state-of-the-art segmentation networks and achieves a sensitivity of $89.47%$ at a specificity of $81.08%$.
arXiv Detail & Related papers (2020-04-04T21:21:44Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.