Extracting Thyroid Nodules Characteristics from Ultrasound Reports Using
Transformer-based Natural Language Processing Methods
- URL: http://arxiv.org/abs/2304.00115v1
- Date: Fri, 31 Mar 2023 20:23:58 GMT
- Title: Extracting Thyroid Nodules Characteristics from Ultrasound Reports Using
Transformer-based Natural Language Processing Methods
- Authors: Aman Pathak, Zehao Yu, Daniel Paredes, Elio Paul Monsour, Andrea Ortiz
Rocha, Juan P. Brito, Naykky Singh Ospina, Yonghui Wu
- Abstract summary: The characteristics of thyroid nodules are often documented in clinical narratives such as ultrasound reports.
To the best of our knowledge, this is the first study to systematically categorize and apply transformer-based NLP models to extract a large number of clinical relevant thyroid nodule characteristics from ultrasound reports.
- Score: 22.35979441935564
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The ultrasound characteristics of thyroid nodules guide the evaluation of
thyroid cancer in patients with thyroid nodules. However, the characteristics
of thyroid nodules are often documented in clinical narratives such as
ultrasound reports. Previous studies have examined natural language processing
(NLP) methods in extracting a limited number of characteristics (<9) using
rule-based NLP systems. In this study, a multidisciplinary team of NLP experts
and thyroid specialists, identified thyroid nodule characteristics that are
important for clinical care, composed annotation guidelines, developed a
corpus, and compared 5 state-of-the-art transformer-based NLP methods,
including BERT, RoBERTa, LongFormer, DeBERTa, and GatorTron, for extraction of
thyroid nodule characteristics from ultrasound reports. Our GatorTron model, a
transformer-based large language model trained using over 90 billion words of
text, achieved the best strict and lenient F1-score of 0.8851 and 0.9495 for
the extraction of a total number of 16 thyroid nodule characteristics, and
0.9321 for linking characteristics to nodules, outperforming other clinical
transformer models. To the best of our knowledge, this is the first study to
systematically categorize and apply transformer-based NLP models to extract a
large number of clinical relevant thyroid nodule characteristics from
ultrasound reports. This study lays ground for assessing the documentation
quality of thyroid ultrasound reports and examining outcomes of patients with
thyroid nodules using electronic health records.
Related papers
- Nodule-DETR: A Novel DETR Architecture with Frequency-Channel Attention for Ultrasound Thyroid Nodule Detection [11.368372526576879]
Thyroid cancer is the most common endocrine malignancy, and its incidence is rising globally.<n>We propose Nodule-DETR, a novel detection transformer (DETR) architecture designed for robust thyroid nodule detection in ultrasound images.<n>Nodule-DETR introduces three key innovations: a Multi-Spectral Frequency-domain Channel Attention (MSFCA) module that leverages frequency analysis to enhance features of low-contrast nodules; a Hierarchical Feature Fusion (HFF) module for efficient multi-scale integration, and Multi-Scale Deformable Attention (MSDA) to flexibly capture small and irregularly shaped nodules.
arXiv Detail & Related papers (2026-01-05T08:53:04Z) - Artificial Intelligence-Enabled Analysis of Radiology Reports: Epidemiology and Consequences of Incidental Thyroid Findings [1.6872046521957624]
Incidental thyroid findings (ITFs) are increasingly detected on imaging performed for non-thyroid indications.<n>Aim is to develop, validate, and deploy a natural language processing pipeline to identify ITFs in radiology reports.<n>Retrospective cohort of adults without prior thyroid disease undergoing thyroid-capturing imaging at Mayo Clinic sites.
arXiv Detail & Related papers (2025-10-30T00:15:07Z) - Accurate Thyroid Cancer Classification using a Novel Binary Pattern Driven Local Discrete Cosine Transform Descriptor [3.663197678470621]
We develop a new CAD system for accurate thyroid cancer classification with emphasis on feature extraction.<n>We term our novel descriptor as Binary Pattern Driven Local Discrete Cosine Transform (BPD-LDCT)
arXiv Detail & Related papers (2025-09-19T19:54:04Z) - Systematic Review of Pituitary Gland and Pituitary Adenoma Automatic Segmentation Techniques in Magnetic Resonance Imaging [40.16592757754337]
We reviewed 34 studies that employed automatic and semi-automatic segmentation methods.<n>The majority of reviewed studies utilized deep learning approaches, with U-Net-based models being the most prevalent.<n>Further improvements are needed to achieve consistently good performance in small structures like the normal pituitary gland.
arXiv Detail & Related papers (2025-06-24T17:05:01Z) - Semi-Supervised Learning for Dose Prediction in Targeted Radionuclide: A Synthetic Data Study [44.18078370879369]
Targeted Radionuclide Therapy (TRT) is a modern strategy in radiation oncology that aims to administer a potent radiation dose specifically to cancer cells.<n>Deep learning holds promise for personalizing TRT doses.<n>Current methods require large time series of SPECT imaging, which is hardly achievable in routine clinical practice.
arXiv Detail & Related papers (2025-03-07T12:21:09Z) - TopoTxR: A topology-guided deep convolutional network for breast parenchyma learning on DCE-MRIs [49.69047720285225]
We propose a novel topological approach that explicitly extracts multi-scale topological structures to better approximate breast parenchymal structures.
We empirically validate emphTopoTxR using the VICTRE phantom breast dataset.
Our qualitative and quantitative analyses suggest differential topological behavior of breast tissue in treatment-na"ive imaging.
arXiv Detail & Related papers (2024-11-05T19:35:10Z) - Reducing Overtreatment of Indeterminate Thyroid Nodules Using a Multimodal Deep Learning Model [3.4812887520451117]
Molecular testing (MT) classifies cytologically indeterminate thyroid nodules as benign or malignant with high sensitivity but low positive predictive value (PPV)
We address this limitation by applying attention multiple instance learning (AMIL) to US images.
A multi-modal deep learning AMIL model was developed, combining US images and MT to classify the nodules as benign or malignant.
arXiv Detail & Related papers (2024-09-27T22:38:03Z) - Use of natural language processing to extract and classify papillary thyroid cancer features from surgical pathology reports [9.200141008020484]
We analyzed 1,410 surgical pathology reports from adult papillary thyroid cancer patients at Mayo Clinic, Rochester, MN.
We developed ThyroPath, a rule-based NLP pipeline, to extract and classify thyroid cancer features into risk categories.
arXiv Detail & Related papers (2024-05-22T22:27:12Z) - AI-Generated Content Enhanced Computer-Aided Diagnosis Model for Thyroid
Nodules: A ChatGPT-Style Assistant [36.02145235227822]
An artificial intelligence-generated computer-aided diagnosis (AIGC-CAD) model, designated as ThyGPT, has been developed.
This model, inspired by the architecture of ChatGPT, could assist radiologists in assessing the risk of thyroid nodules through semantic-level human-machine interaction.
arXiv Detail & Related papers (2024-02-04T08:24:13Z) - ChatRadio-Valuer: A Chat Large Language Model for Generalizable
Radiology Report Generation Based on Multi-institution and Multi-system Data [115.0747462486285]
ChatRadio-Valuer is a tailored model for automatic radiology report generation that learns generalizable representations.
The clinical dataset utilized in this study encompasses a remarkable total of textbf332,673 observations.
ChatRadio-Valuer consistently outperforms state-of-the-art models, especially ChatGPT (GPT-3.5-Turbo) and GPT-4 et al.
arXiv Detail & Related papers (2023-10-08T17:23:17Z) - Less is More: Adaptive Curriculum Learning for Thyroid Nodule Diagnosis [50.231954872304314]
We propose an Adaptive Curriculum Learning framework, which adaptively discovers and discards the samples with inconsistent labels.
We also contribute TNCD: a Thyroid Nodule Classification dataset.
arXiv Detail & Related papers (2022-07-02T11:50:02Z) - Key-frame Guided Network for Thyroid Nodule Recognition using Ultrasound
Videos [13.765306481109988]
This paper proposes a novel method for the automated recognition of thyroid nodules through an exploration of ultrasound videos and key-frames.
We first propose a detection-localization framework to automatically identify the clinical key-frame with a typical nodule in each ultrasound video.
Based on the localized key-frame, we develop a key-frame guided video classification model for thyroid recognition.
arXiv Detail & Related papers (2022-06-27T14:03:26Z) - Improving The Diagnosis of Thyroid Cancer by Machine Learning and
Clinical Data [3.6985351289638957]
Thyroid cancer is a common endocrine carcinoma that occurs in the thyroid gland.
Current human assessment of thyroid nodule malignancy is prone to errors and may not guarantee an accurate preoperative diagnosis.
This study proposed a machine framework to predict thyroid nodule malignancy based on a novel clinical dataset.
arXiv Detail & Related papers (2022-03-27T17:37:18Z) - Comparison of Machine Learning Classifiers to Predict Patient Survival
and Genetics of GBM: Towards a Standardized Model for Clinical Implementation [44.02622933605018]
Radiomic models have been shown to outperform clinical data for outcome prediction in glioblastoma (GBM)
We aimed to compare nine machine learning classifiers to predict overall survival (OS), isocitrate dehydrogenase (IDH) mutation, O-6-methylguanine-DNA-methyltransferase (MGMT) promoter methylation, epidermal growth factor receptor (EGFR) VII amplification and Ki-67 expression in GBM patients.
xGB obtained maximum accuracy for OS (74.5%), AB for IDH mutation (88%), MGMT methylation (71,7%), Ki-67 expression (86,6%), and EGFR amplification (81,
arXiv Detail & Related papers (2021-02-10T15:10:37Z) - Identification of Ischemic Heart Disease by using machine learning
technique based on parameters measuring Heart Rate Variability [50.591267188664666]
In this study, 18 non-invasive features (age, gender, left ventricular ejection fraction and 15 obtained from HRV) of 243 subjects were used to train and validate a series of several ANN.
The best result was obtained using 7 input parameters and 7 hidden nodes with an accuracy of 98.9% and 82% for the training and validation dataset.
arXiv Detail & Related papers (2020-10-29T19:14:41Z) - Confidence-guided Lesion Mask-based Simultaneous Synthesis of Anatomic
and Molecular MR Images in Patients with Post-treatment Malignant Gliomas [65.64363834322333]
Confidence Guided SAMR (CG-SAMR) synthesizes data from lesion information to multi-modal anatomic sequences.
module guides the synthesis based on confidence measure about the intermediate results.
experiments on real clinical data demonstrate that the proposed model can perform better than the state-of-theart synthesis methods.
arXiv Detail & Related papers (2020-08-06T20:20:22Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.