Supervised Machine Learning Algorithm for Detecting Consistency between
Reported Findings and the Conclusions of Mammography Reports
- URL: http://arxiv.org/abs/2202.13618v1
- Date: Mon, 28 Feb 2022 08:59:04 GMT
- Title: Supervised Machine Learning Algorithm for Detecting Consistency between
Reported Findings and the Conclusions of Mammography Reports
- Authors: Alexander Berdichevsky, Mor Peleg, and Daniel L. Rubin
- Abstract summary: Mammography reports document the diagnosis of patients' conditions.
Many reports contain non-standard terms (non-BI-RADS descriptors) and incomplete statements.
Our aim was to develop a tool to detect such discrepancies by comparing the reported conclusions to those that would be expected based on the reported radiology findings.
- Score: 66.89977257992568
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Objective. Mammography reports document the diagnosis of patients'
conditions. However, many reports contain non-standard terms (non-BI-RADS
descriptors) and incomplete statements, which can lead to conclusions that are
not well-supported by the reported findings. Our aim was to develop a tool to
detect such discrepancies by comparing the reported conclusions to those that
would be expected based on the reported radiology findings. Materials and
Methods. A deidentified data set from an academic hospital containing 258
mammography reports supplemented by 120 reports found on the web was used for
training and evaluation. Spell checking and term normalization was used to
unambiguously determine the reported BI-RADS descriptors. The resulting data
were input into seven classifiers that classify mammography reports, based on
their Findings sections, into seven BI-RADS final assessment categories.
Finally, the semantic similarity score of a report to each BI-RADS category is
reported. Results. Our term normalization algorithm correctly identified 97% of
the BI-RADS descriptors in mammography reports. Our system provided 76%
precision and 83% recall in correctly classifying the reports according to
BI-RADS final assessment category. Discussion. The strength of our approach
relies on providing high importance to BI-RADS terms in the summarization
phase, on the semantic similarity that considers the complex data
representation, and on the classification into all seven BI-RADs categories.
Conclusion. BI-RADS descriptors and expected final assessment categories could
be automatically detected by our approach with fairly good accuracy, which
could be used to make users aware that their reported findings do not match
well with their conclusion.
Related papers
- Contrastive Learning with Counterfactual Explanations for Radiology Report Generation [83.30609465252441]
We propose a textbfCountertextbfFactual textbfExplanations-based framework (CoFE) for radiology report generation.
Counterfactual explanations serve as a potent tool for understanding how decisions made by algorithms can be changed by asking what if'' scenarios.
Experiments on two benchmarks demonstrate that leveraging the counterfactual explanations enables CoFE to generate semantically coherent and factually complete reports.
arXiv Detail & Related papers (2024-07-19T17:24:25Z) - RaTEScore: A Metric for Radiology Report Generation [59.37561810438641]
This paper introduces a novel, entity-aware metric, as Radiological Report (Text) Evaluation (RaTEScore)
RaTEScore emphasizes crucial medical entities such as diagnostic outcomes and anatomical details, and is robust against complex medical synonyms and sensitive to negation expressions.
Our evaluations demonstrate that RaTEScore aligns more closely with human preference than existing metrics, validated both on established public benchmarks and our newly proposed RaTE-Eval benchmark.
arXiv Detail & Related papers (2024-06-24T17:49:28Z) - Comparative Experimentation of Accuracy Metrics in Automated Medical
Reporting: The Case of Otitis Consultations [0.5242869847419834]
Generative Artificial Intelligence can be used to automatically generate medical reports based on transcripts of medical consultations.
The accuracy of the generated reports needs to be established to ensure their correctness and usefulness.
There are several metrics for measuring the accuracy of AI generated reports, but little work has been done towards the application of these metrics in medical reporting.
arXiv Detail & Related papers (2023-11-22T09:51:43Z) - Radiology Report Generation Using Transformers Conditioned with
Non-imaging Data [55.17268696112258]
This paper proposes a novel multi-modal transformer network that integrates chest x-ray (CXR) images and associated patient demographic information.
The proposed network uses a convolutional neural network to extract visual features from CXRs and a transformer-based encoder-decoder network that combines the visual features with semantic text embeddings of patient demographic information.
arXiv Detail & Related papers (2023-11-18T14:52:26Z) - BI-RADS BERT & Using Section Tokenization to Understand Radiology
Reports [0.18352113484137625]
Domain specific contextual word embeddings have been shown to achieve impressive accuracy at such natural language processing tasks in medicine.
BERT model pre-trained on breast radiology reports combined with section tokenization resulted in an overall accuracy of 95.9% in field extraction.
arXiv Detail & Related papers (2021-10-14T17:25:49Z) - BI-RADS-Net: An Explainable Multitask Learning Approach for Cancer
Diagnosis in Breast Ultrasound Images [69.41441138140895]
This paper introduces BI-RADS-Net, a novel explainable deep learning approach for cancer detection in breast ultrasound images.
The proposed approach incorporates tasks for explaining and classifying breast tumors, by learning feature representations relevant to clinical diagnosis.
Explanations of the predictions (benign or malignant) are provided in terms of morphological features that are used by clinicians for diagnosis and reporting in medical practice.
arXiv Detail & Related papers (2021-10-05T19:14:46Z) - A New Computer-Aided Diagnosis System with Modified Genetic Feature
Selection for BI-RADS Classification of Breast Masses in Mammograms [5.395050211492798]
The language used to describe abnormalities in mammographic reports is based on the breast Imaging Reporting and Data System (BI-RADS)
This paper proposes a new and effective computer-aided diagnosis (CAD) system to classify mammographic masses into four assessment categories in BI-RADS.
arXiv Detail & Related papers (2020-05-11T13:06:25Z) - Show, Describe and Conclude: On Exploiting the Structure Information of
Chest X-Ray Reports [5.6070625920019825]
Chest X-Ray (CXR) images are commonly used for clinical screening and diagnosis.
The complex structures between and within sections of the reports pose a great challenge to the automatic report generation.
We propose a novel framework that exploits the structure information between and within report sections for generating CXR imaging reports.
arXiv Detail & Related papers (2020-04-26T02:29:20Z) - CLARA: Clinical Report Auto-completion [56.206459591367405]
CLinicit Al it Report it Auto-completion (CLARA) is an interactive method that generates reports in a sentence by sentence fashion based on doctors' anchor words and partially completed sentences.
In our experimental evaluation, CLARA achieved 0.393 CIDEr and 0.248 BLEU-4 on X-ray reports and 0.482 CIDEr and 0.491 BLEU-4 for EEG reports for sentence-level generation.
arXiv Detail & Related papers (2020-02-26T18:45:00Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.