Re-defining Radiology Quality Assurance (QA) -- Artificial Intelligence
(AI)-Based QA by Restricted Investigation of Unequal Scores (AQUARIUS)
- URL: http://arxiv.org/abs/2205.00629v2
- Date: Tue, 3 May 2022 18:21:50 GMT
- Title: Re-defining Radiology Quality Assurance (QA) -- Artificial Intelligence
(AI)-Based QA by Restricted Investigation of Unequal Scores (AQUARIUS)
- Authors: Axel Wismueller, Larry Stockmaster, Ali Vosoughi
- Abstract summary: We present a novel approach, Artificial Intelligence (AI)-Based QUality Assurance by Restricted Investigation of Unequal Scores (AQUARIUS)
AQUARIUS typically includes automatic comparison of AI-based image analysis with natural language processing (NLP) on radiology reports.
Using AQUARIUS with NLP on final radiology reports and targeted expert neuroradiology review of only 29 discordantly classified cases reduced the human QA effort by 98.5%.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: There is an urgent need for streamlining radiology Quality Assurance (QA)
programs to make them better and faster. Here, we present a novel approach,
Artificial Intelligence (AI)-Based QUality Assurance by Restricted
Investigation of Unequal Scores (AQUARIUS), for re-defining radiology QA, which
reduces human effort by up to several orders of magnitude over existing
approaches. AQUARIUS typically includes automatic comparison of AI-based image
analysis with natural language processing (NLP) on radiology reports. Only the
usually small subset of cases with discordant reads is subsequently reviewed by
human experts. To demonstrate the clinical applicability of AQUARIUS, we
performed a clinical QA study on Intracranial Hemorrhage (ICH) detection in
1936 head CT scans from a large academic hospital. Immediately following image
acquisition, scans were automatically analyzed for ICH using a commercially
available software (Aidoc, Tel Aviv, Israel). Cases rated positive for ICH by
AI (ICH-AI+) were automatically flagged in radiologists' reading worklists,
where flagging was randomly switched off with probability 50%. Using AQUARIUS
with NLP on final radiology reports and targeted expert neuroradiology review
of only 29 discordantly classified cases reduced the human QA effort by 98.5%,
where we found a total of six non-reported true ICH+ cases, with radiologists'
missed ICH detection rates of 0.52% and 2.5% for flagged and non-flagged cases,
respectively. We conclude that AQUARIUS, by combining AI-based image analysis
with NLP-based pre-selection of cases for targeted human expert review, can
efficiently identify missed findings in radiology studies and significantly
expedite radiology QA programs in a hybrid human-machine interoperability
approach.
Related papers
- A Survey of Artificial Intelligence in Gait-Based Neurodegenerative Disease Diagnosis [51.07114445705692]
neurodegenerative diseases (NDs) traditionally require extensive healthcare resources and human effort for medical diagnosis and monitoring.
As a crucial disease-related motor symptom, human gait can be exploited to characterize different NDs.
The current advances in artificial intelligence (AI) models enable automatic gait analysis for NDs identification and classification.
arXiv Detail & Related papers (2024-05-21T06:44:40Z) - Artificial intelligence for abnormality detection in high volume neuroimaging: a systematic review and meta-analysis [0.5934394862891423]
Most studies evaluating artificial intelligence (AI) models that detect abnormalities in neuroimaging are tested on unrepresentative patient cohorts.
The aim was to determine the diagnostic test accuracy and summarise the evidence supporting the use of AI models performing first-line, high-volume neuroimaging tasks.
arXiv Detail & Related papers (2024-05-09T10:12:17Z) - A survey of recent methods for addressing AI fairness and bias in
biomedicine [48.46929081146017]
Artificial intelligence systems may perpetuate social inequities or demonstrate biases, such as those based on race or gender.
We surveyed recent publications on different debiasing methods in the fields of biomedical natural language processing (NLP) or computer vision (CV)
We performed a literature search on PubMed, ACM digital library, and IEEE Xplore of relevant articles published between January 2018 and December 2023 using multiple combinations of keywords.
We reviewed other potential methods from the general domain that could be applied to biomedicine to address bias and improve fairness.
arXiv Detail & Related papers (2024-02-13T06:38:46Z) - The Limits of Fair Medical Imaging AI In The Wild [43.97266228706059]
We investigate the extent to which medical AI utilizes demographic encodings.
We confirm that medical imaging AI leverages demographic shortcuts in disease classification.
We find that models with less encoding of demographic attributes are often most "globally optimal"
arXiv Detail & Related papers (2023-12-11T18:59:50Z) - Consensus, dissensus and synergy between clinicians and specialist
foundation models in radiology report generation [32.26270073540666]
The worldwide shortage of radiologists restricts access to expert care and imposes heavy workloads.
Recent progress in automated report generation with vision-language models offer clear potential in ameliorating the situation.
We build a state-of-the-art report generation system for chest radiographs, $textitFlamingo-CXR, by fine-tuning a well-known vision-language foundation model on radiology data.
arXiv Detail & Related papers (2023-11-30T05:38:34Z) - ChatRadio-Valuer: A Chat Large Language Model for Generalizable
Radiology Report Generation Based on Multi-institution and Multi-system Data [115.0747462486285]
ChatRadio-Valuer is a tailored model for automatic radiology report generation that learns generalizable representations.
The clinical dataset utilized in this study encompasses a remarkable total of textbf332,673 observations.
ChatRadio-Valuer consistently outperforms state-of-the-art models, especially ChatGPT (GPT-3.5-Turbo) and GPT-4 et al.
arXiv Detail & Related papers (2023-10-08T17:23:17Z) - Robustness of an Artificial Intelligence Solution for Diagnosis of
Normal Chest X-Rays [0.0]
This study evaluates the robustness of an AI solution for the diagnosis of normal chest X-rays (CXRs)
A total of 4,060 CXRs were sampled to represent a diverse dataset of NHS patients and care settings.
arXiv Detail & Related papers (2022-08-31T09:54:24Z) - StRegA: Unsupervised Anomaly Detection in Brain MRIs using a Compact
Context-encoding Variational Autoencoder [48.2010192865749]
Unsupervised anomaly detection (UAD) can learn a data distribution from an unlabelled dataset of healthy subjects and then be applied to detect out of distribution samples.
This research proposes a compact version of the "context-encoding" VAE (ceVAE) model, combined with pre and post-processing steps, creating a UAD pipeline (StRegA)
The proposed pipeline achieved a Dice score of 0.642$pm$0.101 while detecting tumours in T2w images of the BraTS dataset and 0.859$pm$0.112 while detecting artificially induced anomalies.
arXiv Detail & Related papers (2022-01-31T14:27:35Z) - An Explainable-AI approach for Diagnosis of COVID-19 using MALDI-ToF
Mass Spectrometry [0.9250974571641537]
Severe acute respiratory syndrome coronavirus type-2 (SARS-CoV-2) caused a global pandemic and immensely affected the global economy.
Recently, multiple alternative platforms for testing coronavirus disease 2019 (COVID-19) have been published that show high agreement with current gold standard real-time polymerase chain reaction (RT-PCR) results.
These new methods do away with nasopharyngeal (NP) swabs, eliminate the need for complicated reagents, and reduce the burden on RT-PCR test reagent supply.
In the present work, we have designed an artificial intelligence-based (AI) testing method to provide confidence in the results.
arXiv Detail & Related papers (2021-09-28T23:29:31Z) - Artificial Intelligence to Assist in Exclusion of Coronary
Atherosclerosis during CCTA Evaluation of Chest-Pain in the Emergency
Department: Preparing an Application for Real-World Use [6.9835031964130545]
We describe the development of an AI algorithm and workflow for assisting interpreting physicians in CCTA screening for the absence of coronary atherosclerosis.
The algorithm demonstrated strong performance with AUC-ROC = 0.96.
There is potential for this AI application to assist in CCTA interpretation to help extricate atherosclerosis from chest-pain presentations.
arXiv Detail & Related papers (2020-08-10T16:07:04Z) - Hemogram Data as a Tool for Decision-making in COVID-19 Management:
Applications to Resource Scarcity Scenarios [62.997667081978825]
COVID-19 pandemics has challenged emergency response systems worldwide, with widespread reports of essential services breakdown and collapse of health care structure.
This work describes a machine learning model derived from hemogram exam data performed in symptomatic patients.
Proposed models can predict COVID-19 qRT-PCR results in symptomatic individuals with high accuracy, sensitivity and specificity.
arXiv Detail & Related papers (2020-05-10T01:45:03Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.