Ensuring Ground Truth Accuracy in Healthcare with the EVINCE framework
- URL: http://arxiv.org/abs/2405.15808v2
- Date: Tue, 28 May 2024 05:11:50 GMT
- Title: Ensuring Ground Truth Accuracy in Healthcare with the EVINCE framework
- Authors: Edward Y. Chang,
- Abstract summary: The propagation of mislabeled data through machine learning models into clinical practice is unacceptable.
This paper proposes EVINCE, a system designed to 1) improve diagnosis accuracy and 2) rectify misdiagnoses and minimize training data errors.
- Score: 2.5200794639628032
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Misdiagnosis is a significant issue in healthcare, leading to harmful consequences for patients. The propagation of mislabeled data through machine learning models into clinical practice is unacceptable. This paper proposes EVINCE, a system designed to 1) improve diagnosis accuracy and 2) rectify misdiagnoses and minimize training data errors. EVINCE stands for Entropy Variation through Information Duality with Equal Competence, leveraging this novel theory to optimize the diagnostic process using multiple Large Language Models (LLMs) in a structured debate framework. Our empirical study verifies EVINCE to be effective in achieving its design goals.
Related papers
- Overcoming Uncertain Incompleteness for Robust Multimodal Sequential Diagnosis Prediction via Knowledge Distillation and Random Data Erasing [0.0]
We modify NECHO, designed in a diagnosis code-centric fashion, to handle uncertain modality representation dominance under imperfect data.
We develop a systematic knowledge distillation by employing the modified NECHO as both teacher and student.
We also utilise random erasing on individual data points within sequences during both training and distillation of the teacher to lightly simulate scenario with missing visit information.
arXiv Detail & Related papers (2024-07-28T17:14:27Z) - Automated Medical Coding on MIMIC-III and MIMIC-IV: A Critical Review
and Replicability Study [60.56194508762205]
We reproduce, compare, and analyze state-of-the-art automated medical coding machine learning models.
We show that several models underperform due to weak configurations, poorly sampled train-test splits, and insufficient evaluation.
We present the first comprehensive results on the newly released MIMIC-IV dataset using the reproduced models.
arXiv Detail & Related papers (2023-04-21T11:54:44Z) - Towards Reliable Medical Image Segmentation by utilizing Evidential Calibrated Uncertainty [52.03490691733464]
We introduce DEviS, an easily implementable foundational model that seamlessly integrates into various medical image segmentation networks.
By leveraging subjective logic theory, we explicitly model probability and uncertainty for the problem of medical image segmentation.
DeviS incorporates an uncertainty-aware filtering module, which utilizes the metric of uncertainty-calibrated error to filter reliable data.
arXiv Detail & Related papers (2023-01-01T05:02:46Z) - MIMO: Mutual Integration of Patient Journey and Medical Ontology for
Healthcare Representation Learning [49.57261599776167]
We propose an end-to-end robust Transformer-based solution, Mutual Integration of patient journey and Medical Ontology (MIMO) for healthcare representation learning and predictive analytics.
arXiv Detail & Related papers (2021-07-20T07:04:52Z) - Medical Profile Model: Scientific and Practical Applications in
Healthcare [1.718235998156457]
We present the patient histories as temporal sequences of diseases for which embeddings are learned in an unsupervised setup.
The embedding space includes demographic parameters which allow the creation of generalized patient profiles.
The training of such a medical profile model has been performed on a dataset of more than one million patients.
arXiv Detail & Related papers (2021-06-21T13:30:43Z) - Many-to-One Distribution Learning and K-Nearest Neighbor Smoothing for
Thoracic Disease Identification [83.6017225363714]
deep learning has become the most powerful computer-aided diagnosis technology for improving disease identification performance.
For chest X-ray imaging, annotating large-scale data requires professional domain knowledge and is time-consuming.
In this paper, we propose many-to-one distribution learning (MODL) and K-nearest neighbor smoothing (KNNS) methods to improve a single model's disease identification performance.
arXiv Detail & Related papers (2021-02-26T02:29:30Z) - Clinical Outcome Prediction from Admission Notes using Self-Supervised
Knowledge Integration [55.88616573143478]
Outcome prediction from clinical text can prevent doctors from overlooking possible risks.
Diagnoses at discharge, procedures performed, in-hospital mortality and length-of-stay prediction are four common outcome prediction targets.
We propose clinical outcome pre-training to integrate knowledge about patient outcomes from multiple public sources.
arXiv Detail & Related papers (2021-02-08T10:26:44Z) - FIT: a Fast and Accurate Framework for Solving Medical Inquiring and
Diagnosing Tasks [10.687562550605739]
Self-diagnosis provides low-cost and accessible healthcare via an agent that queries the patient and makes predictions about possible diseases.
We propose a competitive framework, called FIT, which uses an information-theoretic reward to determine what data to collect next.
Our results in two simulated datasets show that FIT can effectively deal with large search space problems, outperforming existing baselines.
arXiv Detail & Related papers (2020-12-02T10:12:49Z) - A novel method for Causal Structure Discovery from EHR data, a
demonstration on type-2 diabetes mellitus [3.8171820752218997]
We propose a new data transformation method and a novel causal structure discovery algorithm.
We demonstrated the proposed methods on an application to type-2 diabetes mellitus.
arXiv Detail & Related papers (2020-11-11T00:50:04Z) - Predicting Clinical Diagnosis from Patients Electronic Health Records
Using BERT-based Neural Networks [62.9447303059342]
We show the importance of this problem in medical community.
We present a modification of Bidirectional Representations from Transformers (BERT) model for classification sequence.
We use a large-scale Russian EHR dataset consisting of about 4 million unique patient visits.
arXiv Detail & Related papers (2020-07-15T09:22:55Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.