AI-based Data Preparation and Data Analytics in Healthcare: The Case of
Diabetes
- URL: http://arxiv.org/abs/2206.06182v1
- Date: Mon, 13 Jun 2022 14:13:15 GMT
- Title: AI-based Data Preparation and Data Analytics in Healthcare: The Case of
Diabetes
- Authors: Marianna Maranghi, Aris Anagnostopoulos, Irene Cannistraci, Ioannis
Chatzigiannakis, Federico Croce, Giulia Di Teodoro, Michele Gentile, Giorgio
Grani, Maurizio Lenzerini, Stefano Leonardi, Andrea Mastropietro, Laura
Palagi, Massimiliano Pappa, Riccardo Rosati, Riccardo Valentini, Paola
Velardi
- Abstract summary: The Associazione Medici Diabetologi (AMD) collects and manages one of the largest worldwide-available collections of diabetic patient records, also known as the AMD database.
This paper presents the initial results of an ongoing project whose focus is the application of Artificial Intelligence and Machine Learning techniques for conceptualizing, cleaning, and analyzing such an important and valuable dataset.
- Score: 10.307863191143635
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: The Associazione Medici Diabetologi (AMD) collects and manages one of the
largest worldwide-available collections of diabetic patient records, also known
as the AMD database. This paper presents the initial results of an ongoing
project whose focus is the application of Artificial Intelligence and Machine
Learning techniques for conceptualizing, cleaning, and analyzing such an
important and valuable dataset, with the goal of providing predictive insights
to better support diabetologists in their diagnostic and therapeutic choices.
Related papers
- An adapted large language model facilitates multiple medical tasks in diabetes care [20.096444964141508]
Large language models (LLMs) have shown promise in various healthcare scenarios, but their effectiveness across a diverse range of diabetes tasks remains unproven.
This study introduced a framework to train and validate diabetes-specific LLMs.
arXiv Detail & Related papers (2024-09-20T03:47:54Z) - Federated Diabetes Prediction in Canadian Adults Using Real-world Cross-Province Primary Care Data [0.04090757602725897]
This paper introduces a federated learning approach, which amalgamates predictive models without centralized data storage and processing, thus avoiding privacy issues.
This marks the first application of federated learning to predict diabetes using real clinical datasets in Canada extracted from the Canadian Primary Care Sentinel Surveillance Network (CPCSSN) without crossprovince patient data sharing.
arXiv Detail & Related papers (2024-08-21T22:47:21Z) - TrialBench: Multi-Modal Artificial Intelligence-Ready Clinical Trial Datasets [57.067409211231244]
This paper presents meticulously curated AIready datasets covering multi-modal data (e.g., drug molecule, disease code, text, categorical/numerical features) and 8 crucial prediction challenges in clinical trial design.
We provide basic validation methods for each task to ensure the datasets' usability and reliability.
We anticipate that the availability of such open-access datasets will catalyze the development of advanced AI approaches for clinical trial design.
arXiv Detail & Related papers (2024-06-30T09:13:10Z) - Recent Advances in Predictive Modeling with Electronic Health Records [71.19967863320647]
utilizing EHR data for predictive modeling presents several challenges due to its unique characteristics.
Deep learning has demonstrated its superiority in various applications, including healthcare.
arXiv Detail & Related papers (2024-02-02T00:31:01Z) - Data-Centric Foundation Models in Computational Healthcare: A Survey [21.53211505568379]
Foundation models (FMs) as an emerging suite of AI techniques have struck a wave of opportunities in computational healthcare.
We discuss key perspectives in AI security, assessment, and alignment with human values.
We offer a promising outlook of FM-based analytics to enhance the performance of patient outcome and clinical workflow.
arXiv Detail & Related papers (2024-01-04T08:00:32Z) - Multimodal Pretraining of Medical Time Series and Notes [45.89025874396911]
Deep learning models show promise in extracting meaningful patterns, but they require extensive labeled data.
We propose a novel approach employing self-supervised pretraining, focusing on the alignment of clinical measurements and notes.
In downstream tasks, including in-hospital mortality prediction and phenotyping, our model outperforms baselines in settings where only a fraction of the data is labeled.
arXiv Detail & Related papers (2023-12-11T21:53:40Z) - DiaTrend: A dataset from advanced diabetes technology to enable
development of novel analytic solutions [0.0]
This dataset is composed of intensive longitudinal data from wearable medical devices, including a total of 27,561 days of continuous glucose monitor data and 8,220 days of insulin pump data from 54 patients with diabetes.
This dataset is useful for developing novel analytic solutions that can reduce the disease burden for people living with diabetes and increase knowledge on chronic condition management in outpatient settings.
arXiv Detail & Related papers (2023-04-04T00:59:04Z) - Medical Pathologies Prediction : Systematic Review and Proposed Approach [0.0]
We have analyzed and examined different works concerning the exploitation of the most recent technologies such as big data, artificial intelligence, machine learning, and deep learning for the improvement of health care.
We propose our general approach concentrating on the collection, preprocessing and clustering of medical data to facilitate access, after analysis, to the patients and health professionals to predict the most frequent pathologies with better precision within a notable timeframe.
arXiv Detail & Related papers (2023-04-01T13:35:17Z) - Unsupervised Pre-Training on Patient Population Graphs for Patient-Level
Predictions [48.02011627390706]
Pre-training has shown success in different areas of machine learning, such as Computer Vision (CV), Natural Language Processing (NLP) and medical imaging.
In this paper, we apply unsupervised pre-training to heterogeneous, multi-modal EHR data for patient outcome prediction.
We find that our proposed graph based pre-training method helps in modeling the data at a population level.
arXiv Detail & Related papers (2022-03-23T17:59:45Z) - MIMO: Mutual Integration of Patient Journey and Medical Ontology for
Healthcare Representation Learning [49.57261599776167]
We propose an end-to-end robust Transformer-based solution, Mutual Integration of patient journey and Medical Ontology (MIMO) for healthcare representation learning and predictive analytics.
arXiv Detail & Related papers (2021-07-20T07:04:52Z) - Trajectories, bifurcations and pseudotime in large clinical datasets:
applications to myocardial infarction and diabetes data [94.37521840642141]
We suggest a semi-supervised methodology for the analysis of large clinical datasets, characterized by mixed data types and missing values.
The methodology is based on application of elastic principal graphs which can address simultaneously the tasks of dimensionality reduction, data visualization, clustering, feature selection and quantifying the geodesic distances (pseudotime) in partially ordered sequences of observations.
arXiv Detail & Related papers (2020-07-07T21:04:55Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.