AZT1D: A Real-World Dataset for Type 1 Diabetes
- URL: http://arxiv.org/abs/2506.14789v1
- Date: Tue, 27 May 2025 21:54:53 GMT
- Title: AZT1D: A Real-World Dataset for Type 1 Diabetes
- Authors: Saman Khamesian, Asiful Arefeen, Bithika M. Thompson, Maria Adela Grando, Hassan Ghasemzadeh,
- Abstract summary: AZT1D is a dataset containing data collected from 25 individuals with type 1 diabetes (T1D) on automated insulin delivery systems.<n>By offering rich, naturalistic data, AZT1D supports a wide range of artificial intelligence and machine learning applications.
- Score: 6.102406188211489
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: High quality real world datasets are essential for advancing data driven approaches in type 1 diabetes (T1D) management, including personalized therapy design, digital twin systems, and glucose prediction models. However, progress in this area has been limited by the scarcity of publicly available datasets that offer detailed and comprehensive patient data. To address this gap, we present AZT1D, a dataset containing data collected from 25 individuals with T1D on automated insulin delivery (AID) systems. AZT1D includes continuous glucose monitoring (CGM) data, insulin pump and insulin administration data, carbohydrate intake, and device mode (regular, sleep, and exercise) obtained over 6 to 8 weeks for each patient. Notably, the dataset provides granular details on bolus insulin delivery (i.e., total dose, bolus type, correction specific amounts) features that are rarely found in existing datasets. By offering rich, naturalistic data, AZT1D supports a wide range of artificial intelligence and machine learning applications aimed at improving clinical decision making and individualized care in T1D.
Related papers
- Glucose-ML: A collection of longitudinal diabetes datasets for development of robust AI solutions [0.0]
Glucose-ML is a collection of 10 publicly available diabetes datasets.<n>The Glucose-ML collection comprises over 300,000 days of continuous glucose monitor (CGM) data.<n>We conduct a case study for the task of blood glucose prediction.
arXiv Detail & Related papers (2025-07-18T16:53:05Z) - SMILE-UHURA Challenge -- Small Vessel Segmentation at Mesoscopic Scale from Ultra-High Resolution 7T Magnetic Resonance Angiograms [60.35639972035727]
The lack of publicly available annotated datasets has impeded the development of robust, machine learning-driven segmentation algorithms.
The SMILE-UHURA challenge addresses the gap in publicly available annotated datasets by providing an annotated dataset of Time-of-Flight angiography acquired with 7T MRI.
Dice scores reached up to 0.838 $pm$ 0.066 and 0.716 $pm$ 0.125 on the respective datasets, with an average performance of up to 0.804 $pm$ 0.15.
arXiv Detail & Related papers (2024-11-14T17:06:00Z) - FedCVD: The First Real-World Federated Learning Benchmark on Cardiovascular Disease Data [52.55123685248105]
Cardiovascular diseases (CVDs) are currently the leading cause of death worldwide, highlighting the critical need for early diagnosis and treatment.
Machine learning (ML) methods can help diagnose CVDs early, but their performance relies on access to substantial data with high quality.
This paper presents the first real-world FL benchmark for cardiovascular disease detection, named FedCVD.
arXiv Detail & Related papers (2024-10-28T02:24:01Z) - MSSDA: Multi-Sub-Source Adaptation for Diabetic Foot Neuropathy Recognition [6.890510442626563]
Diabetic foot neuropathy (DFN) is a critical factor leading to diabetic foot ulcers.
Existing datasets do not directly derive from plantar data.
We have collected a novel dataset comprising continuous plantar pressure data to recognize DFN.
arXiv Detail & Related papers (2024-09-21T14:16:20Z) - From Glucose Patterns to Health Outcomes: A Generalizable Foundation Model for Continuous Glucose Monitor Data Analysis [47.23780364438969]
We present GluFormer, a generative foundation model for CGM data that learns nuanced glycemic patterns and translates them into predictive representations of metabolic health.<n>GluFormer generalizes to 19 external cohorts spanning different ethnicities and ages, 5 countries, 8 CGM devices, and diverse pathophysiological states.<n>In a longitudinal study of 580 adults with CGM data and 12-year follow-up, GluFormer identifies individuals at elevated risk of developing diabetes more effectively than blood HbA1C%.
arXiv Detail & Related papers (2024-08-20T13:19:06Z) - DiaTrend: A dataset from advanced diabetes technology to enable
development of novel analytic solutions [0.0]
This dataset is composed of intensive longitudinal data from wearable medical devices, including a total of 27,561 days of continuous glucose monitor data and 8,220 days of insulin pump data from 54 patients with diabetes.
This dataset is useful for developing novel analytic solutions that can reduce the disease burden for people living with diabetes and increase knowledge on chronic condition management in outpatient settings.
arXiv Detail & Related papers (2023-04-04T00:59:04Z) - Temporal patterns in insulin needs for Type 1 diabetes [0.0]
Type 1 Diabetes (T1D) is a chronic condition where the body produces little or no insulin.
Finding the right insulin dose and time remains a complex, challenging and as yet unsolved control task.
In this study, we use the OpenAPS Data Commons dataset to discover temporal patterns in insulin need driven by well-known factors.
arXiv Detail & Related papers (2022-11-14T14:19:50Z) - Unsupervised pre-training of graph transformers on patient population
graphs [48.02011627390706]
We propose a graph-transformer-based network to handle heterogeneous clinical data.
We show the benefit of our pre-training method in a self-supervised and a transfer learning setting.
arXiv Detail & Related papers (2022-07-21T16:59:09Z) - Unsupervised Pre-Training on Patient Population Graphs for Patient-Level
Predictions [48.02011627390706]
Pre-training has shown success in different areas of machine learning, such as Computer Vision (CV), Natural Language Processing (NLP) and medical imaging.
In this paper, we apply unsupervised pre-training to heterogeneous, multi-modal EHR data for patient outcome prediction.
We find that our proposed graph based pre-training method helps in modeling the data at a population level.
arXiv Detail & Related papers (2022-03-23T17:59:45Z) - Trajectories, bifurcations and pseudotime in large clinical datasets:
applications to myocardial infarction and diabetes data [94.37521840642141]
We suggest a semi-supervised methodology for the analysis of large clinical datasets, characterized by mixed data types and missing values.
The methodology is based on application of elastic principal graphs which can address simultaneously the tasks of dimensionality reduction, data visualization, clustering, feature selection and quantifying the geodesic distances (pseudotime) in partially ordered sequences of observations.
arXiv Detail & Related papers (2020-07-07T21:04:55Z) - GLYFE: Review and Benchmark of Personalized Glucose Predictive Models in
Type-1 Diabetes [4.17510581764131]
GLYFE is a benchmark of machine-learning-based glucose-predictive models.
The results of nine different models coming from the glucose-prediction literature are presented.
arXiv Detail & Related papers (2020-06-29T11:34:41Z) - DTR Bandit: Learning to Make Response-Adaptive Decisions With Low Regret [59.81290762273153]
Dynamic treatment regimes (DTRs) are personalized, adaptive, multi-stage treatment plans that adapt treatment decisions to an individual's initial features and to intermediate outcomes and features at each subsequent stage.
We propose a novel algorithm that, by carefully balancing exploration and exploitation, is guaranteed to achieve rate-optimal regret when the transition and reward models are linear.
arXiv Detail & Related papers (2020-05-06T13:03:42Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.