An Open-Source Dataset on Dietary Behaviors and DASH Eating Plan
Optimization Constraints
- URL: http://arxiv.org/abs/2010.07531v1
- Date: Thu, 15 Oct 2020 05:25:44 GMT
- Title: An Open-Source Dataset on Dietary Behaviors and DASH Eating Plan
Optimization Constraints
- Authors: Farzin Ahmadi, Fardin Ganjkhanloo, Kimia Ghobadi
- Abstract summary: We provide a modified dataset based on dietary behaviors of different groups of people, their demographics, and pre-existing conditions.
We additionally provide tailored datasets for hypertension and pre-diabetic patients as groups of interest who may benefit from targetted diets.
- Score: 0.29298205115761694
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Linear constrained optimization techniques have been applied to many
real-world settings. In recent years, inferring the unknown parameters and
functions inside an optimization model has also gained traction. This inference
is often based on existing observations and/or known parameters. Consequently,
such models require reliable, easily accessed, and easily interpreted examples
to be evaluated. To facilitate research in such directions, we provide a
modified dataset based on dietary behaviors of different groups of people,
their demographics, and pre-existing conditions, among other factors. This data
is gathered from the National Health and Nutrition Examination Survey (NHANES)
and complemented with the nutritional data from the United States Department of
Agriculture (USDA). We additionally provide tailored datasets for hypertension
and pre-diabetic patients as groups of interest who may benefit from targetted
diets such as the Dietary Approaches to Stop Hypertension (DASH) eating plan.
The data is compiled and curated in such a way that it is suitable as input to
linear optimization models. We hope that this data and its supplementary,
open-accessed materials can accelerate and simplify interpretations and
research on linear optimization and constrained inference models. The complete
dataset can be found in the following repository:
https://github.com/CSSEHealthcare/InverseLearning
Related papers
- From Glucose Patterns to Health Outcomes: A Generalizable Foundation Model for Continuous Glucose Monitor Data Analysis [50.80532910808962]
We present GluFormer, a generative foundation model on biomedical temporal data based on a transformer architecture.
GluFormer generalizes to 15 different external datasets, including 4936 individuals across 5 different geographical regions.
It can also predict onset of future health outcomes even 4 years in advance.
arXiv Detail & Related papers (2024-08-20T13:19:06Z) - RoDE: Linear Rectified Mixture of Diverse Experts for Food Large Multi-Modal Models [96.43285670458803]
Uni-Food is a unified food dataset that comprises over 100,000 images with various food labels.
Uni-Food is designed to provide a more holistic approach to food data analysis.
We introduce a novel Linear Rectification Mixture of Diverse Experts (RoDE) approach to address the inherent challenges of food-related multitasking.
arXiv Detail & Related papers (2024-07-17T16:49:34Z) - Enhancing Performance for Highly Imbalanced Medical Data via Data Regularization in a Federated Learning Setting [6.22153888560487]
The goal of the proposed method is to enhance model performance for cardiovascular disease prediction.
The method is evaluated across four datasets for cardiovascular disease prediction, which are scattered across different clients.
arXiv Detail & Related papers (2024-05-30T19:15:38Z) - Adaptive Data Analysis for Growing Data [19.68686581348877]
Reuse of data in adaptive poses challenges regarding overfitting and the statistical validity results.
We present the first generalization bounds for adaptive analysis in the dynamic data setting.
arXiv Detail & Related papers (2024-05-22T06:17:58Z) - Simulation-Enhanced Data Augmentation for Machine Learning Pathloss
Prediction [9.664420734674088]
This paper introduces a novel simulation-enhanced data augmentation method for machine learning pathloss prediction.
Our method integrates synthetic data generated from a cellular coverage simulator and independently collected real-world datasets.
The integration of synthetic data significantly improves the generalizability of the model in different environments.
arXiv Detail & Related papers (2024-02-03T00:38:08Z) - Collinear datasets augmentation using Procrustes validation sets [0.0]
We propose a new method for augmentation of numeric and mixed datasets.
The method generates additional data points by utilizing cross-validation resampling and latent variable modeling.
It is particularly efficient for datasets with moderate to high degrees of collinearity.
arXiv Detail & Related papers (2023-12-08T09:07:11Z) - NutritionVerse: Empirical Study of Various Dietary Intake Estimation Approaches [59.38343165508926]
Accurate dietary intake estimation is critical for informing policies and programs to support healthy eating.
Recent work has focused on using computer vision and machine learning to automatically estimate dietary intake from food images.
We introduce NutritionVerse- Synth, the first large-scale dataset of 84,984 synthetic 2D food images with associated dietary information.
We also collect a real image dataset, NutritionVerse-Real, containing 889 images of 251 dishes to evaluate realism.
arXiv Detail & Related papers (2023-09-14T13:29:41Z) - GastroVision: A Multi-class Endoscopy Image Dataset for Computer Aided
Gastrointestinal Disease Detection [6.231109933741383]
This dataset includes different anatomical landmarks, pathological abnormalities, polyp removal cases and normal findings from the GI tract.
It was annotated and verified by experienced GI endoscopists.
We believe our dataset can facilitate the development of AI-based algorithms for GI disease detection and classification.
arXiv Detail & Related papers (2023-07-16T19:36:03Z) - Prediction-Oriented Bayesian Active Learning [51.426960808684655]
Expected predictive information gain (EPIG) is an acquisition function that measures information gain in the space of predictions rather than parameters.
EPIG leads to stronger predictive performance compared with BALD across a range of datasets and models.
arXiv Detail & Related papers (2023-04-17T10:59:57Z) - DeepRite: Deep Recurrent Inverse TreatmEnt Weighting for Adjusting
Time-varying Confounding in Modern Longitudinal Observational Data [68.29870617697532]
We propose Deep Recurrent Inverse TreatmEnt weighting (DeepRite) for time-varying confounding in longitudinal data.
DeepRite is shown to recover the ground truth from synthetic data, and estimate unbiased treatment effects from real data.
arXiv Detail & Related papers (2020-10-28T15:05:08Z) - DTR Bandit: Learning to Make Response-Adaptive Decisions With Low Regret [59.81290762273153]
Dynamic treatment regimes (DTRs) are personalized, adaptive, multi-stage treatment plans that adapt treatment decisions to an individual's initial features and to intermediate outcomes and features at each subsequent stage.
We propose a novel algorithm that, by carefully balancing exploration and exploitation, is guaranteed to achieve rate-optimal regret when the transition and reward models are linear.
arXiv Detail & Related papers (2020-05-06T13:03:42Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.