Task-wise Split Gradient Boosting Trees for Multi-center Diabetes
Prediction
- URL: http://arxiv.org/abs/2108.07107v1
- Date: Mon, 16 Aug 2021 14:22:44 GMT
- Title: Task-wise Split Gradient Boosting Trees for Multi-center Diabetes
Prediction
- Authors: Mingcheng Chen, Zhenghui Wang, Zhiyun Zhao, Weinan Zhang, Xiawei Guo,
Jian Shen, Yanru Qu, Jieli Lu, Min Xu, Yu Xu, Tiange Wang, Mian Li, Wei-Wei
Tu, Yong Yu, Yufang Bi, Weiqing Wang, Guang Ning
- Abstract summary: Task-wise Split Gradient Boosting Trees (TSGB) is proposed for the multi-center diabetes prediction task.
TSGB achieves superior performance against several state-of-the-art methods.
The proposed TSGB method has been deployed as an online diabetes risk assessment software for early diagnosis.
- Score: 37.846368153741395
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Diabetes prediction is an important data science application in the social
healthcare domain. There exist two main challenges in the diabetes prediction
task: data heterogeneity since demographic and metabolic data are of different
types, data insufficiency since the number of diabetes cases in a single
medical center is usually limited. To tackle the above challenges, we employ
gradient boosting decision trees (GBDT) to handle data heterogeneity and
introduce multi-task learning (MTL) to solve data insufficiency. To this end,
Task-wise Split Gradient Boosting Trees (TSGB) is proposed for the multi-center
diabetes prediction task. Specifically, we firstly introduce task gain to
evaluate each task separately during tree construction, with a theoretical
analysis of GBDT's learning objective. Secondly, we reveal a problem when
directly applying GBDT in MTL, i.e., the negative task gain problem. Finally,
we propose a novel split method for GBDT in MTL based on the task gain
statistics, named task-wise split, as an alternative to standard feature-wise
split to overcome the mentioned negative task gain problem. Extensive
experiments on a large-scale real-world diabetes dataset and a commonly used
benchmark dataset demonstrate TSGB achieves superior performance against
several state-of-the-art methods. Detailed case studies further support our
analysis of negative task gain problems and provide insightful findings. The
proposed TSGB method has been deployed as an online diabetes risk assessment
software for early diagnosis.
Related papers
- Privacy Preserved Blood Glucose Level Cross-Prediction: An Asynchronous Decentralized Federated Learning Approach [13.363740869325646]
Newly diagnosed Type 1 Diabetes (T1D) patients often struggle to obtain effective Blood Glucose (BG) prediction models.
We propose "GluADFL", blood Glucose prediction by Asynchronous Decentralized Federated Learning.
arXiv Detail & Related papers (2024-06-21T17:57:39Z) - Interpretable Causal Inference for Analyzing Wearable, Sensor, and Distributional Data [62.56890808004615]
We develop an interpretable method for distributional data analysis that ensures trustworthy and robust decision-making.
We demonstrate ADD MALTS' utility by studying the effectiveness of continuous glucose monitors in mitigating diabetes risks.
arXiv Detail & Related papers (2023-12-17T00:42:42Z) - Generalizing to Unseen Domains in Diabetic Retinopathy Classification [8.59772105902647]
We study the problem of generalizing a model to unseen distributions or domains in diabetic retinopathy classification.
We propose a simple and effective domain generalization (DG) approach that achieves self-distillation in vision transformers.
We report the performance of several state-of-the-art DG methods on open-source DR classification datasets.
arXiv Detail & Related papers (2023-10-26T09:11:55Z) - Harvard Glaucoma Detection and Progression: A Multimodal Multitask
Dataset and Generalization-Reinforced Semi-Supervised Learning [16.465424871839627]
We develop a novel semi-supervised learning (SSL) model called pseudo supervisor to utilize unlabeled data.
Second, we release the Harvard Glaucoma Detection and Progression (Harvard-GDP) dataset.
This is the largest glaucoma detection dataset with 3D OCT imaging data and the first glaucoma progression forecasting dataset that is publicly available.
arXiv Detail & Related papers (2023-08-25T14:38:51Z) - SSM-DTA: Breaking the Barriers of Data Scarcity in Drug-Target Affinity
Prediction [127.43571146741984]
Drug-Target Affinity (DTA) is of vital importance in early-stage drug discovery.
wet experiments remain the most reliable method, but they are time-consuming and resource-intensive.
Existing methods have primarily focused on developing techniques based on the available DTA data, without adequately addressing the data scarcity issue.
We present the SSM-DTA framework, which incorporates three simple yet highly effective strategies.
arXiv Detail & Related papers (2022-06-20T14:53:25Z) - Unsupervised Pre-Training on Patient Population Graphs for Patient-Level
Predictions [48.02011627390706]
Pre-training has shown success in different areas of machine learning, such as Computer Vision (CV), Natural Language Processing (NLP) and medical imaging.
In this paper, we apply unsupervised pre-training to heterogeneous, multi-modal EHR data for patient outcome prediction.
We find that our proposed graph based pre-training method helps in modeling the data at a population level.
arXiv Detail & Related papers (2022-03-23T17:59:45Z) - Patient Outcome and Zero-shot Diagnosis Prediction with
Hypernetwork-guided Multitask Learning [3.392432412743858]
Multitask deep learning has been applied to patient outcome prediction from text.
Diagnose prediction among the multiple tasks has the generalizability issue due to rare diseases or unseen diagnoses.
We propose a hypernetwork-based approach that generates task-conditioned parameters and coefficients of multitask prediction heads.
arXiv Detail & Related papers (2021-09-07T12:52:26Z) - Improving Limited Labeled Dialogue State Tracking with Self-Supervision [91.68515201803986]
Existing dialogue state tracking (DST) models require plenty of labeled data.
We present and investigate two self-supervised objectives: preserving latent consistency and modeling conversational behavior.
Our proposed self-supervised signals can improve joint goal accuracy by 8.95% when only 1% labeled data is used.
arXiv Detail & Related papers (2020-10-26T21:57:42Z) - Multi-task Learning via Adaptation to Similar Tasks for Mortality
Prediction of Diverse Rare Diseases [10.020413101958944]
Mortality prediction of diverse rare diseases using electronic health record (EHR) data is a crucial task for intelligent healthcare.
Data insufficiency and the clinical diversity of rare diseases make it hard for directly training deep learning models on individual disease data.
We use Ada-Sit to train long short-term memory networks (LSTM) based prediction models on longitudinal EHR data.
arXiv Detail & Related papers (2020-04-11T06:15:23Z) - Short Term Blood Glucose Prediction based on Continuous Glucose
Monitoring Data [53.01543207478818]
This study explores the use of Continuous Glucose Monitoring (CGM) data as input for digital decision support tools.
We investigate how Recurrent Neural Networks (RNNs) can be used for Short Term Blood Glucose (STBG) prediction.
arXiv Detail & Related papers (2020-02-06T16:39:44Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.