iTARGET: Interpretable Tailored Age Regression for Grouped Epigenetic Traits
- URL: http://arxiv.org/abs/2501.02401v1
- Date: Sat, 04 Jan 2025 23:06:46 GMT
- Title: iTARGET: Interpretable Tailored Age Regression for Grouped Epigenetic Traits
- Authors: Zipeng Wu, Daniel Herring, Fabian Spill, James Andrews,
- Abstract summary: We propose a novel two-phase algorithm to accurately predict chronological age from DNA methylation patterns.<n>Our method not only improves prediction accuracy but also reveals key age-related CpG sites, detects age-specific changes in aging rates, and identifies pairwise interactions between CpG sites.<n> Experimental results show that our approach outperforms traditional epigenetic clocks and machine learning models.
- Score: 0.0
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Accurately predicting chronological age from DNA methylation patterns is crucial for advancing biological age estimation. However, this task is made challenging by Epigenetic Correlation Drift (ECD) and Heterogeneity Among CpGs (HAC), which reflect the dynamic relationship between methylation and age across different life stages. To address these issues, we propose a novel two-phase algorithm. The first phase employs similarity searching to cluster methylation profiles by age group, while the second phase uses Explainable Boosting Machines (EBM) for precise, group-specific prediction. Our method not only improves prediction accuracy but also reveals key age-related CpG sites, detects age-specific changes in aging rates, and identifies pairwise interactions between CpG sites. Experimental results show that our approach outperforms traditional epigenetic clocks and machine learning models, offering a more accurate and interpretable solution for biological age estimation with significant implications for aging research.
Related papers
- Investigating the Impact of Histopathological Foundation Models on Regressive Prediction of Homologous Recombination Deficiency [52.50039435394964]
We systematically evaluate foundation models for regression-based tasks.<n>We extract patch-level features from whole slide images (WSI) using five state-of-the-art foundation models.<n>Models are trained to predict continuous HRD scores based on these extracted features across breast, endometrial, and lung cancer cohorts.
arXiv Detail & Related papers (2026-01-29T14:06:50Z) - Phenome-Wide Multi-Omics Integration Uncovers Distinct Archetypes of Human Aging [28.20331959292183]
We developed and rigorously validated a multi-omics aging clock that robustly predicts diverse health outcomes and future disease risk.<n>Unotype clustering of the integrated molecular profiles from multi-omics uncovered distinct biological subtypes of aging.<n>These findings demonstrate the power of multi-omics integration to decode the molecular landscape of aging and lay the groundwork for personalized healthspan monitoring and precision strategies to prevent age-related diseases.
arXiv Detail & Related papers (2025-10-14T11:00:51Z) - A Machine Learning Approach to Predict Biological Age and its Longitudinal Drivers [22.162067953837653]
We develop a machine learning pipeline to predict age using a longitudinal cohort with data from two distinct time periods.<n>By engineering novel features that explicitly capture the rate of change (slope) of key biomarkers over time, we significantly improved model performance.<n>Our framework paves the way for clinical tools that dynamically track patient health trajectories, enabling early intervention and personalized prevention strategies.
arXiv Detail & Related papers (2025-08-13T12:22:12Z) - U-learning for Prediction Inference via Combinatory Multi-Subsampling: With Applications to LASSO and Neural Networks [5.587500517608073]
Epigenetic aging clocks play a pivotal role in estimating an individual's biological age through the examination of DNA methylation patterns.
We introduce a novel U-sampling approach via multi-sublearning for making ensemble predictions.
More specifically, our approach conceptualizes the ensemble estimators within the framework of generalized U-statistics.
We apply our approach to two commonly used predictive algorithms, Lasso and deep neural networks (DNNs), and illustrate the validity of inferences with extensive numerical studies.
arXiv Detail & Related papers (2024-07-22T00:03:51Z) - Using Pre-training and Interaction Modeling for ancestry-specific disease prediction in UK Biobank [69.90493129893112]
Recent genome-wide association studies (GWAS) have uncovered the genetic basis of complex traits, but show an under-representation of non-European descent individuals.
Here, we assess whether we can improve disease prediction across diverse ancestries using multiomic data.
arXiv Detail & Related papers (2024-04-26T16:39:50Z) - Longitudinal prediction of DNA methylation to forecast epigenetic
outcomes [2.5936539522838506]
We introduce a probabilistic and longitudinal machine learning framework based on multi-mean Gaussian processes (GPs)
Our model is trained on a birth cohort of children with methylation profiled at ages 0-4, and we demonstrated that the status of methylation sites for each child can be accurately predicted at ages 5-7.
This approach encourages epigenetic studies to move towards longitudinal design for investigating epigenetic changes during development, ageing and disease progression.
arXiv Detail & Related papers (2023-12-19T22:15:27Z) - Using explainable AI to investigate electrocardiogram changes during healthy aging -- from expert features to raw signals [0.8108972030676012]
We employ a deep-learning model and a tree-based model to analyze ECG data from a robust dataset of healthy individuals across varying ages.
Our analysis with tree-based classifiers reveals age-related declines in inferred breathing rates.
These findings shed new light on age-related ECG changes, offering insights that transcend traditional feature-based approaches.
arXiv Detail & Related papers (2023-10-11T13:05:28Z) - T-Phenotype: Discovering Phenotypes of Predictive Temporal Patterns in
Disease Progression [82.85825388788567]
We develop a novel temporal clustering method, T-Phenotype, to discover phenotypes of predictive temporal patterns from labeled time-series data.
We show that T-Phenotype achieves the best phenotype discovery performance over all the evaluated baselines.
arXiv Detail & Related papers (2023-02-24T13:30:35Z) - Benchmarking Machine Learning Robustness in Covid-19 Genome Sequence
Classification [109.81283748940696]
We introduce several ways to perturb SARS-CoV-2 genome sequences to mimic the error profiles of common sequencing platforms such as Illumina and PacBio.
We show that some simulation-based approaches are more robust (and accurate) than others for specific embedding methods to certain adversarial attacks to the input sequences.
arXiv Detail & Related papers (2022-07-18T19:16:56Z) - An Information-Theoretic Framework for Identifying Age-Related Genes
Using Human Dermal Fibroblast Transcriptome Data [0.8122270502556371]
We develop an information-theoretic framework for identifying genes that are associated with aging.
We use unsupervised and semi-supervised learning techniques on human dermal fibroblast gene expression data.
Performance assessment for both unsupervised and semi-supervised methods show the effectiveness of the framework.
arXiv Detail & Related papers (2021-11-04T02:41:33Z) - LAE : Long-tailed Age Estimation [52.5745217752147]
We first formulate a simple standard baseline and build a much strong one by collecting the tricks in pre-training, data augmentation, model architecture, and so on.
Compared with the standard baseline, the proposed one significantly decreases the estimation errors.
We propose a two-stage training method named Long-tailed Age Estimation (LAE), which decouples the learning procedure into representation learning and classification.
arXiv Detail & Related papers (2021-10-25T09:05:44Z) - FP-Age: Leveraging Face Parsing Attention for Facial Age Estimation in
the Wild [50.8865921538953]
We propose a method to explicitly incorporate facial semantics into age estimation.
We design a face parsing-based network to learn semantic information at different scales.
We show that our method consistently outperforms all existing age estimation methods.
arXiv Detail & Related papers (2021-06-21T14:31:32Z) - Integrated Age Estimation Mechanism [14.66142603273126]
The proposed age estimation mechanism achieves a good tradeoff effect of age estimation.
The mechanism is a framework mechanism that can be used to construct different specific age estimation algorithms.
arXiv Detail & Related papers (2021-03-11T09:14:10Z) - STELAR: Spatio-temporal Tensor Factorization with Latent Epidemiological
Regularization [76.57716281104938]
We develop a tensor method to predict the evolution of epidemic trends for many regions simultaneously.
STELAR enables long-term prediction by incorporating latent temporal regularization through a system of discrete-time difference equations.
We conduct experiments using both county- and state-level COVID-19 data and show that our model can identify interesting latent patterns of the epidemic.
arXiv Detail & Related papers (2020-12-08T21:21:47Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.