iTARGET: Interpretable Tailored Age Regression for Grouped Epigenetic Traits
- URL: http://arxiv.org/abs/2501.02401v1
- Date: Sat, 04 Jan 2025 23:06:46 GMT
- Title: iTARGET: Interpretable Tailored Age Regression for Grouped Epigenetic Traits
- Authors: Zipeng Wu, Daniel Herring, Fabian Spill, James Andrews,
- Abstract summary: We propose a novel two-phase algorithm to accurately predict chronological age from DNA methylation patterns.
Our method not only improves prediction accuracy but also reveals key age-related CpG sites, detects age-specific changes in aging rates, and identifies pairwise interactions between CpG sites.
Experimental results show that our approach outperforms traditional epigenetic clocks and machine learning models.
- Score: 0.0
- License:
- Abstract: Accurately predicting chronological age from DNA methylation patterns is crucial for advancing biological age estimation. However, this task is made challenging by Epigenetic Correlation Drift (ECD) and Heterogeneity Among CpGs (HAC), which reflect the dynamic relationship between methylation and age across different life stages. To address these issues, we propose a novel two-phase algorithm. The first phase employs similarity searching to cluster methylation profiles by age group, while the second phase uses Explainable Boosting Machines (EBM) for precise, group-specific prediction. Our method not only improves prediction accuracy but also reveals key age-related CpG sites, detects age-specific changes in aging rates, and identifies pairwise interactions between CpG sites. Experimental results show that our approach outperforms traditional epigenetic clocks and machine learning models, offering a more accurate and interpretable solution for biological age estimation with significant implications for aging research.
Related papers
- U-learning for Prediction Inference via Combinatory Multi-Subsampling: With Applications to LASSO and Neural Networks [5.587500517608073]
Epigenetic aging clocks play a pivotal role in estimating an individual's biological age through the examination of DNA methylation patterns.
We introduce a novel U-sampling approach via multi-sublearning for making ensemble predictions.
More specifically, our approach conceptualizes the ensemble estimators within the framework of generalized U-statistics.
We apply our approach to two commonly used predictive algorithms, Lasso and deep neural networks (DNNs), and illustrate the validity of inferences with extensive numerical studies.
arXiv Detail & Related papers (2024-07-22T00:03:51Z) - Using Pre-training and Interaction Modeling for ancestry-specific disease prediction in UK Biobank [69.90493129893112]
Recent genome-wide association studies (GWAS) have uncovered the genetic basis of complex traits, but show an under-representation of non-European descent individuals.
Here, we assess whether we can improve disease prediction across diverse ancestries using multiomic data.
arXiv Detail & Related papers (2024-04-26T16:39:50Z) - Longitudinal prediction of DNA methylation to forecast epigenetic
outcomes [2.5936539522838506]
We introduce a probabilistic and longitudinal machine learning framework based on multi-mean Gaussian processes (GPs)
Our model is trained on a birth cohort of children with methylation profiled at ages 0-4, and we demonstrated that the status of methylation sites for each child can be accurately predicted at ages 5-7.
This approach encourages epigenetic studies to move towards longitudinal design for investigating epigenetic changes during development, ageing and disease progression.
arXiv Detail & Related papers (2023-12-19T22:15:27Z) - Using explainable AI to investigate electrocardiogram changes during healthy aging -- from expert features to raw signals [0.8108972030676012]
We employ a deep-learning model and a tree-based model to analyze ECG data from a robust dataset of healthy individuals across varying ages.
Our analysis with tree-based classifiers reveals age-related declines in inferred breathing rates.
These findings shed new light on age-related ECG changes, offering insights that transcend traditional feature-based approaches.
arXiv Detail & Related papers (2023-10-11T13:05:28Z) - T-Phenotype: Discovering Phenotypes of Predictive Temporal Patterns in
Disease Progression [82.85825388788567]
We develop a novel temporal clustering method, T-Phenotype, to discover phenotypes of predictive temporal patterns from labeled time-series data.
We show that T-Phenotype achieves the best phenotype discovery performance over all the evaluated baselines.
arXiv Detail & Related papers (2023-02-24T13:30:35Z) - Benchmarking Machine Learning Robustness in Covid-19 Genome Sequence
Classification [109.81283748940696]
We introduce several ways to perturb SARS-CoV-2 genome sequences to mimic the error profiles of common sequencing platforms such as Illumina and PacBio.
We show that some simulation-based approaches are more robust (and accurate) than others for specific embedding methods to certain adversarial attacks to the input sequences.
arXiv Detail & Related papers (2022-07-18T19:16:56Z) - An Information-Theoretic Framework for Identifying Age-Related Genes
Using Human Dermal Fibroblast Transcriptome Data [0.8122270502556371]
We develop an information-theoretic framework for identifying genes that are associated with aging.
We use unsupervised and semi-supervised learning techniques on human dermal fibroblast gene expression data.
Performance assessment for both unsupervised and semi-supervised methods show the effectiveness of the framework.
arXiv Detail & Related papers (2021-11-04T02:41:33Z) - LAE : Long-tailed Age Estimation [52.5745217752147]
We first formulate a simple standard baseline and build a much strong one by collecting the tricks in pre-training, data augmentation, model architecture, and so on.
Compared with the standard baseline, the proposed one significantly decreases the estimation errors.
We propose a two-stage training method named Long-tailed Age Estimation (LAE), which decouples the learning procedure into representation learning and classification.
arXiv Detail & Related papers (2021-10-25T09:05:44Z) - FP-Age: Leveraging Face Parsing Attention for Facial Age Estimation in
the Wild [50.8865921538953]
We propose a method to explicitly incorporate facial semantics into age estimation.
We design a face parsing-based network to learn semantic information at different scales.
We show that our method consistently outperforms all existing age estimation methods.
arXiv Detail & Related papers (2021-06-21T14:31:32Z) - Integrated Age Estimation Mechanism [14.66142603273126]
The proposed age estimation mechanism achieves a good tradeoff effect of age estimation.
The mechanism is a framework mechanism that can be used to construct different specific age estimation algorithms.
arXiv Detail & Related papers (2021-03-11T09:14:10Z) - STELAR: Spatio-temporal Tensor Factorization with Latent Epidemiological
Regularization [76.57716281104938]
We develop a tensor method to predict the evolution of epidemic trends for many regions simultaneously.
STELAR enables long-term prediction by incorporating latent temporal regularization through a system of discrete-time difference equations.
We conduct experiments using both county- and state-level COVID-19 data and show that our model can identify interesting latent patterns of the epidemic.
arXiv Detail & Related papers (2020-12-08T21:21:47Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.