Emergenet: A Digital Twin of Sequence Evolution for Scalable Emergence Risk Assessment of Animal Influenza A Strains
- URL: http://arxiv.org/abs/2411.17154v1
- Date: Tue, 26 Nov 2024 06:50:18 GMT
- Title: Emergenet: A Digital Twin of Sequence Evolution for Scalable Emergence Risk Assessment of Animal Influenza A Strains
- Authors: Kevin Yuanbo Wu, Jin Li, Aaron Esser-Kahn, Ishanu Chattopadhyay,
- Abstract summary: This study introduces Emergenet, a tool to infer a digital twin of sequence evolution to chart how new variants might emerge in the wild.
Our predictions based on Emergenets built only using 220,151 Hemagglutinnin (HA) sequences consistently outperform WHO seasonal vaccine recommendations.
- Score: 3.6744384899193405
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Despite having triggered devastating pandemics in the past, our ability to quantitatively assess the emergence potential of individual strains of animal influenza viruses remains limited. This study introduces Emergenet, a tool to infer a digital twin of sequence evolution to chart how new variants might emerge in the wild. Our predictions based on Emergenets built only using 220,151 Hemagglutinnin (HA) sequences consistently outperform WHO seasonal vaccine recommendations for H1N1/H3N2 subtypes over two decades (average match-improvement: 3.73 AAs, 28.40\%), and are at par with state-of-the-art approaches that use more detailed phenotypic annotations. Finally, our generative models are used to scalably calculate the current odds of emergence of animal strains not yet in human circulation, which strongly correlates with CDC's expert-assessed Influenza Risk Assessment Tool (IRAT) scores (Pearson's $r = 0.721, p = 10^{-4}$). A minimum five orders of magnitude speedup over CDC's assessment (seconds vs months) then enabled us to analyze 6,354 animal strains collected post-2020 to identify 35 strains with high emergence scores ($> 7.7$). The Emergenet framework opens the door to preemptive pandemic mitigation through targeted inoculation of animal hosts before the first human infection.
Related papers
- PETRA: Pretrained Evolutionary Transformer for SARS-CoV-2 Mutation Prediction [6.649916450582501]
SARS-CoV-2 has demonstrated a rapid and unpredictable evolutionary trajectory.<n>This poses persistent challenges to public health and vaccine development.<n>We introduce PETRA, a novel transformer approach based on evolutionary trajectories derived from phylogenetic trees.
arXiv Detail & Related papers (2025-11-06T01:58:23Z) - Predictive Modeling and Explainable AI for Veterinary Safety Profiles, Residue Assessment, and Health Outcomes Using Real-World Data and Physicochemical Properties [4.53318808068234]
Adverse events (AEs) may signal unexpected or toxicokinetic effects, increasing the risk of violative residues in the food chain.<n>This study introduces a predictive framework for classifying outcomes (Death vs. Recovery) using 1.28 million reports from the U.S. FDA's OpenFDA Center for Veterinary Medicine.
arXiv Detail & Related papers (2025-10-01T23:34:46Z) - Using Pre-training and Interaction Modeling for ancestry-specific disease prediction in UK Biobank [69.90493129893112]
Recent genome-wide association studies (GWAS) have uncovered the genetic basis of complex traits, but show an under-representation of non-European descent individuals.
Here, we assess whether we can improve disease prediction across diverse ancestries using multiomic data.
arXiv Detail & Related papers (2024-04-26T16:39:50Z) - Forecast reconciliation for vaccine supply chain optimization [61.13962963550403]
Vaccine supply chain optimization can benefit from hierarchical time series forecasting.
Forecasts of different hierarchy levels become incoherent when higher levels do not match the sum of the lower levels forecasts.
We tackle the vaccine sale forecasting problem by modeling sales data from GSK between 2010 and 2021 as a hierarchical time series.
arXiv Detail & Related papers (2023-05-02T14:34:34Z) - Hospital transfer risk prediction for COVID-19 patients from a
medicalized hotel based on Diffusion GraphSAGE [7.021489981474361]
Medicalized hotels were established in Taiwan as quarantine facilities for COVID-19 patients with no or mild symptoms.
Due to limited medical care available at these hotels, it is of paramount importance to identify patients at risk of clinical deterioration.
This study aimed to develop and evaluate a graph-based deep learning approach for progressive hospital transfer risk prediction in a medicalized hotel setting.
arXiv Detail & Related papers (2022-12-31T14:59:35Z) - Scalable Pathogen Detection from Next Generation DNA Sequencing with
Deep Learning [3.8175773487333857]
We propose MG2Vec, a deep learning-based solution that uses the transformer network as its backbone.
We show that the proposed approach can help detect pathogens from uncurated, real-world clinical samples.
We provide a comprehensive evaluation of a novel representation learning framework for metagenome-based disease diagnostics with deep learning.
arXiv Detail & Related papers (2022-11-30T00:13:59Z) - Using Deep Learning Sequence Models to Identify SARS-CoV-2 Divergence [1.9573380763700707]
SARS-CoV-2 is an upper respiratory system RNA virus that has caused over 3 million deaths and infecting over 150 million worldwide as of May 2021.
We propose a neural network model that leverages recurrent and convolutional units to take in amino acid sequences of spike proteins and classify corresponding clades.
arXiv Detail & Related papers (2021-11-12T07:52:11Z) - PhyloTransformer: A Discriminative Model for Mutation Prediction Based
on a Multi-head Self-attention Mechanism [10.468453827172477]
Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has caused an ongoing pandemic infecting 219 million people as of 10/19/21, with a 3.6% mortality rate.
Here we developed PhyloTransformer, a Transformer-based discriminative model that engages a multi-head self-attention mechanism to model genetic mutations that may lead to viral reproductive advantage.
arXiv Detail & Related papers (2021-11-03T01:30:57Z) - Modeling the geospatial evolution of COVID-19 using spatio-temporal
convolutional sequence-to-sequence neural networks [48.7576911714538]
Portugal was the country in the world with the largest incidence rate, with 14-days incidence rates per 100,000 inhabitants in excess of 1000.
Despite its importance, accurate prediction of the geospatial evolution of COVID-19 remains a challenge.
arXiv Detail & Related papers (2021-05-06T15:24:00Z) - STELAR: Spatio-temporal Tensor Factorization with Latent Epidemiological
Regularization [76.57716281104938]
We develop a tensor method to predict the evolution of epidemic trends for many regions simultaneously.
STELAR enables long-term prediction by incorporating latent temporal regularization through a system of discrete-time difference equations.
We conduct experiments using both county- and state-level COVID-19 data and show that our model can identify interesting latent patterns of the epidemic.
arXiv Detail & Related papers (2020-12-08T21:21:47Z) - CovidDeep: SARS-CoV-2/COVID-19 Test Based on Wearable Medical Sensors
and Efficient Neural Networks [51.589769497681175]
The novel coronavirus (SARS-CoV-2) has led to a pandemic.
The current testing regime based on Reverse Transcription-Polymerase Chain Reaction for SARS-CoV-2 has been unable to keep up with testing demands.
We propose a framework called CovidDeep that combines efficient DNNs with commercially available WMSs for pervasive testing of the virus.
arXiv Detail & Related papers (2020-07-20T21:47:28Z) - A Deep Q-learning/genetic Algorithms Based Novel Methodology For
Optimizing Covid-19 Pandemic Government Actions [63.669642197519934]
We use the SEIR epidemiological model to represent the evolution of the virus COVID-19 over time in the population.
The sequences of actions (confinement, self-isolation, two-meter distance or not taking restrictions) are evaluated according to a reward system.
We prove that our methodology is a valid tool to discover actions governments can take to reduce the negative effects of a pandemic in both senses.
arXiv Detail & Related papers (2020-05-15T17:17:45Z) - Coronavirus Optimization Algorithm: A bioinspired metaheuristic based on
the COVID-19 propagation model [0.0]
A novel bioinspired metaheuristic is proposed in this work, simulating how the coronavirus spreads and infects healthy people.
The input parameters are already set according to the disease statistics, preventing researchers from initializing them with arbitrary values.
A parallel multi-virus version is proposed in which several coronavirus strains evolve over time and explore wider search space areas in less iterations.
arXiv Detail & Related papers (2020-03-30T17:10:02Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.