Deep learning forward and reverse primer design to detect SARS-CoV-2
emerging variants
- URL: http://arxiv.org/abs/2209.13591v1
- Date: Sun, 25 Sep 2022 20:09:22 GMT
- Title: Deep learning forward and reverse primer design to detect SARS-CoV-2
emerging variants
- Authors: Hanyu Wang and Emmanuel K. Tsinda and Anthony J. Dunn and Francis
Chikweto and Nusreen Ahmed and Emanuela Pelosi and Alain B. Zemkoho
- Abstract summary: Surges in the number of COVID-19 cases are associated with the emergence of SARS-CoV-2 variants.
We develop a semi-automated method to design both forward and reverse primer sets to detect SARS-CoV-2 variants.
- Score: 1.92488183401128
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Surges that have been observed at different periods in the number of COVID-19
cases are associated with the emergence of multiple SARS-CoV-2 (Severe Acute
Respiratory Virus) variants. The design of methods to support laboratory
detection are crucial in the monitoring of these variants. Hence, in this
paper, we develop a semi-automated method to design both forward and reverse
primer sets to detect SARS-CoV-2 variants. To proceed, we train deep
Convolution Neural Networks (CNNs) to classify labelled SARS-CoV-2 variants and
identify partial genomic features needed for the forward and reverse Polymerase
Chain Reaction (PCR) primer design. Our proposed approach supplements existing
ones while promoting the emerging concept of neural network assisted primer
design for PCR. Our CNN model was trained using a database of SARS-CoV-2
full-length genomes from GISAID and tested on a separate dataset from NCBI,
with 98\% accuracy for the classification of variants. This result is based on
the development of three different methods of feature extraction, and the
selected primer sequences for each SARS-CoV-2 variant detection (except
Omicron) were present in more than 95 \% of sequences in an independent set of
5000 same variant sequences, and below 5 \% in other independent datasets with
5000 sequences of each variant. In total, we obtain 22 forward and reverse
primer pairs with flexible length sizes (18-25 base pairs) with an expected
amplicon length ranging between 42 and 3322 nucleotides. Besides the feature
appearance, in-silico primer checks confirmed that the identified primer pairs
are suitable for accurate SARS-CoV-2 variant detection by means of PCR tests.
Related papers
- Primer C-VAE: An interpretable deep learning primer design method to detect emerging virus variants [0.5821597945324924]
We developed Primer C-VAE, a model based on a Variational Auto-Encoder framework with Convolutional Neural Networks.
Using SARS-CoV-2, our model classified variants with 98% accuracy and generated variant-specific primers.
The model also generated effective primers for organisms with longer gene sequences like E. coli and Shigella flexneri.
arXiv Detail & Related papers (2025-03-03T12:17:19Z) - Hard-Attention Gates with Gradient Routing for Endoscopic Image Computing [3.146247125118741]
We introduce FeatureSelection Gates (FSG) or Hard-Attention Gates (HAG) for dynamic feature selection.
This technique aims to boost Convolutional Neural Networks (CNNs) and Vision Transformers (ViTs) by promoting sparse connectivity.
We show that our HAG-enhanced networks substantially enhance performance in both binary and triclass classification tasks related to polyp sizing.
arXiv Detail & Related papers (2024-07-05T10:20:24Z) - Benchmarking Machine Learning Robustness in Covid-19 Genome Sequence
Classification [109.81283748940696]
We introduce several ways to perturb SARS-CoV-2 genome sequences to mimic the error profiles of common sequencing platforms such as Illumina and PacBio.
We show that some simulation-based approaches are more robust (and accurate) than others for specific embedding methods to certain adversarial attacks to the input sequences.
arXiv Detail & Related papers (2022-07-18T19:16:56Z) - Low-complexity deep learning frameworks for acoustic scene
classification [64.22762153453175]
We present low-complexity deep learning frameworks for acoustic scene classification (ASC)
The proposed frameworks can be separated into four main steps: Front-end spectrogram extraction, online data augmentation, back-end classification, and late fusion of predicted probabilities.
Our experiments conducted on DCASE 2022 Task 1 Development dataset have fullfiled the requirement of low-complexity and achieved the best classification accuracy of 60.1%.
arXiv Detail & Related papers (2022-06-13T11:41:39Z) - Using Deep Learning Sequence Models to Identify SARS-CoV-2 Divergence [1.9573380763700707]
SARS-CoV-2 is an upper respiratory system RNA virus that has caused over 3 million deaths and infecting over 150 million worldwide as of May 2021.
We propose a neural network model that leverages recurrent and convolutional units to take in amino acid sequences of spike proteins and classify corresponding clades.
arXiv Detail & Related papers (2021-11-12T07:52:11Z) - Robust Representation and Efficient Feature Selection Allows for
Effective Clustering of SARS-CoV-2 Variants [0.0]
The SARS-CoV-2 virus contains different variants, each of them having different mutations.
Much of the variation in the SARS-CoV-2 genome happens disproportionately in the spike region of the genome sequence.
We propose an approach to cluster spike protein sequences in order to study the behavior of different known variants.
arXiv Detail & Related papers (2021-10-18T21:18:52Z) - Effective and scalable clustering of SARS-CoV-2 sequences [0.41998444721319206]
SARS-CoV-2 continues to mutate as it spreads, according to an evolutionary process.
The number of currently available sequences of SARS-CoV-2 in public databases such as GISAID is already several million.
We propose an approach based on clustering sequences to identify the current major SARS-CoV-2 variants.
arXiv Detail & Related papers (2021-08-18T13:32:43Z) - A k-mer Based Approach for SARS-CoV-2 Variant Identification [55.78588835407174]
We show that preserving the order of the amino acids helps the underlying classifiers to achieve better performance.
We also show the importance of the different amino acids which play a key role in identifying variants and how they coincide with those reported by the USA's Centers for Disease Control and Prevention (CDC)
arXiv Detail & Related papers (2021-08-07T15:08:15Z) - End-2-End COVID-19 Detection from Breath & Cough Audio [68.41471917650571]
We demonstrate the first attempt to diagnose COVID-19 using end-to-end deep learning from a crowd-sourced dataset of audio samples.
We introduce a novel modelling strategy using a custom deep neural network to diagnose COVID-19 from a joint breath and cough representation.
arXiv Detail & Related papers (2021-01-07T01:13:00Z) - CovidDeep: SARS-CoV-2/COVID-19 Test Based on Wearable Medical Sensors
and Efficient Neural Networks [51.589769497681175]
The novel coronavirus (SARS-CoV-2) has led to a pandemic.
The current testing regime based on Reverse Transcription-Polymerase Chain Reaction for SARS-CoV-2 has been unable to keep up with testing demands.
We propose a framework called CovidDeep that combines efficient DNNs with commercially available WMSs for pervasive testing of the virus.
arXiv Detail & Related papers (2020-07-20T21:47:28Z) - COVID-Net S: Towards computer-aided severity assessment via training and
validation of deep neural networks for geographic extent and opacity extent
scoring of chest X-rays for SARS-CoV-2 lung disease severity [58.23203766439791]
Chest x-rays (CXRs) are often used to assess SARS-CoV-2 severity.
In this study, we assess the feasibility of computer-aided scoring of CXRs of SARS-CoV-2 lung disease severity using a deep learning system.
arXiv Detail & Related papers (2020-05-26T16:33:52Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.