Guide-Guard: Off-Target Predicting in CRISPR Applications
- URL: http://arxiv.org/abs/2602.16327v1
- Date: Wed, 18 Feb 2026 10:06:54 GMT
- Title: Guide-Guard: Off-Target Predicting in CRISPR Applications
- Authors: Joseph Bingham, Netanel Arussy, Saman Zonouz,
- Abstract summary: We explore the underlying biological and chemical model from a data driven perspective.<n>We present a machine learning based solution named textitGuide-Guard to predict the behavior of the system given a gRNA in the CRISPR gene-editing process with 84% accuracy.
- Score: 0.0
- License: http://creativecommons.org/licenses/by-sa/4.0/
- Abstract: With the introduction of cyber-physical genome sequencing and editing technologies, such as CRISPR, researchers can more easily access tools to investigate and create remedies for a variety of topics in genetics and health science (e.g. agriculture and medicine). As the field advances and grows, new concerns present themselves in the ability to predict the off-target behavior. In this work, we explore the underlying biological and chemical model from a data driven perspective. Additionally, we present a machine learning based solution named \textit{Guide-Guard} to predict the behavior of the system given a gRNA in the CRISPR gene-editing process with 84\% accuracy. This solution is able to be trained on multiple different genes at the same time while retaining accuracy.
Related papers
- Machine Learning Methods for Gene Regulatory Network Inference [1.3654846342364308]
Gene Regulatory Networks (GRNs) control gene expression and regulation in response to environmental and developmental cues.<n>Advances in computational biology, coupled with high throughput sequencing technologies, have significantly improved the accuracy of GRN inference.<n>Modern approaches increasingly leverage artificial intelligence (AI) to analyze large scale omics data and uncover regulatory gene interactions.
arXiv Detail & Related papers (2025-04-17T03:19:49Z) - Artificial Intelligence and Deep Learning Algorithms for Epigenetic Sequence Analysis: A Review for Epigeneticists and AI Experts [2.5367788916245964]
Epigenetics encompasses mechanisms that can alter the expression of genes without changing the underlying genetic sequence.<n>The changes in gene regulation and expression can manifest in the form of various diseases and disorders such as cancer and congenital deformities.<n>Machine learning and artificial intelligence (AI) approaches have been extensively used for mapping epigenetic modifications to their phenotypic manifestations.
arXiv Detail & Related papers (2025-04-01T01:02:34Z) - GENERator: A Long-Context Generative Genomic Foundation Model [66.46537421135996]
We present GENERator, a generative genomic foundation model featuring a context length of 98k base pairs (bp) and 1.2B parameters.<n>Trained on an expansive dataset comprising 386B bp of DNA, the GENERator demonstrates state-of-the-art performance across both established and newly proposed benchmarks.<n>It also shows significant promise in sequence optimization, particularly through the prompt-responsive generation of enhancer sequences with specific activity profiles.
arXiv Detail & Related papers (2025-02-11T05:39:49Z) - BioDiscoveryAgent: An AI Agent for Designing Genetic Perturbation Experiments [112.25067497985447]
We introduce BioDiscoveryAgent, an agent that designs new experiments, reasons about their outcomes, and efficiently navigates the hypothesis space to reach desired solutions.<n>BioDiscoveryAgent can uniquely design new experiments without the need to train a machine learning model.<n>It achieves an average of 21% improvement in predicting relevant genetic perturbations across six datasets.
arXiv Detail & Related papers (2024-05-27T19:57:17Z) - CRISPR-GPT for Agentic Automation of Gene-editing Experiments [57.10950429181712]
Large Language Models (LLMs) have shown promise in various tasks, but they often lack specific knowledge and struggle to accurately solve biological design problems.<n>In this work, we introduce CRISPR-GPT, an LLM agent augmented with domain knowledge and external tools to automate and enhance the design process of CRISPR-based gene-editing experiments.<n>We showcase the potential of CRISPR-GPT for assisting non-expert researchers with gene-editing experiments from scratch and validate the agent's effectiveness in a real-world use case.
arXiv Detail & Related papers (2024-04-27T22:59:17Z) - CRISPR: Ensemble Model [20.268713698436326]
We propose a novel ensemble learning method for sgRNA design that is accurate and generalizable.
Our results suggest that our method can be used to design sgRNAs with high sensitivity and specificity, even for new genes or cells.
arXiv Detail & Related papers (2024-03-05T14:55:14Z) - Advancing Gene Selection in Oncology: A Fusion of Deep Learning and
Sparsity for Precision Gene Selection [4.093503153499691]
This paper introduces two gene selection strategies for deep learning-based survival prediction models.
The first strategy uses a sparsity-inducing method while the second one uses importance based gene selection for identifying relevant genes.
arXiv Detail & Related papers (2024-03-04T10:44:57Z) - Causal machine learning for single-cell genomics [94.28105176231739]
We discuss the application of machine learning techniques to single-cell genomics and their challenges.
We first present the model that underlies most of current causal approaches to single-cell biology.
We then identify open problems in the application of causal approaches to single-cell data.
arXiv Detail & Related papers (2023-10-23T13:35:24Z) - Deep metric learning improves lab of origin prediction of genetically
engineered plasmids [63.05016513788047]
Genetic engineering attribution (GEA) is the ability to make sequence-lab associations.
We propose a method, based on metric learning, that ranks the most likely labs-of-origin.
We are able to extract key signatures in plasmid sequences for particular labs, allowing for an interpretable examination of the model's outputs.
arXiv Detail & Related papers (2021-11-24T16:29:03Z) - Machine Learning Applications for Therapeutic Tasks with Genomics Data [49.98249191161107]
We review the literature on machine learning applications for genomics through the lens of therapeutic development.
We identify twenty-two machine learning in genomics applications across the entire therapeutics pipeline.
We pinpoint seven important challenges in this field with opportunities for expansion and impact.
arXiv Detail & Related papers (2021-05-03T21:20:20Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.