Pharmacoprint -- a combination of pharmacophore fingerprint and
artificial intelligence as a tool for computer-aided drug design
- URL: http://arxiv.org/abs/2110.01339v2
- Date: Tue, 31 Oct 2023 09:30:08 GMT
- Title: Pharmacoprint -- a combination of pharmacophore fingerprint and
artificial intelligence as a tool for computer-aided drug design
- Authors: Dawid Warszycki, {\L}ukasz Struski, Marek \'Smieja, Rafa{\l} Kafel,
Rafa{\l} Kurczab
- Abstract summary: We propose a high-resolution, pharmacophore fingerprint called Pharmacoprint.
It encodes the presence, types, and relationships between pharmacophore features of a molecule.
- Score: 6.053347262128918
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Structural fingerprints and pharmacophore modeling are methodologies that
have been used for at least two decades in various fields of cheminformatics:
from similarity searching to machine learning (ML). Advances in silico
techniques consequently led to combining both these methodologies into a new
approach known as pharmacophore fingerprint. Herein, we propose a
high-resolution, pharmacophore fingerprint called Pharmacoprint that encodes
the presence, types, and relationships between pharmacophore features of a
molecule. Pharmacoprint was evaluated in classification experiments by using ML
algorithms (logistic regression, support vector machines, linear support vector
machines, and neural networks) and outperformed other popular molecular
fingerprints (i.e., Estate, MACCS, PubChem, Substructure, Klekotha-Roth, CDK,
Extended, and GraphOnly) and ChemAxon Pharmacophoric Features fingerprint.
Pharmacoprint consisted of 39973 bits; several methods were applied for
dimensionality reduction, and the best algorithm not only reduced the length of
bit string but also improved the efficiency of ML tests. Further optimization
allowed us to define the best parameter settings for using Pharmacoprint in
discrimination tests and for maximizing statistical parameters. Finally,
Pharmacoprint generated for 3D structures with defined hydrogens as input data
was applied to neural networks with a supervised autoencoder for selecting the
most important bits and allowed to maximize Matthews Correlation Coefficient up
to 0.962. The results show the potential of Pharmacoprint as a new, perspective
tool for computer-aided drug design.
Related papers
- Physical formula enhanced multi-task learning for pharmacokinetics prediction [54.13787789006417]
A major challenge for AI-driven drug discovery is the scarcity of high-quality data.
We develop a formula enhanced mul-ti-task learning (PEMAL) method that predicts four key parameters of pharmacokinetics simultaneously.
Our experiments reveal that PEMAL significantly lowers the data demand, compared to typical Graph Neural Networks.
arXiv Detail & Related papers (2024-04-16T07:42:55Z) - ADMET property prediction through combinations of molecular fingerprints [0.0]
Random forests or support vector machines paired with extended-connectivity fingerprints consistently outperformed recently developed methods.
A detailed investigation into regression algorithms and molecular fingerprints revealed gradient-boosted decision trees.
We successfully validated our model across 22 Therapeutics Data Commons ADMET benchmarks.
arXiv Detail & Related papers (2023-09-29T22:39:18Z) - Efficient Prediction of Peptide Self-assembly through Sequential and
Graphical Encoding [57.89530563948755]
This work provides a benchmark analysis of peptide encoding with advanced deep learning models.
It serves as a guide for a wide range of peptide-related predictions such as isoelectric points, hydration free energy, etc.
arXiv Detail & Related papers (2023-07-17T00:43:33Z) - Drug Synergistic Combinations Predictions via Large-Scale Pre-Training
and Graph Structure Learning [82.93806087715507]
Drug combination therapy is a well-established strategy for disease treatment with better effectiveness and less safety degradation.
Deep learning models have emerged as an efficient way to discover synergistic combinations.
Our framework achieves state-of-the-art results in comparison with other deep learning-based methods.
arXiv Detail & Related papers (2023-01-14T15:07:43Z) - Molecular Fingerprints for Robust and Efficient ML-Driven Molecular
Generation [0.0]
We propose a novel molecular fingerprint-based variational autoencoder applied for molecular generation on real-world drug molecules.
We observe a substantial improvement in chemical synthetic accessibility ($DeltabarSAS$ = -0.83) and in computational efficiency up to 5.9x in comparison to an existing state-of-the-art SMILES-based architecture.
arXiv Detail & Related papers (2022-11-16T18:07:43Z) - ToDD: Topological Compound Fingerprinting in Computer-Aided Drug
Discovery [8.620443111346523]
In computer-aided drug discovery (CADD), virtual screening is used for identifying the drug candidates that are most likely to bind to a molecular target in a large library of compounds.
To address this problem, we developed a novel method using multi parameter persistence (MP) homology that produces topological fingerprints of the compounds as multidimensional vectors.
We show that the margin loss fine-tuning of pretrained Triplet networks attains highly competitive results in differentiating between compounds in the embedding space and ranking their likelihood of becoming effective drug candidates.
arXiv Detail & Related papers (2022-11-07T19:00:05Z) - Tailoring Molecules for Protein Pockets: a Transformer-based Generative
Solution for Structured-based Drug Design [133.1268990638971]
De novo drug design based on the structure of a target protein can provide novel drug candidates.
We present a generative solution named TamGent that can directly generate candidate drugs from scratch for a given target.
arXiv Detail & Related papers (2022-08-30T09:32:39Z) - MOOMIN: Deep Molecular Omics Network for Anti-Cancer Drug Combination
Therapy [2.446672595462589]
We propose a multimodal graph neural network that can predict the synergistic effect of drug combinations for cancer treatment.
Our model captures the representation based on the context of drugs at multiple scales based on a drug-protein interaction network and metadata.
We demonstrate that the model makes high-quality predictions over a wide range of cancer cell line tissues.
arXiv Detail & Related papers (2021-10-28T13:10:25Z) - Lung Cancer Lesion Detection in Histopathology Images Using Graph-Based
Sparse PCA Network [93.22587316229954]
We propose a graph-based sparse principal component analysis (GS-PCA) network, for automated detection of cancerous lesions on histological lung slides stained by hematoxylin and eosin (H&E)
We evaluate the performance of the proposed algorithm on H&E slides obtained from an SVM K-rasG12D lung cancer mouse model using precision/recall rates, F-score, Tanimoto coefficient, and area under the curve (AUC) of the receiver operator characteristic (ROC)
arXiv Detail & Related papers (2021-10-27T19:28:36Z) - Predicting Potential Drug Targets Using Tensor Factorisation and
Knowledge Graph Embeddings [4.415977307120617]
We have developed a new tensor factorisation model to predict potential drug targets (i.e.,genes or proteins) for diseases.
We enriched the data with gene representations learned from a drug discovery-oriented knowledge graph and applied our proposed method to predict the clinical outcomes for unseen target and dis-ease pairs.
arXiv Detail & Related papers (2021-05-20T16:19:00Z) - A Systematic Approach to Featurization for Cancer Drug Sensitivity
Predictions with Deep Learning [49.86828302591469]
We train >35,000 neural network models, sweeping over common featurization techniques.
We found the RNA-seq to be highly redundant and informative even with subsets larger than 128 features.
arXiv Detail & Related papers (2020-04-30T20:42:17Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.