Uncovering the Mechanism of Hepatotoxiciy of PFAS Targeting L-FABP Using GCN and Computational Modeling
- URL: http://arxiv.org/abs/2409.10370v1
- Date: Mon, 16 Sep 2024 15:13:39 GMT
- Title: Uncovering the Mechanism of Hepatotoxiciy of PFAS Targeting L-FABP Using GCN and Computational Modeling
- Authors: Lucas Jividen, Tibo Duran, Xi-Zhi Niu, Jun Bai,
- Abstract summary: Per- and polyfluoroalkyl substances (PFAS) are persistent environmental pollutants with known toxicity and bioaccumulation issues.
This study advances the predictive modeling of PFAS toxicity by combining semi-supervised graph convolutional networks (GCNs) with molecular descriptors and fingerprints.
- Score: 1.6249398255272316
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Per- and polyfluoroalkyl substances (PFAS) are persistent environmental pollutants with known toxicity and bioaccumulation issues. Their widespread industrial use and resistance to degradation have led to global environmental contamination and significant health concerns. While a minority of PFAS have been extensively studied, the toxicity of many PFAS remains poorly understood due to limited direct toxicological data. This study advances the predictive modeling of PFAS toxicity by combining semi-supervised graph convolutional networks (GCNs) with molecular descriptors and fingerprints. We propose a novel approach to enhance the prediction of PFAS binding affinities by isolating molecular fingerprints to construct graphs where then descriptors are set as the node features. This approach specifically captures the structural, physicochemical, and topological features of PFAS without overfitting due to an abundance of features. Unsupervised clustering then identifies representative compounds for detailed binding studies. Our results provide a more accurate ability to estimate PFAS hepatotoxicity to provide guidance in chemical discovery of new PFAS and the development of new safety regulations.
Related papers
- Unveiling Covert Toxicity in Multimodal Data via Toxicity Association Graphs: A Graph-Based Metric and Interpretable Detection Framework [58.01529356381494]
We propose a novel detection framework based on Toxicity Association Graphs (TAGs)<n>We introduce the first quantifiable metric for hidden toxicity, the Multimodal Toxicity Covertness (MTC)<n>Our approach enables precise identification of covert toxicity while preserving full interpretability of the decision-making process.
arXiv Detail & Related papers (2026-02-03T08:54:25Z) - Rapid Machine Learning-Driven Detection of Pesticides and Dyes Using Raman Spectroscopy [0.5002873541686897]
pesticides and synthetic dyes pose critical threats to food safety, human health, and environmental sustainability.<n>Raman spectroscopy offers molecularly specific fingerprints but suffers from spectral noise, fluorescence background, and band overlap.<n>Here, we propose a deep learning framework based on ResNet-18 feature extraction to detect pesticides and dyes from Raman spectroscopy, called MLRaman.
arXiv Detail & Related papers (2025-11-15T11:35:55Z) - Knowledge-Augmented Long-CoT Generation for Complex Biomolecular Reasoning [51.673503054645415]
Biomolecular mechanisms require multi-step reasoning across molecular interactions, signaling cascades, and metabolic pathways.<n>Existing approaches often exacerbate these issues: reasoning steps may deviate from biological facts or fail to capture long mechanistic dependencies.<n>We propose a Knowledge-Augmented Long-CoT Reasoning framework that integrates LLMs with knowledge graph-based multi-hop reasoning chains.
arXiv Detail & Related papers (2025-11-11T09:26:32Z) - Rethinking Toxicity Evaluation in Large Language Models: A Multi-Label Perspective [104.09817371557476]
Large language models (LLMs) have achieved impressive results across a range of natural language processing tasks.<n>Their potential to generate harmful content has raised serious safety concerns.<n>We introduce three novel multi-label benchmarks for toxicity detection.
arXiv Detail & Related papers (2025-10-16T06:50:33Z) - PGCLODA: Prompt-Guided Graph Contrastive Learning for Oligopeptide-Infectious Disease Association Prediction [11.867921399312701]
Infectious diseases continue to pose a serious threat to public health, underscoring the urgent need for effective computational approaches.<n>This study introduces a prompt-guided graph-based contrast learning framework (PGODA) to uncover potential associations.<n>Case studies further validate the ability of PGODA and its potential to uncover novel, biologically relevant associations.
arXiv Detail & Related papers (2025-09-24T16:25:13Z) - Geometric Multi-color Message Passing Graph Neural Networks for Blood-brain Barrier Permeability Prediction [1.488392495573075]
This paper introduces the geometric multi-color message-passing graph neural network (GMC-MPNN)<n>Our model constructs weighted colored subgraphs based on atom types to capture the spatial relationships and chemical context that govern blood-brain barrier permeability.
arXiv Detail & Related papers (2025-07-25T03:38:46Z) - Conditional Chemical Language Models are Versatile Tools in Drug Discovery [0.0]
We present SAFE-T, a chemical modeling framework that conditions on biological context to prioritize molecules.<n>It supports principled scoring of molecules across tasks such as virtual screening, drug-target interaction prediction, and activity cliff detection.<n>It consistently achieves performance comparable to or better than existing approaches.
arXiv Detail & Related papers (2025-07-14T13:42:39Z) - Structure-Aware Compound-Protein Affinity Prediction via Graph Neural Network with Group Lasso Regularization [11.595051456139021]
We build end-to-end explainable machine learning models for structure-activity relationship (SAR) modeling for compound property prediction.<n>We implement graph neural network (GNN) methods to obtain atom-level feature information and predict compound-protein affinity.<n>We also utilize group lasso and sparse group lasso to prune and highlight molecular subgraphs and enhance the structure-specific model explainability.
arXiv Detail & Related papers (2025-07-04T06:12:18Z) - Breaking Bad Molecules: Are MLLMs Ready for Structure-Level Molecular Detoxification? [19.700175505235876]
ToxiMol is the first benchmark task for general-purpose Multimodal Large Language Models (MLLMs) focused on molecular toxicity repair.<n>We construct a standardized dataset covering 11 primary tasks and 560 representative toxic molecules spanning diverse mechanisms and granularities.
arXiv Detail & Related papers (2025-06-12T17:25:53Z) - GloSS over Toxicity: Understanding and Mitigating Toxicity in LLMs via Global Toxic Subspace [62.68664365246247]
This paper investigates the underlying mechanisms of toxicity generation in Large Language Models (LLMs)<n>We propose GloSS (Global Toxic Subspace Suppression), a lightweight, four-stage method that mitigates toxicity by identifying and removing the global toxic subspace from the parameters of FFN.
arXiv Detail & Related papers (2025-05-20T08:29:11Z) - AI-guided Antibiotic Discovery Pipeline from Target Selection to Compound Identification [0.0]
Recent advances in protein structure prediction and machine learning offer a promising opportunity to accelerate drug discovery.
We develop an end-to-end, guided realization realization of 3D-structure-aware generative models.
This work provides a comparative benchmark and blueprint for deploying artificial intelligence in early-stage antibiotic development.
arXiv Detail & Related papers (2025-04-15T11:36:27Z) - Addressing Model Overcomplexity in Drug-Drug Interaction Prediction With Molecular Fingerprints [0.0]
Accurately predicting drug-drug interactions (DDIs) is crucial for pharmaceutical research and clinical safety.
Recent deep learning models often suffer from high computational costs and limited generalization across datasets.
In this study, we investigate a simpler yet effective approach using molecular representations such as Morgan fingerprints (S), graph-based embeddings from graph convolutional networks (GCNs), and transformer-derived embeddings from MoLFormer integrated into a straightforward neural network.
arXiv Detail & Related papers (2025-03-30T18:27:01Z) - Pretraining Generative Flow Networks with Inexpensive Rewards for Molecular Graph Generation [6.495442425890008]
Generative Flow Networks (GFlowNets) have recently emerged as a suitable framework for generating diverse and high-quality molecular structures.
In this work, we introduce Atomic GFlowNets (A-GFNs), a foundational generative model leveraging individual atoms as building blocks.
We propose an unsupervised pre-training approach using drug-like molecule datasets, which teaches A-GFNs about inexpensive yet informative molecular descriptors.
arXiv Detail & Related papers (2025-03-08T20:41:07Z) - Dumpling GNN: Hybrid GNN Enables Better ADC Payload Activity Prediction Based on Chemical Structure [53.76752789814785]
DumplingGNN is a hybrid Graph Neural Network architecture specifically designed for predicting ADC payload activity based on chemical structure.
We evaluate it on a comprehensive ADC payload dataset focusing on DNA Topoisomerase I inhibitors.
It demonstrates exceptional accuracy (91.48%), sensitivity (95.08%), and specificity (97.54%) on our specialized ADC payload dataset.
arXiv Detail & Related papers (2024-09-23T17:11:04Z) - GFlowNet Pretraining with Inexpensive Rewards [2.924067540644439]
We introduce Atomic GFlowNets (A-GFNs), a foundational generative model leveraging individual atoms as building blocks to explore drug-like chemical space more comprehensively.
We propose an unsupervised pre-training approach using offline drug-like molecule datasets, which conditions A-GFNs on inexpensive yet informative molecular descriptors.
We further our method by implementing a goal-conditioned fine-tuning process, which adapts A-GFNs to optimize for specific target properties.
arXiv Detail & Related papers (2024-09-15T11:42:17Z) - Unlocking Potential Binders: Multimodal Pretraining DEL-Fusion for Denoising DNA-Encoded Libraries [51.72836644350993]
Multimodal Pretraining DEL-Fusion model (MPDF)
We develop pretraining tasks applying contrastive objectives between different compound representations and their text descriptions.
We propose a novel DEL-fusion framework that amalgamates compound information at the atomic, submolecular, and molecular levels.
arXiv Detail & Related papers (2024-09-07T17:32:21Z) - A Gaussian Process Model for Ordinal Data with Applications to Chemoinformatics [0.0]
We present conditional Gaussian process models to predict ordinal outcomes from chemical experiments.
A novel aspect of our model is that the kernel contains a scaling parameter, that controls the strength of the correlation between elements of the chemical space.
We present a genetic algorithm for the facilitation of chemical discovery and identification of important features to the compound's efficacy.
arXiv Detail & Related papers (2024-05-16T11:18:32Z) - Learning to Denoise Biomedical Knowledge Graph for Robust Molecular Interaction Prediction [50.7901190642594]
We propose BioKDN (Biomedical Knowledge Graph Denoising Network) for robust molecular interaction prediction.
BioKDN refines the reliable structure of local subgraphs by denoising noisy links in a learnable manner.
It maintains consistent and robust semantics by smoothing relations around the target interaction.
arXiv Detail & Related papers (2023-12-09T07:08:00Z) - Benchmarking Toxic Molecule Classification using Graph Neural Networks
and Few Shot Learning [0.0]
Traditional Graph Convolutional Networks (GCNs) face challenges with limited data and class imbalance.
We harness the power of Graph Isomorphic Networks, Multi Headed Attention and Free Large-scale Adversarial Augmentation separately on Graphs.
We incorporate Few-Shot Learning to improve the model's generalization with limited annotated samples.
arXiv Detail & Related papers (2023-11-22T16:07:32Z) - Interpreting the Mechanism of Synergism for Drug Combinations Using
Attention-Based Hierarchical Graph Pooling [10.898133007285638]
We develop an interpretable graph neural network (GNN) that reveals the underlying essential therapeutic targets and the mechanism of the synergy (MoS)
The proposed GNN model provides a systematic way to predict and interpret the drug combination synergism based on the detected crucial sub-molecular network.
arXiv Detail & Related papers (2022-09-19T11:18:45Z) - Toxicity Detection in Drug Candidates using Simplified Molecular-Input
Line-Entry System [0.0]
The need for analysis of toxicity in new drug candidates and the requirement of doing it fast have asked the consideration of scientists.
This paper goes for the study of simplified Molecular Input Line-Entry System (SMILES) as a parameter to develop Long short term memory (LSTM) based models.
arXiv Detail & Related papers (2021-01-21T07:02:21Z) - Accelerating Antimicrobial Discovery with Controllable Deep Generative
Models and Molecular Dynamics [109.70543391923344]
CLaSS (Controlled Latent attribute Space Sampling) is an efficient computational method for attribute-controlled generation of molecules.
We screen the generated molecules for additional key attributes by using deep learning classifiers in conjunction with novel features derived from atomistic simulations.
The proposed approach is demonstrated for designing non-toxic antimicrobial peptides (AMPs) with strong broad-spectrum potency.
arXiv Detail & Related papers (2020-05-22T15:57:58Z) - Learning To Navigate The Synthetically Accessible Chemical Space Using
Reinforcement Learning [75.95376096628135]
We propose a novel forward synthesis framework powered by reinforcement learning (RL) for de novo drug design.
In this setup, the agent learns to navigate through the immense synthetically accessible chemical space.
We describe how the end-to-end training in this study represents an important paradigm in radically expanding the synthesizable chemical space.
arXiv Detail & Related papers (2020-04-26T21:40:03Z) - Explainable Deep Relational Networks for Predicting Compound-Protein
Affinities and Contacts [80.69440684790925]
DeepRelations is a physics-inspired deep relational network with intrinsically explainable architecture.
It shows superior interpretability to the state-of-the-art.
It boosts the AUPRC of contact prediction 9.5, 16.9, 19.3 and 5.7-fold for the test, compound-unique, protein-unique, and both-unique sets.
arXiv Detail & Related papers (2019-12-29T00:14:07Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.