Towards Rational Pesticide Design with Graph Machine Learning Models for Ecotoxicology
- URL: http://arxiv.org/abs/2509.18703v1
- Date: Tue, 23 Sep 2025 06:38:05 GMT
- Title: Towards Rational Pesticide Design with Graph Machine Learning Models for Ecotoxicology
- Authors: Jakub Adamczyk,
- Abstract summary: This research focuses on rational pesticide design, using graph machine learning to accelerate the development of safer, eco-friendly agrochemicals.<n>With an emphasis on ecotoxicology, the initial contributions include the creation of ApisTox, the largest curated dataset on pesticide toxicity to honey bees.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: This research focuses on rational pesticide design, using graph machine learning to accelerate the development of safer, eco-friendly agrochemicals, inspired by in silico methods in drug discovery. With an emphasis on ecotoxicology, the initial contributions include the creation of ApisTox, the largest curated dataset on pesticide toxicity to honey bees. We conducted a broad evaluation of machine learning (ML) models for molecular graph classification, including molecular fingerprints, graph kernels, GNNs, and pretrained transformers. The results show that methods successful in medicinal chemistry often fail to generalize to agrochemicals, underscoring the need for domain-specific models and benchmarks. Future work will focus on developing a comprehensive benchmarking suite and designing ML models tailored to the unique challenges of pesticide discovery.
Related papers
- Foundation Models for Discovery and Exploration in Chemical Space [57.97784111110166]
MIST is a family of molecular foundation models trained on large unlabeled datasets.<n>We demonstrate the ability of these models to solve real-world problems across chemical space.
arXiv Detail & Related papers (2025-10-20T17:56:01Z) - Breaking Bad Molecules: Are MLLMs Ready for Structure-Level Molecular Detoxification? [19.700175505235876]
ToxiMol is the first benchmark task for general-purpose Multimodal Large Language Models (MLLMs) focused on molecular toxicity repair.<n>We construct a standardized dataset covering 11 primary tasks and 560 representative toxic molecules spanning diverse mechanisms and granularities.
arXiv Detail & Related papers (2025-06-12T17:25:53Z) - Evaluating machine learning models for predicting pesticides toxicity to honey bees [0.0]
ApisTox is the most comprehensive dataset of experimentally validated chemical toxicity to the honey bee.<n>We evaluate ApisTox using a diverse suite of machine learning approaches, including molecular fingerprints, graph kernels, and graph neural networks.
arXiv Detail & Related papers (2025-03-31T16:51:12Z) - Pesti-Gen: Unleashing a Generative Molecule Approach for Toxicity Aware Pesticide Design [0.0]
Climate change has reduced crop resilience and pesticide efficacy, making reliance on synthetic pesticides inevitable.<n>We propose Pesti-Gen, a novel generative model based on variational auto-encoders to generate pesticide candidates with optimized properties.<n>Pesti-Gen achieves approximately 68% structural validity in generating new molecular structures.
arXiv Detail & Related papers (2025-01-24T13:00:54Z) - BatGPT-Chem: A Foundation Large Model For Retrosynthesis Prediction [65.93303145891628]
BatGPT-Chem is a large language model with 15 billion parameters, tailored for enhanced retrosynthesis prediction.
Our model captures a broad spectrum of chemical knowledge, enabling precise prediction of reaction conditions.
This development empowers chemists to adeptly address novel compounds, potentially expediting the innovation cycle in drug manufacturing and materials science.
arXiv Detail & Related papers (2024-08-19T05:17:40Z) - Benchmarking Toxic Molecule Classification using Graph Neural Networks
and Few Shot Learning [0.0]
Traditional Graph Convolutional Networks (GCNs) face challenges with limited data and class imbalance.
We harness the power of Graph Isomorphic Networks, Multi Headed Attention and Free Large-scale Adversarial Augmentation separately on Graphs.
We incorporate Few-Shot Learning to improve the model's generalization with limited annotated samples.
arXiv Detail & Related papers (2023-11-22T16:07:32Z) - Tertiary Lymphoid Structures Generation through Graph-based Diffusion [54.37503714313661]
In this work, we leverage state-of-the-art graph-based diffusion models to generate biologically meaningful cell-graphs.
We show that the adopted graph diffusion model is able to accurately learn the distribution of cells in terms of their tertiary lymphoid structures (TLS) content.
arXiv Detail & Related papers (2023-10-10T14:37:17Z) - Bi-level Contrastive Learning for Knowledge-Enhanced Molecule Representations [68.32093648671496]
We introduce GODE, which accounts for the dual-level structure inherent in molecules.<n> Molecules possess an intrinsic graph structure and simultaneously function as nodes within a broader molecular knowledge graph.<n>By pre-training two GNNs on different graph structures, GODE effectively fuses molecular structures with their corresponding knowledge graph substructures.
arXiv Detail & Related papers (2023-06-02T15:49:45Z) - Semantic Image Segmentation with Deep Learning for Vine Leaf Phenotyping [59.0626764544669]
In this study, we use Deep Learning methods to semantically segment grapevine leaves images in order to develop an automated object detection system for leaf phenotyping.
Our work contributes to plant lifecycle monitoring through which dynamic traits such as growth and development can be captured and quantified.
arXiv Detail & Related papers (2022-10-24T14:37:09Z) - Graph Machine Learning for Design of High-Octane Fuels [47.43758223690195]
Computer-aided molecular design (CAMD) can identify molecules with desired autoignition properties.
We propose a modular graph-ML CAMD framework that integrates generative graph-ML models with graph neural networks and optimization.
We experimentally investigate and use to illustrate the need for further auto-ignition training data.
arXiv Detail & Related papers (2022-06-01T16:43:04Z) - Utilising Graph Machine Learning within Drug Discovery and Development [19.21101749270075]
Graph Machine Learning (GML) is receiving growing interest within the pharmaceutical and biotechnology industries for its ability to model biomolecular structures.
Herein, we present a multidisciplinary academic-industrial review of the topic within the context of drug discovery and development.
After introducing key terms and modelling approaches, we move chronologically through the drug development pipeline to identify and summarise work incorporating: target identification, design of small molecules and biologics, and drug repurposing.
arXiv Detail & Related papers (2020-12-09T10:12:33Z) - Graph Neural Networks for the Prediction of Substrate-Specific Organic
Reaction Conditions [79.45090959869124]
We present a systematic investigation using graph neural networks (GNNs) to model organic chemical reactions.
We evaluate seven different GNN architectures for classification tasks pertaining to the identification of experimental reagents and conditions.
arXiv Detail & Related papers (2020-07-08T17:21:00Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.