ProteinPNet: Prototypical Part Networks for Concept Learning in Spatial Proteomics
- URL: http://arxiv.org/abs/2512.02983v1
- Date: Tue, 02 Dec 2025 18:00:03 GMT
- Title: ProteinPNet: Prototypical Part Networks for Concept Learning in Spatial Proteomics
- Authors: Louis McConnell, Jieran Sun, Theo Maffei, Raphael Gottardo, Marianna Rapsomaniki,
- Abstract summary: We present ProteinPNet, a novel framework based on part networks that discovers TME motifs from spatial data.<n>ProteinPNet directly learns discriminative, interpretable, faithful spatial prototypes through supervised training.<n>Our results highlight the potential prototype-based learning to reveal interpretable biomarkers within the TME, with implications for mechanistic discovery in omics.
- Score: 0.27185251060695437
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Understanding the spatial architecture of the tumor microenvironment (TME) is critical to advance precision oncology. We present ProteinPNet, a novel framework based on prototypical part networks that discovers TME motifs from spatial proteomics data. Unlike traditional post-hoc explanability models, ProteinPNet directly learns discriminative, interpretable, faithful spatial prototypes through supervised training. We validate our approach on synthetic datasets with ground truth motifs, and further test it on a real-world lung cancer spatial proteomics dataset. ProteinPNet consistently identifies biologically meaningful prototypes aligned with different tumor subtypes. Through graphical and morphological analyses, we show that these prototypes capture interpretable features pointing to differences in immune infiltration and tissue modularity. Our results highlight the potential of prototype-based learning to reveal interpretable spatial biomarkers within the TME, with implications for mechanistic discovery in spatial omics.
Related papers
- ConStruct: Structural Distillation of Foundation Models for Prototype-Based Weakly Supervised Histopathology Segmentation [16.733170895296343]
Weakly supervised semantic segmentation (WSSS) in histopathology relies heavily on classification backbones.<n>We propose a prototype learning framework that integrates morphology-aware representations from CONCH, multi-scale structural cues from SegFormer, and text-guided semantic alignment.<n>Our approach produces high-quality pseudo masks without pixel-level annotations, improves localization completeness, and enhances semantic consistency across tissue types.
arXiv Detail & Related papers (2025-12-11T06:08:29Z) - Towards Open-Ended Visual Scientific Discovery with Sparse Autoencoders [11.190791003373322]
We ask whether sparse autoencoders can enable open-ended feature discovery from foundation model representations.<n>Applying to ecological imagery, the same procedure surfaces fine-grained anatomical structure without access to segmentation or part labels.<n>Our results indicate that sparse decomposition provides a practical instrument for exploring what scientific foundation models have learned.
arXiv Detail & Related papers (2025-11-21T19:38:07Z) - CellPainTR: Generalizable Representation Learning for Cross-Dataset Cell Painting Analysis [51.56484100374058]
We introduce CellPainTR, a Transformer-based architecture designed to learn foundational representations of cellular morphology.<n>Our work represents a significant step towards creating truly foundational models for image-based profiling, enabling more reliable and scalable cross-study biological analysis.
arXiv Detail & Related papers (2025-09-02T03:30:07Z) - Neural Proteomics Fields for Super-resolved Spatial Proteomics Prediction [8.424059461071614]
We introduce the novel task of spatial super-resolution for sequencing-based spatial (seq-SP)<n>Neural Proteomics Fields (NPF) formulates seq-SP as a protein reconstruction problem in continuous space by training a dedicated network for each tissue.<n>NPF achieves state-of-the-art performance with fewer learnable parameters, underscoring its potential for advancing spatial research.
arXiv Detail & Related papers (2025-08-24T14:53:12Z) - PRING: Rethinking Protein-Protein Interaction Prediction from Pairs to Graphs [88.98041407783502]
PRING is the first benchmark that evaluates protein-protein interaction prediction from a graph-level perspective.<n> PRING curates a high-quality, multi-species PPI network dataset comprising 21,484 proteins and 186,818 interactions.
arXiv Detail & Related papers (2025-07-07T15:21:05Z) - AI-powered virtual tissues from spatial proteomics for clinical diagnostics and biomedical discovery [4.608421774700912]
We present Virtual Tissues (VirTues), a foundation model framework for biological tissues that operates across the molecular, cellular and tissue scale.<n>VirTues introduces innovations in transformer architecture design, including a novel tokenization scheme that captures both spatial and marker dimensions.<n>As a generalist model, VirTues outperforms existing approaches across clinical diagnostics, biological discovery and patient case retrieval tasks.
arXiv Detail & Related papers (2025-01-10T15:17:27Z) - MeToken: Uniform Micro-environment Token Boosts Post-Translational Modification Prediction [65.33218256339151]
Post-translational modifications (PTMs) profoundly expand the complexity and functionality of the proteome.
Existing computational approaches predominantly focus on protein sequences to predict PTM sites, driven by the recognition of sequence-dependent motifs.
We introduce the MeToken model, which tokenizes the micro-environment of each acid, integrating both sequence and structural information into unified discrete tokens.
arXiv Detail & Related papers (2024-11-04T07:14:28Z) - Integration of Pre-trained Protein Language Models into Geometric Deep
Learning Networks [68.90692290665648]
We integrate knowledge learned by protein language models into several state-of-the-art geometric networks.
Our findings show an overall improvement of 20% over baselines.
Strong evidence indicates that the incorporation of protein language models' knowledge enhances geometric networks' capacity by a significant margin.
arXiv Detail & Related papers (2022-12-07T04:04:04Z) - Learning Geometrically Disentangled Representations of Protein Folding
Simulations [72.03095377508856]
This work focuses on learning a generative neural network on a structural ensemble of a drug-target protein.
Model tasks involve characterizing the distinct structural fluctuations of the protein bound to various drug molecules.
Results show that our geometric learning-based method enjoys both accuracy and efficiency for generating complex structural variations.
arXiv Detail & Related papers (2022-05-20T19:38:00Z) - G-MIND: An End-to-End Multimodal Imaging-Genetics Framework for
Biomarker Identification and Disease Classification [49.53651166356737]
We propose a novel deep neural network architecture to integrate imaging and genetics data, as guided by diagnosis, that provides interpretable biomarkers.
We have evaluated our model on a population study of schizophrenia that includes two functional MRI (fMRI) paradigms and Single Nucleotide Polymorphism (SNP) data.
arXiv Detail & Related papers (2021-01-27T19:28:04Z) - Characterizing the Latent Space of Molecular Deep Generative Models with
Persistent Homology Metrics [21.95240820041655]
Variational Autos (VAEs) are generative models in which encoder-decoder network pairs are trained to reconstruct training data distributions.
We propose a method for measuring how well the latent space of deep generative models is able to encode structural and chemical features.
arXiv Detail & Related papers (2020-10-18T13:33:02Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.