C$^2$MIL: Synchronizing Semantic and Topological Causalities in Multiple Instance Learning for Robust and Interpretable Survival Analysis
- URL: http://arxiv.org/abs/2509.20152v1
- Date: Wed, 24 Sep 2025 14:17:39 GMT
- Title: C$^2$MIL: Synchronizing Semantic and Topological Causalities in Multiple Instance Learning for Robust and Interpretable Survival Analysis
- Authors: Min Cen, Zhenfeng Zhuang, Yuzhe Zhang, Min Zeng, Baptiste Magnier, Lequan Yu, Hong Zhang, Liansheng Wang,
- Abstract summary: Graph-based Multiple Instance Learning (MIL) is widely used in survival analysis with Hematoxylin and Eosin (H&E)-stained whole slide images (WSIs)<n> variations in staining and scanning can introduce semantic bias, while topological subgraphs that are not relevant to the causal relationships can create noise.<n>We propose a novel and interpretable dual causal graph-based MIL model, C$2$MIL.
- Score: 40.54380946239706
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Graph-based Multiple Instance Learning (MIL) is widely used in survival analysis with Hematoxylin and Eosin (H\&E)-stained whole slide images (WSIs) due to its ability to capture topological information. However, variations in staining and scanning can introduce semantic bias, while topological subgraphs that are not relevant to the causal relationships can create noise, resulting in biased slide-level representations. These issues can hinder both the interpretability and generalization of the analysis. To tackle this, we introduce a dual structural causal model as the theoretical foundation and propose a novel and interpretable dual causal graph-based MIL model, C$^2$MIL. C$^2$MIL incorporates a novel cross-scale adaptive feature disentangling module for semantic causal intervention and a new Bernoulli differentiable causal subgraph sampling method for topological causal discovery. A joint optimization strategy combining disentangling supervision and contrastive learning enables simultaneous refinement of both semantic and topological causalities. Experiments demonstrate that C$^2$MIL consistently improves generalization and interpretability over existing methods and can serve as a causal enhancement for diverse MIL baselines. The code is available at https://github.com/mimic0127/C2MIL.
Related papers
- OTSurv: A Novel Multiple Instance Learning Framework for Survival Prediction with Heterogeneity-aware Optimal Transport [58.5751036808652]
Survival prediction using whole slide images (WSIs) can be formulated as a multiple instance learning (MIL) problem.<n>We propose OTSurv, a novel MIL framework from an optimal transport perspective.<n>We set new results across six popular benchmarks, achieving an absolute 3.6% improvement in average C-index.
arXiv Detail & Related papers (2025-06-25T18:09:42Z) - Few-Shot Learning from Gigapixel Images via Hierarchical Vision-Language Alignment and Modeling [45.67022392509926]
HiVE-MIL is a vision-language framework that constructs a unified graph consisting of parent-child links between coarse (5x) and fine (20x) visual/textual nodes.<n>To further enhance semantic consistency, HiVE-MIL incorporates a two-stage, text-guided dynamic filtering mechanism.<n>Experiments on TCGA breast, lung, and kidney cancer datasets demonstrate that HiVE-MIL consistently outperforms both traditional MIL and recent VLM-based MIL approaches.
arXiv Detail & Related papers (2025-05-23T14:48:32Z) - Multi-View Causal Discovery without Non-Gaussianity: Identifiability and Algorithms [59.15672758767244]
Causal discovery is a difficult problem that typically relies on strong assumptions on the data-generating model, such as non-Gaussianity.<n>Here, we leverage this multi-view structure to achieve causal discovery with weak assumptions.<n>We propose several multi-view causal discovery algorithms, inspired by single-view algorithms.
arXiv Detail & Related papers (2025-02-27T14:06:14Z) - Raising the Bar in Graph OOD Generalization: Invariant Learning Beyond Explicit Environment Modeling [61.222803636132554]
Real-world graph data often exhibit diverse and shifting environments that traditional models fail to generalize across.<n>We propose a novel method termed Multi-Prototype Hyperspherical Invariant Learning (MPHIL)<n>MPHIL achieves state-of-the-art performance, significantly outperforming existing methods across graph data from various domains and with different distribution shifts.
arXiv Detail & Related papers (2025-02-15T07:40:14Z) - The Role of Graph-based MIL and Interventional Training in the Generalization of WSI Classifiers [8.867734798489037]
Whole Slide Imaging (WSI), which involves high-resolution digital scans of pathology slides, has become the gold standard for cancer diagnosis.<n>Its gigapixel resolution and the scarcity of annotated datasets present challenges for deep learning models.<n>We introduce a new framework, Graph-based Multiple Instance Learning with Interventional Training (GMIL-IT) for WSI classification.
arXiv Detail & Related papers (2025-01-31T11:21:08Z) - PRAGA: Prototype-aware Graph Adaptive Aggregation for Spatial Multi-modal Omics Analysis [1.1619559582563954]
We propose PRototype-Aware Graph Adaptative Aggregation for Spatial Multi-modal Omics Analysis (PRAGA)<n> PRAGA constructs a dynamic graph to capture latent semantic relations and comprehensively integrate spatial information and feature semantics.<n>The learnable graph structure can also denoise perturbations by learning cross-modal knowledge.
arXiv Detail & Related papers (2024-09-19T12:53:29Z) - MamMIL: Multiple Instance Learning for Whole Slide Images with State Space Models [56.37780601189795]
We propose a framework named MamMIL for WSI analysis.
We represent each WSI as an undirected graph.
To address the problem that Mamba can only process 1D sequences, we propose a topology-aware scanning mechanism.
arXiv Detail & Related papers (2024-03-08T09:02:13Z) - CausalGym: Benchmarking causal interpretability methods on linguistic
tasks [52.61917615039112]
We use CausalGym to benchmark the ability of interpretability methods to causally affect model behaviour.
We study the pythia models (14M--6.9B) and assess the causal efficacy of a wide range of interpretability methods.
We find that DAS outperforms the other methods, and so we use it to study the learning trajectory of two difficult linguistic phenomena.
arXiv Detail & Related papers (2024-02-19T21:35:56Z) - SI-MIL: Taming Deep MIL for Self-Interpretability in Gigapixel Histopathology [31.21142367010005]
Self-Interpretable MIL (SI-MIL) is a method intrinsically designed for interpretability from the very outset.
SI-MIL employs a deep MIL framework to guide an interpretable branch grounded on handcrafted pathological features.
With its linear prediction constraints, SI-MIL challenges the prevalent myth of an inevitable trade-off between model interpretability and performance.
arXiv Detail & Related papers (2023-12-22T18:59:05Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.