GLOBE-CE: A Translation-Based Approach for Global Counterfactual
Explanations
- URL: http://arxiv.org/abs/2305.17021v2
- Date: Sun, 17 Dec 2023 20:26:53 GMT
- Title: GLOBE-CE: A Translation-Based Approach for Global Counterfactual
Explanations
- Authors: Dan Ley, Saumitra Mishra, Daniele Magazzeni
- Abstract summary: Global & Efficient Counterfactual Explanations (GLOBE-CE) is a flexible framework that tackles the reliability and scalability issues associated with current state-of-the-art.
We provide a unique mathematical analysis of categorical feature translations, utilising it in our method.
Experimental evaluation with publicly available datasets and user studies demonstrate that GLOBE-CE performs significantly better than the current state-of-the-art.
- Score: 10.276136171459731
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Counterfactual explanations have been widely studied in explainability, with
a range of application dependent methods prominent in fairness, recourse and
model understanding. The major shortcoming associated with these methods,
however, is their inability to provide explanations beyond the local or
instance-level. While many works touch upon the notion of a global explanation,
typically suggesting to aggregate masses of local explanations in the hope of
ascertaining global properties, few provide frameworks that are both reliable
and computationally tractable. Meanwhile, practitioners are requesting more
efficient and interactive explainability tools. We take this opportunity to
propose Global & Efficient Counterfactual Explanations (GLOBE-CE), a flexible
framework that tackles the reliability and scalability issues associated with
current state-of-the-art, particularly on higher dimensional datasets and in
the presence of continuous features. Furthermore, we provide a unique
mathematical analysis of categorical feature translations, utilising it in our
method. Experimental evaluation with publicly available datasets and user
studies demonstrate that GLOBE-CE performs significantly better than the
current state-of-the-art across multiple metrics (e.g., speed, reliability).
Related papers
- GLEAMS: Bridging the Gap Between Local and Global Explanations [6.329021279685856]
We propose GLEAMS, a novel method that partitions the input space and learns an interpretable model within each sub-region.
We demonstrate GLEAMS' effectiveness on both synthetic and real-world data, highlighting its desirable properties and human-understandable insights.
arXiv Detail & Related papers (2024-08-09T13:30:37Z) - Robust Explainable Recommendation [10.186029242664931]
We present a general framework for feature-aware explainable recommenders that can withstand external attacks.
Our framework is simple to implement and supports different methods regardless of the internal model structure and intrinsic utility within any model.
arXiv Detail & Related papers (2024-05-03T05:03:07Z) - Scalable and Efficient Hierarchical Visual Topological Mapping [3.114470292106496]
We evaluate state-of-the-art hand-crafted and learned global descriptors using a hierarchical topological mapping technique.
Based on our empirical analysis of multiple runs, we identify that continuity and distinctiveness are crucial characteristics for an optimal global descriptor.
arXiv Detail & Related papers (2024-04-07T17:30:57Z) - Prospector Heads: Generalized Feature Attribution for Large Models & Data [82.02696069543454]
We introduce prospector heads, an efficient and interpretable alternative to explanation-based attribution methods.
We demonstrate how prospector heads enable improved interpretation and discovery of class-specific patterns in input data.
arXiv Detail & Related papers (2024-02-18T23:01:28Z) - Sparsity-Guided Holistic Explanation for LLMs with Interpretable
Inference-Time Intervention [53.896974148579346]
Large Language Models (LLMs) have achieved unprecedented breakthroughs in various natural language processing domains.
The enigmatic black-box'' nature of LLMs remains a significant challenge for interpretability, hampering transparent and accountable applications.
We propose a novel methodology anchored in sparsity-guided techniques, aiming to provide a holistic interpretation of LLMs.
arXiv Detail & Related papers (2023-12-22T19:55:58Z) - Extracting Interpretable Local and Global Representations from Attention
on Time Series [0.135975510645475]
This paper targets two transformer attention based interpretability methods working with local abstraction and global representation.
We distinguish local and global contexts, and provide a comprehensive framework for both general interpretation options.
arXiv Detail & Related papers (2023-09-16T00:51:49Z) - Advancing Counterfactual Inference through Nonlinear Quantile Regression [77.28323341329461]
We propose a framework for efficient and effective counterfactual inference implemented with neural networks.
The proposed approach enhances the capacity to generalize estimated counterfactual outcomes to unseen data.
Empirical results conducted on multiple datasets offer compelling support for our theoretical assertions.
arXiv Detail & Related papers (2023-06-09T08:30:51Z) - Variational Distillation for Multi-View Learning [104.17551354374821]
We design several variational information bottlenecks to exploit two key characteristics for multi-view representation learning.
Under rigorously theoretical guarantee, our approach enables IB to grasp the intrinsic correlation between observations and semantic labels.
arXiv Detail & Related papers (2022-06-20T03:09:46Z) - MACE: An Efficient Model-Agnostic Framework for Counterfactual
Explanation [132.77005365032468]
We propose a novel framework of Model-Agnostic Counterfactual Explanation (MACE)
In our MACE approach, we propose a novel RL-based method for finding good counterfactual examples and a gradient-less descent method for improving proximity.
Experiments on public datasets validate the effectiveness with better validity, sparsity and proximity.
arXiv Detail & Related papers (2022-05-31T04:57:06Z) - Global Counterfactual Explanations: Investigations, Implementations and
Improvements [12.343333815270402]
Actionable Recourse Summaries (AReS) is the only known global counterfactual explanation framework for recourse.
This paper focuses on implementing and improving AReS, the only known global counterfactual explanation framework for recourse.
arXiv Detail & Related papers (2022-04-14T12:21:23Z) - Generative Counterfactuals for Neural Networks via Attribute-Informed
Perturbation [51.29486247405601]
We design a framework to generate counterfactuals for raw data instances with the proposed Attribute-Informed Perturbation (AIP)
By utilizing generative models conditioned with different attributes, counterfactuals with desired labels can be obtained effectively and efficiently.
Experimental results on real-world texts and images demonstrate the effectiveness, sample quality as well as efficiency of our designed framework.
arXiv Detail & Related papers (2021-01-18T08:37:13Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.