Interpretable Machine Learning for Self-Service High-Risk
Decision-Making
- URL: http://arxiv.org/abs/2205.04032v1
- Date: Mon, 9 May 2022 04:37:07 GMT
- Title: Interpretable Machine Learning for Self-Service High-Risk
Decision-Making
- Authors: Charles Recaido, Boris Kovalerchuk
- Abstract summary: This paper contributes to interpretable machine learning via visual knowledge discovery in general line coordinates.
The concepts of hyperblocks as interpretable dataset units and general line coordinates are combined to create a visual self-service machine learning model.
- Score: 7.005458308454871
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: This paper contributes to interpretable machine learning via visual knowledge
discovery in general line coordinates (GLC). The concepts of hyperblocks as
interpretable dataset units and general line coordinates are combined to create
a visual self-service machine learning model. The DSC1 and DSC2 lossless
multidimensional coordinate systems are proposed. DSC1 and DSC2 can map
multiple dataset attributes to a single two-dimensional (X, Y) Cartesian plane
using a graph construction algorithm. The hyperblock analysis was used to
determine visually appealing dataset attribute orders and to reduce line
occlusion. It is shown that hyperblocks can generalize decision tree rules and
a series of DSC1 or DSC2 plots can visualize a decision tree. The DSC1 and DSC2
plots were tested on benchmark datasets from the UCI ML repository. They
allowed for visual classification of data. Additionally, areas of hyperblock
impurity were discovered and used to establish dataset splits that highlight
the upper estimate of worst-case model accuracy to guide model selection for
high-risk decision-making. Major benefits of DSC1 and DSC2 is their highly
interpretable nature. They allow domain experts to control or establish new
machine learning models through visual pattern discovery.
Related papers
- Dissecting embedding method: learning higher-order structures from data [0.0]
Geometric deep learning methods for data learning often include set of assumptions on the geometry of the feature space.
These assumptions together with data being discrete and finite can cause some generalisations, which are likely to create wrong interpretations of the data and models outputs.
arXiv Detail & Related papers (2024-10-14T08:19:39Z) - Deep Contrastive Graph Learning with Clustering-Oriented Guidance [61.103996105756394]
Graph Convolutional Network (GCN) has exhibited remarkable potential in improving graph-based clustering.
Models estimate an initial graph beforehand to apply GCN.
Deep Contrastive Graph Learning (DCGL) model is proposed for general data clustering.
arXiv Detail & Related papers (2024-02-25T07:03:37Z) - Full High-Dimensional Intelligible Learning In 2-D Lossless
Visualization Space [7.005458308454871]
This study explores a new methodology for machine learning classification tasks in 2-D visualization space (2-D ML)
It is shown that this is a full machine learning approach that does not require processing n-dimensional data in an abstract n-dimensional space.
It enables discovering n-D patterns in 2-D space without loss of n-D information using graph representations of n-D data in 2-D.
arXiv Detail & Related papers (2023-05-29T00:21:56Z) - Interactive Decision Tree Creation and Enhancement with Complete
Visualization for Explainable Modeling [0.0]
Decision Trees (DTs) are essential in machine learning (ML) because they are used to understand many black box ML models.
Two new methods for creation and enhancement with complete visualizing Decision Trees are suggested.
arXiv Detail & Related papers (2023-05-28T23:44:15Z) - Two-Stream Graph Convolutional Network for Intra-oral Scanner Image
Segmentation [133.02190910009384]
We propose a two-stream graph convolutional network (i.e., TSGCN) to handle inter-view confusion between different raw attributes.
Our TSGCN significantly outperforms state-of-the-art methods in 3D tooth (surface) segmentation.
arXiv Detail & Related papers (2022-04-19T10:41:09Z) - Visual Cluster Separation Using High-Dimensional Sharpened
Dimensionality Reduction [65.80631307271705]
High-Dimensional Sharpened DR' (HD-SDR) is tested on both synthetic and real-world data sets.
Our method achieves good quality (measured by quality metrics) and scales computationally well with large high-dimensional data.
To illustrate its concrete applications, we further apply HD-SDR on a recent astronomical catalog.
arXiv Detail & Related papers (2021-10-01T11:13:51Z) - Spatial-spectral Hyperspectral Image Classification via Multiple Random
Anchor Graphs Ensemble Learning [88.60285937702304]
This paper proposes a novel spatial-spectral HSI classification method via multiple random anchor graphs ensemble learning (RAGE)
Firstly, the local binary pattern is adopted to extract the more descriptive features on each selected band, which preserves local structures and subtle changes of a region.
Secondly, the adaptive neighbors assignment is introduced in the construction of anchor graph, to reduce the computational complexity.
arXiv Detail & Related papers (2021-03-25T09:31:41Z) - Dual Adversarial Auto-Encoders for Clustering [152.84443014554745]
We propose Dual Adversarial Auto-encoder (Dual-AAE) for unsupervised clustering.
By performing variational inference on the objective function of Dual-AAE, we derive a new reconstruction loss which can be optimized by training a pair of Auto-encoders.
Experiments on four benchmarks show that Dual-AAE achieves superior performance over state-of-the-art clustering methods.
arXiv Detail & Related papers (2020-08-23T13:16:34Z) - Supervised Dimensionality Reduction and Visualization using
Centroid-encoder [1.2487990897680423]
Centroid-Encoder (CE) method is similar to autoencoder but incorporates label information to keep objects of a class close together in reduced visualization space.
CE exploits nonlinearity and labels to encode high variance in low dimensions while capturing the global structure of the data.
We show that when the data variance is spread across multiple modalities, centroid-encoder extracts a significant amount of information from the data in low dimensional space.
arXiv Detail & Related papers (2020-02-27T06:08:22Z) - Learning Bijective Feature Maps for Linear ICA [73.85904548374575]
We show that existing probabilistic deep generative models (DGMs) which are tailor-made for image data, underperform on non-linear ICA tasks.
To address this, we propose a DGM which combines bijective feature maps with a linear ICA model to learn interpretable latent structures for high-dimensional data.
We create models that converge quickly, are easy to train, and achieve better unsupervised latent factor discovery than flow-based models, linear ICA, and Variational Autoencoders on images.
arXiv Detail & Related papers (2020-02-18T17:58:07Z) - CAE-LO: LiDAR Odometry Leveraging Fully Unsupervised Convolutional
Auto-Encoder for Interest Point Detection and Feature Description [10.73965992177754]
We propose a fully unsupervised Conal Auto-Encoder based LiDAR Odometry (CAE-LO) that detects interest points from spherical ring data using 2D CAE and extracts features from multi-resolution voxel model using 3D CAE.
We make several key contributions: 1) experiments based on KITTI dataset show that our interest points can capture more local details to improve the matching success rate on unstructured scenarios and our features outperform state-of-the-art by more than 50% in matching inlier ratio.
arXiv Detail & Related papers (2020-01-06T01:26:28Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.