An Entropic Relevance Measure for Stochastic Conformance Checking in
Process Mining
- URL: http://arxiv.org/abs/2007.09310v2
- Date: Wed, 26 Aug 2020 15:57:16 GMT
- Title: An Entropic Relevance Measure for Stochastic Conformance Checking in
Process Mining
- Authors: Artem Polyvyanyy, Alistair Moffat, Luciano Garc\'ia-Ba\~nuelos
- Abstract summary: We present an entropic relevance measure for conformance checking, computed as the average number of bits required to compress each of the log's traces.
We show that entropic relevance is computable in time linear in the size of the log, and provide evaluation outcomes that demonstrate the feasibility of using the new approach in industrial settings.
- Score: 9.302180124254338
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Given an event log as a collection of recorded real-world process traces,
process mining aims to automatically construct a process model that is both
simple and provides a useful explanation of the traces. Conformance checking
techniques are then employed to characterize and quantify commonalities and
discrepancies between the log's traces and the candidate models. Recent
approaches to conformance checking acknowledge that the elements being compared
are inherently stochastic - for example, some traces occur frequently and
others infrequently - and seek to incorporate this knowledge in their analyses.
Here we present an entropic relevance measure for stochastic conformance
checking, computed as the average number of bits required to compress each of
the log's traces, based on the structure and information about relative
likelihoods provided by the model. The measure penalizes traces from the event
log not captured by the model and traces described by the model but absent in
the event log, thus addressing both precision and recall quality criteria at
the same time. We further show that entropic relevance is computable in time
linear in the size of the log, and provide evaluation outcomes that demonstrate
the feasibility of using the new approach in industrial settings.
Related papers
- Optimizing OOD Detection in Molecular Graphs: A Novel Approach with Diffusion Models [71.39421638547164]
We propose to detect OOD molecules by adopting an auxiliary diffusion model-based framework, which compares similarities between input molecules and reconstructed graphs.
Due to the generative bias towards reconstructing ID training samples, the similarity scores of OOD molecules will be much lower to facilitate detection.
Our research pioneers an approach of Prototypical Graph Reconstruction for Molecular OOD Detection, dubbed as PGR-MOOD and hinges on three innovations.
arXiv Detail & Related papers (2024-04-24T03:25:53Z) - Object-Centric Conformance Alignments with Synchronization (Extended Version) [57.76661079749309]
We present a new formalism that combines the ability of object-centric Petri nets to capture one-to-many relations and the one of Petri nets with identifiers to compare and synchronize objects based on their identity.
We propose a conformance checking approach for such nets based on an encoding in satisfiability modulo theories (SMT)
arXiv Detail & Related papers (2023-12-13T21:53:32Z) - Stochastic Directly-Follows Process Discovery Using Grammatical Inference [8.196011179587304]
We propose a new approach for discovering sound Directly-Follows Graphs that is grounded in grammatical inference over the input traces.
Experiments over real-world datasets confirm that our new approach can construct smaller models that represent the input traces and their frequencies more accurately than the state-of-the-art technique.
arXiv Detail & Related papers (2023-12-09T01:56:25Z) - Score-based Continuous-time Discrete Diffusion Models [102.65769839899315]
We extend diffusion models to discrete variables by introducing a Markov jump process where the reverse process denoises via a continuous-time Markov chain.
We show that an unbiased estimator can be obtained via simple matching the conditional marginal distributions.
We demonstrate the effectiveness of the proposed method on a set of synthetic and real-world music and image benchmarks.
arXiv Detail & Related papers (2022-11-30T05:33:29Z) - Alignment-based conformance checking over probabilistic events [4.060731229044571]
We introduce a weighted trace model and weighted alignment cost function, and a custom threshold parameter that controls the level of confidence on the event data.
The resulting algorithm considers activities of lower but sufficiently high probability that better align with the process model.
arXiv Detail & Related papers (2022-09-09T14:07:37Z) - Conformance Checking Over Stochastically Known Logs [7.882975068446842]
Data logs may become uncertain due to, e.g., sensor reading inaccuracies or incorrect interpretation of readings by processing programs.
In this work we focus on conformance checking, which compares a process model with an event log.
We mathematically define a trace model, a synchronous product, and a cost function that reflects the uncertainty of events in a log.
arXiv Detail & Related papers (2022-03-14T21:33:06Z) - CoCoMoT: Conformance Checking of Multi-Perspective Processes via SMT
(Extended Version) [62.96267257163426]
We introduce the CoCoMoT (Computing Conformance Modulo Theories) framework.
First, we show how SAT-based encodings studied in the pure control-flow setting can be lifted to our data-aware case.
Second, we introduce a novel preprocessing technique based on a notion of property-preserving clustering.
arXiv Detail & Related papers (2021-03-18T20:22:50Z) - Contrastive learning of strong-mixing continuous-time stochastic
processes [53.82893653745542]
Contrastive learning is a family of self-supervised methods where a model is trained to solve a classification task constructed from unlabeled data.
We show that a properly constructed contrastive learning task can be used to estimate the transition kernel for small-to-mid-range intervals in the diffusion case.
arXiv Detail & Related papers (2021-03-03T23:06:47Z) - Goal-directed Generation of Discrete Structures with Conditional
Generative Models [85.51463588099556]
We introduce a novel approach to directly optimize a reinforcement learning objective, maximizing an expected reward.
We test our methodology on two tasks: generating molecules with user-defined properties and identifying short python expressions which evaluate to a given target value.
arXiv Detail & Related papers (2020-10-05T20:03:13Z) - Entropia: A Family of Entropy-Based Conformance Checking Measures for
Process Mining [13.204011949483867]
This paper presents a family of conformance checking measures for process mining founded on the notion of entropy from information theory.
The measures allow classical non-deterministic precision and recall quality criteria for quantifying process models automatically discovered from traces executed by IT-systems and recorded in their event logs.
arXiv Detail & Related papers (2020-08-21T15:54:47Z) - Efficient Conformance Checking using Approximate Alignment Computation
with Tandem Repeats [0.03222802562733786]
Conformance checking aims to find and describe the differences between a process model capturing the expected process behavior and a corresponding event log recording the observed behavior.
Alignments are an established technique to compute the distance between a trace in the event log and the closest execution trace of a corresponding process model.
We propose a novel approximate technique that uses pre- and post-processing steps to compress the length of a trace and recompute the alignment cost.
arXiv Detail & Related papers (2020-04-02T03:50:32Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.