Conformance Checking Over Stochastically Known Logs
- URL: http://arxiv.org/abs/2203.07507v1
- Date: Mon, 14 Mar 2022 21:33:06 GMT
- Title: Conformance Checking Over Stochastically Known Logs
- Authors: Eli Bogdanov, Izack Cohen, Avigdor Gal
- Abstract summary: Data logs may become uncertain due to, e.g., sensor reading inaccuracies or incorrect interpretation of readings by processing programs.
In this work we focus on conformance checking, which compares a process model with an event log.
We mathematically define a trace model, a synchronous product, and a cost function that reflects the uncertainty of events in a log.
- Score: 7.882975068446842
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: With the growing number of devices, sensors and digital systems, data logs
may become uncertain due to, e.g., sensor reading inaccuracies or incorrect
interpretation of readings by processing programs. At times, such uncertainties
can be captured stochastically, especially when using probabilistic data
classification models. In this work we focus on conformance checking, which
compares a process model with an event log, when event logs are stochastically
known. Building on existing alignment-based conformance checking fundamentals,
we mathematically define a stochastic trace model, a stochastic synchronous
product, and a cost function that reflects the uncertainty of events in a log.
Then, we search for an optimal alignment over the reachability graph of the
stochastic synchronous product for finding an optimal alignment between a model
and a stochastic process observation. Via structured experiments with two
well-known process mining benchmarks, we explore the behavior of the suggested
stochastic conformance checking approach and compare it to a standard
alignment-based approach as well as to an approach that creates a lower bound
on performance. We envision the proposed stochastic conformance checking
approach as a viable process mining component for future analysis of stochastic
event logs.
Related papers
- Logistic-beta processes for dependent random probabilities with beta marginals [58.91121576998588]
We propose a novel process called the logistic-beta process, whose logistic transformation yields a process with common beta marginals.
It can model dependence on both discrete and continuous domains, such as space or time, and has a flexible dependence structure through correlation kernels.
We illustrate the benefits through nonparametric binary regression and conditional density estimation examples, both in simulation studies and in a pregnancy outcome application.
arXiv Detail & Related papers (2024-02-10T21:41:32Z) - Data-Driven Reachability Analysis of Stochastic Dynamical Systems with
Conformal Inference [1.446438366123305]
We consider data-driven reachability analysis of discrete-time dynamical systems using conformal inference.
We focus on learning-enabled control systems with complex closed-loop dynamics.
arXiv Detail & Related papers (2023-09-17T07:23:01Z) - Learning Unnormalized Statistical Models via Compositional Optimization [73.30514599338407]
Noise-contrastive estimation(NCE) has been proposed by formulating the objective as the logistic loss of the real data and the artificial noise.
In this paper, we study it a direct approach for optimizing the negative log-likelihood of unnormalized models.
arXiv Detail & Related papers (2023-06-13T01:18:16Z) - Score-based Continuous-time Discrete Diffusion Models [102.65769839899315]
We extend diffusion models to discrete variables by introducing a Markov jump process where the reverse process denoises via a continuous-time Markov chain.
We show that an unbiased estimator can be obtained via simple matching the conditional marginal distributions.
We demonstrate the effectiveness of the proposed method on a set of synthetic and real-world music and image benchmarks.
arXiv Detail & Related papers (2022-11-30T05:33:29Z) - Alignment-based conformance checking over probabilistic events [4.060731229044571]
We introduce a weighted trace model and weighted alignment cost function, and a custom threshold parameter that controls the level of confidence on the event data.
The resulting algorithm considers activities of lower but sufficiently high probability that better align with the process model.
arXiv Detail & Related papers (2022-09-09T14:07:37Z) - Online Time Series Anomaly Detection with State Space Gaussian Processes [12.483273106706623]
R-ssGPFA is an unsupervised online anomaly detection model for uni- and multivariate time series.
For high-dimensional time series, we propose an extension of Gaussian process factor analysis to identify the common latent processes of the time series.
Our model's robustness is improved by using a simple to skip Kalman updates when encountering anomalous observations.
arXiv Detail & Related papers (2022-01-18T06:43:32Z) - Sequential Recommendation via Stochastic Self-Attention [68.52192964559829]
Transformer-based approaches embed items as vectors and use dot-product self-attention to measure the relationship between items.
We propose a novel textbfSTOchastic textbfSelf-textbfAttention(STOSA) to overcome these issues.
We devise a novel Wasserstein Self-Attention module to characterize item-item position-wise relationships in sequences.
arXiv Detail & Related papers (2022-01-16T12:38:45Z) - CoCoMoT: Conformance Checking of Multi-Perspective Processes via SMT
(Extended Version) [62.96267257163426]
We introduce the CoCoMoT (Computing Conformance Modulo Theories) framework.
First, we show how SAT-based encodings studied in the pure control-flow setting can be lifted to our data-aware case.
Second, we introduce a novel preprocessing technique based on a notion of property-preserving clustering.
arXiv Detail & Related papers (2021-03-18T20:22:50Z) - An Entropic Relevance Measure for Stochastic Conformance Checking in
Process Mining [9.302180124254338]
We present an entropic relevance measure for conformance checking, computed as the average number of bits required to compress each of the log's traces.
We show that entropic relevance is computable in time linear in the size of the log, and provide evaluation outcomes that demonstrate the feasibility of using the new approach in industrial settings.
arXiv Detail & Related papers (2020-07-18T02:25:33Z) - Tracking Performance of Online Stochastic Learners [57.14673504239551]
Online algorithms are popular in large-scale learning settings due to their ability to compute updates on the fly, without the need to store and process data in large batches.
When a constant step-size is used, these algorithms also have the ability to adapt to drifts in problem parameters, such as data or model properties, and track the optimal solution with reasonable accuracy.
We establish a link between steady-state performance derived under stationarity assumptions and the tracking performance of online learners under random walk models.
arXiv Detail & Related papers (2020-04-04T14:16:27Z) - Efficient Conformance Checking using Approximate Alignment Computation
with Tandem Repeats [0.03222802562733786]
Conformance checking aims to find and describe the differences between a process model capturing the expected process behavior and a corresponding event log recording the observed behavior.
Alignments are an established technique to compute the distance between a trace in the event log and the closest execution trace of a corresponding process model.
We propose a novel approximate technique that uses pre- and post-processing steps to compress the length of a trace and recompute the alignment cost.
arXiv Detail & Related papers (2020-04-02T03:50:32Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.