SaCoFa: Semantics-aware Control-flow Anonymization for Process Mining
- URL: http://arxiv.org/abs/2109.08501v1
- Date: Fri, 17 Sep 2021 12:26:49 GMT
- Title: SaCoFa: Semantics-aware Control-flow Anonymization for Process Mining
- Authors: Stephan A. Fahrenkrog-Petersen, Martin Kabierski, Fabian R\"osel, Han
van der Aa, Matthias Weidlich
- Abstract summary: We argue for privacy preservation that incorporates a process semantics.
We show how, based on the exponential mechanism, semantic constraints are incorporated to ensure differential privacy of the query result.
- Score: 4.806322013167162
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Privacy-preserving process mining enables the analysis of business processes
using event logs, while giving guarantees on the protection of sensitive
information on process stakeholders. To this end, existing approaches add noise
to the results of queries that extract properties of an event log, such as the
frequency distribution of trace variants, for analysis.Noise insertion neglects
the semantics of the process, though, and may generate traces not present in
the original log. This is problematic. It lowers the utility of the published
data and makes noise easily identifiable, as some traces will violate
well-known semantic constraints.In this paper, we therefore argue for privacy
preservation that incorporates a process semantics. For common trace-variant
queries, we show how, based on the exponential mechanism, semantic constraints
are incorporated to ensure differential privacy of the query result.
Experiments demonstrate that our semantics-aware anonymization yields event
logs of significantly higher utility than existing approaches.
Related papers
- Detecting Anomalous Events in Object-centric Business Processes via
Graph Neural Networks [55.583478485027]
This study proposes a novel framework for anomaly detection in business processes.
We first reconstruct the process dependencies of the object-centric event logs as attributed graphs.
We then employ a graph convolutional autoencoder architecture to detect anomalous events.
arXiv Detail & Related papers (2024-02-14T14:17:56Z) - Enjoy the Silence: Analysis of Stochastic Petri Nets with Silent
Transitions [4.163635746713724]
Capturing behaviors in business and work processes is essential to quantitatively understand how nondeterminism is resolved when taking decisions within the process.
This is of special interest in process mining, where event data tracking the actual execution of the process are related to process models.
Variants of Petri nets provide a natural formal basis for this, but they need to be labelled with (possibly duplicated) activities and equipped with silent transitions.
We show that all such analysis tasks can be solved analytically, in particular reducing them to a single method that combines automata-based techniques to single out the behaviors of interest within a LSP
arXiv Detail & Related papers (2023-06-10T07:57:24Z) - Measuring Rule-based LTLf Process Specifications: A Probabilistic
Data-driven Approach [2.5407767658470726]
Declarative process specifications define the behavior of processes by means of rules based on Linear Temporal Logic on Finite Traces.
In a mining context, these specifications are inferred from, and checked on, multi-sets of runs recorded by information systems.
We propose a technique that measures the degree of satisfaction of specifications over event logs.
arXiv Detail & Related papers (2023-05-09T13:07:01Z) - Towards Knowledge-Centric Process Mining [5.429166905724047]
We present an approach that permits process analytics techniques to deliver value in the face of noisy/incomplete event logs.
Our approach leverages knowledge graphs to mitigate the effects of noise in event logs while supporting process analysts in understanding variability associated with event logs.
arXiv Detail & Related papers (2023-01-26T04:23:04Z) - Robust Semantic Communications with Masked VQ-VAE Enabled Codebook [56.63571713657059]
We propose a framework for the robust end-to-end semantic communication systems to combat the semantic noise.
To combat the semantic noise, the adversarial training with weight is developed to incorporate the samples with semantic noise in the training dataset.
We develop a feature importance module (FIM) to suppress the noise-related and task-unrelated features.
arXiv Detail & Related papers (2022-06-08T16:58:47Z) - FineDiving: A Fine-grained Dataset for Procedure-aware Action Quality
Assessment [93.09267863425492]
We argue that understanding both high-level semantics and internal temporal structures of actions in competitive sports videos is the key to making predictions accurate and interpretable.
We construct a new fine-grained dataset, called FineDiving, developed on diverse diving events with detailed annotations on action procedures.
arXiv Detail & Related papers (2022-04-07T17:59:32Z) - A Distance Measure for Privacy-preserving Process Mining based on
Feature Learning [5.250561515565923]
We show how embeddings of events enable the definition of a distance measure for traces to guide event log anonymization.
Our experiments with real-world data indicate that anonymization using this measure, compared to a syntactic one, yields logs that are closer to the original log in various dimensions.
arXiv Detail & Related papers (2021-07-14T09:44:28Z) - Approximating Instance-Dependent Noise via Instance-Confidence Embedding [87.65718705642819]
Label noise in multiclass classification is a major obstacle to the deployment of learning systems.
We investigate the instance-dependent noise (IDN) model and propose an efficient approximation of IDN to capture the instance-specific label corruption.
arXiv Detail & Related papers (2021-03-25T02:33:30Z) - Extracting Semantic Process Information from the Natural Language in
Event Logs [0.1827510863075184]
We present an approach that achieves this through so-called semantic role labeling of event data.
In this manner, our approach extracts information about up to eight semantic roles per event.
arXiv Detail & Related papers (2021-03-06T08:39:04Z) - Tackling Instance-Dependent Label Noise via a Universal Probabilistic
Model [80.91927573604438]
This paper proposes a simple yet universal probabilistic model, which explicitly relates noisy labels to their instances.
Experiments on datasets with both synthetic and real-world label noise verify that the proposed method yields significant improvements on robustness.
arXiv Detail & Related papers (2021-01-14T05:43:51Z) - Bias and Variance of Post-processing in Differential Privacy [53.29035917495491]
Post-processing immunity is a fundamental property of differential privacy.
It is often argued that post-processing may introduce bias and increase variance.
This paper takes a first step towards understanding the properties of post-processing.
arXiv Detail & Related papers (2020-10-09T02:12:54Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.