Related papers: Hawk: DevOps-driven Transparency and Accountability in Cloud Native Systems

Hawk: DevOps-driven Transparency and Accountability in Cloud Native Systems

URL: http://arxiv.org/abs/2306.02496v1
Date: Sun, 4 Jun 2023 22:09:42 GMT
Title: Hawk: DevOps-driven Transparency and Accountability in Cloud Native Systems
Authors: Elias Gr\"unewald, Jannis Kiesel, Siar-Remzi Akbayin, Frank Pallas
Abstract summary: Transparency is one of the most important principles of modern privacy regulations. Data controllers must provide data subjects with precise information about the collection, processing, storage, and transfer of personal data.
Score: 0.0
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Transparency is one of the most important principles of modern privacy regulations, such as the GDPR or CCPA. To be compliant with such regulatory frameworks, data controllers must provide data subjects with precise information about the collection, processing, storage, and transfer of personal data. To do so, respective facts and details must be compiled and always kept up to date. In traditional, rather static system environments, this inventory (including details such as the purposes of processing or the storage duration for each system component) could be done manually. In current circumstances of agile, DevOps-driven, and cloud-native information systems engineering, however, such manual practices do not suit anymore, making it increasingly hard for data controllers to achieve regulatory compliance. To allow for proper collection and maintenance of always up-to-date transparency information smoothly integrating into DevOps practices, we herein propose a set of novel approaches explicitly tailored to specific phases of the DevOps lifecycle most relevant in matters of privacy-related transparency and accountability at runtime: Release, Operation, and Monitoring. For each of these phases, we examine the specific challenges arising in determining the details of personal data processing, develop a distinct approach and provide respective proof of concept implementations that can easily be applied in cloud native systems. We also demonstrate how these components can be integrated with each other to establish transparency information comprising design- and runtime-elements. Furthermore, our experimental evaluation indicates reasonable overheads. On this basis, data controllers can fulfill their regulatory transparency obligations in line with actual engineering practices.

Related papers

Data-Juicer 2.0: Cloud-Scale Adaptive Data Processing for Foundation Models [64.28420991770382]
We present Data-Juicer 2.0, a new system offering fruitful data processing capabilities backed by over a hundred operators. The system is publicly available, actively maintained, and broadly adopted in diverse research endeavors, practical applications, and real-world products such as Alibaba Cloud PAI.
arXiv Detail & Related papers (2024-12-23T08:29:57Z)
Flex: End-to-End Text-Instructed Visual Navigation with Foundation Models [59.892436892964376]
We investigate the minimal data requirements and architectural adaptations necessary to achieve robust closed-loop performance with vision-based control policies. Our findings are synthesized in Flex (Fly-lexically), a framework that uses pre-trained Vision Language Models (VLMs) as frozen patch-wise feature extractors. We demonstrate the effectiveness of this approach on quadrotor fly-to-target tasks, where agents trained via behavior cloning successfully generalize to real-world scenes.
arXiv Detail & Related papers (2024-10-16T19:59:31Z)
Extending Business Process Management for Regulatory Transparency [0.0]
We bridge the gap between business processes and application systems by providing a plug-in extension to BPMN featuring regulatory transparency information. We leverage process mining techniques to discover and analyze personal data flows in business processes.
arXiv Detail & Related papers (2024-06-14T12:08:34Z)
Semantic Modelling of Organizational Knowledge as a Basis for Enterprise Data Governance 4.0 -- Application to a Unified Clinical Data Model [6.302916372143144]
We establish a simple, cost-efficient framework that enables metadata-driven, agile and (semi-automated) data governance. We explain how we implement and use this framework to integrate 25 years of clinical study data at an enterprise scale in a fully productive environment.
arXiv Detail & Related papers (2023-10-20T19:36:03Z)
Auditing and Generating Synthetic Data with Controllable Trust Trade-offs [54.262044436203965]
We introduce a holistic auditing framework that comprehensively evaluates synthetic datasets and AI models. It focuses on preventing bias and discrimination, ensures fidelity to the source data, assesses utility, robustness, and privacy preservation. We demonstrate the framework's effectiveness by auditing various generative models across diverse use cases.
arXiv Detail & Related papers (2023-04-21T09:03:18Z)
Enabling Versatile Privacy Interfaces Using Machine-Readable Transparency Information [0.0]
We argue that privacy shall incorporate the context of display, personal preferences, and individual competences of data subjects. We provide a general model of how transparency information can be provided from a data controller to data subjects. We show how transparency can be enhanced using machine-readable transparency information and how data controllers can meet respective regulatory obligations.
arXiv Detail & Related papers (2023-02-21T20:40:26Z)
Scalable Discovery and Continuous Inventory of Personal Data at Rest in Cloud Native Systems [0.0]
Cloud native systems are processing large amounts of personal data through numerous and possibly multi-paradigmatic data stores. From a privacy engineering perspective, a core challenge is to keep track of all exact locations, where personal data is being stored. We present Teiresias, comprising i) a workflow pattern for scalable discovery of personal data at rest, and ii) a cloud native system architecture and open source prototype implementation of said workflow pattern.
arXiv Detail & Related papers (2022-09-09T10:45:34Z)
SOLIS -- The MLOps journey from data acquisition to actionable insights [62.997667081978825]
In this paper we present a unified deployment pipeline and freedom-to-operate approach that supports all requirements while using basic cross-platform tensor framework and script language engines. This approach however does not supply the needed procedures and pipelines for the actual deployment of machine learning capabilities in real production grade systems.
arXiv Detail & Related papers (2021-12-22T14:45:37Z)
Learning to Limit Data Collection via Scaling Laws: Data Minimization Compliance in Practice [62.44110411199835]
We build on literature in machine learning law to propose framework for limiting collection based on data interpretation that ties data to system performance. We formalize a data minimization criterion based on performance curve derivatives and provide an effective and interpretable piecewise power law technique.
arXiv Detail & Related papers (2021-07-16T19:59:01Z)
TIRA: An OpenAPI Extension and Toolbox for GDPR Transparency in RESTful Architectures [0.0]
Transparency provides information about what personal data is collected for which purposes, how long it is stored, or to which parties it is transferred. Technical approaches for implementing transparency in practice are, however, only rarely considered. We introduce 1) a transparency-focused extension of OpenAPI specifications that allows individual service descriptions to be enriched with transparency-related annotations in a bottom-up fashion and 2) a set of higher-order tools for aggregating respective information across multiple, interdependent services and for coherently integrating our approach into automated CI/CD-pipelines.
arXiv Detail & Related papers (2021-06-10T18:42:50Z)
Trustworthy Transparency by Design [57.67333075002697]
We propose a transparency framework for software design, incorporating research on user trust and experience. Our framework enables developing software that incorporates transparency in its design.
arXiv Detail & Related papers (2021-03-19T12:34:01Z)
Petri Nets with Parameterised Data: Modelling and Verification (Extended Version) [67.99023219822564]
We introduce and study an extension of coloured Petri nets, called catalog-nets, providing two key features to capture this type of processes. We show that fresh-value injection is a particularly complex feature to handle, and discuss strategies to tame it.
arXiv Detail & Related papers (2020-06-11T17:26:08Z)

This list is automatically generated from the titles and abstracts of the papers in this site.