Federated and Privacy-Preserving Learning of Accounting Data in
Financial Statement Audits
- URL: http://arxiv.org/abs/2208.12708v1
- Date: Fri, 26 Aug 2022 15:09:18 GMT
- Title: Federated and Privacy-Preserving Learning of Accounting Data in
Financial Statement Audits
- Authors: Marco Schreyer, Timur Sattarov, Damian Borth
- Abstract summary: We propose a Federated Learning framework to train DL models on auditing relevant accounting data of multiple clients.
We evaluate our approach to detect accounting anomalies in three real-world datasets of city payments.
- Score: 1.4986031916712106
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: The ongoing 'digital transformation' fundamentally changes audit evidence's
nature, recording, and volume. Nowadays, the International Standards on
Auditing (ISA) requires auditors to examine vast volumes of a financial
statement's underlying digital accounting records. As a result, audit firms
also 'digitize' their analytical capabilities and invest in Deep Learning (DL),
a successful sub-discipline of Machine Learning. The application of DL offers
the ability to learn specialized audit models from data of multiple clients,
e.g., organizations operating in the same industry or jurisdiction. In general,
regulations require auditors to adhere to strict data confidentiality measures.
At the same time, recent intriguing discoveries showed that large-scale DL
models are vulnerable to leaking sensitive training data information. Today, it
often remains unclear how audit firms can apply DL models while complying with
data protection regulations. In this work, we propose a Federated Learning
framework to train DL models on auditing relevant accounting data of multiple
clients. The framework encompasses Differential Privacy and Split Learning
capabilities to mitigate data confidentiality risks at model inference. We
evaluate our approach to detect accounting anomalies in three real-world
datasets of city payments. Our results provide empirical evidence that auditors
can benefit from DL models that accumulate knowledge from multiple sources of
proprietary client data.
Related papers
- Advancing Anomaly Detection: Non-Semantic Financial Data Encoding with LLMs [49.57641083688934]
We introduce a novel approach to anomaly detection in financial data using Large Language Models (LLMs) embeddings.
Our experiments demonstrate that LLMs contribute valuable information to anomaly detection as our models outperform the baselines.
arXiv Detail & Related papers (2024-06-05T20:19:09Z) - Trustless Audits without Revealing Data or Models [49.23322187919369]
We show that it is possible to allow model providers to keep their model weights (but not architecture) and data secret while allowing other parties to trustlessly audit model and data properties.
We do this by designing a protocol called ZkAudit in which model providers publish cryptographic commitments of datasets and model weights.
arXiv Detail & Related papers (2024-04-06T04:43:06Z) - Privacy-Preserving Financial Anomaly Detection via Federated Learning & Multi-Party Computation [17.314619091307343]
We describe a privacy-preserving framework that allows financial institutions to jointly train highly accurate anomaly detection models.
We show that our solution enables the network to train a highly accurate anomaly detection model while preserving privacy of customer data.
arXiv Detail & Related papers (2023-10-06T19:16:41Z) - FedSOV: Federated Model Secure Ownership Verification with Unforgeable
Signature [60.99054146321459]
Federated learning allows multiple parties to collaborate in learning a global model without revealing private data.
We propose a cryptographic signature-based federated learning model ownership verification scheme named FedSOV.
arXiv Detail & Related papers (2023-05-10T12:10:02Z) - Auditing and Generating Synthetic Data with Controllable Trust Trade-offs [54.262044436203965]
We introduce a holistic auditing framework that comprehensively evaluates synthetic datasets and AI models.
It focuses on preventing bias and discrimination, ensures fidelity to the source data, assesses utility, robustness, and privacy preservation.
We demonstrate the framework's effectiveness by auditing various generative models across diverse use cases.
arXiv Detail & Related papers (2023-04-21T09:03:18Z) - Tight Auditing of Differentially Private Machine Learning [77.38590306275877]
For private machine learning, existing auditing mechanisms are tight.
They only give tight estimates under implausible worst-case assumptions.
We design an improved auditing scheme that yields tight privacy estimates for natural (not adversarially crafted) datasets.
arXiv Detail & Related papers (2023-02-15T21:40:33Z) - Federated Continual Learning to Detect Accounting Anomalies in Financial
Auditing [1.2205797997133396]
We propose a Federated Continual Learning framework enabling auditors to learn audit models from decentral clients continuously.
We evaluate the framework's ability to detect accounting anomalies in common scenarios of organizational activity.
arXiv Detail & Related papers (2022-10-26T21:33:08Z) - RESHAPE: Explaining Accounting Anomalies in Financial Statement Audits
by enhancing SHapley Additive exPlanations [1.3333957453318743]
We propose (RESHAPE) which explains the model output on an aggregated attribute-level.
Our results show empirical evidence that RESHAPE results in versatile explanations compared to state-of-the-art baselines.
We envision such attribute-level explanations as a necessary next step in the adoption of unsupervised DL techniques in financial auditing.
arXiv Detail & Related papers (2022-09-19T16:23:43Z) - Continual Learning for Unsupervised Anomaly Detection in Continuous
Auditing of Financial Accounting Data [1.9659095632676094]
International audit standards require the direct assessment of a financial statement's underlying accounting journal entries.
Deep-learning inspired audit techniques emerged to examine vast quantities of journal entry data.
This work proposes a continual anomaly detection framework to overcome both challenges and designed to learn from a stream of journal entry data experiences.
arXiv Detail & Related papers (2021-12-25T09:21:14Z) - Multi-view Contrastive Self-Supervised Learning of Accounting Data
Representations for Downstream Audit Tasks [1.9659095632676094]
International audit standards require the direct assessment of a financial statement's underlying accounting transactions, referred to as journal entries.
Deep learning inspired audit techniques have emerged in the field of auditing vast quantities of journal entry data.
We propose a contrastive self-supervised learning framework designed to learn audit task invariant accounting data representations.
arXiv Detail & Related papers (2021-09-23T08:16:31Z) - RoFL: Attestable Robustness for Secure Federated Learning [59.63865074749391]
Federated Learning allows a large number of clients to train a joint model without the need to share their private data.
To ensure the confidentiality of the client updates, Federated Learning systems employ secure aggregation.
We present RoFL, a secure Federated Learning system that improves robustness against malicious clients.
arXiv Detail & Related papers (2021-07-07T15:42:49Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.