The Saudi Privacy Policy Dataset
- URL: http://arxiv.org/abs/2304.02757v1
- Date: Wed, 5 Apr 2023 21:40:37 GMT
- Title: The Saudi Privacy Policy Dataset
- Authors: Hend Al-Khalifa, Malak Mashaabi, Ghadi Al-Yahya and Raghad Alnashwan
- Abstract summary: The paper introduces a diverse compilation of privacy policies from various sectors in Saudi Arabia.
The final dataset includes 1,000 websites belonging to 7 sectors, 4,638 lines of text, 775,370 tokens, and a corpus size of 8,353 KB.
The paper aims to further research and development in the areas of privacy policy analysis, natural language processing, and machine learning applications related to privacy and data protection.
- Score: 0.0
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: This paper introduces the Saudi Privacy Policy Dataset, a diverse compilation
of Arabic privacy policies from various sectors in Saudi Arabia, annotated
according to the 10 principles of the Personal Data Protection Law (PDPL); the
PDPL was established to be compatible with General Data Protection Regulation
(GDPR); one of the most comprehensive data regulations worldwide. Data were
collected from multiple sources, including the Saudi Central Bank, the Saudi
Arabia National United Platform, the Council of Health Insurance, and general
websites using Google and Wikipedia. The final dataset includes 1,000 websites
belonging to 7 sectors, 4,638 lines of text, 775,370 tokens, and a corpus size
of 8,353 KB. The annotated dataset offers significant reuse potential for
assessing privacy policy compliance, benchmarking privacy practices across
industries, and developing automated tools for monitoring adherence to data
protection regulations. By providing a comprehensive and annotated dataset of
privacy policies, this paper aims to facilitate further research and
development in the areas of privacy policy analysis, natural language
processing, and machine learning applications related to privacy and data
protection, while also serving as an essential resource for researchers,
policymakers, and industry professionals interested in understanding and
promoting compliance with privacy regulations in Saudi Arabia.
Related papers
- Differential Privacy Overview and Fundamental Techniques [63.0409690498569]
This chapter is meant to be part of the book "Differential Privacy in Artificial Intelligence: From Theory to Practice"
It starts by illustrating various attempts to protect data privacy, emphasizing where and why they failed.
It then defines the key actors, tasks, and scopes that make up the domain of privacy-preserving data analysis.
arXiv Detail & Related papers (2024-11-07T13:52:11Z) - Interactive GDPR-Compliant Privacy Policy Generation for Software Applications [6.189770781546807]
To use software applications users are sometimes requested to provide their personal information.
As privacy has become a significant concern many protection regulations exist worldwide.
We propose an approach that generates comprehensive and compliant privacy policy.
arXiv Detail & Related papers (2024-10-04T01:22:16Z) - A BERT-based Empirical Study of Privacy Policies' Compliance with GDPR [9.676166100354282]
This study aims to address challenge of compliance analysis between privacy policies for 5G networks.
We manually collected privacy policies from almost 70 different MNOs and we utilized an automated BERT-based model for classification.
In addition, we present first empirical evidence on the readability of privacy policies for 5G network. we adopted incorporates various established readability metrics.
arXiv Detail & Related papers (2024-07-09T11:47:52Z) - Collection, usage and privacy of mobility data in the enterprise and public administrations [55.2480439325792]
Security measures such as anonymization are needed to protect individuals' privacy.
Within our study, we conducted expert interviews to gain insights into practices in the field.
We survey privacy-enhancing methods in use, which generally do not comply with state-of-the-art standards of differential privacy.
arXiv Detail & Related papers (2024-07-04T08:29:27Z) - SoK: The Gap Between Data Rights Ideals and Reality [46.14715472341707]
Do rights-based privacy laws effectively empower individuals over their data?
This paper scrutinizes these approaches by reviewing empirical studies, news articles, and blog posts.
arXiv Detail & Related papers (2023-12-03T21:52:51Z) - Advancing Differential Privacy: Where We Are Now and Future Directions for Real-World Deployment [100.1798289103163]
We present a detailed review of current practices and state-of-the-art methodologies in the field of differential privacy (DP)
Key points and high-level contents of the article were originated from the discussions from "Differential Privacy (DP): Challenges Towards the Next Frontier"
This article aims to provide a reference point for the algorithmic and design decisions within the realm of privacy, highlighting important challenges and potential research directions.
arXiv Detail & Related papers (2023-04-14T05:29:18Z) - PLUE: Language Understanding Evaluation Benchmark for Privacy Policies
in English [77.79102359580702]
We introduce the Privacy Policy Language Understanding Evaluation benchmark, a multi-task benchmark for evaluating the privacy policy language understanding.
We also collect a large corpus of privacy policies to enable privacy policy domain-specific language model pre-training.
We demonstrate that domain-specific continual pre-training offers performance improvements across all tasks.
arXiv Detail & Related papers (2022-12-20T05:58:32Z) - A Fine-grained Chinese Software Privacy Policy Dataset for Sequence
Labeling and Regulation Compliant Identification [23.14031861460124]
We construct the first Chinese privacy policy dataset, CA4P-483, to facilitate the sequence labeling tasks and regulation compliance identification.
Our dataset includes 483 Chinese Android application privacy policies, over 11K sentences, and 52K fine-grained annotations.
arXiv Detail & Related papers (2022-12-04T05:59:59Z) - Associating eHealth Policies and National Data Privacy Regulations [1.713291434132985]
This project aims to evaluate and highlight associations between systems' policies and privacy regulations.
Using bias-corrected Cramer's V and Thiel's U tests we found weak zero associations between e-health systems' rules protections for data and personal privacy.
arXiv Detail & Related papers (2022-02-27T21:22:48Z) - Detecting Compliance of Privacy Policies with Data Protection Laws [0.0]
Privacy policies are often written in extensive legal jargon that is difficult to understand.
We aim to bridge that gap by providing a framework that analyzes privacy policies in light of various data protection laws.
By using such a tool, users would be better equipped to understand how their personal data is managed.
arXiv Detail & Related papers (2021-02-21T09:15:15Z) - Beyond privacy regulations: an ethical approach to data usage in
transportation [64.86110095869176]
We describe how Federated Machine Learning can be applied to the transportation sector.
We see Federated Learning as a method that enables us to process privacy-sensitive data, while respecting customer's privacy.
arXiv Detail & Related papers (2020-04-01T15:10:12Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.