SoK: Content Moderation for End-to-End Encryption
- URL: http://arxiv.org/abs/2303.03979v1
- Date: Tue, 7 Mar 2023 15:26:41 GMT
- Title: SoK: Content Moderation for End-to-End Encryption
- Authors: Sarah Scheffler and Jonathan Mayer
- Abstract summary: Messaging applications now enable end-to-end-encryption (E2EE) by default, and E2EE data storage is becoming common.
These important advances for security and privacy create new content moderation challenges for online services.
We bridge literature that is diverse in both content moderation subject matter, such as malware, spam, hate speech, terrorist content, and enterprise policy compliance.
- Score: 2.66512000865131
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Popular messaging applications now enable end-to-end-encryption (E2EE) by
default, and E2EE data storage is becoming common. These important advances for
security and privacy create new content moderation challenges for online
services, because services can no longer directly access plaintext content.
While ongoing public policy debates about E2EE and content moderation in the
United States and European Union emphasize child sexual abuse material and
misinformation in messaging and storage, we identify and synthesize a wealth of
scholarship that goes far beyond those topics. We bridge literature that is
diverse in both content moderation subject matter, such as malware, spam, hate
speech, terrorist content, and enterprise policy compliance, as well as
intended deployments, including not only privacy-preserving content moderation
for messaging, email, and cloud storage, but also private introspection of
encrypted web traffic by middleboxes. In this work, we systematize the study of
content moderation in E2EE settings. We set out a process pipeline for content
moderation, drawing on a broad interdisciplinary literature that is not
specific to E2EE. We examine cryptography and policy design choices at all
stages of this pipeline, and we suggest areas of future research to fill gaps
in literature and better understand possible paths forward.
Related papers
- StopHC: A Harmful Content Detection and Mitigation Architecture for Social Media Platforms [0.46289929100614996]
textscStopHC is a harmful content detection and mitigation architecture for social media platforms.
Our solution contains two modules, one that employs deep neural network architecture for harmful content detection, and one that uses a network immunization algorithm to block toxic nodes and stop the spread of harmful content.
arXiv Detail & Related papers (2024-11-09T10:23:22Z) - Private Hierarchical Governance for Encrypted Messaging [20.838090769270607]
We propose private hierarchical governance systems using E2EE messaging for privacy.
We show how an extension to the message layer security protocol suffices for achieving a rich set of governance policies.
We build a prototype E2EE messaging system called MlsGov that supports content-based community and platform moderation, elections of community moderators, votes to remove abusive users, and more.
arXiv Detail & Related papers (2024-06-27T17:33:23Z) - Content Moderation on Social Media in the EU: Insights From the DSA
Transparency Database [0.0]
Digital Services Act (DSA) requires large social media platforms in the EU to provide clear and specific information whenever they restrict access to certain content.
Statements of Reasons (SoRs) are collected in the DSA Transparency Database to ensure transparency and scrutiny of content moderation decisions.
We empirically analyze 156 million SoRs within an observation period of two months to provide an early look at content moderation decisions of social media platforms in the EU.
arXiv Detail & Related papers (2023-12-07T16:56:19Z) - Why Should This Article Be Deleted? Transparent Stance Detection in
Multilingual Wikipedia Editor Discussions [47.944081120226905]
We construct a novel dataset of Wikipedia editor discussions along with their reasoning in three languages.
The dataset contains the stances of the editors (keep, delete, merge, comment), along with the stated reason, and a content moderation policy, for each edit decision.
We demonstrate that stance and corresponding reason (policy) can be predicted jointly with a high degree of accuracy, adding transparency to the decision-making process.
arXiv Detail & Related papers (2023-10-09T15:11:02Z) - Enhancing End-to-End Conversational Speech Translation Through Target
Language Context Utilization [73.85027121522295]
We introduce target language context in E2E-ST, enhancing coherence and overcoming memory constraints of extended audio segments.
Our proposed contextual E2E-ST outperforms the isolated utterance-based E2E-ST approach.
arXiv Detail & Related papers (2023-09-27T14:32:30Z) - A Unified Framework for Integrating Semantic Communication and
AI-Generated Content in Metaverse [57.317580645602895]
Integrated Semantic Communication and AI-Generated Content (ISGC) has attracted a lot of attentions recently.
ISGC transfers semantic information from user inputs, generates digital content, and renders graphics for Metaverse.
We introduce a unified framework that captures ISGC two primary benefits, including integration gain for optimized resource allocation.
arXiv Detail & Related papers (2023-05-18T02:02:36Z) - Advancing Differential Privacy: Where We Are Now and Future Directions for Real-World Deployment [100.1798289103163]
We present a detailed review of current practices and state-of-the-art methodologies in the field of differential privacy (DP)
Key points and high-level contents of the article were originated from the discussions from "Differential Privacy (DP): Challenges Towards the Next Frontier"
This article aims to provide a reference point for the algorithmic and design decisions within the realm of privacy, highlighting important challenges and potential research directions.
arXiv Detail & Related papers (2023-04-14T05:29:18Z) - Leveraging Large Text Corpora for End-to-End Speech Summarization [58.673480990374635]
End-to-end speech summarization (E2E SSum) is a technique to directly generate summary sentences from speech.
We present two novel methods that leverage a large amount of external text summarization data for E2E SSum training.
arXiv Detail & Related papers (2023-03-02T05:19:49Z) - SoK: Content Moderation Schemes in End-to-End Encrypted Systems [0.6138671548064355]
We study the unique features of some content moderation techniques, such as message franking and perceptual hashing.
This has led researchers to develop remediations and design new security primitives to make content moderation compatible with end-to-end encryption systems.
arXiv Detail & Related papers (2022-08-23T18:27:28Z) - Abstractive Summarization of Spoken and Written Instructions with BERT [66.14755043607776]
We present the first application of the BERTSum model to conversational language.
We generate abstractive summaries of narrated instructional videos across a wide variety of topics.
We envision this integrated as a feature in intelligent virtual assistants, enabling them to summarize both written and spoken instructional content upon request.
arXiv Detail & Related papers (2020-08-21T20:59:34Z) - WAC: A Corpus of Wikipedia Conversations for Online Abuse Detection [0.0]
We propose an original framework, based on the Wikipedia Comment corpus, with comment-level annotations of different types.
This large corpus of more than 380k annotated messages opens perspectives for online abuse detection and especially for context-based approaches.
We also propose, in addition to this corpus, a complete benchmarking platform to stimulate and fairly compare scientific works around the problem of content abuse detection.
arXiv Detail & Related papers (2020-03-13T10:26:45Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.