Datastore Design for Analysis of Police Broadcast Audio at Scale
- URL: http://arxiv.org/abs/2310.16956v1
- Date: Wed, 25 Oct 2023 19:52:19 GMT
- Title: Datastore Design for Analysis of Police Broadcast Audio at Scale
- Authors: Ayah Ahmad (1), Christopher Graziul (2), Margaret Beale Spencer (2)
((1) University of California, Berkeley, (2) University of Chicago)
- Abstract summary: We describe preliminary work towards enabling Speech Emotion Recognition (SER) in an analysis of the Chicago Police Department's (CPD)
We demonstrate the pipelined creation of a datastore to enable a multimodal analysis of composed raw audio files.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: With policing coming under greater scrutiny in recent years, researchers have
begun to more thoroughly study the effects of contact between police and
minority communities. Despite data archives of hundreds of thousands of
recorded Broadcast Police Communications (BPC) being openly available to the
public, a closer look at a large-scale analysis of the language of policing has
remained largely unexplored. While this research is critical in understanding a
"pre-reflective" notion of policing, the large quantity of data presents
numerous challenges in its organization and analysis.
In this paper, we describe preliminary work towards enabling Speech Emotion
Recognition (SER) in an analysis of the Chicago Police Department's (CPD) BPC
by demonstrating the pipelined creation of a datastore to enable a multimodal
analysis of composed raw audio files.
Related papers
- Understanding Audiovisual Deepfake Detection: Techniques, Challenges, Human Factors and Perceptual Insights [49.81915942821647]
Deep Learning has been successfully applied in diverse fields, and its impact on deepfake detection is no exception.
Deepfakes are fake yet realistic synthetic content that can be used deceitfully for political impersonation, phishing, slandering, or spreading misinformation.
This paper aims to improve the effectiveness of deepfake detection strategies and guide future research in cybersecurity and media integrity.
arXiv Detail & Related papers (2024-11-12T09:02:11Z) - Race and Privacy in Broadcast Police Communications [3.034710104407876]
We examine the Chicago Police Department's (CPD's) use of broadcast police communications (BPC) to coordinate activity of law enforcement officers (LEOs) in the city.
From a recently assembled archive of 80,775 hours of BPC associated with CPD operations, we analyze text transcripts of radio transmissions broadcast 9:00 AM to 5:00 PM on August 10th, 2018 in one majority Black, one majority white, and one majority Hispanic area.
We explore the vocabulary and speech acts used by police in BPC, comparing mentions of personal characteristics to local demographics, the personal information shared over BPC, and the
arXiv Detail & Related papers (2024-07-01T21:34:51Z) - A Multi-Perspective Machine Learning Approach to Evaluate Police-Driver
Interaction in Los Angeles [18.379058918856717]
Police officers, the most visible and contacted agents of the state, interact with the public more than 20 million times a year during traffic stops.
Body-worn cameras (BWCs) are lauded as a means to enhance police accountability and improve police-public interactions.
This article proposes an approach to developing new multi-perspective, multimodal machine learning (ML) tools to analyze the audio, video, and transcript information from this BWC footage.
arXiv Detail & Related papers (2024-01-24T19:56:20Z) - SER_AMPEL: a multi-source dataset for speech emotion recognition of
Italian older adults [58.49386651361823]
SER_AMPEL is a multi-source dataset for speech emotion recognition (SER)
It is collected with the aim of providing a reference for speech emotion recognition in case of Italian older adults.
The evidence of the need for such a dataset emerges from the analysis of the state of the art.
arXiv Detail & Related papers (2023-11-24T13:47:25Z) - Developing Speech Processing Pipelines for Police Accountability [22.711149484932527]
Police body-worn cameras have the potential to improve accountability and transparency in policing. Yet in practice, they result in millions of hours of footage that is never reviewed.
We investigate the potential of large pre-trained speech models for facilitating reviews, focusing on ASR and officer speech detection in footage from traffic stops.
Our proposed pipeline includes training data alignment and filtering, fine-tuning with resource constraints, and combining officer speech detection with ASR for a fully automated approach.
arXiv Detail & Related papers (2023-06-09T17:48:58Z) - Leveraging Large Language Models for Topic Classification in the Domain
of Public Affairs [65.9077733300329]
Large Language Models (LLMs) have the potential to greatly enhance the analysis of public affairs documents.
LLMs can be of great use to process domain-specific documents, such as those in the domain of public affairs.
arXiv Detail & Related papers (2023-06-05T13:35:01Z) - Retrieval Enhanced Data Augmentation for Question Answering on Privacy
Policies [74.01792675564218]
We develop a data augmentation framework based on ensembling retriever models that captures relevant text segments from unlabeled policy documents.
To improve the diversity and quality of the augmented data, we leverage multiple pre-trained language models (LMs) and cascade them with noise reduction filter models.
Using our augmented data on the PrivacyQA benchmark, we elevate the existing baseline by a large margin (10% F1) and achieve a new state-of-the-art F1 score of 50%.
arXiv Detail & Related papers (2022-04-19T15:45:23Z) - Librarian-in-the-Loop: A Natural Language Processing Paradigm for
Detecting Informal Mentions of Research Data in Academic Literature [1.4190701053683017]
We propose a natural language processing paradigm to support the human task of identifying informal mentions made to research datasets.
The work of discovering informal data mentions is currently performed by librarians and their staff in the Inter-university Consortium for Political and Social Research.
arXiv Detail & Related papers (2022-03-10T02:11:30Z) - APES: Audiovisual Person Search in Untrimmed Video [87.4124877066541]
We present the Audiovisual Person Search dataset (APES)
APES contains over 1.9K identities labeled along 36 hours of video.
A key property of APES is that it includes dense temporal annotations that link faces to speech segments of the same identity.
arXiv Detail & Related papers (2021-06-03T08:16:42Z) - Multi-Modal Video Forensic Platform for Investigating Post-Terrorist
Attack Scenarios [55.82693757287532]
Large scale Video Analytic Platforms (VAP) assist law enforcement agencies (LEA) in identifying suspects and securing evidence.
We present a video analytic platform that integrates visual and audio analytic modules and fuses information from surveillance cameras and video uploads from eyewitnesses.
arXiv Detail & Related papers (2020-04-02T14:29:27Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.