Whose Tweets are Surveilled for the Police: An Audit of Social-Media
Monitoring Tool via Log Files
- URL: http://arxiv.org/abs/2001.08777v1
- Date: Thu, 23 Jan 2020 19:35:12 GMT
- Title: Whose Tweets are Surveilled for the Police: An Audit of Social-Media
Monitoring Tool via Log Files
- Authors: Glencora Borradaile, Brett Burkhardt, Alexandria LeClerc
- Abstract summary: We obtained log files from the Corvallis (Oregon) Police Department's use of social media monitoring software called DigitalStakeout.
These log files include the results of proprietary searches by DigitalStakeout that were running over a period of 13 months and include 7240 social media posts.
We observe differences in the demographics of the users whose Tweets are flagged by DigitalStakeout compared to the demographics of the Twitter users in the region.
- Score: 69.02688684221265
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Social media monitoring by law enforcement is becoming commonplace, but
little is known about what software packages for it do. Through public records
requests, we obtained log files from the Corvallis (Oregon) Police Department's
use of social media monitoring software called DigitalStakeout. These log files
include the results of proprietary searches by DigitalStakeout that were
running over a period of 13 months and include 7240 social media posts. In this
paper, we focus on the Tweets logged in this data and consider the racial and
ethnic identity (through manual coding) of the users that are therein flagged
by DigitalStakeout. We observe differences in the demographics of the users
whose Tweets are flagged by DigitalStakeout compared to the demographics of the
Twitter users in the region, however, our sample size is too small to determine
significance. Further, the demographics of the Twitter users in the region do
not seem to reflect that of the residents of the region, with an apparent
higher representation of Black and Hispanic people. We also reconstruct the
keywords related to a Narcotics report set up by DigitalStakeout for the
Corvallis Police Department and find that these keywords flag Tweets unrelated
to narcotics or flag Tweets related to marijuana, a drug that is legal for
recreational use in Oregon. Almost all of the keywords have a common meaning
unrelated to narcotics (e.g.\ broken, snow, hop, high) that call into question
the utility that such a keyword based search could have to law enforcement.
Related papers
- Semantic Similarity Models for Depression Severity Estimation [53.72188878602294]
This paper presents an efficient semantic pipeline to study depression severity in individuals based on their social media writings.
We use test user sentences for producing semantic rankings over an index of representative training sentences corresponding to depressive symptoms and severity levels.
We evaluate our methods on two Reddit-based benchmarks, achieving 30% improvement over state of the art in terms of measuring depression severity.
arXiv Detail & Related papers (2022-11-14T18:47:26Z) - Manipulating Twitter Through Deletions [64.33261764633504]
Research into influence campaigns on Twitter has mostly relied on identifying malicious activities from tweets obtained via public APIs.
Here, we provide the first exhaustive, large-scale analysis of anomalous deletion patterns involving more than a billion deletions by over 11 million accounts.
We find that a small fraction of accounts delete a large number of tweets daily.
First, limits on tweet volume are circumvented, allowing certain accounts to flood the network with over 26 thousand daily tweets.
Second, coordinated networks of accounts engage in repetitive likes and unlikes of content that is eventually deleted, which can manipulate ranking algorithms.
arXiv Detail & Related papers (2022-03-25T20:07:08Z) - Semi-supervised Stance Detection of Tweets Via Distant Network
Supervision [32.86421107987556]
Homophily properties over the social network provide strong signal of coarse-grained user-level stance.
We present SANDS, a new semi-supervised stance detector.
Sands achieves a macro-F1 score of 0.55 (0.49) on US (India)-based datasets.
arXiv Detail & Related papers (2022-01-03T13:04:54Z) - Analyzing the Impact of COVID-19 on Economy from the Perspective of
Users Reviews [0.0]
A large number of Coronavirus-related tweets are considered and analyzed using natural language processing and information retrieval science.
From the analysis of the charts, we learn that the reason for publishing economic tweets is not only the increase in the number of people infected with the Coronavirus but also imposed restrictions and lockdowns in countries.
arXiv Detail & Related papers (2021-10-05T17:44:41Z) - News consumption and social media regulations policy [70.31753171707005]
We analyze two social media that enforced opposite moderation methods, Twitter and Gab, to assess the interplay between news consumption and content regulation.
Our results show that the presence of moderation pursued by Twitter produces a significant reduction of questionable content.
The lack of clear regulation on Gab results in the tendency of the user to engage with both types of content, showing a slight preference for the questionable ones which may account for a dissing/endorsement behavior.
arXiv Detail & Related papers (2021-06-07T19:26:32Z) - Machine Translation, Sentiment Analysis, Text Similarity, Topic
Modelling, and Tweets: Understanding Social Media Usage Among Police and
Gendarmerie Organizations [0.0]
The study was aimed to investigate the use of social media by the gendarmerie and police organizations operating in Turkey.
It was found that Jandarma (Turkey) has the highest power of influence in the twitter sample.
arXiv Detail & Related papers (2021-01-29T18:26:24Z) - Intersectional Bias in Hate Speech and Abusive Language Datasets [0.3149883354098941]
African American tweets were up to 3.7 times more likely to be labeled as abusive.
African American male tweets were up to 77% more likely to be labeled as hateful.
This study provides the first systematic evidence on intersectional bias in datasets of hate speech and abusive language.
arXiv Detail & Related papers (2020-05-12T16:58:48Z) - Word frequency and sentiment analysis of twitter messages during
Coronavirus pandemic [0.0]
The social networking site, Twitter, demonstrated similar effect with the number of posts related to coronavirus showing an unprecedented growth in a very short span of time.
This paper presents a statistical analysis of the twitter messages related to this disease posted since January 2020.
Results showed that the majority of the tweets had a positive polarity and only about 15% were negative.
arXiv Detail & Related papers (2020-04-08T10:45:08Z) - Forensic Authorship Analysis of Microblogging Texts Using N-Grams and
Stylometric Features [63.48764893706088]
This work aims at identifying authors of tweet messages, which are limited to 280 characters.
We use for our experiments a self-captured database of 40 users, with 120 to 200 tweets per user.
Results using this small set are promising, with the different features providing a classification accuracy between 92% and 98.5%.
arXiv Detail & Related papers (2020-03-24T19:32:11Z) - #MeToo on Campus: Studying College Sexual Assault at Scale Using Data
Reported on Social Media [71.74529365205053]
We analyze the influence of the # trend on a pool of college followers.
The results show that the majority of topics embedded in those # tweets detail sexual harassment stories.
There exists a significant correlation between the prevalence of this trend and official reports on several major geographical regions.
arXiv Detail & Related papers (2020-01-16T18:05:46Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.