Related papers: Application of Data Science to Discover Violence-Related Issues in Iraq

Application of Data Science to Discover Violence-Related Issues in Iraq

URL: http://arxiv.org/abs/2006.07980v1
Date: Sun, 14 Jun 2020 18:58:25 GMT
Title: Application of Data Science to Discover Violence-Related Issues in Iraq
Authors: Merari Gonz\'alez, Germ\'an H. Alf\'erez
Abstract summary: There is a lack of governmental open data to discover social issues in Iraq. Our contribution is the application of data science to open non-governmental big data to discover violence-related social issues in Iraq.
Score: 0.0
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Data science has been satisfactorily used to discover social issues in several parts of the world. However, there is a lack of governmental open data to discover those issues in countries such as Iraq. This situation arises the following questions: how to apply data science principles to discover social issues despite the lack of open data in Iraq? How to use the available data to make predictions in places without data? Our contribution is the application of data science to open non-governmental big data from the Global Database of Events, Language, and Tone (GDELT) to discover particular violence-related social issues in Iraq. Specifically we applied the K-Nearest Neighbors, N\"aive Bayes, Decision Trees, and Logistic Regression classification algorithms to discover the following issues: refugees, humanitarian aid, violent protests, fights with artillery and tanks, and mass killings. The best results were obtained with the Decision Trees algorithm to discover areas with refugee crises and artillery fights. The accuracy for these two events is 0.7629. The precision to discover the locations of refugee crises is 0.76, the recall is 0.76, and the F1-score is 0.76. Also, our approach discovers the locations of artillery fights with a precision of 0.74, a recall of 0.75, and a F1-score of 0.75.

Related papers

The Landscape of Causal Discovery Data: Grounding Causal Discovery in Real-World Applications [47.62544556500003]
Causal discovery aims to automatically uncover causal relationships from data. Current methods often rely on unrealistic assumptions and are evaluated only on simple synthetic toy datasets. We present applications in biology, neuroscience, and Earth sciences.
arXiv Detail & Related papers (2024-12-02T20:26:29Z)
Causal Micro-Narratives [62.47217054314046]
We present a novel approach to classify causal micro-narratives from text. These narratives are sentence-level explanations of the cause(s) and/or effect(s) of a target subject.
arXiv Detail & Related papers (2024-10-07T17:55:10Z)
On Responsible Machine Learning Datasets with Fairness, Privacy, and Regulatory Norms [56.119374302685934]
There have been severe concerns over the trustworthiness of AI technologies. Machine and deep learning algorithms depend heavily on the data used during their development. We propose a framework to evaluate the datasets through a responsible rubric.
arXiv Detail & Related papers (2023-10-24T14:01:53Z)
High Accuracy Location Information Extraction from Social Network Texts Using Natural Language Processing [0.0]
This paper is part of a research project that uses text from social networks to extract necessary information to build an adequate dataset for terrorist attack prediction. We collected a set of 3000 social network texts about terrorism in Burkina Faso and used a subset to experiment with existing NLP solutions. The experiment reveals that existing solutions have poor accuracy for location recognition, which our solution resolves.
arXiv Detail & Related papers (2023-08-31T10:21:24Z)
Xenophobic Events vs. Refugee Population -- Using GDELT to Identify Countries with Disproportionate Coverage [0.3867363075280544]
We used the Global Database of Events, Language, and Tone (GDELT) database to examine xenophobic events reported in the media during 2022. We collected a dataset of 2,778 unique events and created a choropleth map illustrating the frequency of events scaled by the refugee population's proportion in each host country. Contrary to the belief that hosting a significant number of forced migrants results in higher xenophobic incidents, our findings indicate a potential connection to political factors.
arXiv Detail & Related papers (2023-08-09T16:10:05Z)
Word Sense Disambiguation as a Game of Neurosymbolic Darts [3.0572129477925727]
We propose a novel neurosymbolic methodology to push the F1 score above 90%. The core of our methodology is a neurosymbolic sense embedding, in terms of a configuration of nested balls in n-dimensional space. We trained a Transformer to learn the mapping from a contextualized word embedding to its sense ball embedding, just like playing the game of darts.
arXiv Detail & Related papers (2023-07-25T07:22:57Z)
One-Shot Learning for Periocular Recognition: Exploring the Effect of Domain Adaptation and Data Bias on Deep Representations [59.17685450892182]
We investigate the behavior of deep representations in widely used CNN models under extreme data scarcity for One-Shot periocular recognition. We improved state-of-the-art results that made use of networks trained with biometric datasets with millions of images. Traditional algorithms like SIFT can outperform CNNs in situations with limited data.
arXiv Detail & Related papers (2023-07-11T09:10:16Z)
MLGWSC-1: The first Machine Learning Gravitational-Wave Search Mock Data Challenge [110.7678032481059]
We present the results of the first Machine Learning Gravitational-Wave Search Mock Data Challenge (MLGWSC-1). For this challenge, participating groups had to identify gravitational-wave signals from binary black hole mergers of increasing complexity and duration embedded in progressively more realistic noise. Our results show that current machine learning search algorithms may already be sensitive enough in limited parameter regions to be useful for some production settings.
arXiv Detail & Related papers (2022-09-22T16:44:59Z)
Knowledge Graph Question Answering Leaderboard: A Community Resource to Prevent a Replication Crisis [61.740077541531726]
We provide a new central and open leaderboard for any KGQA benchmark dataset as a focal point for the community. Our analysis highlights existing problems during the evaluation of KGQA systems.
arXiv Detail & Related papers (2022-01-20T13:46:01Z)
Understanding peacefulness through the world news [1.6975704972827304]
We exploit information extracted from Global Data on Events, Location, and Tone (GDELT) digital news database to capture peacefulness through the Global Peace Index (GPI) Applying predictive machine learning models, we demonstrate that news media attention from GDELT can be used as a proxy for measuring GPI at a monthly level.
arXiv Detail & Related papers (2021-06-01T08:24:57Z)
AutoSpace: Neural Architecture Search with Less Human Interference [84.42680793945007]
Current neural architecture search (NAS) algorithms still require expert knowledge and effort to design a search space for network construction. We propose a novel differentiable evolutionary framework named AutoSpace, which evolves the search space to an optimal one. With the learned search space, the performance of recent NAS algorithms can be improved significantly compared with using previously manually designed spaces.
arXiv Detail & Related papers (2021-03-22T13:28:56Z)
A Comparative Study on Crime in Denver City Based on Machine Learning and Data Mining [0.0]
I analyzed a real-world crime and accident dataset of Denver county, USA, from January 2014 to May 2019. This project aims to predict and highlights the trends of occurrence that will, in return, support the law enforcement agencies and government to discover the preventive measures. The outcomes are captured using two popular test methods: train-test split, and k-fold crossvalidation.
arXiv Detail & Related papers (2020-01-09T01:36:11Z)

This list is automatically generated from the titles and abstracts of the papers in this site.