Application of Data Science to Discover Violence-Related Issues in Iraq
- URL: http://arxiv.org/abs/2006.07980v1
- Date: Sun, 14 Jun 2020 18:58:25 GMT
- Title: Application of Data Science to Discover Violence-Related Issues in Iraq
- Authors: Merari Gonz\'alez, Germ\'an H. Alf\'erez
- Abstract summary: There is a lack of governmental open data to discover social issues in Iraq.
Our contribution is the application of data science to open non-governmental big data to discover violence-related social issues in Iraq.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Data science has been satisfactorily used to discover social issues in
several parts of the world. However, there is a lack of governmental open data
to discover those issues in countries such as Iraq. This situation arises the
following questions: how to apply data science principles to discover social
issues despite the lack of open data in Iraq? How to use the available data to
make predictions in places without data? Our contribution is the application of
data science to open non-governmental big data from the Global Database of
Events, Language, and Tone (GDELT) to discover particular violence-related
social issues in Iraq. Specifically we applied the K-Nearest Neighbors, N\"aive
Bayes, Decision Trees, and Logistic Regression classification algorithms to
discover the following issues: refugees, humanitarian aid, violent protests,
fights with artillery and tanks, and mass killings. The best results were
obtained with the Decision Trees algorithm to discover areas with refugee
crises and artillery fights. The accuracy for these two events is 0.7629. The
precision to discover the locations of refugee crises is 0.76, the recall is
0.76, and the F1-score is 0.76. Also, our approach discovers the locations of
artillery fights with a precision of 0.74, a recall of 0.75, and a F1-score of
0.75.
Related papers
- To Search or Not to Search: Aligning the Decision Boundary of Deep Search Agents via Causal Intervention [61.82680155643223]
We identify the root cause of misaligned decision boundaries, the threshold determining when accumulated information suffices to answer.<n>This causes over-search (redundant searching despite sufficient knowledge) and under-search (premature termination yielding incorrect answers.<n>We propose a comprehensive framework comprising two key components. First, we introduce causal intervention-based diagnosis that identifies boundary errors.<n>Second, we develop Decision Boundary Alignment for Deep Search agents (DAS)<n>Our DAS method effectively calibrates these boundaries, mitigating both over-search and under-search to achieve substantial gains in accuracy and efficiency.
arXiv Detail & Related papers (2026-02-03T09:29:06Z) - Shoot First, Ask Questions Later? Building Rational Agents that Explore and Act Like People [81.63702981397408]
Given limited resources, to what extent do agents based on language models (LMs) act rationally?<n>We develop methods to benchmark and enhance agentic information-seeking, drawing on insights from human behavior.<n>For Spotter agents, our approach boosts accuracy by up to 14.7% absolute over LM-only baselines; for Captain agents, it raises expected information gain (EIG) by up to 0.227 bits (94.2% of the achievable noise ceiling)
arXiv Detail & Related papers (2025-10-23T17:57:28Z) - BRIGHT: A globally distributed multimodal building damage assessment dataset with very-high-resolution for all-weather disaster response [50.76124284445902]
Building damage assessment (BDA) is an essential capability in the aftermath of a disaster to reduce human casualties.<n>Recent research focuses on the development of AI models to achieve accurate mapping of unseen disaster events.<n>We present a BDA dataset using veRy-hIGH-resoluTion optical and SAR imagery (BRIGHT) to support AI-based all-weather disaster response.
arXiv Detail & Related papers (2025-01-10T14:57:18Z) - The Landscape of Causal Discovery Data: Grounding Causal Discovery in Real-World Applications [47.62544556500003]
Causal discovery aims to automatically uncover causal relationships from data.
Current methods often rely on unrealistic assumptions and are evaluated only on simple synthetic toy datasets.
We present applications in biology, neuroscience, and Earth sciences.
arXiv Detail & Related papers (2024-12-02T20:26:29Z) - Causal Micro-Narratives [62.47217054314046]
We present a novel approach to classify causal micro-narratives from text.
These narratives are sentence-level explanations of the cause(s) and/or effect(s) of a target subject.
arXiv Detail & Related papers (2024-10-07T17:55:10Z) - On Responsible Machine Learning Datasets with Fairness, Privacy, and Regulatory Norms [56.119374302685934]
There have been severe concerns over the trustworthiness of AI technologies.
Machine and deep learning algorithms depend heavily on the data used during their development.
We propose a framework to evaluate the datasets through a responsible rubric.
arXiv Detail & Related papers (2023-10-24T14:01:53Z) - High Accuracy Location Information Extraction from Social Network Texts
Using Natural Language Processing [0.0]
This paper is part of a research project that uses text from social networks to extract necessary information to build an adequate dataset for terrorist attack prediction.
We collected a set of 3000 social network texts about terrorism in Burkina Faso and used a subset to experiment with existing NLP solutions.
The experiment reveals that existing solutions have poor accuracy for location recognition, which our solution resolves.
arXiv Detail & Related papers (2023-08-31T10:21:24Z) - Xenophobic Events vs. Refugee Population -- Using GDELT to Identify
Countries with Disproportionate Coverage [0.3867363075280544]
We used the Global Database of Events, Language, and Tone (GDELT) database to examine xenophobic events reported in the media during 2022.
We collected a dataset of 2,778 unique events and created a choropleth map illustrating the frequency of events scaled by the refugee population's proportion in each host country.
Contrary to the belief that hosting a significant number of forced migrants results in higher xenophobic incidents, our findings indicate a potential connection to political factors.
arXiv Detail & Related papers (2023-08-09T16:10:05Z) - Word Sense Disambiguation as a Game of Neurosymbolic Darts [3.0572129477925727]
We propose a novel neurosymbolic methodology to push the F1 score above 90%.
The core of our methodology is a neurosymbolic sense embedding, in terms of a configuration of nested balls in n-dimensional space.
We trained a Transformer to learn the mapping from a contextualized word embedding to its sense ball embedding, just like playing the game of darts.
arXiv Detail & Related papers (2023-07-25T07:22:57Z) - One-Shot Learning for Periocular Recognition: Exploring the Effect of
Domain Adaptation and Data Bias on Deep Representations [59.17685450892182]
We investigate the behavior of deep representations in widely used CNN models under extreme data scarcity for One-Shot periocular recognition.
We improved state-of-the-art results that made use of networks trained with biometric datasets with millions of images.
Traditional algorithms like SIFT can outperform CNNs in situations with limited data.
arXiv Detail & Related papers (2023-07-11T09:10:16Z) - MLGWSC-1: The first Machine Learning Gravitational-Wave Search Mock Data
Challenge [110.7678032481059]
We present the results of the first Machine Learning Gravitational-Wave Search Mock Data Challenge (MLGWSC-1).
For this challenge, participating groups had to identify gravitational-wave signals from binary black hole mergers of increasing complexity and duration embedded in progressively more realistic noise.
Our results show that current machine learning search algorithms may already be sensitive enough in limited parameter regions to be useful for some production settings.
arXiv Detail & Related papers (2022-09-22T16:44:59Z) - Knowledge Graph Question Answering Leaderboard: A Community Resource to
Prevent a Replication Crisis [61.740077541531726]
We provide a new central and open leaderboard for any KGQA benchmark dataset as a focal point for the community.
Our analysis highlights existing problems during the evaluation of KGQA systems.
arXiv Detail & Related papers (2022-01-20T13:46:01Z) - Understanding peacefulness through the world news [1.6975704972827304]
We exploit information extracted from Global Data on Events, Location, and Tone (GDELT) digital news database to capture peacefulness through the Global Peace Index (GPI)
Applying predictive machine learning models, we demonstrate that news media attention from GDELT can be used as a proxy for measuring GPI at a monthly level.
arXiv Detail & Related papers (2021-06-01T08:24:57Z) - AutoSpace: Neural Architecture Search with Less Human Interference [84.42680793945007]
Current neural architecture search (NAS) algorithms still require expert knowledge and effort to design a search space for network construction.
We propose a novel differentiable evolutionary framework named AutoSpace, which evolves the search space to an optimal one.
With the learned search space, the performance of recent NAS algorithms can be improved significantly compared with using previously manually designed spaces.
arXiv Detail & Related papers (2021-03-22T13:28:56Z) - A Comparative Study on Crime in Denver City Based on Machine Learning
and Data Mining [0.0]
I analyzed a real-world crime and accident dataset of Denver county, USA, from January 2014 to May 2019.
This project aims to predict and highlights the trends of occurrence that will, in return, support the law enforcement agencies and government to discover the preventive measures.
The outcomes are captured using two popular test methods: train-test split, and k-fold crossvalidation.
arXiv Detail & Related papers (2020-01-09T01:36:11Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.