A Framework for Pre-processing of Social Media Feeds based on Integrated
Local Knowledge Base
- URL: http://arxiv.org/abs/2006.15854v1
- Date: Mon, 29 Jun 2020 07:56:22 GMT
- Title: A Framework for Pre-processing of Social Media Feeds based on Integrated
Local Knowledge Base
- Authors: Taiwo Kolajo, Olawande Daramola, Ayodele Adebiyi, Seth Aaditeshwar
- Abstract summary: This paper proposes an improved framework for pre-processing of social media feeds for better performance.
The framework had an accuracy of 94.07% on a standardized dataset, and 99.78% on localised dataset when used to extract sentiments from tweets.
- Score: 1.5749416770494706
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Most of the previous studies on the semantic analysis of social media feeds
have not considered the issue of ambiguity that is associated with slangs,
abbreviations, and acronyms that are embedded in social media posts. These
noisy terms have implicit meanings and form part of the rich semantic context
that must be analysed to gain complete insights from social media feeds. This
paper proposes an improved framework for pre-processing of social media feeds
for better performance. To do this, the use of an integrated knowledge base
(ikb) which comprises a local knowledge source (Naijalingo), urban dictionary
and internet slang was combined with the adapted Lesk algorithm to facilitate
semantic analysis of social media feeds. Experimental results showed that the
proposed approach performed better than existing methods when it was tested on
three machine learning models, which are support vector machines, multilayer
perceptron, and convolutional neural networks. The framework had an accuracy of
94.07% on a standardized dataset, and 99.78% on localised dataset when used to
extract sentiments from tweets. The improved performance on the localised
dataset reveals the advantage of integrating the use of local knowledge sources
into the process of analysing social media feeds particularly in interpreting
slangs/acronyms/abbreviations that have contextually rooted meanings.
Related papers
- Text-Video Retrieval with Global-Local Semantic Consistent Learning [122.15339128463715]
We propose a simple yet effective method, Global-Local Semantic Consistent Learning (GLSCL)
GLSCL capitalizes on latent shared semantics across modalities for text-video retrieval.
Our method achieves comparable performance with SOTA as well as being nearly 220 times faster in terms of computational cost.
arXiv Detail & Related papers (2024-05-21T11:59:36Z) - Informed Meta-Learning [55.2480439325792]
Meta-learning and informed ML stand out as two approaches for incorporating prior knowledge into ML pipelines.
We formalise a hybrid paradigm, informed meta-learning, facilitating the incorporation of priors from unstructured knowledge representations.
We demonstrate the potential benefits of informed meta-learning in improving data efficiency, robustness to observational noise and task distribution shifts.
arXiv Detail & Related papers (2024-02-25T15:08:37Z) - HICL: Hashtag-Driven In-Context Learning for Social Media Natural
Language Understanding [15.743523533234224]
In this paper, we propose a novel hashtag-driven in-context learning framework for natural language understanding on social media.
Our objective is to enable a model #Encoder to incorporate topic-related semantic information, which allows it to retrieve topic-related posts.
For empirical studies, we collected 45M tweets to set up an in-context NLU benchmark, and the experimental results on seven downstream tasks show that HICL substantially advances the previous state-of-the-art results.
arXiv Detail & Related papers (2023-08-19T11:31:45Z) - Unsupervised Sentiment Analysis of Plastic Surgery Social Media Posts [91.3755431537592]
The massive collection of user posts across social media platforms is primarily untapped for artificial intelligence (AI) use cases.
Natural language processing (NLP) is a subfield of AI that leverages bodies of documents, known as corpora, to train computers in human-like language understanding.
This study demonstrates that the applied results of unsupervised analysis allow a computer to predict either negative, positive, or neutral user sentiment towards plastic surgery.
arXiv Detail & Related papers (2023-07-05T20:16:20Z) - Comparative Study of Sentiment Analysis for Multi-Sourced Social Media
Platforms [0.0]
In this paper, we provide a comparative analysis using techniques of lexicon-based, machine learning and deep learning approaches.
The dataset we used was a multi-source dataset from the comment section of various social networking sites like Twitter, Reddit, etc.
arXiv Detail & Related papers (2022-12-09T06:33:49Z) - Contextual information integration for stance detection via
cross-attention [59.662413798388485]
Stance detection deals with identifying an author's stance towards a target.
Most existing stance detection models are limited because they do not consider relevant contextual information.
We propose an approach to integrate contextual information as text.
arXiv Detail & Related papers (2022-11-03T15:04:29Z) - Data Expansion using Back Translation and Paraphrasing for Hate Speech
Detection [1.192436948211501]
We present a new deep learning-based method that fuses a Back Translation method, and a Paraphrasing technique for data augmentation.
We evaluate our proposal on five publicly available datasets; namely, AskFm corpus, Formspring dataset, Warner and Waseem dataset, Olid, and Wikipedia toxic comments dataset.
arXiv Detail & Related papers (2021-05-25T09:52:42Z) - Named Entity Recognition for Social Media Texts with Semantic
Augmentation [70.44281443975554]
Existing approaches for named entity recognition suffer from data sparsity problems when conducted on short and informal texts.
We propose a neural-based approach to NER for social media texts where both local (from running text) and augmented semantics are taken into account.
arXiv Detail & Related papers (2020-10-29T10:06:46Z) - Meta-Learning with Context-Agnostic Initialisations [86.47040878540139]
We introduce a context-adversarial component into the meta-learning process.
This produces an initialisation for fine-tuning to target which is context-agnostic and task-generalised.
We evaluate our approach on three commonly used meta-learning algorithms and two problems.
arXiv Detail & Related papers (2020-07-29T08:08:38Z) - Convolutional Neural Networks for Sentiment Analysis in Persian Social
Media [6.51882364384472]
We propose a sentiment analysis method for Persian text using Convolutional Neural Network (CNN)
We evaluate the method on three different datasets of Persian social media texts using Area under Curve metric.
arXiv Detail & Related papers (2020-02-14T19:52:39Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.