Multilevel sentiment analysis in arabic
- URL: http://arxiv.org/abs/2205.12328v1
- Date: Tue, 24 May 2022 19:16:06 GMT
- Title: Multilevel sentiment analysis in arabic
- Authors: Ahmed Nassar, Ebru Sezer
- Abstract summary: The average F-score achieved in the term level SA for both positive and negative testing classes is 0.92.
In the document level SA, the average F-score for positive testing classes is 0.94, while for negative classes is 0.93.
- Score: 1.4467794332678539
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: In this study, we aimed to improve the performance results of Arabic
sentiment analysis. This can be achieved by investigating the most successful
machine learning method and the most useful feature vector to classify
sentiments in both term and document levels into two (positive or negative)
categories. Moreover, specification of one polarity degree for the term that
has more than one is investigated. Also to handle the negations and
intensifications, some rules are developed. According to the obtained results,
Artificial Neural Network classifier is nominated as the best classifier in
both term and document level sentiment analysis (SA) for Arabic Language.
Furthermore, the average F-score achieved in the term level SA for both
positive and negative testing classes is 0.92. In the document level SA, the
average F-score for positive testing classes is 0.94, while for negative
classes is 0.93.
Related papers
- You Shall Know a Tool by the Traces it Leaves: The Predictability of Sentiment Analysis Tools [74.98850427240464]
We show that sentiment analysis tools disagree on the same dataset.
We show that the sentiment tool used for sentiment annotation can even be predicted from its outcome.
arXiv Detail & Related papers (2024-10-18T17:27:38Z) - Achieving More with Less: A Tensor-Optimization-Powered Ensemble Method [53.170053108447455]
Ensemble learning is a method that leverages weak learners to produce a strong learner.
We design a smooth and convex objective function that leverages the concept of margin, making the strong learner more discriminative.
We then compare our algorithm with random forests of ten times the size and other classical methods across numerous datasets.
arXiv Detail & Related papers (2024-08-06T03:42:38Z) - Strategies for Arabic Readability Modeling [9.976720880041688]
Automatic readability assessment is relevant to building NLP applications for education, content analysis, and accessibility.
We present a set of experimental results on Arabic readability assessment using a diverse range of approaches.
arXiv Detail & Related papers (2024-07-03T11:54:11Z) - Comparative Analysis of Libraries for the Sentimental Analysis [0.0]
This study is main goal is to provide a comparative comparison of libraries using machine learning methods.
Five Python and R libraries NLTK, Textlob Vader, Transformers (GPT and BERT pretrained), and Tidytext will be used in the study to apply sentiment analysis techniques.
arXiv Detail & Related papers (2023-07-26T17:21:53Z) - Language Model Classifier Aligns Better with Physician Word Sensitivity
than XGBoost on Readmission Prediction [86.15787587540132]
We introduce sensitivity score, a metric that scrutinizes models' behaviors at the vocabulary level.
Our experiments compare the decision-making logic of clinicians and classifiers based on rank correlations of sensitivity scores.
arXiv Detail & Related papers (2022-11-13T23:59:11Z) - Enabling Classifiers to Make Judgements Explicitly Aligned with Human
Values [73.82043713141142]
Many NLP classification tasks, such as sexism/racism detection or toxicity detection, are based on human values.
We introduce a framework for value-aligned classification that performs prediction based on explicitly written human values in the command.
arXiv Detail & Related papers (2022-10-14T09:10:49Z) - Negation Handling in Machine Learning-Based Sentiment Classification for
Colloquial Arabic [0.0]
The role of negation in Arabic sentiment analysis has been explored only to a limited extent, especially for colloquial Arabic.
We propose a simple rule-based algorithm for handling the problem; the rules were crafted based on observing many cases of negation.
arXiv Detail & Related papers (2021-07-24T13:12:37Z) - Automatic Difficulty Classification of Arabic Sentences [0.0]
The accuracy of our 3-way CEFR classification is F-1 of 0.80 and 0.75 for Arabic-Bert and XLM-R classification respectively and 0.71 Spearman correlation for regression.
We compare the use of sentence embeddings of different kinds (fastText, mBERT, XLM-R and Arabic-BERT) as well as traditional language features such as POS tags, dependency trees, readability scores and frequency lists for language learners.
arXiv Detail & Related papers (2021-03-07T16:02:04Z) - Negative Margin Matters: Understanding Margin in Few-shot Classification [72.85978953262004]
This paper introduces a negative margin loss to metric learning based few-shot learning methods.
The negative margin loss significantly outperforms regular softmax loss, and state-of-the-art accuracy on three standard few-shot classification benchmarks.
arXiv Detail & Related papers (2020-03-26T17:59:05Z) - On Positive-Unlabeled Classification in GAN [130.43248168149432]
This paper defines a positive and unlabeled classification problem for standard GANs.
It then leads to a novel technique to stabilize the training of the discriminator in GANs.
arXiv Detail & Related papers (2020-02-04T05:59:37Z) - Text Complexity Classification Based on Linguistic Information:
Application to Intelligent Tutoring of ESL [0.0]
The goal of this work is to build a classifier that can identify text complexity within the context of teaching reading to English as a Second Language ( ESL) learners.
Using a corpus of 6171 texts, which had already been classified into three different levels of difficulty by ESL experts, different experiments were conducted with five machine learning algorithms.
The results showed that the adopted linguistic features provide a good overall classification performance.
arXiv Detail & Related papers (2020-01-07T02:42:57Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.