Reed at SemEval-2020 Task 9: Fine-Tuning and Bag-of-Words Approaches to
Code-Mixed Sentiment Analysis
- URL: http://arxiv.org/abs/2007.13061v2
- Date: Tue, 4 Aug 2020 05:13:57 GMT
- Title: Reed at SemEval-2020 Task 9: Fine-Tuning and Bag-of-Words Approaches to
Code-Mixed Sentiment Analysis
- Authors: Vinay Gopalan, Mark Hopkins
- Abstract summary: We explore the task of sentiment analysis on Hinglish (code-mixed Hindi-English) tweets as participants of Task 9 of the SemEval-2020 competition, known as the SentiMix task.
We had two main approaches: 1) applying transfer learning by fine-tuning pre-trained BERT models and 2) training feedforward neural networks on bag-of-words representations.
During the evaluation phase of the competition, we obtained an F-score of 71.3% with our best model, which placed $4th$ out of 62 entries in the official system rankings.
- Score: 1.2147145617662432
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We explore the task of sentiment analysis on Hinglish (code-mixed
Hindi-English) tweets as participants of Task 9 of the SemEval-2020
competition, known as the SentiMix task. We had two main approaches: 1)
applying transfer learning by fine-tuning pre-trained BERT models and 2)
training feedforward neural networks on bag-of-words representations. During
the evaluation phase of the competition, we obtained an F-score of 71.3% with
our best model, which placed $4^{th}$ out of 62 entries in the official system
rankings.
Related papers
- SmurfCat at SemEval-2024 Task 6: Leveraging Synthetic Data for Hallucination Detection [51.99159169107426]
We present our novel systems developed for the SemEval-2024 hallucination detection task.
Our investigation spans a range of strategies to compare model predictions with reference standards.
We introduce three distinct methods that exhibit strong performance metrics.
arXiv Detail & Related papers (2024-04-09T09:03:44Z) - BJTU-WeChat's Systems for the WMT22 Chat Translation Task [66.81525961469494]
This paper introduces the joint submission of the Beijing Jiaotong University and WeChat AI to the WMT'22 chat translation task for English-German.
Based on the Transformer, we apply several effective variants.
Our systems achieve 0.810 and 0.946 COMET scores.
arXiv Detail & Related papers (2022-11-28T02:35:04Z) - UrduFake@FIRE2020: Shared Track on Fake News Identification in Urdu [62.6928395368204]
This paper gives the overview of the first shared task at FIRE 2020 on fake news detection in the Urdu language.
The goal is to identify fake news using a dataset composed of 900 annotated news articles for training and 400 news articles for testing.
The dataset contains news in five domains: (i) Health, (ii) Sports, (iii) Showbiz, (iv) Technology, and (v) Business.
arXiv Detail & Related papers (2022-07-25T03:46:51Z) - Overview of the Shared Task on Fake News Detection in Urdu at FIRE 2020 [62.6928395368204]
Task was posed as a binary classification task, in which the goal is to differentiate between real and fake news.
We provided a dataset divided into 900 annotated news articles for training and 400 news articles for testing.
42 teams from 6 different countries (India, China, Egypt, Germany, Pakistan, and the UK) registered for the task.
arXiv Detail & Related papers (2022-07-25T03:41:32Z) - RuArg-2022: Argument Mining Evaluation [69.87149207721035]
This paper is a report of the organizers on the first competition of argumentation analysis systems dealing with Russian language texts.
A corpus containing 9,550 sentences (comments on social media posts) on three topics related to the COVID-19 pandemic was prepared.
The system that won the first place in both tasks used the NLI (Natural Language Inference) variant of the BERT architecture.
arXiv Detail & Related papers (2022-06-18T17:13:37Z) - Palomino-Ochoa at SemEval-2020 Task 9: Robust System based on
Transformer for Code-Mixed Sentiment Classification [1.6244541005112747]
We present a transfer learning system to perform a mixed Spanish-English sentiment classification task.
Our proposal uses the state-of-the-art language model BERT and embed it within a ULMFiT transfer learning pipeline.
arXiv Detail & Related papers (2020-11-18T18:25:58Z) - LT3 at SemEval-2020 Task 9: Cross-lingual Embeddings for Sentiment
Analysis of Hinglish Social Media Text [1.0152838128195465]
We investigate two approaches to solve the task of Hinglish sentiment analysis.
The first approach uses cross-lingual embeddings resulting from projecting Hinglish and pre-trained English FastText word embeddings.
The second approach incorporates pre-trained English embeddings that are incrementally retrained with a set of Hinglish tweets.
arXiv Detail & Related papers (2020-10-21T14:03:16Z) - WESSA at SemEval-2020 Task 9: Code-Mixed Sentiment Analysis using
Transformers [0.0]
We describe our system submitted for SemEval 2020 Task 9, Sentiment Analysis for Code-Mixed Social Media Text.
Our best performing system is a Transfer Learning-based model that fine-tunes "XLM-RoBERTa"
For later submissions, our system manages to achieve a 75.9% average F1-Score on the test set using CodaLab username "ahmed0sultan"
arXiv Detail & Related papers (2020-09-21T13:59:24Z) - SemEval-2020 Task 9: Overview of Sentiment Analysis of Code-Mixed Tweets [29.74702868712367]
We present the results of the SemEval-2020 Task 9 on Sentiment Analysis of Code-Mixed Tweets (SentiMix 2020)
We release and describe our Hinglish (Hindi-English) and Spanglish (Spanish-English) corpora annotated with word-level language identification and sentence-level sentiment labels.
arXiv Detail & Related papers (2020-08-10T17:17:52Z) - Deep Learning Brasil -- NLP at SemEval-2020 Task 9: Overview of
Sentiment Analysis of Code-Mixed Tweets [0.2294014185517203]
In this paper, we describe a methodology to predict sentiment in code-mixed tweets (hindi-english)
Our team called verissimo.manoel in CodaLab developed an approach based on an ensemble of four models.
The final classification algorithm was an ensemble of some predictions of all softmax values from these four models.
arXiv Detail & Related papers (2020-07-28T16:42:41Z) - Emotion Recognition for In-the-wild Videos [92.01434273996097]
This paper is a brief introduction to our submission to the seven basic expression classification track of Affective Behavior Analysis in-the-wild Competition held in conjunction with the IEEE International Conference on Automatic Face and Gesture Recognition (FG) 2020.
Our method combines Deep Residual Network (ResNet) and Bidirectional Long Short-Term Memory Network (BLSTM), achieving 64.3% accuracy and 43.4% final metric on the validation set.
arXiv Detail & Related papers (2020-02-13T11:29:46Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.