Detecting Hostile Posts using Relational Graph Convolutional Network
- URL: http://arxiv.org/abs/2101.03485v1
- Date: Sun, 10 Jan 2021 06:50:22 GMT
- Title: Detecting Hostile Posts using Relational Graph Convolutional Network
- Authors: Sarthak, Shikhar Shukla, Govind Mittal, Karm Veer Arya
- Abstract summary: This work is based on the submission to competition conducted by AAAI@2021 for detection of hostile posts in Hindi on social media platforms.
Here, a model is presented for classification of hostile posts using Convolutional Networks.
The proposed model is performing at par with Google's XLM-RoBERTa on the given dataset.
Among all submissions to the challenge, our classification system with XLMRoberta secured 2nd rank on fine-grained classification.
- Score: 1.8734449181723827
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: This work is based on the submission to the competition Hindi Constraint
conducted by AAAI@2021 for detection of hostile posts in Hindi on social media
platforms. Here, a model is presented for detection and classification of
hostile posts and further classify into fake, offensive, hate and defamation
using Relational Graph Convolutional Networks. Unlike other existing work, our
approach is focused on using semantic meaning along with contextutal
information for better classification. The results from AAAI@2021 indicates
that the proposed model is performing at par with Google's XLM-RoBERTa on the
given dataset. Our best submission with RGCN achieves an F1 score of 0.97 (7th
Rank) on coarse-grained evaluation and achieved best performance on identifying
fake posts. Among all submissions to the challenge, our classification system
with XLM-Roberta secured 2nd rank on fine-grained classification.
Related papers
- Text Augmentations with R-drop for Classification of Tweets Self
Reporting Covid-19 [28.91836510067532]
This paper presents models created for the Social Media Mining for Health 2023 shared task.
Our approach involves a classification model that incorporates diverse textual augmentations.
Our system achieves an impressive F1 score of 0.877 on the test set.
arXiv Detail & Related papers (2023-11-06T14:18:16Z) - Fine-grained Recognition with Learnable Semantic Data Augmentation [68.48892326854494]
Fine-grained image recognition is a longstanding computer vision challenge.
We propose diversifying the training data at the feature-level to alleviate the discriminative region loss problem.
Our method significantly improves the generalization performance on several popular classification networks.
arXiv Detail & Related papers (2023-09-01T11:15:50Z) - Okapi: Generalising Better by Making Statistical Matches Match [7.392460712829188]
Okapi is a simple, efficient, and general method for robust semi-supervised learning based on online statistical matching.
Our method uses a nearest-neighbours-based matching procedure to generate cross-domain views for a consistency loss.
We show that it is in fact possible to leverage additional unlabelled data to improve upon empirical risk minimisation.
arXiv Detail & Related papers (2022-11-07T12:41:17Z) - No Fear of Heterogeneity: Classifier Calibration for Federated Learning
with Non-IID Data [78.69828864672978]
A central challenge in training classification models in the real-world federated system is learning with non-IID data.
We propose a novel and simple algorithm called Virtual Representations (CCVR), which adjusts the classifier using virtual representations sampled from an approximated ssian mixture model.
Experimental results demonstrate that CCVR state-of-the-art performance on popular federated learning benchmarks including CIFAR-10, CIFAR-100, and CINIC-10.
arXiv Detail & Related papers (2021-06-09T12:02:29Z) - KBCNMUJAL@HASOC-Dravidian-CodeMix-FIRE2020: Using Machine Learning for
Detection of Hate Speech and Offensive Code-Mixed Social Media text [1.0499611180329804]
This paper describes the system submitted by our team, KBCNMUJAL, for Task 2 of the shared task Hate Speech and Offensive Content Identification in Indo-European languages.
The datasets of two Dravidian languages Viz. Malayalam and Tamil of size 4000 observations, each were shared by the HASOC organizers.
The best performing classification models developed for both languages are applied on test datasets.
arXiv Detail & Related papers (2021-02-19T11:08:02Z) - Divide and Conquer: An Ensemble Approach for Hostile Post Detection in
Hindi [25.723773314371947]
The data for this task is provided in Hindi Devanagari script which was collected from Twitter and Facebook.
It is a multi-label multi-class classification problem where each data instance is annotated into one or more of the five classes: fake, hate, offensive, defamation, and non-hostile.
Our team 'Albatross', scored 0.9709 Coarse grained hostility F1 score measure on Hostile Post Detection in Hindi subtask and secured 2nd rank out of 45 teams for the task.
arXiv Detail & Related papers (2021-01-20T05:38:07Z) - LaDiff ULMFiT: A Layer Differentiated training approach for ULMFiT [0.0]
We propose a Layer Differentiated training procedure for training a pre-trained ULMFiT arXiv:1801.06146 model.
We used special tokens to annotate specific parts of the tweets to improve language understanding and gain insights on the model.
The proposed approach ranked 61st out of 164 in the sub-task "COVID19 Fake News Detection in English"
arXiv Detail & Related papers (2021-01-13T09:52:04Z) - Anomaly Detection-Based Unknown Face Presentation Attack Detection [74.4918294453537]
Anomaly detection-based spoof attack detection is a recent development in face Presentation Attack Detection.
In this paper, we present a deep-learning solution for anomaly detection-based spoof attack detection.
The proposed approach benefits from the representation learning power of the CNNs and learns better features for fPAD task.
arXiv Detail & Related papers (2020-07-11T21:20:55Z) - ScoreGAN: A Fraud Review Detector based on Multi Task Learning of
Regulated GAN with Data Augmentation [50.779498955162644]
We propose ScoreGAN for fraud review detection that makes use of both review text and review rating scores in the generation and detection process.
Results show that the proposed framework outperformed the existing state-of-the-art framework, namely FakeGAN, in terms of AP by 7%, and 5% on the Yelp and TripAdvisor datasets.
arXiv Detail & Related papers (2020-06-11T16:15:06Z) - SCAN: Learning to Classify Images without Labels [73.69513783788622]
We advocate a two-step approach where feature learning and clustering are decoupled.
A self-supervised task from representation learning is employed to obtain semantically meaningful features.
We obtain promising results on ImageNet, and outperform several semi-supervised learning methods in the low-data regime.
arXiv Detail & Related papers (2020-05-25T18:12:33Z) - High-Order Information Matters: Learning Relation and Topology for
Occluded Person Re-Identification [84.43394420267794]
We propose a novel framework by learning high-order relation and topology information for discriminative features and robust alignment.
Our framework significantly outperforms state-of-the-art by6.5%mAP scores on Occluded-Duke dataset.
arXiv Detail & Related papers (2020-03-18T12:18:35Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.