BPGC at SemEval-2020 Task 11: Propaganda Detection in News Articles with
Multi-Granularity Knowledge Sharing and Linguistic Features based Ensemble
Learning
- URL: http://arxiv.org/abs/2006.00593v2
- Date: Mon, 24 Aug 2020 12:34:38 GMT
- Title: BPGC at SemEval-2020 Task 11: Propaganda Detection in News Articles with
Multi-Granularity Knowledge Sharing and Linguistic Features based Ensemble
Learning
- Authors: Rajaswa Patil, Somesh Singh and Swati Agarwal
- Abstract summary: SemEval 2020 Task-11 aims to design automated systems for news propaganda detection.
Task-11 consists of two sub-tasks, namely, Span Identification and Technique Classification.
- Score: 2.8913142991383114
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Propaganda spreads the ideology and beliefs of like-minded people,
brainwashing their audiences, and sometimes leading to violence. SemEval 2020
Task-11 aims to design automated systems for news propaganda detection. Task-11
consists of two sub-tasks, namely, Span Identification - given any news
article, the system tags those specific fragments which contain at least one
propaganda technique; and Technique Classification - correctly classify a given
propagandist statement amongst 14 propaganda techniques. For sub-task 1, we use
contextual embeddings extracted from pre-trained transformer models to
represent the text data at various granularities and propose a
multi-granularity knowledge sharing approach. For sub-task 2, we use an
ensemble of BERT and logistic regression classifiers with linguistic features.
Our results reveal that the linguistic features are the strong indicators for
covering minority classes in a highly imbalanced dataset.
Related papers
- PropaInsight: Toward Deeper Understanding of Propaganda in Terms of Techniques, Appeals, and Intent [71.20471076045916]
Propaganda plays a critical role in shaping public opinion and fueling disinformation.
Propainsight systematically dissects propaganda into techniques, arousal appeals, and underlying intent.
Propagaze combines human-annotated data with high-quality synthetic data.
arXiv Detail & Related papers (2024-09-19T06:28:18Z) - HuBERTopic: Enhancing Semantic Representation of HuBERT through
Self-supervision Utilizing Topic Model [62.995175485416]
We propose a new approach to enrich the semantic representation of HuBERT.
An auxiliary topic classification task is added to HuBERT by using topic labels as teachers.
Experimental results demonstrate that our method achieves comparable or better performance than the baseline in most tasks.
arXiv Detail & Related papers (2023-10-06T02:19:09Z) - IITD at the WANLP 2022 Shared Task: Multilingual Multi-Granularity
Network for Propaganda Detection [25.536546272915427]
We present our system for two subtasks of the shared task on propaganda detection in Arabic.
Subtask 1 is a multi-label classification problem to find the propaganda techniques used in a given tweet.
Subtask 2 asks to identify the textual span for each instance of each technique that is present in the tweet.
arXiv Detail & Related papers (2022-10-31T10:14:43Z) - Faking Fake News for Real Fake News Detection: Propaganda-loaded
Training Data Generation [105.20743048379387]
We propose a novel framework for generating training examples informed by the known styles and strategies of human-authored propaganda.
Specifically, we perform self-critical sequence training guided by natural language inference to ensure the validity of the generated articles.
Our experimental results show that fake news detectors trained on PropaNews are better at detecting human-written disinformation by 3.62 - 7.69% F1 score on two public datasets.
arXiv Detail & Related papers (2022-03-10T14:24:19Z) - Cross-Domain Learning for Classifying Propaganda in Online Contents [67.10699378370752]
We present an approach to leverage cross-domain learning, based on labeled documents and sentences from news and tweets, as well as political speeches with a clear difference in their degrees of being propagandistic.
Our experiments demonstrate the usefulness of this approach, and identify difficulties and limitations in various configurations of sources and targets for the transfer step.
arXiv Detail & Related papers (2020-11-13T10:19:13Z) - SemEval-2020 Task 11: Detection of Propaganda Techniques in News
Articles [0.6999740786886536]
We present the results of SemEval-2020 Task 11 on Detection of Propaganda Techniques in News Articles.
The task featured two subtasks: Span Identification and Technique Classification.
For both subtasks, the best systems used pre-trained Transformers and ensembles.
arXiv Detail & Related papers (2020-09-06T10:05:43Z) - DUTH at SemEval-2020 Task 11: BERT with Entity Mapping for Propaganda
Classification [1.5469452301122173]
This report describes the methods employed by the Democritus University of Thrace (DUTH) team for participating in SemEval-2020 Task 11: Detection of Propaganda Techniques in News Articles.
arXiv Detail & Related papers (2020-08-22T18:18:02Z) - LTIatCMU at SemEval-2020 Task 11: Incorporating Multi-Level Features for
Multi-Granular Propaganda Span Identification [70.1903083747775]
This paper describes our submission for the task of Propaganda Span Identification in news articles.
We introduce a BERT-BiLSTM based span-level propaganda classification model that identifies which token spans within the sentence are indicative of propaganda.
arXiv Detail & Related papers (2020-08-11T16:14:47Z) - newsSweeper at SemEval-2020 Task 11: Context-Aware Rich Feature
Representations For Propaganda Classification [2.0491741153610334]
This paper describes our submissions to SemEval 2020 Task 11: Detection of Propaganda Techniques in News Articles.
We make use of pre-trained BERT language model enhanced with tagging techniques developed for the task of Named Entity Recognition.
For the second subtask, we incorporate contextual features in a pre-trained RoBERTa model for the classification of propaganda techniques.
arXiv Detail & Related papers (2020-07-21T14:06:59Z) - Leveraging Declarative Knowledge in Text and First-Order Logic for
Fine-Grained Propaganda Detection [139.3415751957195]
We study the detection of propagandistic text fragments in news articles.
We introduce an approach to inject declarative knowledge of fine-grained propaganda techniques.
arXiv Detail & Related papers (2020-04-29T13:46:15Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.