TechTexC: Classification of Technical Texts using Convolution and
Bidirectional Long Short Term Memory Network
- URL: http://arxiv.org/abs/2012.11420v1
- Date: Mon, 21 Dec 2020 15:22:47 GMT
- Title: TechTexC: Classification of Technical Texts using Convolution and
Bidirectional Long Short Term Memory Network
- Authors: Omar Sharif, Eftekhar Hossain, Mohammed Moshiul Hoque
- Abstract summary: A classification system (called 'TechTexC') is developed to perform the classification task using three techniques.
Results show that CNN with BiLSTM model outperforms the other techniques concerning task-1 of sub-tasks (a, b, c and g) and task-2a.
In the case of test set, the combined CNN with BiLSTM approach achieved that higher accuracy for the subtasks 1a (70.76%), 1b (79.97%), 1c (65.45%), 1g (49.23%) and 2a (70.14%)
- Score: 0.0
- License: http://creativecommons.org/licenses/by-sa/4.0/
- Abstract: This paper illustrates the details description of technical text
classification system and its results that developed as a part of participation
in the shared task TechDofication 2020. The shared task consists of two
sub-tasks: (i) first task identify the coarse-grained technical domain of given
text in a specified language and (ii) the second task classify a text of
computer science domain into fine-grained sub-domains. A classification system
(called 'TechTexC') is developed to perform the classification task using three
techniques: convolution neural network (CNN), bidirectional long short term
memory (BiLSTM) network, and combined CNN with BiLSTM. Results show that CNN
with BiLSTM model outperforms the other techniques concerning task-1 of
sub-tasks (a, b, c and g) and task-2a. This combined model obtained f1 scores
of 82.63 (sub-task a), 81.95 (sub-task b), 82.39 (sub-task c), 84.37 (sub-task
g), and 67.44 (task-2a) on the development dataset. Moreover, in the case of
test set, the combined CNN with BiLSTM approach achieved that higher accuracy
for the subtasks 1a (70.76%), 1b (79.97%), 1c (65.45%), 1g (49.23%) and 2a
(70.14%).
Related papers
- SemEval-2024 Task 8: Multidomain, Multimodel and Multilingual Machine-Generated Text Detection [68.858931667807]
Subtask A is a binary classification task determining whether a text is written by a human or generated by a machine.
Subtask B is to detect the exact source of a text, discerning whether it is written by a human or generated by a specific LLM.
Subtask C aims to identify the changing point within a text, at which the authorship transitions from human to machine.
arXiv Detail & Related papers (2024-04-22T13:56:07Z) - Provable Multi-Task Representation Learning by Two-Layer ReLU Neural Networks [69.38572074372392]
We present the first results proving that feature learning occurs during training with a nonlinear model on multiple tasks.
Our key insight is that multi-task pretraining induces a pseudo-contrastive loss that favors representations that align points that typically have the same label across tasks.
arXiv Detail & Related papers (2023-07-13T16:39:08Z) - Multi-Task Classification of Sewer Pipe Defects and Properties using a
Cross-Task Graph Neural Network Decoder [56.673599764041384]
We present a novel decoder-focused multi-task classification architecture Cross-Task Graph Neural Network (CT-GNN)
CT-GNN refines the disjointed per-task predictions using cross-task information.
We achieve state-of-the-art performance on all four classification tasks in the Sewer-ML dataset.
arXiv Detail & Related papers (2021-11-15T15:36:50Z) - UPB at SemEval-2021 Task 7: Adversarial Multi-Task Learning for
Detecting and Rating Humor and Offense [0.6404122934568858]
We describe our adversarial multi-task network, AMTL-Humor, used to detect and rate humor and offensive texts.
Our best model consists of an ensemble of all tested configurations, and achieves a 95.66% F1-score and 94.70% accuracy for Task 1a.
arXiv Detail & Related papers (2021-04-13T09:59:05Z) - Environmental sound analysis with mixup based multitask learning and
cross-task fusion [0.12891210250935145]
acoustic scene classification and acoustic event classification are two closely related tasks.
In this letter, a two-stage method is proposed for the above tasks.
The proposed method has confirmed the complementary characteristics of acoustic scene and acoustic event classifications.
arXiv Detail & Related papers (2021-03-30T05:11:53Z) - YNU-HPCC at SemEval-2020 Task 11: LSTM Network for Detection of
Propaganda Techniques in News Articles [5.352512345142247]
This paper summarizes our studies on propaganda detection techniques for news articles in the SemEval-2020 task 11.
We implement the GloVe word representation, the BERT pretraining model, and the LSTM model architecture to accomplish this task.
Our method significantly outperforms the officially released baseline method, and the SI and TC subtasks rank 17th and 22nd, respectively, for the test set.
arXiv Detail & Related papers (2020-08-24T02:42:12Z) - CyberWallE at SemEval-2020 Task 11: An Analysis of Feature Engineering
for Ensemble Models for Propaganda Detection [0.0]
We use a bi-LSTM architecture in the Span Identification subtask and train a complex ensemble model for the Technique Classification subtask.
Our systems achieve a rank of 8 out of 35 teams in the SI subtask and 8 out of 31 teams in the TC subtask.
arXiv Detail & Related papers (2020-08-22T15:51:16Z) - Device-Robust Acoustic Scene Classification Based on Two-Stage
Categorization and Data Augmentation [63.98724740606457]
We present a joint effort of four groups, namely GT, USTC, Tencent, and UKE, to tackle Task 1 - Acoustic Scene Classification (ASC) in the DCASE 2020 Challenge.
Task 1a focuses on ASC of audio signals recorded with multiple (real and simulated) devices into ten different fine-grained classes.
Task 1b concerns with classification of data into three higher-level classes using low-complexity solutions.
arXiv Detail & Related papers (2020-07-16T15:07:14Z) - Boundary-assisted Region Proposal Networks for Nucleus Segmentation [89.69059532088129]
Machine learning models cannot perform well because of large amount of crowded nuclei.
We devise a Boundary-assisted Region Proposal Network (BRP-Net) that achieves robust instance-level nucleus segmentation.
arXiv Detail & Related papers (2020-06-04T08:26:38Z) - MetricUNet: Synergistic Image- and Voxel-Level Learning for Precise CT
Prostate Segmentation via Online Sampling [66.01558025094333]
We propose a two-stage framework, with the first stage to quickly localize the prostate region and the second stage to precisely segment the prostate.
We introduce a novel online metric learning module through voxel-wise sampling in the multi-task network.
Our method can effectively learn more representative voxel-level features compared with the conventional learning methods with cross-entropy or Dice loss.
arXiv Detail & Related papers (2020-05-15T10:37:02Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.