Shayona@SMM4H23: COVID-19 Self diagnosis classification using BERT and
LightGBM models
- URL: http://arxiv.org/abs/2401.02158v1
- Date: Thu, 4 Jan 2024 09:13:18 GMT
- Title: Shayona@SMM4H23: COVID-19 Self diagnosis classification using BERT and
LightGBM models
- Authors: Rushi Chavda, Darshan Makwana, Vraj Patel, Anupam Shukla
- Abstract summary: This paper describes approaches and results for shared Task 1 and 4 of SMMH4-23 by Team Shayona.
Our team has achieved the highest f1-score 0.94 in Task-1 among all participants.
- Score: 1.5566524830295307
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: This paper describes approaches and results for shared Task 1 and 4 of
SMMH4-23 by Team Shayona. Shared Task-1 was binary classification of english
tweets self-reporting a COVID-19 diagnosis, and Shared Task-4 was Binary
classification of English Reddit posts self-reporting a social anxiety disorder
diagnosis. Our team has achieved the highest f1-score 0.94 in Task-1 among all
participants. We have leveraged the Transformer model (BERT) in combination
with the LightGBM model for both tasks.
Related papers
- LT4SG@SMM4H24: Tweets Classification for Digital Epidemiology of Childhood Health Outcomes Using Pre-Trained Language Models [1.0312118123538199]
This paper presents our approaches for the SMM4H24 Shared Task 5 on the binary classification of English tweets reporting children's medical disorders.
Our best-performing system achieves an F1-score of 0.938 on test data, outperforming the benchmark by 1.18%.
arXiv Detail & Related papers (2024-06-11T22:48:18Z) - ThangDLU at #SMM4H 2024: Encoder-decoder models for classifying text data on social disorders in children and adolescents [49.00494558898933]
This paper describes our participation in Task 3 and Task 5 of the #SMM4H (Social Media Mining for Health) 2024 Workshop.
Task 3 is a multi-class classification task centered on tweets discussing the impact of outdoor environments on symptoms of social anxiety.
Task 5 involves a binary classification task focusing on tweets reporting medical disorders in children.
We applied transfer learning from pre-trained encoder-decoder models such as BART-base and T5-small to identify the labels of a set of given tweets.
arXiv Detail & Related papers (2024-04-30T17:06:20Z) - SemEval-2024 Shared Task 6: SHROOM, a Shared-task on Hallucinations and Related Observable Overgeneration Mistakes [48.83290963506378]
This paper presents the results of the SHROOM, a shared task focused on detecting hallucinations.
We observe a number of key trends in how this approach was tackled.
While a majority of the teams did outperform our proposed baseline system, the performances of top-scoring systems are still consistent with a random handling of the more challenging items.
arXiv Detail & Related papers (2024-03-12T15:06:22Z) - MANTIS at #SMM4H 2023: Leveraging Hybrid and Ensemble Models for
Detection of Social Anxiety Disorder on Reddit [29.857322234373317]
This paper presents our system employed for the Social Media Mining for Health 2023 Shared Task 4: Binary classification of English Reddit posts self-reporting a social anxiety disorder diagnosis.
We investigate and contrast the efficacy of hybrid and ensemble models that harness specialized medical domain-adapted transformers in conjunction with BiLSTM neural networks.
arXiv Detail & Related papers (2023-11-28T09:33:41Z) - tmn at #SMM4H 2023: Comparing Text Preprocessing Techniques for
Detecting Tweets Self-reporting a COVID-19 Diagnosis [1.8492669447784602]
The paper describes a system developed for Task 1 at SMM4H 2023.
The goal of the task is to automatically distinguish tweets that self-report a COVID-19 diagnosis from those that do not.
arXiv Detail & Related papers (2023-11-01T07:41:23Z) - Customizing General-Purpose Foundation Models for Medical Report
Generation [64.31265734687182]
The scarcity of labelled medical image-report pairs presents great challenges in the development of deep and large-scale neural networks.
We propose customizing off-the-shelf general-purpose large-scale pre-trained models, i.e., foundation models (FMs) in computer vision and natural language processing.
arXiv Detail & Related papers (2023-06-09T03:02:36Z) - When Accuracy Meets Privacy: Two-Stage Federated Transfer Learning
Framework in Classification of Medical Images on Limited Data: A COVID-19
Case Study [77.34726150561087]
COVID-19 pandemic has spread rapidly and caused a shortage of global medical resources.
CNN has been widely utilized and verified in analyzing medical images.
arXiv Detail & Related papers (2022-03-24T02:09:41Z) - The Report on China-Spain Joint Clinical Testing for Rapid COVID-19 Risk
Screening by Eye-region Manifestations [59.48245489413308]
We developed and tested a COVID-19 rapid prescreening model using the eye-region images captured in China and Spain with cellphone cameras.
The performance was measured using area under receiver-operating-characteristic curve (AUC), sensitivity, specificity, accuracy, and F1.
arXiv Detail & Related papers (2021-09-18T02:28:01Z) - Fine-Tuning Transformers for Identifying Self-Reporting Potential Cases
and Symptoms of COVID-19 in Tweets [5.425235965110337]
We describe our straight-forward approach for Tasks 5 and 6 of 2021 Social Media Mining for Health Applications (SMM4H) shared tasks.
Our system is based on fine-tuning Distill- BERT on each task, as well as first fine-tuning the model on the other task.
arXiv Detail & Related papers (2021-04-12T14:31:51Z) - CO-Search: COVID-19 Information Retrieval with Semantic Search, Question
Answering, and Abstractive Summarization [53.67205506042232]
CO-Search is a retriever-ranker semantic search engine designed to handle complex queries over the COVID-19 literature.
To account for the domain-specific and relatively limited dataset, we generate a bipartite graph of document paragraphs and citations.
We evaluate our system on the data of the TREC-COVID information retrieval challenge.
arXiv Detail & Related papers (2020-06-17T01:32:48Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.