UQ at #SMM4H 2023: ALEX for Public Health Analysis with Social Media
- URL: http://arxiv.org/abs/2309.04213v2
- Date: Tue, 12 Sep 2023 07:19:22 GMT
- Title: UQ at #SMM4H 2023: ALEX for Public Health Analysis with Social Media
- Authors: Yan Jiang, Ruihong Qiu, Yi Zhang, Zi Huang
- Abstract summary: Current techniques for public health analysis involve popular models such as BERT and large language models (LLMs)
In this paper, a novel ALEX framework is proposed to improve the performance of public health analysis on social media.
- Score: 33.081637097464146
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: As social media becomes increasingly popular, more and more activities
related to public health emerge. Current techniques for public health analysis
involve popular models such as BERT and large language models (LLMs). However,
the costs of training in-domain LLMs for public health are especially
expensive. Furthermore, such kinds of in-domain datasets from social media are
generally imbalanced. To tackle these challenges, the data imbalance issue can
be overcome by data augmentation and balanced training. Moreover, the ability
of the LLMs can be effectively utilized by prompting the model properly. In
this paper, a novel ALEX framework is proposed to improve the performance of
public health analysis on social media by adopting an LLMs explanation
mechanism. Results show that our ALEX model got the best performance among all
submissions in both Task 2 and Task 4 with a high score in Task 1 in Social
Media Mining for Health 2023 (SMM4H)[1]. Our code has been released at https://
github.com/YanJiangJerry/ALEX.
Related papers
- CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs [62.84082370758761]
CharXiv is a comprehensive evaluation suite involving 2,323 charts from arXiv papers.
To ensure quality, all charts and questions are handpicked, curated, and verified by human experts.
Results reveal a substantial, previously underestimated gap between the reasoning skills of the strongest proprietary model.
arXiv Detail & Related papers (2024-06-26T17:50:11Z) - SMP Challenge: An Overview and Analysis of Social Media Prediction Challenge [63.311045291016555]
Social Media Popularity Prediction (SMPP) is a crucial task that involves automatically predicting future popularity values of online posts.
This paper summarizes the challenging task, data, and research progress.
arXiv Detail & Related papers (2024-05-17T02:36:14Z) - Explorers at #SMM4H 2023: Enhancing BERT for Health Applications through
Knowledge and Model Fusion [3.386401892906348]
Social media has become a valuable data resource for studying human health.
This paper outlines the methods in our participation in the #SMM4H 2023 Shared Tasks.
arXiv Detail & Related papers (2023-12-17T08:52:05Z) - Countering Misinformation via Emotional Response Generation [15.383062216223971]
proliferation of misinformation on social media platforms (SMPs) poses a significant danger to public health, social cohesion and democracy.
Previous research has shown how social correction can be an effective way to curb misinformation.
We present VerMouth, the first large-scale dataset comprising roughly 12 thousand claim-response pairs.
arXiv Detail & Related papers (2023-11-17T15:37:18Z) - Balanced and Explainable Social Media Analysis for Public Health with
Large Language Models [13.977401672173533]
Current techniques for public health analysis involve popular models such as BERT and large language models (LLMs)
To tackle these challenges, the data imbalance issue can be overcome by sophisticated data augmentation methods for social media datasets.
In this paper, a novel ALEX framework is proposed for social media analysis on public health.
arXiv Detail & Related papers (2023-09-12T04:15:34Z) - A Review on Knowledge Graphs for Healthcare: Resources, Applications,
and Promises [53.48844796428081]
This work provides the first comprehensive review of healthcare knowledge graphs (HKGs)
It summarizes the pipeline and key techniques for HKG construction, as well as the common utilization approaches.
At the application level, we delve into the successful integration of HKGs across various health domains.
arXiv Detail & Related papers (2023-06-07T21:51:56Z) - ManiTweet: A New Benchmark for Identifying Manipulation of News on Social Media [74.93847489218008]
We present a novel task, identifying manipulation of news on social media, which aims to detect manipulation in social media posts and identify manipulated or inserted information.
To study this task, we have proposed a data collection schema and curated a dataset called ManiTweet, consisting of 3.6K pairs of tweets and corresponding articles.
Our analysis demonstrates that this task is highly challenging, with large language models (LLMs) yielding unsatisfactory performance.
arXiv Detail & Related papers (2023-05-23T16:40:07Z) - Connecting Fairness in Machine Learning with Public Health Equity [0.0]
biases in data and model design can result in disparities for certain protected groups and amplify existing inequalities in healthcare.
This study summarizes seminal literature on ML fairness and presents a framework for identifying and mitigating biases in the data and model.
Case studies suggest how the framework can be used to prevent these biases and highlight the need for fair and equitable ML models in public health.
arXiv Detail & Related papers (2023-04-08T10:21:49Z) - Benchmarking for Public Health Surveillance tasks on Social Media with a
Domain-Specific Pretrained Language Model [9.070482285386387]
We present PHS-BERT, a transformer-based language model to identify tasks related to public health surveillance on social media.
Compared with existing PLMs that are mainly evaluated on limited tasks, PHS-BERT achieved state-of-the-art performance on all 25 tested datasets.
arXiv Detail & Related papers (2022-04-09T18:01:18Z) - MET: Multimodal Perception of Engagement for Telehealth [52.54282887530756]
We present MET, a learning-based algorithm for perceiving a human's level of engagement from videos.
We release a new dataset, MEDICA, for mental health patient engagement detection.
arXiv Detail & Related papers (2020-11-17T15:18:38Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.