Named Entity Recognition in COVID-19 tweets with Entity Knowledge Augmentation
- URL: http://arxiv.org/abs/2510.04001v1
- Date: Sun, 05 Oct 2025 02:22:26 GMT
- Title: Named Entity Recognition in COVID-19 tweets with Entity Knowledge Augmentation
- Authors: Xuankang Zhang, Jiangming Liu,
- Abstract summary: We propose a novel entity knowledge augmentation approach for COVID-19.<n>Our proposed entity knowledge augmentation improves NER performance in both fully-supervised and few-shot settings.
- Score: 3.8390058921374615
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The COVID-19 pandemic causes severe social and economic disruption around the world, raising various subjects that are discussed over social media. Identifying pandemic-related named entities as expressed on social media is fundamental and important to understand the discussions about the pandemic. However, there is limited work on named entity recognition on this topic due to the following challenges: 1) COVID-19 texts in social media are informal and their annotations are rare and insufficient to train a robust recognition model, and 2) named entity recognition in COVID-19 requires extensive domain-specific knowledge. To address these issues, we propose a novel entity knowledge augmentation approach for COVID-19, which can also be applied in general biomedical named entity recognition in both informal text format and formal text format. Experiments carried out on the COVID-19 tweets dataset and PubMed dataset show that our proposed entity knowledge augmentation improves NER performance in both fully-supervised and few-shot settings. Our source code is publicly available: https://github.com/kkkenshi/LLM-EKA/tree/master
Related papers
- Exploring a Hybrid Deep Learning Framework to Automatically Discover
Topic and Sentiment in COVID-19 Tweets [2.3940819037450987]
COVID-19 has created a major public health problem worldwide and other problems such as economic crisis, unemployment, mental distress, etc.
The pandemic is deadly in the world and involves many people not only with infection but also with problems, stress, wonder, fear, resentment, and hatred.
Twitter is a highly influential social media platform and a significant source of health-related information, news, opinion and public sentiment.
arXiv Detail & Related papers (2023-12-02T16:58:17Z) - EasyNER: A Customizable Easy-to-Use Pipeline for Deep Learning- and
Dictionary-based Named Entity Recognition from Medical Text [32.73124984242397]
We develop an easy-to-use end-to-end pipeline for deep learning- and dictionary-based named entity recognition.
The pipeline can access and process large medical research article collections ( CORD-19) or raw text.
The output consists of publication-ready ranked lists and graphs of detected entities and files containing the annotated texts.
arXiv Detail & Related papers (2023-04-16T15:17:56Z) - RHO ($\
ho$): Reducing Hallucination in Open-domain Dialogues with
Knowledge Grounding [57.46495388734495]
This paper presents RHO ($rho$) utilizing the representations of linked entities and relation predicates from a knowledge graph (KG)
We propose (1) local knowledge grounding to combine textual embeddings with the corresponding KG embeddings; and (2) global knowledge grounding to equip RHO with multi-hop reasoning abilities via the attention mechanism.
arXiv Detail & Related papers (2022-12-03T10:36:34Z) - METS-CoV: A Dataset of Medical Entity and Targeted Sentiment on COVID-19
Related Tweets [13.35986397208115]
This paper releases METS-CoV, a dataset containing medical entities and targeted sentiments from COVID-19-related tweets.
To the best of our knowledge, METS-CoV is the first dataset to collect medical entities and corresponding sentiments of COVID-19-related tweets.
arXiv Detail & Related papers (2022-09-28T01:55:14Z) - "COVID-19 was a FIFA conspiracy #curropt": An Investigation into the
Viral Spread of COVID-19 Misinformation [60.268682953952506]
We estimate the extent to which misinformation has influenced the course of the COVID-19 pandemic using natural language processing models.
We provide a strategy to combat social media posts that are likely to cause widespread harm.
arXiv Detail & Related papers (2022-06-12T19:41:01Z) - Knowledge-Rich Self-Supervised Entity Linking [58.838404666183656]
Knowledge-RIch Self-Supervision ($tt KRISSBERT$) is a universal entity linker for four million UMLS entities.
Our approach subsumes zero-shot and few-shot methods, and can easily incorporate entity descriptions and gold mention labels if available.
Without using any labeled information, our method produces $tt KRISSBERT$, a universal entity linker for four million UMLS entities.
arXiv Detail & Related papers (2021-12-15T05:05:12Z) - A Dynamic Topic Identification and Labeling Approach of COVID-19 Tweets [3.097385298197292]
The COVID-19 epidemic has affected the use of social media by many people across the globe.
This paper formulates the problem of dynamically identifying key topics with proper labels from COVID-19 Tweets to provide an overview of wider public opinion.
arXiv Detail & Related papers (2021-08-13T16:51:04Z) - COVID-19 and Big Data: Multi-faceted Analysis for Spatio-temporal
Understanding of the Pandemic with Social Media Conversations [4.07452542897703]
Social media platforms have served as a vehicle for the global conversation about COVID-19.
We present a framework for analysis, mining, and tracking the critical content and characteristics of social media conversations around the pandemic.
arXiv Detail & Related papers (2021-04-22T00:45:50Z) - Named Entity Recognition for Social Media Texts with Semantic
Augmentation [70.44281443975554]
Existing approaches for named entity recognition suffer from data sparsity problems when conducted on short and informal texts.
We propose a neural-based approach to NER for social media texts where both local (from running text) and augmented semantics are taken into account.
arXiv Detail & Related papers (2020-10-29T10:06:46Z) - Artificial Intelligence (AI) in Action: Addressing the COVID-19 Pandemic
with Natural Language Processing (NLP) [8.281080540533559]
Natural language processing can be applied to address many of the information needs made urgent by the COVID-19 pandemic.
This review surveys approximately 150 NLP studies and more than 50 systems and datasets addressing the COVID-19 pandemic.
arXiv Detail & Related papers (2020-10-09T22:10:43Z) - CO-Search: COVID-19 Information Retrieval with Semantic Search, Question
Answering, and Abstractive Summarization [53.67205506042232]
CO-Search is a retriever-ranker semantic search engine designed to handle complex queries over the COVID-19 literature.
To account for the domain-specific and relatively limited dataset, we generate a bipartite graph of document paragraphs and citations.
We evaluate our system on the data of the TREC-COVID information retrieval challenge.
arXiv Detail & Related papers (2020-06-17T01:32:48Z) - Mapping the Landscape of Artificial Intelligence Applications against
COVID-19 [59.30734371401316]
COVID-19, the disease caused by the SARS-CoV-2 virus, has been declared a pandemic by the World Health Organization.
We present an overview of recent studies using Machine Learning and, more broadly, Artificial Intelligence to tackle many aspects of the COVID-19 crisis.
arXiv Detail & Related papers (2020-03-25T12:30:33Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.