Identifying Health Risks from Family History: A Survey of Natural Language Processing Techniques
- URL: http://arxiv.org/abs/2403.09997v1
- Date: Fri, 15 Mar 2024 03:43:07 GMT
- Title: Identifying Health Risks from Family History: A Survey of Natural Language Processing Techniques
- Authors: Xiang Dai, Sarvnaz Karimi, Nathan O'Callaghan,
- Abstract summary: We survey the literature on the techniques that have been developed to utilise digital health records to identify risks of familial diseases.
We highlight that rule-based methods are heavily investigated and are still actively used for family history extraction.
More recent efforts have been put into building neural models based on large-scale pre-trained language models.
- Score: 10.121264712810616
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Electronic health records include information on patients' status and medical history, which could cover the history of diseases and disorders that could be hereditary. One important use of family history information is in precision health, where the goal is to keep the population healthy with preventative measures. Natural Language Processing (NLP) and machine learning techniques can assist with identifying information that could assist health professionals in identifying health risks before a condition is developed in their later years, saving lives and reducing healthcare costs. We survey the literature on the techniques from the NLP field that have been developed to utilise digital health records to identify risks of familial diseases. We highlight that rule-based methods are heavily investigated and are still actively used for family history extraction. Still, more recent efforts have been put into building neural models based on large-scale pre-trained language models. In addition to the areas where NLP has successfully been utilised, we also identify the areas where more research is needed to unlock the value of patients' records regarding data collection, task formulation and downstream applications.
Related papers
- A Survey of Artificial Intelligence in Gait-Based Neurodegenerative Disease Diagnosis [51.07114445705692]
neurodegenerative diseases (NDs) traditionally require extensive healthcare resources and human effort for medical diagnosis and monitoring.
As a crucial disease-related motor symptom, human gait can be exploited to characterize different NDs.
The current advances in artificial intelligence (AI) models enable automatic gait analysis for NDs identification and classification.
arXiv Detail & Related papers (2024-05-21T06:44:40Z) - De-identification of clinical free text using natural language
processing: A systematic review of current approaches [48.343430343213896]
Natural language processing has repeatedly demonstrated its feasibility in automating the de-identification process.
Our study aims to provide systematic evidence on how the de-identification of clinical free text has evolved in the last thirteen years.
arXiv Detail & Related papers (2023-11-28T13:20:41Z) - Deep Reinforcement Learning Framework for Thoracic Diseases
Classification via Prior Knowledge Guidance [49.87607548975686]
The scarcity of labeled data for related diseases poses a huge challenge to an accurate diagnosis.
We propose a novel deep reinforcement learning framework, which introduces prior knowledge to direct the learning of diagnostic agents.
Our approach's performance was demonstrated using the well-known NIHX-ray 14 and CheXpert datasets.
arXiv Detail & Related papers (2023-06-02T01:46:31Z) - Developing a Robust Computable Phenotype Definition Workflow to Describe
Health and Disease in Observational Health Research [0.6465251961564604]
Health informatics is built upon patient health data.
standardization is required to compute population statistics that are common metrics used in fields such as epidemiology.
While standards exist to structure and analyze patient data, analogous best practices for rigorously defining patient populations do not exist.
arXiv Detail & Related papers (2023-03-30T15:29:54Z) - Privacy-preserving machine learning for healthcare: open challenges and
future perspectives [72.43506759789861]
We conduct a review of recent literature concerning Privacy-Preserving Machine Learning (PPML) for healthcare.
We primarily focus on privacy-preserving training and inference-as-a-service.
The aim of this review is to guide the development of private and efficient ML models in healthcare.
arXiv Detail & Related papers (2023-03-27T19:20:51Z) - Retrieval-Augmented and Knowledge-Grounded Language Models for Faithful Clinical Medicine [68.7814360102644]
We propose the Re$3$Writer method with retrieval-augmented generation and knowledge-grounded reasoning.
We demonstrate the effectiveness of our method in generating patient discharge instructions.
arXiv Detail & Related papers (2022-10-23T16:34:39Z) - Ontology-Driven Self-Supervision for Adverse Childhood Experiences
Identification Using Social Media Datasets [1.0349800230036503]
Adverse Childhood Experiences (ACEs) have been shown to be associated with increased risks of mental health diseases or other abnormal behaviours in later lives.
The identification of ACEs from textual data with Natural Language Processing (NLP) is challenging because there are no NLP ready ACE.
We present an ontology-driven self-supervised approach for producing a publicly available resource that would support large-scale machine learning.
arXiv Detail & Related papers (2022-08-24T12:23:01Z) - On Curating Responsible and Representative Healthcare Video
Recommendations for Patient Education and Health Literacy: An Augmented
Intelligence Approach [5.545277272908999]
One in three U.S. adults use the Internet to diagnose or learn about a health concern.
Health literacy divides can be exacerbated by algorithmic recommendations.
arXiv Detail & Related papers (2022-07-13T01:54:59Z) - Intelligent Sight and Sound: A Chronic Cancer Pain Dataset [74.77784420691937]
This paper introduces the first chronic cancer pain dataset, collected as part of the Intelligent Sight and Sound (ISS) clinical trial.
The data collected to date consists of 29 patients, 509 smartphone videos, 189,999 frames, and self-reported affective and activity pain scores.
Using static images and multi-modal data to predict self-reported pain levels, early models show significant gaps between current methods available to predict pain.
arXiv Detail & Related papers (2022-04-07T22:14:37Z) - Multilingual Medical Question Answering and Information Retrieval for
Rural Health Intelligence Access [1.0499611180329804]
In rural regions of several developing countries, access to quality healthcare, medical infrastructure, and professional diagnosis is largely unavailable.
Several deaths resulting from this lack of medical access, absence of patient's previous health records, and the supplanting of information in indigenous languages can be easily prevented.
We describe an approach leveraging the phenomenal progress in Machine Learning and NLP (Natural Language Processing) techniques to design a model that is low-resource, multilingual, and a preliminary first-point-of-contact medical assistant.
arXiv Detail & Related papers (2021-06-02T16:05:24Z) - A Systematic Review of Natural Language Processing for Knowledge
Management in Healthcare [0.6193838300896449]
The objective of this paper is to identify the potential of NLP, especially, how NLP is used to support the knowledge management process in the healthcare domain.
This paper provides a comprehensive survey of the state-of-the-art NLP research with a particular focus on how knowledge is created, captured, shared, and applied in the healthcare domain.
arXiv Detail & Related papers (2020-07-17T17:50:50Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.