Linking Health News to Research Literature
- URL: http://arxiv.org/abs/2107.06472v1
- Date: Wed, 14 Jul 2021 03:50:51 GMT
- Title: Linking Health News to Research Literature
- Authors: Jun Wang, Bei Yu
- Abstract summary: Accurately linking news articles to scientific research works is a critical component in a number of applications.
Although the lack of links between news and literature has been a challenge in these applications, it is a relatively unexplored research problem.
- Score: 12.80865601729801
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Accurately linking news articles to scientific research works is a critical
component in a number of applications, such as measuring the social impact of a
research work and detecting inaccuracies or distortions in science news.
Although the lack of links between news and literature has been a challenge in
these applications, it is a relatively unexplored research problem. In this
paper we designed and evaluated a new approach that consists of (1) augmenting
latest named-entity recognition techniques to extract various metadata, and (2)
designing a new elastic search engine that can facilitate the use of enriched
metadata queries. To evaluate our approach, we constructed two datasets of
paired news articles and research papers: one is used for training models to
extract metadata, and the other for evaluation. Our experiments showed that the
new approach performed significantly better than a baseline approach used by
altmetric.com (0.89 vs 0.32 in terms of top-1 accuracy). To further demonstrate
the effectiveness of the approach, we also conducted a study on 37,600
health-related press releases published on EurekAlert!, which showed that our
approach was able to identify the corresponding research papers with a top-1
accuracy of at least 0.97.
Related papers
- Multi-objective Representation for Numbers in Clinical Narratives Using CamemBERT-bio [0.9208007322096533]
This research aims to classify numerical values extracted from medical documents across seven physiological categories.
We introduce two main innovations: integrating keyword embeddings into the model and adopting a number-agnostic strategy.
We show substantial improvements in the effectiveness of CamemBERT-bio, surpassing conventional methods with an F1 score of 0.89.
arXiv Detail & Related papers (2024-05-28T01:15:21Z) - Investigating Persuasion Techniques in Arabic: An Empirical Study Leveraging Large Language Models [0.13980986259786224]
This paper presents a comprehensive empirical study focused on identifying persuasive techniques in Arabic social media content.
We utilize Pre-trained Language Models (PLMs) and leverage the ArAlEval dataset.
Our study explores three different learning approaches by harnessing the power of PLMs.
arXiv Detail & Related papers (2024-05-21T15:55:09Z) - Application of Transformers based methods in Electronic Medical Records:
A Systematic Literature Review [77.34726150561087]
This work presents a systematic literature review of state-of-the-art advances using transformer-based methods on electronic medical records (EMRs) in different NLP tasks.
arXiv Detail & Related papers (2023-04-05T22:19:42Z) - Tag-Aware Document Representation for Research Paper Recommendation [68.8204255655161]
We propose a hybrid approach that leverages deep semantic representation of research papers based on social tags assigned by users.
The proposed model is effective in recommending research papers even when the rating data is very sparse.
arXiv Detail & Related papers (2022-09-08T09:13:07Z) - Research Trends and Applications of Data Augmentation Algorithms [77.34726150561087]
We identify the main areas of application of data augmentation algorithms, the types of algorithms used, significant research trends, their progression over time and research gaps in data augmentation literature.
We expect readers to understand the potential of data augmentation, as well as identify future research directions and open questions within data augmentation research.
arXiv Detail & Related papers (2022-07-18T11:38:32Z) - SciLander: Mapping the Scientific News Landscape [8.504643390943409]
We introduce SciLander, a method for learning representations of news sources reporting on science-based topics.
We evaluate our method on a novel COVID-19 dataset containing nearly 1M news articles from 500 sources spanning a period of 18 months since the beginning of the pandemic in 2020.
arXiv Detail & Related papers (2022-05-16T20:20:43Z) - FPSRS: A Fusion Approach for Paper Submission Recommendation System [0.0]
This paper presents two newer approaches for recommending scientific articles.
The first approach employs RNN structures besides using Conv1D.
We also introduce a new method, namely DistilBertAims, using DistillBert for two cases of uppercase and lower-case words to vectorize features such as Title, Abstract, and Keywords.
The experimental results show that the second approach could obtain a better performance, which is 62.46% and 12.44% higher than the best of the previous study.
arXiv Detail & Related papers (2022-05-12T09:06:56Z) - CitationIE: Leveraging the Citation Graph for Scientific Information
Extraction [89.33938657493765]
We use the citation graph of referential links between citing and cited papers.
We observe a sizable improvement in end-to-end information extraction over the state-of-the-art.
arXiv Detail & Related papers (2021-06-03T03:00:12Z) - What's New? Summarizing Contributions in Scientific Literature [85.95906677964815]
We introduce a new task of disentangled paper summarization, which seeks to generate separate summaries for the paper contributions and the context of the work.
We extend the S2ORC corpus of academic articles by adding disentangled "contribution" and "context" reference labels.
We propose a comprehensive automatic evaluation protocol which reports the relevance, novelty, and disentanglement of generated outputs.
arXiv Detail & Related papers (2020-11-06T02:23:01Z) - Opportunities and Challenges of Deep Learning Methods for
Electrocardiogram Data: A Systematic Review [62.490310870300746]
The electrocardiogram (ECG) is one of the most commonly used diagnostic tools in medicine and healthcare.
Deep learning methods have achieved promising results on predictive healthcare tasks using ECG signals.
This paper presents a systematic review of deep learning methods for ECG data from both modeling and application perspectives.
arXiv Detail & Related papers (2019-12-28T02:44:29Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.