Semi-automated extraction of research topics and trends from NCI funding
in radiological sciences from 2000-2020
- URL: http://arxiv.org/abs/2306.13075v1
- Date: Thu, 22 Jun 2023 17:47:42 GMT
- Title: Semi-automated extraction of research topics and trends from NCI funding
in radiological sciences from 2000-2020
- Authors: Mark Nguyen, Peter Beidler, Joseph Tsai, August Anderson, Daniel Chen,
Paul Kinahan, John Kang
- Abstract summary: We developed a semi-automated approach to extract and name research topics.
We applied this to $1.9B of NCI funding over 21 years in the radiological sciences.
- Score: 0.0
- License: http://creativecommons.org/licenses/by-sa/4.0/
- Abstract: Investigators, funders, and the public desire knowledge on topics and trends
in publicly funded research but current efforts in manual categorization are
limited in scale and understanding. We developed a semi-automated approach to
extract and name research topics, and applied this to \$1.9B of NCI funding
over 21 years in the radiological sciences to determine micro- and macro-scale
research topics and funding trends. Our method relies on sequential clustering
of existing biomedical-based word embeddings, naming using subject matter
experts, and visualization to discover trends at a macroscopic scale above
individual topics. We present results using 15 and 60 cluster topics, where we
found that 2D projection of grant embeddings reveals two dominant axes:
physics-biology and therapeutic-diagnostic. For our dataset, we found that
funding for therapeutics- and physics-based research have outpaced diagnostics-
and biology-based research, respectively. We hope these results may (1) give
insight to funders on the appropriateness of their funding allocation, (2)
assist investigators in contextualizing their work and explore neighboring
research domains, and (3) allow the public to review where their tax dollars
are being allocated.
Related papers
- Rapid Biomedical Research Classification: The Pandemic PACT Advanced Categorisation Engine [10.692728349388297]
Pandemic PACT project aims to track and analyse research funding and clinical evidence for a wide range of diseases with outbreak potential.
This paper introduces the Pandemic PACT Advanced Categorisation Engine (PPACE) along with its associated dataset.
arXiv Detail & Related papers (2024-07-14T05:22:53Z) - De-identification of clinical free text using natural language
processing: A systematic review of current approaches [48.343430343213896]
Natural language processing has repeatedly demonstrated its feasibility in automating the de-identification process.
Our study aims to provide systematic evidence on how the de-identification of clinical free text has evolved in the last thirteen years.
arXiv Detail & Related papers (2023-11-28T13:20:41Z) - A Diachronic Analysis of Paradigm Shifts in NLP Research: When, How, and
Why? [84.46288849132634]
We propose a systematic framework for analyzing the evolution of research topics in a scientific field using causal discovery and inference techniques.
We define three variables to encompass diverse facets of the evolution of research topics within NLP.
We utilize a causal discovery algorithm to unveil the causal connections among these variables using observational data.
arXiv Detail & Related papers (2023-05-22T11:08:00Z) - Research Trends and Applications of Data Augmentation Algorithms [77.34726150561087]
We identify the main areas of application of data augmentation algorithms, the types of algorithms used, significant research trends, their progression over time and research gaps in data augmentation literature.
We expect readers to understand the potential of data augmentation, as well as identify future research directions and open questions within data augmentation research.
arXiv Detail & Related papers (2022-07-18T11:38:32Z) - Award rate inequities in biomedical research [55.850540873687386]
The authors performed an analysis of 14,263 biomedical research proposals with proposed start dates between 2010-2022 from the University of Michigan Medical School.
There is a clear relationship between race/ethnicity and rates of proposal award.
Black/African American and Asian researchers appear disadvantaged across all submission categories relative to White researchers.
arXiv Detail & Related papers (2022-06-14T14:05:39Z) - Analyzing Scientific Publications using Domain-Specific Word Embedding
and Topic Modelling [0.6308539010172307]
This paper presents a framework for conducting scientific analyses of academic publications.
It combines various techniques of Natural Language Processing, such as word embedding and topic modelling.
We propose two novel scientific publication embedding, i.e., PUB-G and PUB-W, which are capable of learning semantic meanings of general as well as domain-specific words.
arXiv Detail & Related papers (2021-12-24T04:25:34Z) - Machine Learning Applications for Therapeutic Tasks with Genomics Data [49.98249191161107]
We review the literature on machine learning applications for genomics through the lens of therapeutic development.
We identify twenty-two machine learning in genomics applications across the entire therapeutics pipeline.
We pinpoint seven important challenges in this field with opportunities for expansion and impact.
arXiv Detail & Related papers (2021-05-03T21:20:20Z) - A field guide to cultivating computational biology [1.040598660564506]
Biomedical research centers can empower basic discovery and novel therapeutic strategies by leveraging their large-scale datasets from experiments and patients.
This data, together with new technologies to create and analyze it, has ushered in an era of data-driven discovery which requires moving beyond the traditional individual, single-discipline investigator research model.
We propose solutions for individual scientists, institutions, journal publishers, funding agencies, and educators.
arXiv Detail & Related papers (2021-04-23T01:24:21Z) - Topic Diffusion Discovery Based on Deep Non-negative Autoencoder [0.0]
We propose using a Deep Non-negative Autoencoder with information divergence measurement to monitor topic diffusion.
The proposed approach is able to identify the evolution of research topics as well as to discover topic diffusions in online fashions.
arXiv Detail & Related papers (2020-10-08T00:58:10Z) - SciSight: Combining faceted navigation and research group detection for
COVID-19 exploratory scientific search [40.76512878641023]
The COVID-19 pandemic has sparked unprecedented mobilization of scientists, generating a deluge of papers that makes it hard for researchers to keep track and explore new directions.
In this paper, we present SciSight, a system for exploratory search of COVID-19 research integrating two key capabilities.
First, exploring associations between biomedical facets automatically extracted from papers (e.g., genes, drugs, diseases, patient outcomes); second, combining textual and network information to search and visualize groups of researchers and their ties.
arXiv Detail & Related papers (2020-05-20T08:56:21Z) - Mapping the Landscape of Artificial Intelligence Applications against
COVID-19 [59.30734371401316]
COVID-19, the disease caused by the SARS-CoV-2 virus, has been declared a pandemic by the World Health Organization.
We present an overview of recent studies using Machine Learning and, more broadly, Artificial Intelligence to tackle many aspects of the COVID-19 crisis.
arXiv Detail & Related papers (2020-03-25T12:30:33Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.