Biomedical and Clinical English Model Packages in the Stanza Python NLP
Library
- URL: http://arxiv.org/abs/2007.14640v1
- Date: Wed, 29 Jul 2020 07:27:41 GMT
- Title: Biomedical and Clinical English Model Packages in the Stanza Python NLP
Library
- Authors: Yuhao Zhang, Yuhui Zhang, Peng Qi, Christopher D. Manning, Curtis P.
Langlotz
- Abstract summary: We introduce biomedical and clinical English model packages for the Stanza Python NLP library.
These packages offer accurate syntactic analysis and named entity recognition capabilities for biomedical and clinical text.
We show via extensive experiments that our packages achieve syntactic analysis and named entity recognition performance that is on par with or surpasses state-of-the-art results.
- Score: 47.47381610312517
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We introduce biomedical and clinical English model packages for the Stanza
Python NLP library. These packages offer accurate syntactic analysis and named
entity recognition capabilities for biomedical and clinical text, by combining
Stanza's fully neural architecture with a wide variety of open datasets as well
as large-scale unsupervised biomedical and clinical text data. We show via
extensive experiments that our packages achieve syntactic analysis and named
entity recognition performance that is on par with or surpasses
state-of-the-art results. We further show that these models do not compromise
speed compared to existing toolkits when GPU acceleration is available, and are
made easy to download and use with Stanza's Python interface. A demonstration
of our packages is available at: http://stanza.run/bio.
Related papers
- pyvene: A Library for Understanding and Improving PyTorch Models via
Interventions [79.72930339711478]
$textbfpyvene$ is an open-source library that supports customizable interventions on a range of different PyTorch modules.
We show how $textbfpyvene$ provides a unified framework for performing interventions on neural models and sharing the intervened upon models with others.
arXiv Detail & Related papers (2024-03-12T16:46:54Z) - PyPOTS: A Python Toolbox for Data Mining on Partially-Observed Time
Series [0.0]
PyPOTS is an open-source Python library dedicated to data mining and analysis on partially-observed time series.
It provides easy access to diverse algorithms categorized into four tasks: imputation, classification, clustering, and forecasting.
arXiv Detail & Related papers (2023-05-30T07:57:05Z) - Multimodal Interactive Lung Lesion Segmentation: A Framework for
Annotating PET/CT Images based on Physiological and Anatomical Cues [16.159693927845975]
Deep learning has enabled the accurate segmentation of various diseases in medical imaging.
These performances, however, typically demand large amounts of manual voxel annotations.
We propose a multimodal interactive segmentation framework that mitigates these issues by combining anatomical and physiological cues from PET/CT data.
arXiv Detail & Related papers (2023-01-24T10:50:45Z) - DeeProb-kit: a Python Library for Deep Probabilistic Modelling [0.0]
DeeProb-kit is a unified library written in Python consisting of a collection of deep probabilistic models (DPMs)
It includes efficiently implemented learning techniques, inference routines, statistical algorithms, and provides high-quality fully-documented APIs.
arXiv Detail & Related papers (2022-12-08T17:02:16Z) - PyTorch Connectomics: A Scalable and Flexible Segmentation Framework for
EM Connectomics [26.612749327414335]
PyTorch Connectomics (PyTC) is an open-source deep-learning framework for the semantic and instance segmentation of volumetric microscopy images.
PyTC aims to segment neurons, synapses, and other organelles like mitochondria at nanometer resolution for understanding neuronal communication, metabolism, and development in animal brains.
arXiv Detail & Related papers (2021-12-10T04:02:23Z) - Scikit-dimension: a Python package for intrinsic dimension estimation [58.8599521537]
This technical note introduces textttscikit-dimension, an open-source Python package for intrinsic dimension estimation.
textttscikit-dimension package provides a uniform implementation of most of the known ID estimators based on scikit-learn application programming interface.
We briefly describe the package and demonstrate its use in a large-scale (more than 500 datasets) benchmarking of methods for ID estimation in real-life and synthetic data.
arXiv Detail & Related papers (2021-09-06T16:46:38Z) - CBLUE: A Chinese Biomedical Language Understanding Evaluation Benchmark [51.38557174322772]
We present the first Chinese Biomedical Language Understanding Evaluation benchmark.
It is a collection of natural language understanding tasks including named entity recognition, information extraction, clinical diagnosis normalization, single-sentence/sentence-pair classification.
We report empirical results with the current 11 pre-trained Chinese models, and experimental results show that state-of-the-art neural models perform by far worse than the human ceiling.
arXiv Detail & Related papers (2021-06-15T12:25:30Z) - PyHealth: A Python Library for Health Predictive Models [53.848478115284195]
PyHealth is an open-source Python toolbox for developing various predictive models on healthcare data.
The data preprocessing module enables the transformation of complex healthcare datasets into machine learning friendly formats.
The predictive modeling module provides more than 30 machine learning models, including established ensemble trees and deep neural network-based approaches.
arXiv Detail & Related papers (2021-01-11T22:02:08Z) - Pythonic Black-box Electronic Structure Tool (PyBEST). An open-source
Python platform for electronic structure calculations at the interface
between chemistry and physics [52.77024349608834]
Pythonic Black-box Electronic Structure Tool (PyBEST) was developed at Nicolaus Copernicus University in Toru'n.
PyBEST is written primarily in the Python3 programming language with additional parts written in C++.
The capability of PyBEST to perform large-scale electronic structure calculations is demonstrated for the model vitamin B12 compound.
arXiv Detail & Related papers (2020-10-12T07:10:23Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.