HiRID-ICU-Benchmark -- A Comprehensive Machine Learning Benchmark on
High-resolution ICU Data
- URL: http://arxiv.org/abs/2111.08536v3
- Date: Thu, 18 Nov 2021 09:00:45 GMT
- Title: HiRID-ICU-Benchmark -- A Comprehensive Machine Learning Benchmark on
High-resolution ICU Data
- Authors: Hugo Y\`eche, Rita Kuznetsova, Marc Zimmermann, Matthias H\"user,
Xinrui Lyu, Martin Faltys, Gunnar R\"atsch
- Abstract summary: We aim to provide a benchmark covering a large spectrum of ICU-related tasks.
Using the HiRID dataset, we define multiple clinically relevant tasks developed in collaboration with clinicians.
We provide an in-depth analysis of current state-of-the-art sequence modeling methods, highlighting some limitations of deep learning approaches for this type of data.
- Score: 0.8418021941792283
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The recent success of machine learning methods applied to time series
collected from Intensive Care Units (ICU) exposes the lack of standardized
machine learning benchmarks for developing and comparing such methods. While
raw datasets, such as MIMIC-IV or eICU, can be freely accessed on Physionet,
the choice of tasks and pre-processing is often chosen ad-hoc for each
publication, limiting comparability across publications. In this work, we aim
to improve this situation by providing a benchmark covering a large spectrum of
ICU-related tasks. Using the HiRID dataset, we define multiple clinically
relevant tasks developed in collaboration with clinicians. In addition, we
provide a reproducible end-to-end pipeline to construct both data and labels.
Finally, we provide an in-depth analysis of current state-of-the-art sequence
modeling methods, highlighting some limitations of deep learning approaches for
this type of data. With this benchmark, we hope to give the research community
the possibility of a fair comparison of their work.
Related papers
- LESEN: Label-Efficient deep learning for Multi-parametric MRI-based
Visual Pathway Segmentation [5.726588626363204]
We propose a label-efficient deep learning method with self-ensembling (LESEN)
LESEN incorporates supervised and unsupervised losses, enabling the student and teacher models to mutually learn from each other.
Our experiments on the human connectome project (HCP) dataset demonstrate the superior performance of our method.
arXiv Detail & Related papers (2024-01-03T10:22:13Z) - On the Importance of Step-wise Embeddings for Heterogeneous Clinical
Time-Series [1.3285222309805063]
Recent advances in deep learning for sequence modeling have not fully transferred to tasks handling time-series from electronic health records.
In particular, in problems related to the Intensive Care Unit (ICU), the state-of-the-art remains to tackle sequence classification in a tabular manner with tree-based methods.
arXiv Detail & Related papers (2023-11-15T12:18:15Z) - Yet Another ICU Benchmark: A Flexible Multi-Center Framework for Clinical ML [0.7982607013768545]
Yet Another ICU Benchmark (YAIB) is a modular framework that allows researchers to define reproducible and comparable clinical ML experiments.
YAIB supports most open-access ICU datasets (MIMIC III/IV, eICU, HiRID, AUMCdb) and is easily adaptable to future ICU datasets.
We demonstrate that the choice of dataset, cohort definition, and preprocessing have a major impact on the prediction performance.
arXiv Detail & Related papers (2023-06-08T11:16:20Z) - Time Associated Meta Learning for Clinical Prediction [78.99422473394029]
We propose a novel time associated meta learning (TAML) method to make effective predictions at multiple future time points.
To address the sparsity problem after task splitting, TAML employs a temporal information sharing strategy to augment the number of positive samples.
We demonstrate the effectiveness of TAML on multiple clinical datasets, where it consistently outperforms a range of strong baselines.
arXiv Detail & Related papers (2023-03-05T03:54:54Z) - NEVIS'22: A Stream of 100 Tasks Sampled from 30 Years of Computer Vision
Research [96.53307645791179]
We introduce the Never-Ending VIsual-classification Stream (NEVIS'22), a benchmark consisting of a stream of over 100 visual classification tasks.
Despite being limited to classification, the resulting stream has a rich diversity of tasks from OCR, to texture analysis, scene recognition, and so forth.
Overall, NEVIS'22 poses an unprecedented challenge for current sequential learning approaches due to the scale and diversity of tasks.
arXiv Detail & Related papers (2022-11-15T18:57:46Z) - An Extensive Data Processing Pipeline for MIMIC-IV [0.20326203100766121]
We provide an end-to-end fully customizable pipeline to extract, clean, and pre-process data.
We predict and evaluate the fourth version of the MIMIC dataset (MIMIC-IV) for ICU and non-ICU-related clinical time-series prediction tasks.
arXiv Detail & Related papers (2022-04-29T01:09:38Z) - LifeLonger: A Benchmark for Continual Disease Classification [59.13735398630546]
We introduce LifeLonger, a benchmark for continual disease classification on the MedMNIST collection.
Task and class incremental learning of diseases address the issue of classifying new samples without re-training the models from scratch.
Cross-domain incremental learning addresses the issue of dealing with datasets originating from different institutions while retaining the previously obtained knowledge.
arXiv Detail & Related papers (2022-04-12T12:25:05Z) - Federated Cycling (FedCy): Semi-supervised Federated Learning of
Surgical Phases [57.90226879210227]
FedCy is a semi-supervised learning (FSSL) method that combines FL and self-supervised learning to exploit a decentralized dataset of both labeled and unlabeled videos.
We demonstrate significant performance gains over state-of-the-art FSSL methods on the task of automatic recognition of surgical phases.
arXiv Detail & Related papers (2022-03-14T17:44:53Z) - Continual Learning for Recurrent Neural Networks: a Review and Empirical
Evaluation [12.27992745065497]
Continual Learning with recurrent neural networks could pave the way to a large number of applications where incoming data is non stationary.
We organize the literature on CL for sequential data processing by providing a categorization of the contributions and a review of the benchmarks.
We propose two new benchmarks for CL with sequential data based on existing datasets, whose characteristics resemble real-world applications.
arXiv Detail & Related papers (2021-03-12T19:25:28Z) - A Meta-embedding-based Ensemble Approach for ICD Coding Prediction [64.42386426730695]
International Classification of Diseases (ICD) are the de facto codes used globally for clinical coding.
These codes enable healthcare providers to claim reimbursement and facilitate efficient storage and retrieval of diagnostic information.
Our proposed approach enhances the performance of neural models by effectively training word vectors using routine medical data as well as external knowledge from scientific articles.
arXiv Detail & Related papers (2021-02-26T17:49:58Z) - Few-Shot Named Entity Recognition: A Comprehensive Study [92.40991050806544]
We investigate three schemes to improve the model generalization ability for few-shot settings.
We perform empirical comparisons on 10 public NER datasets with various proportions of labeled data.
We create new state-of-the-art results on both few-shot and training-free settings.
arXiv Detail & Related papers (2020-12-29T23:43:16Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.