Semi-supervised Optimal Transport with Self-paced Ensemble for
Cross-hospital Sepsis Early Detection
- URL: http://arxiv.org/abs/2106.10352v1
- Date: Fri, 18 Jun 2021 20:54:18 GMT
- Title: Semi-supervised Optimal Transport with Self-paced Ensemble for
Cross-hospital Sepsis Early Detection
- Authors: Ruiqing Ding, Yu Zhou, Jie Xu, Yan Xie, Qiqiang Liang, He Ren, Yixuan
Wang, Yanlin Chen, Leye Wang and Man Huang
- Abstract summary: State-of-the-art methods require large amounts of labeled medical data for supervised learning.
Lack of labeled data will cause enormous obstacles if one hospital wants to deploy a new Sepsis detection system.
We propose a semi-supervised optimal transport with self-paced ensemble framework for Sepsis early detection.
- Score: 12.704730765459257
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The utilization of computer technology to solve problems in medical scenarios
has attracted considerable attention in recent years, which still has great
potential and space for exploration. Among them, machine learning has been
widely used in the prediction, diagnosis and even treatment of Sepsis. However,
state-of-the-art methods require large amounts of labeled medical data for
supervised learning. In real-world applications, the lack of labeled data will
cause enormous obstacles if one hospital wants to deploy a new Sepsis detection
system. Different from the supervised learning setting, we need to use known
information (e.g., from another hospital with rich labeled data) to help build
a model with acceptable performance, i.e., transfer learning. In this paper, we
propose a semi-supervised optimal transport with self-paced ensemble framework
for Sepsis early detection, called SPSSOT, to transfer knowledge from the other
that has rich labeled data. In SPSSOT, we first extract the same clinical
indicators from the source domain (e.g., hospital with rich labeled data) and
the target domain (e.g., hospital with little labeled data), then we combine
the semi-supervised domain adaptation based on optimal transport theory with
self-paced under-sampling to avoid a negative transfer possibly caused by
covariate shift and class imbalance. On the whole, SPSSOT is an end-to-end
transfer learning method for Sepsis early detection which can automatically
select suitable samples from two domains respectively according to the number
of iterations and align feature space of two domains. Extensive experiments on
two open clinical datasets demonstrate that comparing with other methods, our
proposed SPSSOT, can significantly improve the AUC values with only 1% labeled
data in the target domain in two transfer learning scenarios, MIMIC
$rightarrow$ Challenge and Challenge $rightarrow$ MIMIC.
Related papers
- Extending Machine Learning-Based Early Sepsis Detection to Different
Demographics [1.2724528787590168]
We compare two ensemble learning methods, LightGBM and XGBoost, using the public eICU-CRD dataset and a private South Korean St. Mary's Hospital's dataset.
Our analysis reveals the effectiveness of these methods in addressing healthcare data imbalance and enhancing sepsis detection.
arXiv Detail & Related papers (2023-11-07T20:02:52Z) - Domain Adaptive Synapse Detection with Weak Point Annotations [63.97144211520869]
We present AdaSyn, a framework for domain adaptive synapse detection with weak point annotations.
In the WASPSYN challenge at I SBI 2023, our method ranks the 1st place.
arXiv Detail & Related papers (2023-08-31T05:05:53Z) - Improving Multiple Sclerosis Lesion Segmentation Across Clinical Sites:
A Federated Learning Approach with Noise-Resilient Training [75.40980802817349]
Deep learning models have shown promise for automatically segmenting MS lesions, but the scarcity of accurately annotated data hinders progress in this area.
We introduce a Decoupled Hard Label Correction (DHLC) strategy that considers the imbalanced distribution and fuzzy boundaries of MS lesions.
We also introduce a Centrally Enhanced Label Correction (CELC) strategy, which leverages the aggregated central model as a correction teacher for all sites.
arXiv Detail & Related papers (2023-08-31T00:36:10Z) - Source-Free Domain Adaptation for Medical Image Segmentation via
Prototype-Anchored Feature Alignment and Contrastive Learning [57.43322536718131]
We present a two-stage source-free domain adaptation (SFDA) framework for medical image segmentation.
In the prototype-anchored feature alignment stage, we first utilize the weights of the pre-trained pixel-wise classifier as source prototypes.
Then, we introduce the bi-directional transport to align the target features with class prototypes by minimizing its expected cost.
arXiv Detail & Related papers (2023-07-19T06:07:12Z) - Universal Semi-Supervised Learning for Medical Image Classification [21.781201758182135]
Semi-supervised learning (SSL) has attracted much attention since it reduces the expensive costs of collecting adequate well-labeled training data.
Traditional SSL is built upon an assumption that labeled and unlabeled data should be from the same distribution.
We propose a unified framework to leverage unseen unlabeled data for open-scenario semi-supervised medical image classification.
arXiv Detail & Related papers (2023-04-08T16:12:36Z) - 2021 BEETL Competition: Advancing Transfer Learning for Subject
Independence & Heterogenous EEG Data Sets [89.84774119537087]
We design two transfer learning challenges around diagnostics and Brain-Computer-Interfacing (BCI)
Task 1 is centred on medical diagnostics, addressing automatic sleep stage annotation across subjects.
Task 2 is centred on Brain-Computer Interfacing (BCI), addressing motor imagery decoding across both subjects and data sets.
arXiv Detail & Related papers (2022-02-14T12:12:20Z) - Cross-Site Severity Assessment of COVID-19 from CT Images via Domain
Adaptation [64.59521853145368]
Early and accurate severity assessment of Coronavirus disease 2019 (COVID-19) based on computed tomography (CT) images offers a great help to the estimation of intensive care unit event.
To augment the labeled data and improve the generalization ability of the classification model, it is necessary to aggregate data from multiple sites.
This task faces several challenges including class imbalance between mild and severe infections, domain distribution discrepancy between sites, and presence of heterogeneous features.
arXiv Detail & Related papers (2021-09-08T07:56:51Z) - Self-transfer learning via patches: A prostate cancer triage approach
based on bi-parametric MRI [1.3934382972253603]
Prostate cancer (PCa) is the second most common cancer diagnosed among men worldwide.
The current PCa diagnostic pathway comes at the cost of substantial overdiagnosis, leading to unnecessary treatment and further testing.
We present a patch-based pre-training strategy to distinguish between clinically significant (cS) and non-clinically significant (ncS) lesions.
arXiv Detail & Related papers (2021-07-22T17:02:38Z) - Deep Semi-supervised Metric Learning with Dual Alignment for Cervical
Cancer Cell Detection [49.78612417406883]
We propose a novel semi-supervised deep metric learning method for cervical cancer cell detection.
Our model learns an embedding metric space and conducts dual alignment of semantic features on both the proposal and prototype levels.
We construct a large-scale dataset for semi-supervised cervical cancer cell detection for the first time, consisting of 240,860 cervical cell images.
arXiv Detail & Related papers (2021-04-07T17:11:27Z) - Deep Transfer Learning for Infectious Disease Case Detection Using
Electronic Medical Records [0.0]
During an infectious disease pandemic, it is critical to share electronic medical records or models (learned from these records) across regions.
Applying one region's data/model to another region often have distribution shift issues that violate the assumptions of traditional machine learning techniques.
To explore the potential of deep transfer learning algorithms, we applied two data-based algorithms and model-based transfer learning algorithms to infectious disease detection tasks.
arXiv Detail & Related papers (2021-03-08T01:53:29Z) - Federated Semi-Supervised Learning for COVID Region Segmentation in
Chest CT using Multi-National Data from China, Italy, Japan [14.776338073000526]
COVID-19 has led to urgent needs for reliable diagnosis and management of SARS-CoV-2 infection.
Recent efforts have focused on computer-aided characterization and diagnosis.
domain shift of data across clinical data centers poses a serious challenge when deploying learning-based models.
arXiv Detail & Related papers (2020-11-23T21:51:26Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.