NORA: A Nephrology-Oriented Representation Learning Approach Towards Chronic Kidney Disease Classification
- URL: http://arxiv.org/abs/2509.12704v1
- Date: Tue, 16 Sep 2025 05:54:33 GMT
- Title: NORA: A Nephrology-Oriented Representation Learning Approach Towards Chronic Kidney Disease Classification
- Authors: Mohammad Abdul Hafeez Khan, Twisha Bhattacharyya, Omar Khan, Noorah Khan, Alina Aziz Fatima Khan, Mohammed Qutub Khan, Sujoy Ghosh Hajra,
- Abstract summary: Chronic Kidney Disease (CKD) affects millions of people worldwide, yet its early detection remains challenging.<n>We introduce the Nephrology-Oriented Representation leArning (NORA) approach, which combines supervised contrastive learning with a nonlinear Random Forest classifier.<n>Our results demonstrated that NORA improves class separability and overall classification performance, particularly enhancing the F1-score for early-stage CKD.
- Score: 2.785096184515774
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Chronic Kidney Disease (CKD) affects millions of people worldwide, yet its early detection remains challenging, especially in outpatient settings where laboratory-based renal biomarkers are often unavailable. In this work, we investigate the predictive potential of routinely collected non-renal clinical variables for CKD classification, including sociodemographic factors, comorbid conditions, and urinalysis findings. We introduce the Nephrology-Oriented Representation leArning (NORA) approach, which combines supervised contrastive learning with a nonlinear Random Forest classifier. NORA first derives discriminative patient representations from tabular EHR data, which are then used for downstream CKD classification. We evaluated NORA on a clinic-based EHR dataset from Riverside Nephrology Physicians. Our results demonstrated that NORA improves class separability and overall classification performance, particularly enhancing the F1-score for early-stage CKD. Additionally, we assessed the generalizability of NORA on the UCI CKD dataset, demonstrating its effectiveness for CKD risk stratification across distinct patient cohorts.
Related papers
- A Disease-Centric Vision-Language Foundation Model for Precision Oncology in Kidney Cancer [54.58205672910646]
RenalCLIP is a visual-language foundation model for characterization, diagnosis and prognosis of renal mass.<n>It achieved better performance and superior generalizability across 10 core tasks spanning the full clinical workflow of kidney cancer.
arXiv Detail & Related papers (2025-08-22T17:48:19Z) - Testing Causal Explanations: A Case Study for Understanding the Effect of Interventions on Chronic Kidney Disease [1.2449538970962482]
We developed a methodology that uses a large observational electronic health record dataset.
Principles of regression discontinuity were used to derive randomized data subsets to test expert-driven interventions.
This methodology demonstrates how real-world EHR data can be used to provide population-level insights to inform improved healthcare delivery.
arXiv Detail & Related papers (2024-10-15T20:34:36Z) - AI-Driven Predictive Analytics Approach for Early Prognosis of Chronic Kidney Disease Using Ensemble Learning and Explainable AI [0.2399911126932527]
Chronic Kidney Disease (CKD) is a heterogeneous disorder that significantly impacts kidney structure and functions, eventually leading to kidney failure.<n>The goal of this research is to visualize dominating features, feature scores, and values exhibited for early prognosis and detection of CKD using ensemble learning and explainable AI.
arXiv Detail & Related papers (2024-06-10T18:46:14Z) - Explainable Machine Learning System for Predicting Chronic Kidney Disease in High-Risk Cardiovascular Patients [0.0]
This research developed an explainable machine learning system for predicting Chronic Kidney Disease (CKD) in patients with cardiovascular risks.
The Random Forest model achieved the highest sensitivity of 88.2%.
arXiv Detail & Related papers (2024-04-17T07:59:33Z) - Learning to diagnose cirrhosis from radiological and histological labels
with joint self and weakly-supervised pretraining strategies [62.840338941861134]
We propose to leverage transfer learning from large datasets annotated by radiologists, to predict the histological score available on a small annex dataset.
We compare different pretraining methods, namely weakly-supervised and self-supervised ones, to improve the prediction of the cirrhosis.
This method outperforms the baseline classification of the METAVIR score, reaching an AUC of 0.84 and a balanced accuracy of 0.75.
arXiv Detail & Related papers (2023-02-16T17:06:23Z) - RoS-KD: A Robust Stochastic Knowledge Distillation Approach for Noisy
Medical Imaging [67.02500668641831]
Deep learning models trained on noisy datasets are sensitive to the noise type and lead to less generalization on unseen samples.
We propose a Robust Knowledge Distillation (RoS-KD) framework which mimics the notion of learning a topic from multiple sources to ensure deterrence in learning noisy information.
RoS-KD learns a smooth, well-informed, and robust student manifold by distilling knowledge from multiple teachers trained on overlapping subsets of training data.
arXiv Detail & Related papers (2022-10-15T22:32:20Z) - SurvLatent ODE : A Neural ODE based time-to-event model with competing
risks for longitudinal data improves cancer-associated Deep Vein Thrombosis
(DVT) prediction [68.8204255655161]
We propose a generative time-to-event model, SurvLatent ODE, which parameterizes a latent representation under irregularly sampled data.
Our model then utilizes the latent representation to flexibly estimate survival times for multiple competing events without specifying shapes of event-specific hazard function.
SurvLatent ODE outperforms the current clinical standard Khorana Risk scores for stratifying DVT risk groups.
arXiv Detail & Related papers (2022-04-20T17:28:08Z) - Intelligent Sight and Sound: A Chronic Cancer Pain Dataset [74.77784420691937]
This paper introduces the first chronic cancer pain dataset, collected as part of the Intelligent Sight and Sound (ISS) clinical trial.
The data collected to date consists of 29 patients, 509 smartphone videos, 189,999 frames, and self-reported affective and activity pain scores.
Using static images and multi-modal data to predict self-reported pain levels, early models show significant gaps between current methods available to predict pain.
arXiv Detail & Related papers (2022-04-07T22:14:37Z) - An Explainable Classification Model for Chronic Kidney Disease Patients [0.0]
Chronic Kidney Disease (CKD) is experiencing a globally increasing incidence and high cost to health systems.
The employment of data mining to discover subtle patterns in CKD indicators would contribute to an early diagnosis.
This work develops a classifier model that would support healthcare professionals in the early diagnosis of CKD patients.
arXiv Detail & Related papers (2021-05-21T14:09:43Z) - Variational Knowledge Distillation for Disease Classification in Chest
X-Rays [102.04931207504173]
We propose itvariational knowledge distillation (VKD), which is a new probabilistic inference framework for disease classification based on X-rays.
We demonstrate the effectiveness of our method on three public benchmark datasets with paired X-ray images and EHRs.
arXiv Detail & Related papers (2021-03-19T14:13:56Z) - Sparse Longitudinal Representations of Electronic Health Record Data for
the Early Detection of Chronic Kidney Disease in Diabetic Patients [6.040252097102974]
Chronic kidney disease (CKD) is a gradual loss of renal function over time.
We propose a novel framework to learn sparse longitudinal representations of patients' medical records.
arXiv Detail & Related papers (2020-11-09T22:07:25Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.